What does IRS individual taxpayer data say about taxes paid by race in 2018–2021?

Checked on December 17, 2025
Disclaimer: Factually can make mistakes. Please verify important information or breaking news. Learn more.

This fact-check may be outdated. Consider refreshing it to get the most current information.

Executive summary

The IRS does not collect race or ethnicity on tax returns, so there are no official IRS tables that directly report taxes paid by race for 2018–2021; analyses that show racial patterns in tax burdens rely on imputation or matching methods developed by Treasury, researchers and advocacy groups [1] [2] [3]. Treasury’s Office of Tax Analysis and outside groups have imputed race/Hispanic origin into tax records to estimate how tax expenditures and enforcement outcomes vary by race [4] [3].

1. No direct IRS breakdown: the data gap that shapes every claim

The fundamental fact is plain: the IRS does not ask taxpayers their race or ethnicity on tax forms, and it does not publish tax liabilities or payments by race in its standard Statistics of Income or Data Book releases [1] [5] [6]. That absence means any statement about “taxes paid by race” for 2018–2021 is not reporting a direct IRS measurement but an inference built on other data or statistical imputation [1] [3].

2. How analysts get race into tax data: imputation and re‑weighting

To analyze racial patterns, Treasury and academic teams impute race/Hispanic origin using observable fields in tax records — income, filing status, age, dependents, names and ZIP Code Tabulation Areas — then re‑weight samples to match demographic distributions from surveys [3]. Treasury’s Individual Tax Model and subsequent papers explain that imputed “RH weights” are used to estimate distributional impacts and the benefits of tax expenditures by race [3] [4].

3. What Treasury and others found when they imputed race

Treasury research and follow‑on papers examine how large tax expenditures and credits are distributed across imputed racial groups; those analyses have documented disparities in who benefits from particular tax provisions, though the exact magnitudes depend on modeling choices and the imputation method [4] [2]. The Treasury feature announcing the work emphasizes the need to refine the approach and that results come from imputed—not self‑reported—race [2].

4. Audit disparities: independent studies find uneven enforcement

Separate from tax liability estimates, several studies examining IRS audit selection find disparities tied to race even though race is not observed in IRS records. For example, research using partial identification methods reports that Black taxpayers face audit rates several times higher than non‑Black taxpayers, with much of the gap concentrated among Earned Income Tax Credit claimants [7] [6]. Congressional and academic reporting has echoed concerns that Black taxpayers may be audited at higher rates than their share of the population would predict [6].

5. Why imputation matters — and what it cannot prove alone

Imputation permits important analysis but carries limits: imputations depend on correlates like names and geography that imperfectly infer race, and different imputation choices change estimated distributions of tax benefits [3] [4]. The Government Accountability Office, Treasury, and independent analysts all stress that absent direct collection or secure linkage to self‑reported demographic sources, conclusions remain model‑dependent [8] [3].

6. Policy and political stakes behind the data debate

Calls to collect race on tax forms or to enable secure linkages reflect concerns that the tax code can exacerbate or mitigate racial inequities — proponents argue better data would reveal who benefits from tax breaks and who bears enforcement costs, while critics fear privacy harms and reduced compliance if taxpayers distrust the IRS [9] [8]. Treasury’s work on imputation is presented as a compromise: learning more without changing tax forms [2] [3].

7. What the public can reliably conclude about 2018–2021

Available sources do not provide IRS‑reported totals of taxes paid by race for 2018–2021 because the agency lacks race/ethnicity fields; published estimates for those years come from imputed analyses and independent studies that document disparities in benefits and enforcement but differ in methods and results [1] [3] [4] [7]. Any specific dollar totals or shares by race for 2018–2021 must therefore be tied to a named imputation study and interpreted in light of that study’s assumptions [4] [3].

8. Recommended next steps for clearer public understanding

Policymakers and researchers should (a) increase transparency about imputation methods and uncertainty in published estimates [3] [4], (b) consider secure, legally vetted ways to link tax records to self‑reported demographic data where permissible [8] [2], and (c) prioritize audits of selection systems and algorithms to reduce disparate enforcement outcomes shown in audit research [7] [6].

Limitations: this summary relies solely on the provided materials; available sources do not contain an IRS table that directly reports taxes paid by race for 2018–2021, nor do they present a single, uncontested set of dollar figures for those years [1] [5].

Want to dive deeper?
How did total federal income taxes paid vary by race from 2018 to 2021 according to IRS data?
What methodology does the IRS use to estimate taxpayers' race and ethnicity in individual data files?
How do income distribution and filing rates explain racial differences in taxes paid 2018–2021?
What role did COVID-19 tax credits and stimulus payments play in racial disparities in tax burdens in 2020–2021?
Are there state-level patterns or policy factors that affected taxes paid by race between 2018 and 2021?