Appendix A: Implementation Code¶
A.1 Quantile Regression Forest Implementation¶
The following code demonstrates the implementation of Quantile Regression Forests for variable imputation:
from quantile_forest import RandomForestQuantileRegressor
qrf = RandomForestQuantileRegressor(
n_estimators=100,
min_samples_leaf=1,
random_state=0
)A.2 PyTorch Optimization for Reweighting¶
The reweighting optimization uses PyTorch for gradient-based optimization:
import torch
# Initialize with log of original weights
log_weights = torch.log(original_weights)
log_weights.requires_grad = True
# Adam optimizer
optimizer = torch.optim.Adam([log_weights], lr=0.1)
# Optimization loop
for iteration in range(5000):
weights = torch.exp(log_weights)
achieved = weights @ loss_matrix
relative_errors = (achieved - targets) / targets
loss = torch.mean(relative_errors ** 2)
optimizer.zero_grad()
loss.backward()
optimizer.step()Appendix B: Tables¶
Table A1: Complete List of Imputed Variables¶
Variables Imputed from IRS Public Use File (67 variables)¶
Income Variables:
employment_income
partnership_s_corp_income
social_security
taxable_pension_income
tax_exempt_pension_income
long_term_capital_gains
short_term_capital_gains
taxable_ira_distributions
self_employment_income
qualified_dividend_income
non_qualified_dividend_income
rental_income
taxable_unemployment_compensation
taxable_interest_income
tax_exempt_interest_income
estate_income
miscellaneous_income
farm_income
alimony_income
farm_rent_income
non_sch_d_capital_gains
long_term_capital_gains_on_collectibles
unrecaptured_section_1250_gain
salt_refund_income
Deductions and Adjustments:
interest_deduction
unreimbursed_business_employee_expenses
pre_tax_contributions
charitable_cash_donations
self_employed_pension_contribution_ald
domestic_production_ald
self_employed_health_insurance_ald
charitable_non_cash_donations
alimony_expense
health_savings_account_ald
student_loan_interest
investment_income_elected_form_4952
early_withdrawal_penalty
educator_expense
deductible_mortgage_interest
Tax Credits:
cdcc_relevant_expenses
foreign_tax_credit
american_opportunity_credit
general_business_credit
energy_efficient_home_improvement_credit
amt_foreign_tax_credit
excess_withheld_payroll_tax
savers_credit
prior_year_minimum_tax_credit
other_credits
Qualified Business Income Variables:
w2_wages_from_qualified_business
unadjusted_basis_qualified_property
business_is_sstb
qualified_reit_and_ptp_income
qualified_bdc_income
farm_operations_income
estate_income_would_be_qualified
farm_operations_income_would_be_qualified
farm_rent_income_would_be_qualified
partnership_s_corp_income_would_be_qualified
rental_income_would_be_qualified
self_employment_income_would_be_qualified
Other Tax Variables:
traditional_ira_contributions
qualified_tuition_expenses
casualty_loss
unreported_payroll_tax
recapture_of_investment_credit
Variables Imputed from Survey of Income and Program Participation (1 variable)¶
tip_income
Variables Imputed from Survey of Consumer Finances (3 variables)¶
networth
auto_loan_balance
auto_loan_interest
Variables Imputed from American Community Survey (2 variables)¶
rent
real_estate_taxes