Working Papers
Bounding a Class of Parameters in Measurement Error Models under Data Combination, Job Market Paper (Draft Coming Soon)
[Abstract]
Validation data that includes both the measurements and the true values provides an effective way to address measurement error. However, obtaining this data can be challenging due to issues related to data collection and matching. This paper studies measurement error models when the measurement and the true value are observed in two different datasets that cannot be matched. We consider a class of parameters that are essential for understanding (non-classical) measurement error and derive their bounds by solving linear programming problems. Our framework allows for an extensive set of identifying assumptions, ensuring flexible structures for the measurement error. We demonstrate our approach using two unmatched datasets: one with reported welfare benefits and the other with actual benefits.
Assessing Measurement Error in Linear Instrumental Variables Models
[Abstract] | [Draft]
In linear regression analysis, it is common to use instruments to address measurement error in the regressor. However, bias can still arise if the measurement error correlates with either the true variable, other regressors, or the instrument. This paper develops a sensitivity analysis framework for linear instrumental variables (IV) models that accounts for such concerns. We establish bounds for the parameter of interest using a set of sensitivity parameters that restrict the consistent deviations of the measurement from the true variable. We illustrate our methods in an empirical study that uses twins data to analyze the effect of schooling level on wages.
Testing Beta Pricing Models with Latent Factors, with Kunpeng Li, Qi Li, and Guofu Zhou
[Abstract] | [SSRN]
We propose a test for assessing the validity of linear asset pricing models with latent factors. We establish the asymptotic normality of the test statistic under the null as both the cross-sectional (n) and time-series (T) dimensions grow large. To improve power against sparse alternatives, we incorporate a screening statistic. Simulations demonstrate favorable finite-sample performance. Applying our test to a broad set of anomalies, we find that a small number of latent factors cannot explain their returns. We also revisit the influential study of Giglio and Xiu (2021), which develops a novel method for extracting macro-risk premia, and show that a new extraction approach is needed to account for the more complicated pricing errors revealed by our test.
Boundary Adjusted, Polynomial Adaptive, Nonparametric Kernel Conditional Density Estimation, with Qi Li and Jeffrey S. Racine (Draft Available Upon Request)
[Abstract]
We consider local polynomial nonparametric kernel estimation of conditional density functions g(y|x) when bounds [a,b] on the response Y may be known or unknown. When bounds are known, we use kernel functions for the response that exploit their presence, while when bounds are unknown we advocate using the empirical support [min(Y_i), max(Y_i)] in their place. A key feature of our approach is the use of data-driven methods for joint selection of the order of the approximating local polynomial, p, and the bandwidths for the response and predictors, h_y and h_x. This approach is able to automatically adjust to the presence of unknown bounds and automatically adapt the degree of the approximating polynomial and the bandwidths to the underlying data-generating process, thereby freeing the practitioner from having to select any of these key parameters in an otherwise ad hoc manner. Theoretical underpinnings are provided, and simulations along with an illustrative application indicate that this approach is particularly useful when the distribution of Y is bounded but the bounds are not known a priori, while it remains competitive with existing methods in infinite-support settings. An R package that implements these methods is available for interested readers.
Works in Progress
Tighter Gini-Frisch Bounds