Numerical differentiation is widely used in econometrics and many other areas of quantitative economic analysis. Many functions that need to be differentiated in econometric analysis need to be estimated from the data. For example, estimating the approximate variance of an estimator often requires estimating the derivatives of the moment conditions that define the estimator. Many estimators are also obtained by finding the zeros of the first order condition of the sample objective functions.

The estimated functions can be either non-differentiable or difficult to differentiate analytically. Oftentimes the estimated functions are complex and can be challenging to compute even numerically. Empirical researchers often apply numerical differentiation methods which depend on taking a finite number of differences of the objective function at discrete points, either explicitly or implicitly through the use of software routines, to the estimated functions from the sample in order to approximate the derivative of the unknown true functions.

A key tuning parameter that determines how well the numerical derivatives approximate the analytic derivatives is the step size used in the finite differencing operation. Empirical researchers often find that different step sizes can lead to very different numerical derivative estimates. While the importance of numerical derivatives has not gone unnoticed in econometrics, statistics and mathematics, the results that are available in the existing literature are very limited in scope.

The goal of this project is to take an important step to provide a systematic framework for understanding the conditions on the step size in numerical di^erentiation that are needed to obtain the optimal quality of approximation. These conditions involve subtle tradeoffs between the complexity of the function that needs to be differentiated and the amount of information that is available in the sample of data, and the degree of smoothness of the expectation of the function with respect to the sampling distribution. Empirical process theory provides a powerful tool for analyzing the complex of functions in the presence of randomly sampled data.

This project focuses on analyzing the use of numerical derivatives in estimating the asymptotic variance of estimators and in obtaining extreme estimators through gradient based optimization routines. The PIs' first goal is to give general sufficient consistency conditions that allow for nondifferentiable and discontinuous moment functions in consistent variance estimation. The precise rate conditions for the step size in numerical differentiation that we obtain depend on the tradeoff between bias and the degree of nonsmoothness of the moment condition. These general conditions can be specialized for certain continuous models, for which choosing a smaller step size can only be beneficial in reducing the asymptotic bias. However, the asymptotic bias will be dominated by the statistical noise once it falls below a certain threshold.

The second goal of this project is to analyze a class of estimators that are based on numerically differentiating a finite sample objective function, and provide conditions under which numerical derivative based optimization methods deliver consistent and asymptotic normal parameter estimates. The conditions for numerical extreme estimators require that the step size used in the numerical derivative has to converge to zero at specific rates when the sample size increases to infinity. The conditions required for the consistency of the asymptotic variance and for the convergence of the estimator itself can be different. The PIs seek extensive results that cover finite dimensional parametric models, infinite dimensional semiparametric models, and models that are defined by U-processes involving multiple layers of summation over the sampling data.

The proposed project involves joint work with Professor Aprajit Mahajan from Stanford University.

Project Report

This project involves developing theoretical statistical properties of the use of numerical derivatives in econometric analysis and validating the theoretical results through computer simulations. A major result is that when numerical derivative is being used to estimate the variance-covariance matrix of consistent estimator, the condition on the step size that is needed for consistent can be shown to be much weaker than the known results available in the literature. The precise condition depends on the degree of smoothness of the objective function that is to be differentiated. For smooth functions the choice of the step size does not have to balance the tradeoff between the bias and the variance, and only needs balance the reduction of the bias with the lower bound imposed by the machine precision. A second main result is that when numerical derivative is being used to optimize an objective function to obtain an econometric estimator, it can substantially change the statistical properties of the resulting estimator. For a class of objective functions that has been extensively analyzed in econometrics and statistics, the use of numerical differentiation can transform a nonstandard parametric problem into a smooth but nonparametric estimation problem. Numerical methods are widely used in many different areas including public finance, macroeconomics, labor economics and industrial organization. This project is among the first to analyze the statistical impact of numerical methods on econometric procedures. We believe that the results that will be developed will be relevant for many areas of empirical research that make extensive use of these techniques. Our project bridges the gap between mathematics, statistics and economics. Mathematical studies of numerical algorithms often only focus on its analytical properties and not its statistical properties. However, statistical properties of numerical algorithms are at least as important as, if not more important than, their analytical properties as they are often applied to econometrics routines implemented using real life statistical data. This project also relates closely to a very interesting literature of empirical processes which analyze how the complexity of a class of function relates to the uniform statistical convergence properties under suitable sampling assumptions. The use of empirical process theory brings together a deeper understanding of the rules that a researcher needs to understand when using numerical methods. Numerical analysis and econometrics are traditionally two very different areas in the economics research community. This project has the potential of bringing these two groups of researchers closer and bridging the gap between them, and as a result can improve the institutional structure within economics research communities.

Agency
National Science Foundation (NSF)
Institute
Division of Social and Economic Sciences (SES)
Type
Standard Grant (Standard)
Application #
1024504
Program Officer
Nancy Lutz
Project Start
Project End
Budget Start
2010-09-15
Budget End
2014-08-31
Support Year
Fiscal Year
2010
Total Cost
$157,280
Indirect Cost
Name
Stanford University
Department
Type
DUNS #
City
Stanford
State
CA
Country
United States
Zip Code
94305