EPSRC Reference: |
EP/J006742/1 |
Title: |
Semiparametric Sample Selection Models with Applications in Biostatistics, Economics and Environmetrics |
Principal Investigator: |
Marra, Dr G |
Other Investigators: |
|
Researcher Co-Investigators: |
|
Project Partners: |
|
Department: |
Statistical Science |
Organisation: |
UCL |
Scheme: |
First Grant - Revised 2009 |
Starts: |
01 May 2012 |
Ends: |
30 June 2013 |
Value (£): |
99,723
|
EPSRC Research Topic Classifications: |
Statistics & Appl. Probability |
|
|
EPSRC Industrial Sector Classifications: |
No relevance to Underpinning Sectors |
|
|
Related Grants: |
|
Panel History: |
|
Summary on Grant Application Form |
The issue of sample selection bias (SSB) arises in many real data applications where a subset of a sample of observations is systematically excluded due to a particular attribute. This exclusion can lead to distorted results because the sample used for statistical analysis would not be representative of all statistical units at population level.
Let us consider the example in which the researcher is interested in estimating the effect of education on wage in a sample of women, accounting for other variables such as experience and age. Here individuals with zero wage are typically excluded from the sample, and because many features of these individuals are likely to differ from those with positive wage, statistical analysis based only on the positive wage individuals will yield biased estimates. Specifically, when not accounting for SSB, as education increases by say 1 unit, wage increases by 1.2 units. If we account for SSB, then the latter value increases to 2.1. The corrected estimate is crucial for policy planning and decision-making in that it can lead, for instance, to larger or smaller investments in education and skills as compared to the case of a biased smaller estimate.
Additional issues are that the effect of education (as well as experience and age) can exhibit complicated patterns, and that these variables can have a different effect on (i) the probability that the individual has wage different from zero, and on (ii) the magnitude of the wage value for individuals whose wage is greater than zero. In the current example, it is believed, for example, that the effect of education on wage is positive up to a certain level (say 15 years) after which the effect tends to plateau or perhaps decline, and that experience and age have different impacts on (i) and (ii). This piece of information is also crucial in that policies targeting specific categories of individuals can be designed, hence maximising the use of economic resources.
It is recognized that current methods do not address the issues mentioned above satisfactorily. The aim of this project is to provide the statistical theory, a numerical method and software for semiparametric sample selection modelling, where these issues can be simultaneously and fully dealt with.
|
Key Findings |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Potential use in non-academic contexts |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Impacts |
Description |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk |
Summary |
|
Date Materialised |
|
|
Sectors submitted by the Researcher |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Project URL: |
|
Further Information: |
|
Organisation Website: |
|