EPSRC Reference: |
EP/T02772X/1 |
Title: |
Change-point analysis in high dimensions |
Principal Investigator: |
Wang, Dr T |
Other Investigators: |
|
Researcher Co-Investigators: |
|
Project Partners: |
|
Department: |
Statistical Science |
Organisation: |
UCL |
Scheme: |
New Investigator Award |
Starts: |
01 March 2021 |
Ends: |
31 August 2021 |
Value (£): |
229,022
|
EPSRC Research Topic Classifications: |
Statistics & Appl. Probability |
|
|
EPSRC Industrial Sector Classifications: |
No relevance to Underpinning Sectors |
|
|
Related Grants: |
|
Panel History: |
|
Summary on Grant Application Form |
Modern applications routinely generate time-ordered datasets, where many covariates are simultaneously measured over time. Examples include wearable technologies recording the health state of individuals from multi-sensor feedbacks, internet traffic data collected by tens of thousands of routers and functional magnetic resonance imaging (fMRI) scans that record the evolution of certain chemical contrast in different areas of the brain. The explosion in number of such high-dimensional data streams calls for methodological advances for their analysis.
Change-point analysis is an essential statistical technique used in identifying abrupt changes in such data streams. The identified 'change-points' often signal interesting or abnormal events, and can be used to carve up the data streams into shorter segments that are easier to analyse.
Classical change-point analysis methods identify changes in a single variable over time. However, they often suffer from significant performance loss in high-dimensional datasets when applied componentwise. The area of high-dimensional change-point analysis grew out of the need to respond to the challenge created by high-dimensional data streams. A few methods have been proposed in this relatively new area. However, they often require simplifying assumptions that restrict their usefulness in many applications.
In this proposal, I will develop new methods that can handle more realistic data settings. Specifically, I will develop (1) an algorithm that can monitor the data stream 'online' as data points are observed one after another, so that it responds to changes as quickly as possible while maintaining a low rate of false alarms; (2) a change-point procedure that can handle highly correlated component series, a situation that is very common in multi-sensor measurements; (3) a robust method for change-point estimation in the presence of missing or contaminated data. I will provide theoretical performance guarantees for the developed methods and implement them in open-source R packages.
|
Key Findings |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Potential use in non-academic contexts |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Impacts |
Description |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk |
Summary |
|
Date Materialised |
|
|
Sectors submitted by the Researcher |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Project URL: |
|
Further Information: |
|
Organisation Website: |
|