EPSRC logo

Details of Grant 

EPSRC Reference: EP/G026858/1
Title: Imbalanced Data Set Modelling and Classification for Life Threatening/ Safety Critical Applications
Principal Investigator: Hong, Professor X
Other Investigators:
Harris, Professor C
Researcher Co-Investigators:
Project Partners:
Department: Sch of Systems Engineering
Organisation: University of Reading
Scheme: Standard Research
Starts: 01 October 2009 Ends: 30 September 2012 Value (£): 102,020
EPSRC Research Topic Classifications:
Artificial Intelligence
EPSRC Industrial Sector Classifications:
No relevance to Underpinning Sectors
Related Grants:
Panel History:
Panel DatePanel NameOutcome
21 Oct 2008 ICT Prioritsation Panel (Oct 2008) Announced
Summary on Grant Application Form
Machine learning from imbalanced data sets is related to a broad range of very important problems in many engineering and scientific disciplines, e.g. medical diagnostics, signal detection and machine/material fault detection. Apart from the highly practical value, data learning from imbalanced data sets is also of high theoretical interest. Because the performance metrics used in conventional classifier construction may break down when applied to the imbalanced data sets, this has motivated considerable researches in machine learning communities aimed at a variety of learning methodologies for the imbalanced data setsDespite significant research in machine learning for imbalanced data, there is still a need and/or a lack of general methodologies that are able to deliver the capability of knowledge discovery as demanded by many hugely important applications. For example, it is highly beneficial to discover new noninvasive biological markers from clinical data, which can improve early medical diagnostics results, in order to start early treatment of a cancer. The motivation of the proposed research can be illustrated by another example. In material science, suppose that new materials with exceptional properties, e.g. strength, are required for new mechanical structures, e. g. military vehicles. For this purpose, a sample of experimental trials is performed to obtain a new material together with the measurements of the properties. It is highly desirable that the properties/behaviours could be discovered, by resort of data modelling using a small sample, rather than performing many more unnecessary and very expensive engineering experiments (large sample).This proposal is concerned with the development of a new modelling approach which builds upon the state-of-the-art nonlinear modelling methodologies and is specifically designed for pattern recognition using the imbalanced data sets. The objectives of the research include the modelling, classification, class probability (risk) prediction and knowledge discovery from the imbalanced data sets which are commonly found in many associated applications.
Key Findings
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Potential use in non-academic contexts
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Impacts
Description This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Summary
Date Materialised
Sectors submitted by the Researcher
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Project URL:  
Further Information:  
Organisation Website: http://www.rdg.ac.uk