EPSRC logo

Details of Grant 

EPSRC Reference: GR/M87146/01
Title: AN INTEGRATED MULTIPLE-LEVEL STATISTICAL MODEL FOR SPEECH PATTERN PROCESSING
Principal Investigator: Russell, Professor M
Other Investigators:
Researcher Co-Investigators:
Project Partners:
Defence Science and Technology Laborator
Department: Electronic, Electrical and Computer Eng
Organisation: University of Birmingham
Scheme: Standard Research (Pre-FEC)
Starts: 07 February 2000 Ends: 06 February 2003 Value (£): 179,084
EPSRC Research Topic Classifications:
Human Communication in ICT
EPSRC Industrial Sector Classifications:
Creative Industries No relevance to Underpinning Sectors
Related Grants:
Panel History:  
Summary on Grant Application Form
The impressive improvements in speech recognition performance which Hidden Markov Model-based systems have achieved on controlled, large vocabulary tasks have not been matched by fundamental improvements in our approach to speech modelling. We still do not understand how to incorporate speech knowledge as a computationally useful constraint in a practical model for speech recognition. The goal of this research is to develop a novel statistical framework for speech pattern modelling, where the relationship between symbolic and acoustic representations of speech is regulated by an articulatory-based intermediate representation which captures inherent constraints of the speech production process. This will require a new, rigorous theory of data-driven, multiple level statistical modelling, and will involve the derivation of a mathematical framework, extension of training and recognition algorithms, and experimental evaluation on a standard speech corpus.If successful, better modelling of phenomena such as co-articulation and articulatory effort should result in improved recognition of natural speech. Models which avoid assumptions of random variation should offer improved noise robustness, and the distillation of speaker characteristics into a compact intermediate representation should facilitate fast speaker-adaptation. Success would also constitute a significant step towards a unified framework for speech pattern modelling, capable of supporting recognition and synthesis.
Key Findings
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Potential use in non-academic contexts
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Impacts
Description This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Summary
Date Materialised
Sectors submitted by the Researcher
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Project URL:  
Further Information:  
Organisation Website: http://www.bham.ac.uk