Details of Grant

EPSRC Reference:

EP/X015459/1

Title:

Learning of safety critical model predictive controllers for autonomous systems

Principal Investigator:

Fleming, Dr JM

Other Investigators:

Researcher Co-Investigators:

Project Partners:

Dynamotion srl

University of Padua (Padova)

Department:

Wolfson Sch of Mech, Elec & Manufac Eng

Organisation:

Loughborough University

Scheme:

New Investigator Award

Starts:

01 June 2023

Ends:

31 May 2026

Value (£):

400,125

EPSRC Research Topic Classifications:

Artificial Intelligence	Control Engineering
Robotics & Autonomy

EPSRC Industrial Sector Classifications:

Transport Systems and Vehicles

Related Grants:

Panel History:

Panel Date	Panel Name	Outcome
05 Oct 2022	Engineering Prioritisation Panel Meeting 5 and 6 October 2022	Announced

Summary on Grant Application Form

Modern autonomous systems such as mobile robots and autonomous vehicles rely heavily on feedback controllers for motion control, particularly for path-following and obstacle avoidance, where they are employed to follow a trajectory set by a higher-level motion planner in a hierarchical control scheme. Model Predictive Control (MPC) is a popular controller choice for obstacle avoidance, as it allows constraints to be specified to ensure that the mobile robot or autonomous vehicle does not collide with obstacles. The behaviour of MPC is well understood from years of theoretical development and industrial practice, providing strong safety assurances, but considerable time and expert knowledge is required to implement it, especially in safety-critical applications such as autonomous vehicles.

In recent years, research on deep Reinforcement Learning (RL) has provided new methods to automatically find nonlinear feedback controllers for challenging control problems. But unlike MPC, existing RL methods typically have no guarantees of stability or of constraint satisfaction, and for safety-critical applications it is difficult to verify their behaviour.

To combine the predictability and safety guarantees of MPC with the power and convenience of modern RL methods, this project will develop methods to automatically learn MPC controllers in actor-critic RL frameworks, considering motion control and obstacle avoidance problems for autonomous vehicles. This will be a direct application of recent mathematical results showing that convex optimisations, such as MPC, can be employed as a trainable layer in RL frameworks such as PyTorch, allowing them to be learned. The goal is to enable rapid design and prototyping of path-following type MPC without requiring expert-knowledge of the underlying MPC algorithm, therefore reducing development time and cost and improving safety and reliability of future mobile robots and autonomous vehicles.

To ensure the new algorithms are practically applicable, an example application of motorcycle path-following and stability assistance will be used to guide their development. The problem of stabilising a two-wheeled vehicle in forward motion to follow a predefined path, for example via steering actuation, is challenging and has important applications in the emerging area of active safety systems for motorcycles and scooters.

For long term impact and to encourage adoption of the new methods by autonomous systems researchers, the new methods developed will be included in an open-source software library published on Github.

Key Findings

This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk

Potential use in non-academic contexts

This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk

Impacts

Description	This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Summary
Date Materialised

Sectors submitted by the Researcher

This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk

Project URL:

Further Information:

Organisation Website:

http://www.lboro.ac.uk