Details of Grant

EPSRC Reference:

EP/N017447/1

Title:

Numeric Domain Model Acquisition from Action Traces

Principal Investigator:

Gregory, Dr P

Other Investigators:

Researcher Co-Investigators:

Project Partners:

Department:

Sch of Computing, Media & the Arts

Organisation:

Teesside University

Scheme:

First Grant - Revised 2009

Starts:

01 February 2016

Ends:

30 June 2017

Value (£):

100,460

EPSRC Research Topic Classifications:

Artificial Intelligence

EPSRC Industrial Sector Classifications:

No relevance to Underpinning Sectors

Related Grants:

Panel History:

Panel Date	Panel Name	Outcome
20 Oct 2015	EPSRC ICT Prioritisation Panel - Oct 2015	Announced

Summary on Grant Application Form

Many industrial and commercial applications of planning technology have to reason about numbers. For example, in the area of autonomous systems and robotics, an autonomous robot often has to reason about its position in space, power levels and storage capacities. We call the internal representation that the robot has of its environment its model of the world. It is essential for these models to be easy to construct and ideally, they should be automatically constructed.

This project concerns the subject of learning formal models of state-transition systems from observation of those systems operating. Consider an observer unfamiliar with the game of chess: by observing a sequence of moves, much can be learnt about the rules of the game. Watching a bishop for long enough demonstrates that only diagonal moves are possible for this piece.

Domain model acquisition, or automated modelling, is the problem of allowing a computer to learn its own world model by observing actions. We will develop new methods of automated modelling for state transition systems with numeric state variables. And when we refer to domain model acquisition, we refer to the learning of any state transition system from example data that includes sequences of state transitions, whether that is in the context of automated planning, general game playing, interactive narrative, workplace rostering, or any other type of underlying problem.

We first plan to learn models of domain with a common restriction of numeric variables in planning; the restriction to action costs. This restriction means that each ground action has a constant cost, and that the only numeric variable accumulates the sum of these individual action costs over the length of the plan. This accumulated value is the optimisation variable. Board games with action costs are those in which a score is accumulated throughout the play of the game.

Successful completion of the first stage means that we have a domain model acquisition algorithm to learn models that include action costs. This class of planning domains is an important subset of numeric planning domains. However, many planning domains contain more complex numeric properties and, in particular, arbitrary numeric variables and constraints. The second stage, therefore, will concentrate on developing algorithms to learn these constraints.

Completion of the project will allow models of many different problems to be learnt simply from observation. Examples include such things as capacity limits for certain resources, dimensional constraints for positioning items and strength of friendship level requirement to enable certain actions within a social-network aware interactive narrative setting.

Key Findings

This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk

Potential use in non-academic contexts

This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk

Impacts

Description	This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Summary
Date Materialised

Sectors submitted by the Researcher

This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk

Project URL:

Further Information:

Organisation Website:

http://www.tees.ac.uk