Incorporating Learning in BDI Agents ephane Airiau 1 Lin Padgham 2 - PowerPoint PPT Presentation

Incorporating Learning in BDI Agents ephane Airiau 1 Lin Padgham 2 Sebastian Sardina 2 St´ Sandip Sen 3 1 ILLC - University of Amsterdam 2 RMIT University, Melbourne, Australia 3 University of Tulsa, OK, USA ALAMS+ALAg 2008 Worshop at AAMAS Estoril, Portugal, 2008 Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

BDI Agents in a nutshell Belief, Desire, Intentions Belief: knowledge about the world and its own internal state Desires (or goals): what the agent has decided to work towards achieving Intentions: how the agents has decided to takle these goals. No planning from first principles: agents use a plan library (library of partially instantiated plans to be used to achieve the goals) Practical reasoning agents: quickly reason and react to asynchronous events. Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

Definition (Plan) e : ψ ← P where e is an event that triggers the plan ψ is the context for which the plan can be applied P is the body of the plan (succession of actions and/or subgoals) Goal-Plan tree Failure recovery P i : plan G G i : goals OR when a step fails, causing a plan SG i : sub- P 1 P 2 to fail, an alternative plan is goals AND tried. SG 1 SG 2 SG 3 ex: if both P 1 and P 2 are applicable, when P 4 fails, P 2 can be tried P 3 P 4 P 5 Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

BDI execution algorithm input Take the next event 1 (internal/external) Beliefs Event Queue Modify any goals, beliefs, 2 intentions (new event may cause an update of the belief, causing a plan library Intentions Reasoning modification of the goals and/or Deliberation intentions) dynamic static Select an applicable plan to 3 respond to this event actions Place this plan in the intention 4 base; Take the next step on a 5 selected intention (may execute an action, generate a new event) Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

BDI execution algorithm input Take the next event 1 (internal/external) Beliefs Event Queue Modify any goals, beliefs, 2 intentions (new event may cause an update of the belief, causing a plan library Intentions Reasoning modification of the goals and/or Deliberation intentions) dynamic static Select an applicable plan to 3 respond to this event actions Place this plan in the intention 4 BDI agents are well suited for base; complex applications with soft Take the next step on a 5 real-time reasoning and control selected intention (may execute requirements. an action, generate a new event) Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

Issues BDI agents lack learning capabilities to modify their behavior (e.g. in case of frequent failures) Plans and context conditions are programmed by a user. In a complex environment, context conditions may be hard to capture precisely too loose: plan is applicable when it is not → failures too tight: plan is not applicable when it actually is → a goal may not appear achievable when it is Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

Issues BDI agents lack learning capabilities to modify their behavior (e.g. in case of frequent failures) Plans and context conditions are programmed by a user. In a complex environment, context conditions may be hard to capture precisely too loose: plan is applicable when it is not → failures too tight: plan is not applicable when it actually is → a goal may not appear achievable when it is Research goal Add learning capabilities to adapt and precise context conditions of plans Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

Issues BDI agents lack learning capabilities to modify their behavior (e.g. in case of frequent failures) Plans and context conditions are programmed by a user. In a complex environment, context conditions may be hard to capture precisely too loose: plan is applicable when it is not → failures too tight: plan is not applicable when it actually is → a goal may not appear achievable when it is Research goal Add learning capabilities to adapt and precise context conditions of plans A first step Use a decision tree (DT) in addition to the context condition Each plan has a decision tree telling whether it is applicable Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

Example of a DT the environment is described by three boolean attributes a , b and c b true false a c true false true false a a 40+, 10- 4+,25- true false false true 110+, 0- 1+, 50- 1+ 35- 20+ 5- Context condition converted from the decision tree : ( a ∧ b ) ∨ ( a ∧ ¬ b ∧ c ) ∨ ( a ∧ ¬ b ∧ ¬ c ) . Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

Learning Issues When to collect data? In case of failure, did the failure occur because the current plan was not applicable? did it fail because other plans below were mistakenly chosen? P i : plan G 0 G i : goals OR SG i : sub- P 01 P 02 goals AND AND SG 1 SG 2 SG 3 SG 4 SG 5 OR OR P 11 P 12 P 21 P 22 P 31 P 41 P 51 When to start to use the decision tree? Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

Learning Issues When to collect data? In case of failure, did the failure occur because the current plan was not applicable? → Correct data did it fail because other plans below were mistakenly chosen? P i : plan G 0 G i : goals OR SG i : sub- P 01 P 02 goals AND AND SG 1 SG 2 SG 3 SG 4 SG 5 OR OR P 11 P 12 P 21 P 22 P 31 P 41 P 51 When to start to use the decision tree? Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

Learning Issues When to collect data? In case of failure, did the failure occur because the current plan was not applicable? → Correct data did it fail because other plans below were mistakenly chosen? → Incorrect data P i : plan G 0 G i : goals OR SG i : sub- P 01 P 02 goals AND AND SG 1 SG 2 SG 3 SG 4 SG 5 OR OR P 11 P 12 P 21 P 22 P 31 P 41 P 51 When to start to use the decision tree? Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

Initial Experiments Three mechanisms for plan selection CL: all trees are learnt at the same time, all data is used BU: Bottom Up learning: DT higher in the hierarchy wait for DT below to be formed PS: Probabilistic selection: plans are selected according to the frequency of success provided by the decision tree Use the DT after k instances have been observed for CL and BU ( k large), after few instances for PS (5-10 to have an initial DT). Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

Initial Results Setup: 17 plans, world state is defined by six boolean attributes, depth of goal-plan tree is 4. All context conditions are set to true. k = 100 Non Deterministic World: action may fail with a probability of 0.1 1 0.8 frequence of success 0.6 0.4 0.2 Probabilistic selection BU CL 0 0 200 400 600 800 1000 1200 1400 Number of instances Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

Conclusion Though theoretically, need to wait for DTs below to be accurate before collecting data for DT higher, DTs handle the spurious data as noise Using PS, the context conditions are learnt faster and are accurate Future Work Test on larger goal-plan trees Try better criteria for starting using the DTs Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

Contacts St´ ephane Airiau: stephane@illc.uva.nl Lin Padgham: lin.padgham@rmit.edu.au Sebastian Sardina: sebastian.sardina@rmit.edu.au Sandip Sen: sandip@utulsa.edu Airiau,Padgham,Sardina,Sen Incorporating Learning in BDI Agent

Incorporating Learning in BDI Agents ephane Airiau 1 Lin Padgham 2 - PowerPoint PPT Presentation

Incorporating Learning in BDI Agents ephane Airiau 1 Lin Padgham 2 Sebastian Sardina 2 St Sandip Sen 3 1 ILLC - University of Amsterdam 2 RMIT University, Melbourne, Australia 3 University of Tulsa, OK, USA ALAMS+ALAg 2008 Worshop at AAMAS

Belief - Desire - Intention (BDI) Model BDI Introduction, Applications and Analyses Massimo

John Ruth John Ruth President, President, BDI BDI- -USA USA Vice President, Vice

3. Reasoning in Agents Part 2: BDI Agents ems (SMA-UPC) Javier Vzquez-Salceda q Multiagent

Learning Agents Overview Learning important aspects Learning in Agents goal, types; individual

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

7/16/2009 Using the BDI and DAYC for assessment and eligibility determination Ellen Kester,

Agent-based Computing Jadex: A BDI Reasoning Engine Maciej Gawinecki Overview Theoretical

A Formal Approach to Embedding First-Principles Planning in BDI Agent Systems Mengwei Xu, Kim

Cognitive social evaluations for multi-context BDI agents Isaac Pinyol and Jordi Sabater-Mir

Agent-Based Systems Practical reasoning agents The BDI architecture Michael Rovatsos

Normative rational agents a BDI approach Mihnea Tufi Jean-Gabriel Ganascia Universit

ENCAPSULATING REACTING BEHAVIOUR IN GOAL-BASED PLANS FOR PROGRAMMING BDI AGENTS Rafael H. Bordini

Alternatives to Threshold-Based Desire Selection in Bayesian BDI Agents Bernardo Luz 1 Felipe

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents & environments

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

Tutorial Outline Introduction to Autonomous Agents and Multi-Agent Systems I Agents N What are

On Programming Organization-Aware Agents Andreas Schmidt Jensen Department of Applied Mathematics

Organisation in MAS is a supra-agent pattern of emergent or ( pre ) defined agents

Answer Set Programming and Human-Robot Interactions: Challenges and Opportunities Tran Cao Son

Robotic Paradigms and Control Architectures Jan Faigl Department of Computer Science Faculty of

Simulating Societies using Distributed AI Paper by Jim Doran, presented by Zsfia Szatmri

A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department

Sequent calculi for logics of agency: the deliberative STIT Edi Pavlovi c (joint work with

Multiagent Systems and Agent-based Simulation From theory to transport application Prof. dr.

Incorporating Learning in BDI Agents ephane Airiau 1 Lin Padgham 2 - PowerPoint PPT Presentation

Incorporating Learning in BDI Agents ephane Airiau 1 Lin Padgham 2 Sebastian Sardina 2 St Sandip Sen 3 1 ILLC - University of Amsterdam 2 RMIT University, Melbourne, Australia 3 University of Tulsa, OK, USA ALAMS+ALAg 2008 Worshop at AAMAS

Belief - Desire - Intention (BDI) Model BDI Introduction, Applications and Analyses Massimo

John Ruth John Ruth President, President, BDI BDI- -USA USA Vice President, Vice

3. Reasoning in Agents Part 2: BDI Agents ems (SMA-UPC) Javier Vzquez-Salceda q Multiagent

Learning Agents Overview Learning important aspects Learning in Agents goal, types; individual

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

7/16/2009 Using the BDI and DAYC for assessment and eligibility determination Ellen Kester,

Agent-based Computing Jadex: A BDI Reasoning Engine Maciej Gawinecki Overview Theoretical

A Formal Approach to Embedding First-Principles Planning in BDI Agent Systems Mengwei Xu, Kim

Cognitive social evaluations for multi-context BDI agents Isaac Pinyol and Jordi Sabater-Mir

Agent-Based Systems Practical reasoning agents The BDI architecture Michael Rovatsos

Normative rational agents a BDI approach Mihnea Tufi Jean-Gabriel Ganascia Universit

ENCAPSULATING REACTING BEHAVIOUR IN GOAL-BASED PLANS FOR PROGRAMMING BDI AGENTS Rafael H. Bordini

Alternatives to Threshold-Based Desire Selection in Bayesian BDI Agents Bernardo Luz 1 Felipe

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents &amp; environments

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

Tutorial Outline Introduction to Autonomous Agents and Multi-Agent Systems I Agents N What are

On Programming Organization-Aware Agents Andreas Schmidt Jensen Department of Applied Mathematics

Organisation in MAS is a supra-agent pattern of emergent or ( pre ) defined agents

Answer Set Programming and Human-Robot Interactions: Challenges and Opportunities Tran Cao Son

Robotic Paradigms and Control Architectures Jan Faigl Department of Computer Science Faculty of

Simulating Societies using Distributed AI Paper by Jim Doran, presented by Zsfia Szatmri

A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department

Sequent calculi for logics of agency: the deliberative STIT Edi Pavlovi c (joint work with

Multiagent Systems and Agent-based Simulation From theory to transport application Prof. dr.

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents & environments