• Aucun résultat trouvé

Agen ts and en vironmen ts

N/A
N/A
Protected

Academic year: 2022

Partager "Agen ts and en vironmen ts"

Copied!
5
0
0

Texte intégral

(1)

Intelligent A gents

Chapter2

Chapter21

Reminders

Assignment0(lisprefresher)due1/28

Lisp/emacs/AIMAtutorial:11-1todayandMonday,271Soda

Chapter22

Outline

♦Agentsandenvironments

♦Rationality

♦PEAS(Performancemeasure,Environment,Actuators,Sensors)

♦Environmenttypes

♦Agenttypes

Chapter23

Agen ts and en vironmen ts

?agent percepts sensors

actions environment

actuators

Agentsincludehumans,robots,softbots,thermostats,etc.

Theagentfunctionmapsfrompercepthistoriestoactions:

f:P→A

Theagentprogramrunsonthephysicalarchitecturetoproducef

Chapter24

V acuum-cleaner w orld

A B

Percepts:locationandcontents,e.g.,[A,Dirty]

Actions:Left,Right,Suck,NoOp

Chapter25

A v acuum-cleaner agen t

PerceptsequenceAction[A,Clean]Right[A,Dirty]Suck[B,Clean]Left[B,Dirty]Suck[A,Clean],[A,Clean]Right[A,Clean],[A,Dirty]Suck...

functionReflex-Vacuum-Agent([location,status])returnsanaction

ifstatus=DirtythenreturnSuckelseiflocation=AthenreturnRightelseiflocation=BthenreturnLeft

Whatistherightfunction?Canitbeimplementedinasmallagentprogram?

Chapter26

(2)

Rationalit y

Fixedperformancemeasureevaluatestheenvironmentsequence–onepointpersquarecleanedupintimeT?–onepointpercleansquarepertimestep,minusonepermove?–penalizefor>kdirtysquares?

Arationalagentchooseswhicheveractionmaximizestheexpectedvalueoftheperformancemeasuregiventheperceptsequencetodate

Rational6=omniscient–perceptsmaynotsupplyallrelevantinformationRational6=clairvoyant–actionoutcomesmaynotbeasexpectedHence,rational6=successful

Rational⇒exploration,learning,autonomy

Chapter27

PEAS

Todesignarationalagent,wemustspecifythetaskenvironment

Consider,e.g.,thetaskofdesigninganautomatedtaxi:

Performancemeasure??

Environment??

Actuators??

Sensors??

Chapter28

PEAS

Todesignarationalagent,wemustspecifythetaskenvironment

Consider,e.g.,thetaskofdesigninganautomatedtaxi:

Performancemeasure??safety,destination,profits,legality,comfort,...

Environment??USstreets/freeways,traffic,pedestrians,weather,...

Actuators??steering,accelerator,brake,horn,speaker/display,...

Sensors??video,accelerometers,gauges,enginesensors,keyboard,GPS,...

Chapter29

In ternet shopping agen t

Performancemeasure??

Environment??

Actuators??

Sensors??

Chapter210

In ternet shopping agen t

Performancemeasure??price,quality,appropriateness,efficiency

Environment??currentandfutureWWWsites,vendors,shippers

Actuators??displaytouser,followURL,fillinform

Sensors??HTMLpages(text,graphics,scripts)

Chapter211

En vironmen t typ es

SolitaireBackgammonInternetshoppingTaxiObservable??Deterministic??Episodic??Static??Discrete??Single-agent??

Chapter212

(3)

En vironmen t typ es

SolitaireBackgammonInternetshoppingTaxiObservable??YesYesNoNoDeterministic??Episodic??Static??Discrete??Single-agent??

Chapter213

En vironmen t typ es

SolitaireBackgammonInternetshoppingTaxiObservable??YesYesNoNoDeterministic??YesNoPartlyNoEpisodic??Static??Discrete??Single-agent??

Chapter214

En vironmen t typ es

SolitaireBackgammonInternetshoppingTaxiObservable??YesYesNoNoDeterministic??YesNoPartlyNoEpisodic??NoNoNoNoStatic??Discrete??Single-agent??

Chapter215

En vironmen t typ es

SolitaireBackgammonInternetshoppingTaxiObservable??YesYesNoNoDeterministic??YesNoPartlyNoEpisodic??NoNoNoNoStatic??YesSemiSemiNoDiscrete??Single-agent??

Chapter216

En vironmen t typ es

SolitaireBackgammonInternetshoppingTaxiObservable??YesYesNoNoDeterministic??YesNoPartlyNoEpisodic??NoNoNoNoStatic??YesSemiSemiNoDiscrete??YesYesYesNoSingle-agent??

Chapter217

En vironmen t typ es

SolitaireBackgammonInternetshoppingTaxiObservable??YesYesNoNoDeterministic??YesNoPartlyNoEpisodic??NoNoNoNoStatic??YesSemiSemiNoDiscrete??YesYesYesNoSingle-agent??YesNoYes(exceptauctions)No

Theenvironmenttypelargelydeterminestheagentdesign

Therealworldis(ofcourse)partiallyobservable,stochastic,sequential,dynamic,continuous,multi-agent

Chapter218

(4)

Agen t typ es

Fourbasictypesinorderofincreasinggenerality:–simplereflexagents–reflexagentswithstate–goal-basedagents–utility-basedagents

Allthesecanbeturnedintolearningagents

Chapter219

Simple reflex agen ts

Agent

Environment

Sensors

What the worldis like now

What action Ishould do nowCondition−action rules

Actuators

Chapter220

Example

functionReflex-Vacuum-Agent([location,status])returnsanaction

ifstatus=DirtythenreturnSuckelseiflocation=AthenreturnRightelseiflocation=BthenreturnLeft

(setqjoe(make-agent:name’joe:body(make-agent-body):program(make-reflex-vacuum-agent-program)))

(defunmake-reflex-vacuum-agent-program()#’(lambda(percept)(let((location(firstpercept))(status(secondpercept)))(cond((eqstatus’dirty)’Suck)((eqlocation’A)’Right)((eqlocation’B)’Left)))))

Chapter221

Reflex agen ts with state

Agent

Environment

Sensors

What action Ishould do now State

How the world evolves

What my actions do

Condition−action rules

Actuators What the worldis like now

Chapter222

Example

functionReflex-Vacuum-Agent([location,status])returnsanactionstatic:lastA,lastB,numbers,initially

ifstatus=Dirtythen...

(defunmake-reflex-vacuum-agent-with-state-program()(let((last-Ainfinity)(last-Binfinity))#’(lambda(percept)(let((location(firstpercept))(status(secondpercept)))(incflast-A)(incflast-B)(cond((eqstatus’dirty)(if(eqlocation’A)(setqlast-A0)(setqlast-B0))’Suck)((eqlocation’A)(if(>last-B3)’Right’NoOp))((eqlocation’B)(if(>last-A3)’Left’NoOp)))))))

Chapter223

Goal-based agen ts

Agent

Environment

Sensors

What it will be like if I do action A

What action Ishould do now State

How the world evolves

What my actions do

Goals

Actuators What the worldis like now

Chapter224

(5)

Utilit y-based agen ts

Agent

Environment

Sensors

What it will be like if I do action A

How happy I will be in such a state

What action Ishould do now State

How the world evolves

What my actions do

Utility

Actuators What the worldis like now

Chapter225

Learning agen ts

Performance standard

Agent

Environment

Sensors

Performance element changes

knowledgelearning goals

Problem generator feedback

Learning element Critic

Actuators

Chapter226

Summary

Agentsinteractwithenvironmentsthroughactuatorsandsensors

Theagentfunctiondescribeswhattheagentdoesinallcircumstances

Theperformancemeasureevaluatestheenvironmentsequence

Aperfectlyrationalagentmaximizesexpectedperformance

Agentprogramsimplement(some)agentfunctions

PEASdescriptionsdefinetaskenvironments

Environmentsarecategorizedalongseveraldimensions:observable?deterministic?episodic?static?discrete?single-agent?

Severalbasicagentarchitecturesexist:reflex,reflexwithstate,goal-based,utility-based

Chapter227

Références

Documents relatifs

Guide d’identification des plantes exotiques envahissant les milieux aquatiques et les berges du bassin Loire-Bretagne. Fédération des conservatoires d’espaces naturels,

Si le manomètre continue à indiquer pleine pression et que le nettoyeur se met en marche et s’arrête sans cesse, la cause peut être une fuite au niveau de la pompe, du

BRUNET-MoRET pour reconstituer, sur la riviere Capot (Martinique), non pas des variations d'échelles, mais les variations du tarage qui se produisent un nombre respectable

Et C'est pourquoi Et C'est pourquoi Quand on la voit, Quand on la voit, Elle est si ronde, Elle est si ronde, La Lune blonde La Lune blonde. Mais une nuit elle maigrit Mais une

– 1 رﺎﻴﺘﻟا ﺔﻬﺟ ﻪﻴﻠﻋ ﻦّﻴﺑو ، ﺔﻴﺋﺎﺑﺮﻬﻜﻟا ةراﺪﻠﻟ ﺎﻴﺤﻴﺿﻮﺗ ﻼﻜﺷ ﻢﺳرا ﻦﻳﺮﺗﻮﺘﻟا ﻲﺘﻬﺟو.. ﻟا

Montrer que la probabilité pour qu’un véhicule contrôlé soit en bon état (c’est-à-dire ait de bons freins et un bon éclairage) est 0,809 6c. Au cours d’un contrôle,

Tracer la courbe C et préciser ses tangentes aux points d’abscisses respectives 0 et α.. Amérique du Sud 2

Déduire des questions précédentes que l’équation (1) admet deux solu- tions opposées.. Déterminer les tangentes communes aux courbes Γ