The rise of AI
Agent
Environment
state
reward action
numerical value
Learning:
Exploration/exploitation: Not always
take the action that is believed to be optimal to allow exploration.
Generalization: Generalize the
experience gained in some states to other states.