tildy-mdp

This is a fun project, inspired by talk of richard sutton - Tutorial: Introduction to Reinforcement Learning with Function Approximation

Play with this repo

python3 learn_mdp.py

About the project

Here the user is a reinforcement learning agent and he tries to find the optimal policy to gain maximum rewards. The environment has two states A and B. User can take 2 actions - 1,2 . Based on user's action in a state he gets positive or negative reward/feedback.

If you decide to play this game then following is the optimal policy

State	Action
A	2
B	1

This repository can be used for educational purposes. This repo can be used to explain the following concepts of Reinforcement Learning -

MDP
Exploration vs Exploitation Dilemma
Introduction to RL.

Feel free to improve this project. Pull Requests are welcome.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
learn_mdp.py		learn_mdp.py
true model of the world.jpeg		true model of the world.jpeg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

learn_mdp.py

learn_mdp.py

true model of the world.jpeg

true model of the world.jpeg

Repository files navigation

tildy-mdp

Play with this repo

About the project

About

Releases

Packages

Contributors 2

Languages

License

vaibhawvipul/tildy-mdp

Folders and files

Latest commit

History

Repository files navigation

tildy-mdp

Play with this repo

About the project

About

Topics

Resources

License

Stars

Watchers

Forks

Languages