ai

Hand crafted AI algorithms made with tender loving care (and numpy)

This repo contains

An implementation of Natural Evolution Strategies (the OpenAI variant where sigma is fixed, for simplicity)
An implementation of Covariance-Matrix Adaptation (CMA-ES), along with an adapter for pycma.
A few pretrained networks in ./nets

Lunar Lander

After way too much training NES with a low sigma it was able to mostly solve Lunar Lander

Sometimes it fails, though it usually comes close

To test it yourself make sure nets/LunarLanderContinuous-v2-16.pkl exists then run

python main.py --env LunarLanderContinuous-v2 --eval

An agent was also trained using Covariance-Matrix Adaptation (the --cma option). After ~220 generations it looks like this

The resulting agent is more robust, and successfully deactivates the boosters after landing. I think this is because CMA-ES can fine-tune better by adapting sigma, I ought to try sigma-adaptation for my NES agent too.

See the CMA-ES agent with

python main.py --env LunarLanderContinuous-v2 --eval --save LunarLanderContinuous-v2-16-CMA.pkl

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
images		images
nets		nets
.gitignore		.gitignore
README.md		README.md
main.py		main.py
mycma.py		mycma.py
requirements.txt		requirements.txt
test.py		test.py
test_ai.py		test_ai.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

images

images

nets

nets

.gitignore

.gitignore

README.md

README.md

main.py

main.py

mycma.py

mycma.py

requirements.txt

requirements.txt

test.py

test.py

test_ai.py

test_ai.py

Repository files navigation

ai

Lunar Lander

About

Releases

Packages

Languages

UlisseMini/ai

Folders and files

Latest commit

History

Repository files navigation

ai

Lunar Lander

About

Resources

Stars

Watchers

Forks

Languages