Skip to content

UlisseMini/ai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ai

Hand crafted AI algorithms made with tender loving care (and numpy)

This repo contains

Lunar Lander

After way too much training NES with a low sigma it was able to mostly solve Lunar Lander

winning lunar lander

Sometimes it fails, though it usually comes close

failing lunar lander

To test it yourself make sure nets/LunarLanderContinuous-v2-16.pkl exists then run

python main.py --env LunarLanderContinuous-v2 --eval

An agent was also trained using Covariance-Matrix Adaptation (the --cma option). After ~220 generations it looks like this

cma lunar lander

The resulting agent is more robust, and successfully deactivates the boosters after landing. I think this is because CMA-ES can fine-tune better by adapting sigma, I ought to try sigma-adaptation for my NES agent too.

See the CMA-ES agent with

python main.py --env LunarLanderContinuous-v2 --eval --save LunarLanderContinuous-v2-16-CMA.pkl

About

Basic neuroevolution from scratch, playing openai gym games

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages