reinforcement-learning Double Q Learning Model built referencing this paper: https://arxiv.org/abs/1509.06461 Used Open AI Gym for environment: https://github.com/openai/gym