Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.
-
Updated
Oct 14, 2022 - Python
Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.
Implementation of td policy evaluation and q-learning on a grid world.
Genetic policy optimiser for specific environments using cellular automata
A Python-based repository with implementations of RL algorithms, featuring visualization tools and benchmarks
Policy evaluation in the total discounted reward setting in RL.
A reinforcement learning project for crowd-dynamics in a very narrow corridor
This project involves training the game "Misere Tic Tac Toe" to the computer using Robust and Efficient Reinforcement Learning Techniques
A repository to store a copy of a research paper for an Economics Senior Thesis.
🐍 Implementation of the REINFORCEjs library from Kaparthy in Python
Implementation of RL Algorithms in Openai Gym Frozen-Lake Environment
Exploratory Data Analysis and GLM modelling using the European Union Social Survey dataset (ESS). Aim: Can we predict voter preferences using logistic regression?
Effective Programming Practice with Python: Replication of González, Libertad (2013)
Difference-in-Differences to identify the causal effect of NPIs during Covid-19: evidence from Denmark and Sweden
Evaluation of post-lockdown policies using social contacts and risk of professional exposure.
The primary objective of the project is to assess the effectiveness of opioid drug regulations in three U.S. states.
Various reinforcement learning algorithms implemented on the frozen lake grid world.
Book chapter: Central and Eastern European Economies after the Ukrainian War — Between a Rock and a Hard Place: Chapter 5. Inflation Shock and Monetary Policy
Applying AlphaZero Self-Play Tactics to LLaMA for Enhanced Chatbot Interaction
This includes sample reinfrocement learning algorithms .Currently working on an approach to use RL for more comlex navigation issues
Dynamic Programming for Finite Markov Decision Processes
Add a description, image, and links to the policy-evaluation topic page so that developers can more easily learn about it.
To associate your repository with the policy-evaluation topic, visit your repo's landing page and select "manage topics."