Reinforcement Learning Algorithms

Overview

This repository provides implementations of various reinforcement learning (RL) algorithms, including:

Proximal Policy Optimization (PPO)
Vanilla Policy Gradient (VPG)
Deep Q-Network (DQN) and its variants: DDQN, N-step DQN, PER

The implementation supports:

Atari and Classic Control environments
Continuous and Discrete Action Spaces (for PPO and VPG)
PyTorch-based Deep Learning Models
gymnasium instead of gym
Custom function and class design, distinct from OpenAI Baselines

Installation

Prerequisites

Ensure you have Python 3.11+ installed. Then, install the dependencies:

pip install -r requirements.txt

Cloning the Repository

git clone https://github.com/mmarouen/agentic-ai-system
cd agentic-ai-system

Usage

Training an Agent

To train an agent with PPO, run:

python pg_train.py --algorithm ppo --map-name CartPole-v1

For VPG:

python pg_train.py --algorithm vpg --map-name LunarLander-v2

For DQN:

python dqn_train.py --algorithm dqn --map-name PongNoFrameskip-v4

Training will store training results (tensorboard, config files, model files) under the runs subfolder named map-name-exp-name argument. Example:

python pg_train.py --algorithm ppo --map-name CartPole-v1 --exp-name test-run

will create the folder CartPole-v1-test-run under runs

Hyperparameter Configuration

Pass hyperparameters as command-line arguments:

python pg_train.py --algorithm ppo --map-name CartPole-v1 --lr-policy 0.0003 --gamma 0.97 --shared-network False

Play

To use the trained agent with PPO or VPG run:

python pg_play.py --folder ppo-test --map-name CartPole-v1

For DQN:

python dqn_train.py --algorithm dqn --map-name PongNoFrameskip-v4

Results

Below are reward curves for selected games and algorithms:

PPO on ALE/BeamRider-v5

DQN on PongNoFrameskip-v4

DQN on Acrobot-v1

Additional results are available in the results/ folder.

Features

✅ Custom RL framework based on gymnasium and PyTorch
✅ Supports both discrete and continuous action spaces (PPO & VPG)
✅ Modular and extensible design for easy customization
✅ Supports Atari and Classic Control environments
✅ Implements multiple flavors of DQN (DQN, Double DQN, Dueling DQN, etc.)

Contributing

Contributions are welcome! Please submit a pull request or open an issue to discuss improvements.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contact

Im reachable at my email at azzouz.marouen@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
config		config
lib		lib
oai		oai
results		results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dp_TSP.py		dp_TSP.py
dqn_play.py		dqn_play.py
dqn_train.py		dqn_train.py
pg_play.py		pg_play.py
pg_train.py		pg_train.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning Algorithms

Overview

Installation

Prerequisites

Cloning the Repository

Usage

Training an Agent

Hyperparameter Configuration

Play

Results

PPO on ALE/BeamRider-v5

DQN on PongNoFrameskip-v4

DQN on Acrobot-v1

Features

Contributing

License

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

License

mmarouen/agentic-ai-system

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Algorithms

Overview

Installation

Prerequisites

Cloning the Repository

Usage

Training an Agent

Hyperparameter Configuration

Play

Results

PPO on ALE/BeamRider-v5

DQN on PongNoFrameskip-v4

DQN on Acrobot-v1

Features

Contributing

License

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages