🧠 PyTorch PPO & SAC Implementations for Continuous Control

Overview

This repository contains clean, modular implementations of two foundational Deep Reinforcement Learning algorithms:

Proximal Policy Optimization (PPO)
Soft Actor-Critic (SAC)

Both are implemented using PyTorch and are suited for continuous action spaces.

This repo is designed for:

🔬 Researchers
📚 Students
🧑‍💻 Developers building DRL pipelines

Keywords: Soft Actor-Critic, PPO, Reinforcement Learning, PyTorch, DRL, RL, Actor Critic, Continuous Control, OpenAI Gym, Off-policy, On-policy, Deep RL, SOTA

✨ Features

🧱 Modular structure with reusable components (Actor, Critic, Memory)
🧮 PPO implementation with GAE, clipping, entropy regularisation
🔁 SAC with automatic entropy tuning and twin Q-networks
🔍 Logging of rewards and action distributions
⚙️ Easy to integrate into any RL environment

📦 Installation

git clone https://github.com/your-username/ppo-sac-pytorch.git
cd ppo-sac-pytorch
pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
ppo.py		ppo.py
sac.py		sac.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 PyTorch PPO & SAC Implementations for Continuous Control

Overview

✨ Features

📦 Installation

About

Uh oh!

Releases

Packages

Languages

RodolpheFmd/Modelling-PyTorch-DRL

Folders and files

Latest commit

History

Repository files navigation

🧠 PyTorch PPO & SAC Implementations for Continuous Control

Overview

✨ Features

📦 Installation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages