ppo-agent
There are 28 repositories under ppo-agent topic.
pythonlessons/Reinforcement_Learning
Reinforcement learning tutorials
bitsauce/Carla-ppo
This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.
davide97l/rl-policies-attacks-defenses
Adversarial attacks on Deep Reinforcement Learning (RL)
abhilash1910/Deep_Reinforcement_Learning_Trading
Deep Reinforcement Learning for Trading
Atharv24/SnakeGym
Multi agent gym environment based on the classic Snake game with implementations of various reinforcement learning algorithms in pytorch
mohammadzainabbas/Reinforcement-Learning-CS
💡 Grasp - Pick-and-place with a robotic hand 👨🏻💻
7enTropy7/Racer_AI
Developed an highly customizable OpenAI gym environment and trained a stable_baselines3 PPO agent. Used the expert agent for Imitation Learning with DAgger
dschori/Ackerbot
Reinforcement Learning based navigation
ImSOLty/On-The-Waves
🚤🏖️BOATS DO VZHHHHH BBBDROOM, BEEEEP, BEEEP, GNAA, HONK, VZHHHHHHHHHHHHHH🏖️🚤
jookie/jojoBot
Financial trading strategies using deep reinforcement learning (DRL). It offers a frameworks for quantitative finance, enabling practitioners to create, test, and implement investments strategies.
c2d08y/LearningBot
A deep reinforcement learning Bot for https://kana.byha.top:444/
jfpettit/flare
Modular Reinforcement Learning in PyTorch.
JulianCatnip/atst-walker-agent
Concept and development of a walking AT-ST Walker (Starwars) ML-agent.
pranshurastogi29/BTC_mining_fees_optimization_RL
In this project, I have tried to use DeepRL for optimizing the selection of transactions done by the miner to increase the fee when they execute a block on the chain
fracapuano/Quinto
Repository for the final project of the "Computational Intelligence" course @ PoliTo, 2022/2023
iamvigneshwars/ai-walkers-ppo-pytorch
AI agent learns to walk, run, hop and crawl with out any given data using proximal policy optimisation.
AnastasiaML/Computational-Intelligence-And-Deep-Learning-Techniques-In-Developing-Intelligent-Agents-For-Games
The aim of this repository is the analysis and study of computer intelligence and in-depth learning techniques in the development of intelligent gaming agents.
GerTheMessiah/Snake-AI
Short own implementation of the game snake. In this project I'am using the ray library together with ray tune and a custom PPO model.
harikris001/Super-Mario-Reinforcement_Learning
Reinforcement Learning in Super Mario using Pytorch and PPO
PetropoulakisPanagiotis/igae
State Representations as Incentives for Reinforcement Learning Agents: A Sim2Real Analysis on Robotic Grasping
RsGoksel/Snake-Game_PPO-Solution
Snake game environment integrated with OpenAI Gym. Proximal Policy Optimization (PPO) implementation for training. Visualization of training progress and agent performance. Easy to understand code.
ellkrauze/gc-ml
Java Garbage Collector Performance Tuning with Reinforcement Learning Methods
navneet1083/textsum-tune
This project is based on fine-tuning LLM models (FLAN-T5) for text summarisation task using PEFT approach. All evaluation metrics being computed on ROUGE scoring and LoRA optimisation techniques being used for fine-tuning.
roeey777/Splendor-AI
AI agents for the boardgame Splendor
strcoder4007/Mario-Reinforcement-Learning
Training a Mario reinforcement learning agent using Open AI Gym and Stable Baselines 3 PPO algorithm.
00Utkarsh00/ML-DOOM
Automated gaming using machine learning
IvanBirkmaier/ppo_agent
This repository contains the code for a project paper for a Master's module in the field of reinforcement learning. The aim of the project is to explore and implement Proximal Policy Optimization (PPO) agents to learn and play the 7x7 Hex game.
jookie/jojostock1
An adaptive Machine Reinforcement Learning (MRL) system is being developed to gather and analyze media data using web scraping, training models to predict outcomes in areas like stock market trends, sports events, and other performance domains. It continuously refines its strategies based on real-time data and evolving patterns.