RedTachyon

Reinforcement Learning researcher, AI enthusiast

Paris

Pinned Repositories

cogment-lab
A toolkit for practical Human-AI cooperation research
Language:Python13 3 52
Gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Language:Python7.5k 42 473837
PettingZoo
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
Language:Python2.7k 18 377420
gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python34.9k 1.1k 1.8k8.6k
aiphysicist
My implementation of Wu's and Tegmark's AI Physicist
Language:Python1 2 10
alcove
A basic template for Farama projects
Language:Shell3 1 00
coltra-rl
A modular implementation of PPO, and soon hopefully other algorithms.
Language:Python26 3 142
CrowdAI
This will be a PhD thesis someday
Language:C#6 3 01
Ferry
WiP gRPC Gymnasium API
Language:Rust5 1 00
mingpt-rs
A toy implementation of Karpathy's minGPT with tch in Rust
Language:Rust3 1 01

RedTachyon's Repositories

RedTachyon/coltra-rl
A modular implementation of PPO, and soon hopefully other algorithms.
Language:Python26 3 142
RedTachyon/CrowdAI
This will be a PhD thesis someday
Language:C#6 3 01
RedTachyon/Ferry
WiP gRPC Gymnasium API
Language:Rust5 1 00
RedTachyon/llm-zth
Language:Jupyter Notebook1 1 0
RedTachyon/tutor-at-home
Language:Python1 3 01
RedTachyon/redtachyonme
Language:HTML0 2 00
RedTachyon/anterion
Open-source software engineer
Language:Python0 0
RedTachyon/AutoGPT
An experimental open-source attempt to make GPT-4 fully autonomous.
Language:JavaScript0 0
RedTachyon/cogment-verse
Research platform for Human-in-the-loop learning (HILL) & Multi-Agent Reinforcement Learning (MARL)
Language:Python0 0
RedTachyon/Cradle
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
Language:Python0 0
RedTachyon/decimal-float-toy-vite
Language:TypeScript
RedTachyon/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image (uncensored)
Language:Python0 0
RedTachyon/Gymnasium
A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym)
Language:Python1 0
RedTachyon/instructor
structured outputs for llms
Language:Python0 0
RedTachyon/keras
Deep Learning for humans
Language:Python0 0
RedTachyon/laser
The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
Language:Python0 0
RedTachyon/LaVague
Copilot for web automation
Language:Python0 0
RedTachyon/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda0 0
RedTachyon/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python0 0
RedTachyon/mincv
Language:TypeScript1 0
RedTachyon/OSWorld
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Language:Python0 0
RedTachyon/pong-wars
Language:HTML0 0
RedTachyon/redtachyon
2 0
RedTachyon/SWE-agent
SWE-agent: Agent Computer Interfaces Enable Software Engineering Language Models
Language:Python0 0
RedTachyon/task-standard
METR Task Standard
Language:TypeScript0 0
RedTachyon/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python0 0
RedTachyon/TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
Language:Jupyter Notebook0 0
RedTachyon/typarse
Simple type-based argument parsing
Language:Python2 0
RedTachyon/vimGPT
Browse the web with GPT-4V and Vimium
Language:Python0 0
RedTachyon/wildcats-ai
This will one day be an actually working AI agent
Language:Jupyter Notebook1 0