ZachariahPang's Stars
Delgan/loguru
Python logging made (stupidly) simple
microsoft/DeepSpeedExamples
Example models using DeepSpeed
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
fchollet/ARC-AGI
The Abstraction and Reasoning Corpus
vimalabs/VIMABench
Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
crewAIInc/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
AnswerDotAI/RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
svenstaro/genact
🌀 A nonsense activity generator
corl-team/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
xai-org/grok-1
Grok open release
ajeetdsouza/zoxide
A smarter cd command. Supports all major shells.
dave1010/tree-of-thought-prompting
Using Tree-of-Thought Prompting to boost ChatGPT's reasoning
ExpectationMax/simple_gpu_scheduler
Simple scheduler for running jobs on GPUs
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
google/python-fire
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
brentyi/tyro
CLI interfaces & config objects, from types
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
qgallouedec/panda-gym
Set of robotic environments based on PyBullet physics engine and gymnasium.
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
ryanchenstats/Gymnasium-Robotics
A collection of robotics simulation environments for reinforcement learning
bstadie/Gymnasium-Robotics
A collection of robotics simulation environments for reinforcement learning
beartype/beartype
Unbearably fast near-real-time hybrid runtime-static type-checking in pure Python.
seungeunrho/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
facebookresearch/Pearl
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
GT-RIPL/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites