ZachariahPang

learning machine learning

ZachariahPang's Stars

Delgan/loguru
Python logging made (stupidly) simple
Language:Python19.6k694
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Language:Python6k1k
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
Language:Python1.2k163
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python31.6k3.9k
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python2.1k206
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
6.3k383
fchollet/ARC-AGI
The Abstraction and Reasoning Corpus
Language:JavaScript3.3k556
vimalabs/VIMABench
Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
Language:Python27334
crewAIInc/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Language:Python19.6k2.7k
AnswerDotAI/RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
Language:Python2.9k199
svenstaro/genact
🌀 A nonsense activity generator
Language:Rust9.6k410
corl-team/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Language:Python46518
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Language:Python3.7k665
xai-org/grok-1
Grok open release
Language:Python49.5k8.3k
ajeetdsouza/zoxide
A smarter cd command. Supports all major shells.
Language:Rust22k535
dave1010/tree-of-thought-prompting
Using Tree-of-Thought Prompting to boost ChatGPT's reasoning
65960
ExpectationMax/simple_gpu_scheduler
Simple scheduler for running jobs on GPUs
Language:Python17112
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python6.1k610
google/python-fire
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
Language:Python26.9k1.4k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook11.9k1.7k
brentyi/tyro
CLI interfaces & config objects, from types
Language:Python47425
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python5.4k613
qgallouedec/panda-gym
Set of robotic environments based on PyBullet physics engine and gymnasium.
Language:Python547114
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
Language:Jupyter Notebook7k482
ryanchenstats/Gymnasium-Robotics
A collection of robotics simulation environments for reinforcement learning
Language:Python1
bstadie/Gymnasium-Robotics
A collection of robotics simulation environments for reinforcement learning
Language:Python11
beartype/beartype
Unbearably fast near-real-time hybrid runtime-static type-checking in pure Python.
Language:Python2.6k57
seungeunrho/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Language:Python2.8k460
facebookresearch/Pearl
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
Language:Jupyter Notebook2.6k156
GT-RIPL/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
2.8k229

ZachariahPang

ZachariahPang's Stars

Delgan/loguru

microsoft/DeepSpeedExamples

openai/lm-human-preferences

hiyouga/LLaMA-Factory

OpenRLHF/OpenRLHF

WooooDyy/LLM-Agent-Paper-List

fchollet/ARC-AGI

vimalabs/VIMABench

crewAIInc/crewAI

AnswerDotAI/RAGatouille

svenstaro/genact

corl-team/CORL

google-deepmind/dm_control

xai-org/grok-1

ajeetdsouza/zoxide

dave1010/tree-of-thought-prompting

ExpectationMax/simple_gpu_scheduler

bitsandbytes-foundation/bitsandbytes

google/python-fire

meta-llama/llama-recipes

brentyi/tyro

vwxyzjn/cleanrl

qgallouedec/panda-gym

cloneofsimo/lora

ryanchenstats/Gymnasium-Robotics

bstadie/Gymnasium-Robotics

beartype/beartype

seungeunrho/minimalRL

facebookresearch/Pearl

GT-RIPL/Awesome-LLM-Robotics