WMWMW1

WMWMW1's Stars

happyapplehorse/agere
The tool is used for building and driving workflows specifically tailored for AI initiatives. It can be used to construct AI agents.
Language:Python12312
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python54k5.6k
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python31k3.8k
WMWMW1/LM-from-scratch
LM from scratch
Language:Jupyter Notebook4
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Language:Python7.7k937
pinecone-io/examples
Jupyter Notebooks to help you get hands-on with Pinecone vector databases
Language:Jupyter Notebook2.7k992
snap-stanford/MLAgentBench
Language:Python22430
DLLXW/baby-llama2-chinese
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
Language:Python2.4k302
flowersteam/Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
Language:Python21324
assafelovic/gpt-researcher
LLM based autonomous agent that does online comprehensive research on any given topic
Language:Python14.1k1.8k
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Language:Python3.9k841
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language:Python1.6k341
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python132k26.3k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
Language:Jupyter Notebook11.6k1.7k
yangjianxin1/Firefly-LLaMA2-Chinese
Firefly中文LLaMA-2大模型，支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型
Language:Python39631
Farama-Foundation/chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
Language:Python1.3k129
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
Language:Python8.2k752
binance/binance-spot-api-docs
Official Documentation for the Binance Spot APIs and Streams
3.9k1.3k
bgavran/Category_Theory_Machine_Learning
List of papers studying machine learning through the lens of category theory
Language:Python1.3k68
mlii/mfrl
Mean Field Multi-Agent Reinforcement Learning
Language:Python374101
BorealisAI/mtmfrl
Multi Type Mean Field Reinforcement Learning
Language:Python2810
jasonyip184/Index-Tracking-Portfolio-Optimization
Language:Jupyter Notebook2113
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++65.1k9.3k
qmfin/index_data
Dataset for index tracking
1
OrigamiDream/gato
Unofficial Gato: A Generalist Agent
Language:Python19628
mingkai-zheng/GENIUS
Can GPT-4 Perform Neural Architecture Search?
Language:Python826