BiEchi's Stars
deepfakes/faceswap
Deepfakes Software For All
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
prasmussen/gdrive
Google Drive CLI Client
apple/ml-ferret
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
joerick/pyinstrument
🚴 Call stack profiler for Python. Shows you why your code is slow!
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
wkentaro/gdown
Google Drive Public File Downloader when Curl/Wget Fails
paperswithcode/releasing-research-code
Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)
nerfies/nerfies.github.io
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
yuweihao/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
openai/prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
glotlabs/gdrive
Google Drive CLI Client
lucidrains/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
google-research/rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
acl-org/acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
SeanLee97/AnglE
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
jbloomAus/SAELens
Training Sparse Autoencoders on Language Models
DigiRL-agent/digirl
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
RL4VLM/RL4VLM
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
YifeiZhou02/ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
Berkeley-NLP/Agent-Eval-Refine
Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]
Violet24K/HowToUIUC
Guide for surviving at UIUC (under development)
Evian-Zhang/Zhihu2Markdown
transform Zhihu article to HTML and Markdown files
samzhangjy/ZhihuToMarkdown
一个将知乎文章自动转换为Markdown的小工具,使用Python编写
Unified-Language-Model-Alignment/src
jdvin/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
neelnanda-io/neelutils
Random utils for personal use