BugCreat0r's Stars
clash-verge-rev/clash-verge-rev
Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)
haozheji/exact-optimization
ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment
deepseek-ai/DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
BlackSamorez/tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
thunlp/UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Farama-Foundation/chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
OpenBMB/AgentVerse
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
OpenBMB/ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
agi-templar/Stable-Alignment
Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
NAOSI-DLUT/Campus2024
2024届互联网校招信息汇总
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
jefferyYu/ESAFN
Dataset and codes for our paper "Entity-Sensitive Attention and Fusion Network for Entity-Level Multimodal Sentiment Classification".
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
microsoft/DeepSpeedExamples
Example models using DeepSpeed
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
salaniz/pycocoevalcap
Python 3 support for the MS COCO caption evaluation tools
git-lfs/git-lfs
Git extension for versioning large files
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
toshas/torch-discounted-cumsum
Fast Discounted Cumulative Sums in PyTorch
thomfoster/minRLHF
A (somewhat) minimal library for finetuning language models with PPO on human feedback.
TheExGenesis/rlhf-magic