gao-xiao-bai's Stars
gao-xiao-bai/StrategyLLM
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
GAIR-NLP/ProX
Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"
ars22/scaling-LLM-math-synthetic-data
Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"
OpenAutoCoder/Agentless
Agentless🐱: an agentless approach to automatically solve software development problems
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
NL2Code/CodeR
Aider-AI/aider
aider is AI pair programming in your terminal
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
renqibing/CodeAttack
[ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion
nus-apr/auto-code-rover
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 30.67% tasks (pass@1) in SWE-bench lite and 38.40% tasks (pass@1) in SWE-bench verified with each task costs less than $0.7.
bigcode-project/octopack
🐙 OctoPack: Instruction Tuning Code Large Language Models
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
open-mmlab/mmengine
OpenMMLab Foundational Library for Training Deep Learning Models
princeton-nlp/SWE-agent
[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.
Pythagora-io/gpt-pilot
The first real AI developer
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
princeton-nlp/SWE-bench
[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
open-compass/DevBench
A Comprehensive Benchmark for Software Development.
ninechapter-algorithm/leetcode-linghu-templete
算法面试必备,推荐刷题网站www.lintcode.com。北大学霸的《LeetCode刷题模板》+V领取: jiuzhangfeifei
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
wdndev/llm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
centerforaisafety/HarmBench
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
gao-xiao-bai/JsonTuning
JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
OpenLMLab/MOSS-RLHF
MOSS-RLHF
greenbellpepper/GreenPepper
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.