gao-xiao-bai

gao-xiao-bai's Stars

gao-xiao-bai/StrategyLLM
1
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
5k275
GAIR-NLP/ProX
Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"
Language:Python18312
ars22/scaling-LLM-math-synthetic-data
Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"
24
OpenAutoCoder/Agentless
Agentless🐱: an agentless approach to automatically solve software development problems
Language:Python70084
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14k1.3k
NL2Code/CodeR
15217
Aider-AI/aider
aider is AI pair programming in your terminal
Language:Python21.2k2k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.7k1.1k
renqibing/CodeAttack
[ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion
Language:Python252
nus-apr/auto-code-rover
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 30.67% tasks (pass@1) in SWE-bench lite and 38.40% tasks (pass@1) in SWE-bench verified with each task costs less than $0.7.
Language:Python2.7k285
bigcode-project/octopack
🐙 OctoPack: Instruction Tuning Code Large Language Models
Language:Jupyter Notebook43227
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Language:Python3.9k305
open-mmlab/mmengine
OpenMMLab Foundational Library for Training Deep Learning Models
Language:Python1.2k352
princeton-nlp/SWE-agent
[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.
Language:Python13.6k1.4k
Pythagora-io/gpt-pilot
The first real AI developer
Language:Python31.6k3.2k
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
Language:Python6.4k453
princeton-nlp/SWE-bench
[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?
Language:Python1.9k331
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
Language:Python33.5k3.8k
open-compass/DevBench
A Comprehensive Benchmark for Software Development.
Language:Python855
ninechapter-algorithm/leetcode-linghu-templete
算法面试必备，推荐刷题网站www.lintcode.com。北大学霸的《LeetCode刷题模板》+V领取: jiuzhangfeifei
3.2k776
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Language:Python20.1k2.5k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python29.5k4.4k
wdndev/llm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
Language:HTML3.5k404
centerforaisafety/HarmBench
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Language:Jupyter Notebook31653
gao-xiao-bai/JsonTuning
JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning
Language:Python10
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
2.6k167
OpenLMLab/MOSS-RLHF
MOSS-RLHF
Language:Python1.3k101
greenbellpepper/GreenPepper
857119
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python6.9k1.8k