MLSysOps/Code-Agent-Survey

A survey of Code Agents / Foundation Models for improving development productivity. Become 10x SWE, MLE, etc.

MIT

Code-Agent-Survey

A survey of Code Agents for improving development productivity. These agents aim to help

SWE (Software Engineer)
MLE (Machine Learning Engineer)
DS (Data Scientist)
DA (Data Analyst)

Table of Contents

Paper with Code
Opensource Projects / Company Products
Foundation Models

Paper with Code

Paper	Year	Publisher	Type	Institution	Code
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering	2024	Arxiv	GitHub issue fixing	Princeton
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?	2024	ICLR	benchmark github issue fixing	Princeton
DevBench: A Comprehensive Benchmark for Software Development	2024	Arxiv	benchmark LLM for dev lifecycle	Shanghai AI Laboratory etc
Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering	2024	Arxiv	code generation for CodeContests	CodiumAI
ChatDev: Communicative Agents for Software Development	2023	Arxiv	design, coding, and testing	Tsinghua University etc
MetaGPT: The Multi-Agent Framework	2023	Arxiv	Multi-Agent Framework. using the software collaboration as an example	DeepWisdom etc
Data Interpreter: An LLM Agent For Data Science	2024	Arxiv	address data science problem	DeepWisdom etc
Agentless: Demystifying LLM-based Software Engineering Agents	2024	Arxiv	agentless method for swe-bench	UIUC
DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation	2023	ICML	code generation benchmark with a thousand data science questions	The University of Hong Kong etc
AutoCodeRover: Autonomous Program Improvement	2024	[Arxiv]		NUS

Opensource Projects / Company Products

Name	Type	Target	Contributor	Code / Product
OpenDevin: Code Less, Make More	write code, fix bugs, and ship features.	SWE	OpenDevin Community
Devon: An open-source pair programmer	Codebase exploration Config writing Test writing Bug fixing Architecture exploration	SWE	entropy-research
gpt-engineer	write and execute software code	SWE	gpt-engineer-org
Aider is AI pair programming in your terminal	start a new project or work with an existing git repo.	SWE	paul-gauthier
Cover-Agent	automate and enhance the generation of tests (currently mostly unit tests)	QA Engineer	CodiumAI
PR-Agent	Automated Pull Request Analysis, Feedback, Suggestions	SWE	CodiumAI
GPT PILOT	VS Code extension that aims to provide the first real AI developer companion	SWE	Pythagora-io
Claude Engineer	assist with a wide range of software development tasks	SWE	Doriandarko
cognition AI	An applied AI lab building end-to-end software agents.	SWE	cognition.ai	Product
Tabby	self-hosted AI coding assistant, Github Copilot alternative	SWE	tabbyml
Sweep AI	issue PR, unit test	SWE	Sweep AI	Product
Continue AI	GitHub Copilot alternative, VS Code and JetBrains extension	SWE	Continue
Hex Magic AI	text to sql/python data analysis code, copilot to understand and fix code issues	MLE / DS	Hex	Product
datagpt	chatbot to SQL, automatic data analysis	DA	datagpt	Product
pandas-ai	Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc).	DA	pandas AI
bito	generate, explain and review code	SWE	bito	[Product]

Foundation Model (Code Specific)

Name	Paper	Year	Blog	Institution	GitHub
Code Llama: Open Foundation Models for Code	arxiv	2023	link	Meta
CodeQwen1.5-7B	arxiv	2024	link	Alibaba
StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation	N.A.	2024	link	HuggingFace etc.
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence	arxiv	2024	link	DeepSeek
Codestral	N.A.	2024	link	Mistral	[Hugging Face]