Large Language Model for Software Engineering

The collection is actively updated with the help of an internal literature search engine.

Model List

2023 (234 Models)

Click to expand!


NoCrypt/fast-repo	TabbyML/SantaCoder-1B	michaelfeil/ct2fast-starchat-alpha
mantra-coding/alBERTo	michaelfeil/ct2fast-starcoderbase	michaelfeil/ct2fast-starcoder
bigcode/tiny_starcoder_py	kevinpro/Vicuna-13B-CoT	jiezhou1996/test
Soliai/Soli	bigcode/gpt_bigcode-santacoder	michaelfeil/ct2fast-gpt_bigcode-santacoder
dushigao/yolov4	bigcode/starcoder	rustformers/bloom-ggml
rustformers/bloomz-ggml	mishasadhaker/codet5_large_typescript	sahil2801/instruct-codegen-16B
jokerLang/aa	CNXT/CHaTx	sadiqj/camlcoder
omegaodin/replit-replit-code-v1-3b	replit/replit-code-v1-3b	teknium/Replit-v1-CodeInstruct-3B-fp16
teknium/Replit-v1-CodeInstruct-3B	kkhan/gpt2-medium-iba-txt	4bit/Replit-v1-CodeInstruct-3B
bigscience/bloomz-560m	bigscience/bloomz-1b1	bigscience/bloomz-1b7
bigscience/bloomz-3b	bigscience/bloomz-7b1	bigscience/bloomz
Neupane9Sujal/Text_Summarization	betelguesestudios/ChatDBD	azizp128/emotion-predictor-indobert
zirui3/starcoder-ft-zh	zjunlp/CaMA-13B-LoRA	zjunlp/CaMA-13B-Diff
Aryan2003/roberta_job	zchflyer/test11	EnterNameBros/DialoGPT-small-Senko-san-ver-2
dev2bit/es2bash-mt5	omegaodin/gpt2	Fsoft-AIC/Codebert-docstring-inconsistency
HuggingFaceH4/starchat-alpha	AsakusaRinne/LLamaSharpSamples	AlexWortega/wortegaLM-1b
huolongguo10/check_sec_tiny	NeoDim/starcoder-GGML	NeoDim/starcoderbase-GGML
NeoDim/starchat-alpha-GGML	christinacdl/moderate_severe_depression_model	KinglyCrow/pythia-3b-deduped-sft-r1-python-finetuned
Binaryy/blender-bot-distill-finetuned	Fredithefish/CrimsonPajama	showpiece/donut4cover_of_books
OdiaGenAI/odiagenAI-model-v1	NatLee/openpose-keras-model	pratikcha/DummyModelTest
up201806461/BFP-combined	baotoan2002/GPT-2	brandit/atharv.1
BlackBull/yeet	wandisun/generate_testcase	pszemraj/bart-large-code-instructiongen
redlinezh/redlinezh	erichilarysmithsr/Quality-of-Life-Games	AlexWortega/wortegaLM
rishiraj/starchat	bigcode/starcoder-megatron	bigcode/starcoderbase-megatron
bigcode/santacoder	bigscience/bloom-1b7	bigscience/bloom-560m
bigcode/starpii	bigcode/starcoderbase	APJ23/MultiHeaded_Sentiment_Analysis_Model
lentan/replit	bigcode/starencoder	jitesh/emotion-english
TinaLiHF/fined-tuned-T5small	tmnam20/codebert-code-summarization	Vipitis/santacoder-finetuned-the-stack-glsl
Vipitis/santacoder-finetuned-Shadertoys-fine	Vipitis/santacoder-finetuned-Shadertoys	tabbleman/test
huolongguo10/check_sec	HelloImSteven/AppleScript-Summarizer	duncan93/video
alexpaul/QI-Large-v1	JeanL-0/ChatAnswering-PTBR	jitroy07/BOT
Rirou360/test	RafMuz/alpaca7B-lora	Akhil0-o/saved_model_links
TrippingFollowing39/AMOGUS	Akhil0-o/saved_model_body	MrRainbow/RainbowGPT
Akhil0-o/Phishing_detection	Ilangraterol/Dataset_model	AlexWortega/instruct_rugptlarge
MLRush/chinese-chat-30m	MLRush/chinese-lm-30m	ParsaKgvr/mmdGPT
ParsaKgvr/mmdBERT	dorkai/codeX-1.0	OtterDev/otterchat
Phonecharger/WLAsw1	MatthiasPi/ActiveLearningModel-WAR-WassersteinActiveRegression	Wannita/baseline_codecompletion
ybelkada/bloom-1b7-8bit	kelly233/test_model	ArmelR/AlpacaCode512
bigscience/bloom-3b	lambdasec/santafixer	ybelkada/bloom-560m-8bit
PromptKing/GTA5_PROCESS_LEARNING_AI	Qrstud/ANCs	HTP/CHaTx
LYFCJJ/anythingv45-cjj-diffusers	hakurei/instruct-12b	Dirus/GPTOWN
TeamGZG/toxic-comment-classification-project	MarTinSForZZa/Innerversal	newsrx/bloomz-7b1
0x7194633/pyGPT-50M	dhnchandan/huggingface	RomanTeucher/PythonCoder
bigscience/bloom	edbeeching/llama-se-rl-adapter	TheEeeeLin/test
olivierdehaene/optimized-santacoder	Mauquoi-00/Teenage_Gender_Classification	Esly35i/Esmoli
zee2221/ai_me	urmom12349823/AItext	manstepharder/hangi
Sentdex/GPyT	lxe/Cerebras-GPT-2.7B-Alpaca-SP	akone/bloomgpt
TSjB/mbart-large-52-qm-ru-v1	Wannita/PyCoder	mazeratti/creative
TabbyML/NeoX-1.3B	pszemraj/bart-base-code-instructiongen	AlexWortega/instruct_rugptMedium
vernin/maylora	valooo/test	amongusrickroll68/MeloMind
amongusrickroll68/TextImagine-1.0-March-2023	badmatr11x/distilroberta-base-offensive-hateful-speech-text-multiclassification	Techh/speed_car_policee
Ar4ikov/gpt2-650k-stable-diffusion-prompt-generator	bigscience/distill-bloom-1b3	CAUKiel/JavaBERT
emre/java-RoBERTa-Tara-small	Ashokajou51/NonToxicCivilBert	thevyasamit/bert_fake_news_classification
namikazi25/DCNN_on_CIFAR_10	mdoshi2612/fake-news-detector	CAUKiel/JavaBERT-uncased
shibing624/code-autocomplete-distilgpt2-python	shibing624/code-autocomplete-gpt2-base	aarnphm/multi-length-text-classification-pipeline
NITINNANNAPANENI/Ll	rockmiin/ml-codeparrot	Naina07/Fine_tune
bigscience/bloom-1b1	bigscience/distill-bloom-1b3-10x	wittyicon/Text-Alchemy
razent/cotext-1-cc	omarelsayeed/wav2vec2_ar_anz2	whybeyoung/test
KonghaYao/MagicPrompt_SD_V1	zabir-alnazi/fatima-fellowship-ai-gen-detector	Abdullah007/image-classification-ResNet50
AlexWortega/instruct_rugptSmall	sjiang1/codecse	daeunj/828A
Ajibola/PaViT	changwh5/BigBiGAN-MNIST-150epoch	Azarthehulk/Image_preprocessing_basics
nishakathiriya/DR-model	AcrossTheUniverseZ/ATUZGenerator	Roy029/sno_empty
imharesh/Shabbat	pavanBuduguppa/asr_inverse_text_normalization	NeyroTech/PicKHK
rapples/png2emb	AlexWortega/taskGPT2-xl-v0.2a	marlenezw/AutoVC_Voice_Conversion
mrm8488/santacoder-finetuned-the-stack-clojure	BrendaTellez/sounds	BrendaTellez/SoundClassificationCNNRNN
samkenxstream/AlgoSilicon	samkenxstream/HierarchyMartialsAI	ilahazs/rokashibasakiv1
bigscience/bloom-7b1	bigscience/bloom-560m-intermediate	bigscience/bloom-1b1-intermediate
bigscience/bloom-3b-intermediate	bigscience/bloom-7b1-intermediate	bigscience/bloomz-mt
bigscience/bloomz-7b1-mt	bigscience/bloomz-7b1-p3	bigscience/bloomz-p3
bigscience/bloom-1b7-intermediate	mrm8488/santacoder-finetuned-the-stack-swift	Neighhhbor/Test_model
muhtasham/santacoder-finetuned-the-stack-cobol	muhtasham/santacoder-finetuned-the-stack-assembly	HuggingFaceH4/bloomz-7b1
zkep/detr	loubnabnl/santacoder-code-to-text	mrm8488/santacoder-finetuned-the-stack-bash-shell
Thyral/Testing	noahshinn024/santacoder-ts	el-profesor/code_t5
K8778/universe	CarperAI/diff-codegen-6b-v2	CarperAI/diff-codegen-2b-v2
CarperAI/diff-codegen-350m-v2	96harsh56/bert_test2	aminian/ML-final-project
microsoft/codereviewer	facebook/incoder-1B	facebook/incoder-6B
MrFitzmaurice/roberta-finetuned-topic-5	mble/nameToStdName	aadvari/movie-recommender
aparnabhat/kannada-ner	Kaliel456/Lynn	bigcode/santacoder-megatron
Salesforce/codegen2-1B	Salesforce/codegen2-3_7B	Salesforce/codegen2-7B

2022 (38 Models)

Click to expand!


mrm8488/bloom-560m-finetuned-the-stack-rust	smallcloudai/codify_medium_multi	smallcloudai/codify_3b_multi
anjandash/JavaBERT-small	anjandash/JavaBERT-mini	saikatc/NatGen
Nokia/nlgp-docstring	alecsharpie/codegen_350m_html	alecsharpie/codegen_350m_css
CarperAI/diff-codegen-350m-v1	giulio98/codegen-350M-multi-xlcost-v2	giulio98/codegen-350M-multi-xlcost
Nokia/nlgp-natural	model-attribution-challenge/bloom-560m	CarperAI/FIM-NeoX-1.3B
model-attribution-challenge/bloom-2b5	huggingface/CodeBERTa-language-id	codeparrot/codeparrot-small-code-to-text
moyix/csrc_774m	codeparrot/unixcoder-java-complexity-prediction	codeparrot/codeparrot-small-text-to-code
bigscience/bloom-optimizer-states	model-attribution-challenge/bloom-350m	little-star/good_model
codeparrot/codeparrot-small-multi	bigscience/bloom-intermediate	bigscience/tr11-176B-logs
codeparrot/codeparrot-small	huggingface/CodeBERTa-small-v1	codeparrot/codeparrot
lvwerra/test_card	razent/spbert-mlm-base	razent/spbert-mlm-wso-base
razent/spbert-mlm-zero	razent/cotext-2-cc	razent/cotext-1-ccg
ietz/distilroberta-base-finetuned-jira-qt-issue-titles-and-bodies	ietz/distilroberta-base-finetuned-jira-qt-issue-title

2021 (2 Models)

Click to expand!


mrm8488/codeBERTaJS	mrm8488/CodeBERTaPy

Popular Model List

Year-Id	Model Name	Paper	# of Parameters /Model Size	Open Source?
2023-4	CodeGen2	link	[1B, 3.7B, 7B, 16B]	source
2023-3	CodeT5+	link	[220M, 770M, 2B, 6B, 16B]	source
2023-2	StarCoder	link	[15B]	source
2023-1	CodeGeeX	link	[13B]	source
2022-2	InCoder	link	[1.3B/2.62GB, 6B(26.6GB)]	source
2022-1	CodeGen	link	[350M, 2B, 6B, 16B]	source
2021-1	CodeT5	link	[770M/892MB]	source
2020-2	GraphCodeBERT	link		source
2020-1	CodeBERT	link		source

Paper List

Task Tags

Click to expand!

Year-Id	Title	Venue Name(Type)
2023-1	Invalidator: Automated Patch Correctness Assessment via Semantic and Syntactic Reasoning	TSE(J)
2022-27	Fast Changeset-based Bug Localization with BERT	ICSE(C)
2022-26	An Empirical Study on the Usage of Transformer Models for Code Completion	TSE(J)
2022-25	DualSC: Automatic Generation and Summarization of Shellcode via Transformer and Dual Learning	SANER(C)
2022-24	Source Code Summarization with Structural Relative Position Guided Transformer	SANER(C)
2022-23	Aspect-Based API Review Classification: How Far Can Pre-Trained Transformer Model Go?	SANER(C)
2022-22	Can Identifier Splitting Improve Open-Vocabulary Language Model of Code?	SANER(C)
2022-21	Evaluation of Context-Aware Language Models and Experts for Effort Estimation of Software Maintenance Issues	ICSME(C)
2022-20	Automating code review activities by large-scale pre-training	FSE(C)
2022-19	VulCurator: A Vulnerability-fixing Commit Detector	FSE(C)
2022-18	AutoPruner: Transformer-based Call Graph Pruning	FSE(C)
2022-17	Can pre-trained code embeddings improve model performance? Revisiting the use of code embeddings in software engineering tasks	EMSE(J)
2022-16	Bridging Pre-trained Models and Downstream Tasks for Source Code Understanding	ICSE(C)
2022-15	Jigsaw: Large Language Models meet Program Synthesis	ICSE(C)
2022-14	Natural Attack for Pre-trained Models of Code	ICSE(C)
2022-13	Using Pre-Trained Models to Boost Code Review Automation	ICSE(C)
2022-12	What Do They Capture? - A Structural Analysis of Pre-Trained Language Models for Source Code	ICSE(C)
2022-11	A Light Bug Triage Framework for Applying Large Pre-trained Language Model	ASE(C)
2022-10	AST-Probe: Recovering abstract syntax trees from hidden representations of pre-trained language models	ASE(C)
2022-9	Compressing Pre-trained Models of Code into 3 MB	ASE(C)
2022-8	PRCBERT: Prompt Learning for Requirement Classification using BERT-based Pretrained Language Models	ASE(C)
2022-7	Prompt-tuned Code Language Model as a Neural Knowledge Base for Type Inference in Statically-Typed Partial Code	ASE(C)
2022-6	Few-shot training LLMs for project-specific code-summarization	ASE(C)
2022-5	Diet code is healthy: simplifying programs for pre-trained models of code	FSE(C)
2022-4	Discrepancies among pre-trained deep neural networks: a new threat to model zoo reliability	FSE(C)
2022-3	Effective and scalable fault injection using bug reports and generative language models	FSE(C)
2022-2	An extensive study on pre-trained models for program understanding and generation	ISSTA(C)
2022-1	Using pre-trained language models to resolve textual and semantic merge conflicts (experience paper)	ISSTA(C)
2021-7	Studying the Usage of Text-To-Text Transfer Transformer to Support Code-Related Tasks	ICSE(C)
2021-6	Traceability Transformed: Generating more Accurate Links with Pre-Trained BERT Models	ICSE(C)
2021-5	Code Prediction by Feeding Trees to Transformers	ICSE(C)
2021-4	Traceability Transformed: Generating more Accurate Links with Pre-Trained BERT Models	ICSE(C)
2021-3	DeepMemory: Model-based Memorization Analysis of Deep Neural Language Models	ASE(C)
2021-2	What do pre-trained code models know about code?	ASE(C)
2021-1	Does reusing pre-trained NLP model propagate bugs?	FSE(C)
2020-3	Achieving Reliable Sentiment Analysis in the Software Engineering Domain using BERT	ICSME(C)
2020-2	Sentiment Analysis for Software Engineering: How Far Can Pre-trained Transformer Models Go?	ICSME(C)
2020-1	Multi-task Learning based Pre-trained Language Model for Code Completion	ASE(C)

Paper Stats

Venue Stats

Venue	Count
ICSE	10
FSE	7
ASE	9
ISSTA	2
TSE	2
TOSEM	0
EMSE	1
ICSME	3
SANER	4
MSR	0

Year Stats

Venue	Count
2023	1
2022	27
2021	7
2020	3

Considered Venues

Conferences

Software Engineering Domain:
- ICSE: International Conference on Software Engineering
- FSE: The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering
- ASE: IEEE/ACM International Conference on Automated Software Engineering
- ISSTA: International Symposium on Software Testing and Analysis
- ICSME: IEEE International Conference on Software Maintenance and Evolution
- MSR: IEEE Working Conference on Mining Software Repositories
- SANER: IEEE International Conference on Software Analysis, Evolution, and Reengineering

Journals

Software Engineering Domain:
- TSE: IEEE Transactions on Software Engineering
- TOSEM: ACM Transactions on Software Engineering and Methodology
- EMSE: Empirical Software Engineering

Contribution

The easiest way to contribute is to submit a paper with verified information via GitHub issues. Only url of the paper should be already enough. The mainteiner will add accordingly and keep you updated in the issue conversation.

Alternatively, you can create a pull request. For that, you need to strictly follow the format.

Any other suggestion to improve this repository is also highly welcomed via GitHub issues.

Xin-Zhou-smu/LLM4SE

Large Language Model for Software Engineering

Table of Contents

Model List

2023 (234 Models)

2022 (38 Models)

2021 (2 Models)

Popular Model List

Paper List

Paper Stats

Venue Stats

Year Stats

Considered Venues

Conferences

Journals

Contribution

Contributors