Pinned Repositories
astminer
A library for mining of path-based representations of code (and more)
blog-cvicse
cmu-seai
CMU Lecture: Machine Learning In Production / AI Engineering / Software Engineering for AI-Enabled Systems (SE4AI)
code-docstring-corpus
Preprocessed Python functions and docstrings for automated code documentation (code2doc) and automated code generation (doc2code) tasks.
CodeXGLUE
CodeXGLUE
compy-learn
ComPy-Learn is a framework for exploring program representations for ML4CODE tasks.
datasets
source{d} datasets ("big code") for source code analysis and machine learning on source code
devign
Effective Vulnerability Identification by Learning Comprehensive Program Semantics via Graph Neural Networks
mineSStuBs
Hosts our tool for mining simple "stupid'' bugs (SStuBs).
ML-meets-SE
ZzYuanYaozZ's Repositories
ZzYuanYaozZ/cmu-seai
CMU Lecture: Machine Learning In Production / AI Engineering / Software Engineering for AI-Enabled Systems (SE4AI)
ZzYuanYaozZ/neural-program-analysis
Awesome papers of nueral program analysis
ZzYuanYaozZ/ML-meets-SE
ZzYuanYaozZ/CodeXGLUE
CodeXGLUE
ZzYuanYaozZ/astminer
A library for mining of path-based representations of code (and more)
ZzYuanYaozZ/ProGraML
Graph-based Program Representation & Models for Deep Learning
ZzYuanYaozZ/MSR_20_Code_vulnerability_CSV_Dataset
A C/C++ Code Vulnerability Dataset with Code Changes and CVE Summaries
ZzYuanYaozZ/mineSStuBs
Hosts our tool for mining simple "stupid'' bugs (SStuBs).
ZzYuanYaozZ/devign
Effective Vulnerability Identification by Learning Comprehensive Program Semantics via Graph Neural Networks
ZzYuanYaozZ/compy-learn
ComPy-Learn is a framework for exploring program representations for ML4CODE tasks.
ZzYuanYaozZ/code-docstring-corpus
Preprocessed Python functions and docstrings for automated code documentation (code2doc) and automated code generation (doc2code) tasks.
ZzYuanYaozZ/datasets
source{d} datasets ("big code") for source code analysis and machine learning on source code
ZzYuanYaozZ/StackOverflow-Question-Code-Dataset
StaQC: a systematically mined dataset containing around 148K Python and 120K SQL domain question-code pairs, as described in "StaQC: A Systematically Mined Question-Code Dataset from Stack Overflow" (WWW'18)
ZzYuanYaozZ/blog-cvicse