codebert
There are 33 repositories under codebert topic.
neulab/code-bert-score
CodeBERTScore: an automatic metric for code generation, based on BERTScore
dessertlab/EVIL
EVIL (Exploiting software VIa natural Language) is an approach to automatically generate software exploits in assembly/Python language from descriptions in natural language. The approach leverages Neural Machine Translation (NMT) techniques and a dataset that we developed for this work.
RepoAnalysis/RepoSnipy
Neural search engine for discovering semantically similar Python repositories on GitHub
jorge-martinez-gil/graphcodebert-interpretability
Augmenting the Interpretability of GraphCodeBERT for Code Similarity Tasks
jorge-martinez-gil/small-code-models
Repository about small code models
philippnormann/malicious-payload-detection
🕵️♂️ ML project to identify malicious web payloads, aimed at boosting the effectiveness of WAFs and IDSs.
dessertlab/Targeted-Data-Poisoning-Attacks
This repository contains the code, the dataset and the experimental results related to the paper "Vulnerabilities in AI Code Generators: Exploring Targeted Data Poisoning Attacks" accepted for publication at The 32nd IEEE/ACM International Conference on Program Comprehension (ICPC 2024).
EhsanMashhadi/ISSRE2023-BugSeverityPrediction
Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.
jorge-martinez-gil/ensemble-codesim
Advanced Detection of Source Code Clones via an Ensemble of Unsupervised Similarity Measures
jorge-martinez-gil/graphcodebert-feature-integration
Improving Source Code Similarity Detection with GraphCodeBERT and Additional Feature Integration
sssszh/Vulnerability-Detection
Fine-tuning CodeBERT for Vulnerability Detection
ML4SE2022/Group4
Fine-tuning CodeBERT with AST-based Vectors for Code Translation
RepoAnalysis/RepoSim
This repository contains experiments on comparing the similarity of Python repositories using ML models.
daimakram/Bug-Detection-Code-Summarization
Performs Code Summarization, Bug Detection, Bug Removal using different Natural language processing models including Garph CodeBERT, GREAT, GNN, CoText etc.
MarttiWu/codeopt
CodeOpt: A framework for optimizing code performance using Two-Stage Sampling, Few-Shot Learning, and Iterative Self-Reflection with support for Genetic Algorithm Inspired Chain-of-Thought (GA-COT).
RepoMining/RepoSim4Py
A project for determining the similarity of python repositories based on embedding approach
Ahmedfir/java-business-locations
extracts business-logic code locations.
blackscythe123/IRSE
The study uses the IRSE/FIRE dataset and explores the impact of combining original C code data with Python-derived silver-standard
bosszii2709/ai-dataset-generator
🤖 Generate tailored AI training datasets quickly and easily, transforming your domain knowledge into essential training data for model fine-tuning.
hishamp3/codeDetection
Django implementation of CodeBERT for detecting vulnerable code.
khushnood-rafique/Transformer-Based-Unit-Test-Generation
This study compares three transformer-based mod- els—CodeT5, CodeBERT, and CodeGen.
sarvagyakrcs/s0.dev
The modern web development landscape is plagued by a peculiar paradox: despite the abundance of UI components and design systems, developers still spend countless hours reimplementing similar interfaces. S0 addresses this challenge by introducing a novel approach that combines advanced vector search capabilities.
shruti10-designer/NNDL-Autograding
Auto-grading of C programs using Machine Learning and Deep Learning models such as random forest, CNN, LSTM etc and code embedding models such as CodeBERT. Also published a paper for the same in IEEE (14th ICCNT Conference)
Vaibhav06Jha28/ChainSage
"AI-powered vulnerability detection for Solidity smart contracts using Mistral + CodeBERT"
aleksibovellan/ai-python-code-validator
AI/ML Trained Python Code Validator with Gradio Web Interface
GianRomani/Neural_search_engine
Neural search engine for questions/answers from StackOverflow
hishamp3/codeXGLUE
CodeXGLUE, a benchmark dataset to foster machine learning research for program understanding and generation.
Radowan98/ZSVulD
Implementation and dataset for A Zero-Shot Framework for Cross-Project Vulnerability Detection in Source Code (Empirical Software Engineering, 2026).
yegmor/CoCLR-ML_Reproducibility_Challenge_2021
Reproducibility report ofCoSQA: 20,000+ Web Queries for Code Search and QuestionAnswering for ML Reproducibility Challenge 2021
kordy0-0/ai-dataset-generator
🛠️ Generate AI training datasets easily, transforming complex information from documents into structured data for model fine-tuning.
santoshpremi/AutoMCP
AutoMCP: Dynamic Tool Synthesis via Code Pattern Mining and Self-Regenerative Agent Architectures
ZakriaComputerEngineer/Automated-Bug-Report-Classification-to-Improve-Source-Code-Quality
This repository is source code of conference paper "From Bug Reports to Code Quality: A Transformer-Based Classification Approach"