codebert

There are 33 repositories under codebert topic.

neulab/code-bert-score
CodeBERTScore: an automatic metric for code generation, based on BERTScore
Language:Jupyter Notebook203 4 1015
dessertlab/EVIL
EVIL (Exploiting software VIa natural Language) is an approach to automatically generate software exploits in assembly/Python language from descriptions in natural language. The approach leverages Neural Machine Translation (NMT) techniques and a dataset that we developed for this work.
Language:Python27 4 24
RepoAnalysis/RepoSnipy
Neural search engine for discovering semantically similar Python repositories on GitHub
Language:Python26 1 06
jorge-martinez-gil/graphcodebert-interpretability
Augmenting the Interpretability of GraphCodeBERT for Code Similarity Tasks
Language:Python20 2 01
jorge-martinez-gil/small-code-models
Repository about small code models
Language:Python19 2 02
philippnormann/malicious-payload-detection
🕵️‍♂️ ML project to identify malicious web payloads, aimed at boosting the effectiveness of WAFs and IDSs.
Language:Jupyter Notebook13 1 00
dessertlab/Targeted-Data-Poisoning-Attacks
This repository contains the code, the dataset and the experimental results related to the paper "Vulnerabilities in AI Code Generators: Exploring Targeted Data Poisoning Attacks" accepted for publication at The 32nd IEEE/ACM International Conference on Program Comprehension (ICPC 2024).
Language:Python12 1 02
EhsanMashhadi/ISSRE2023-BugSeverityPrediction
Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.
Language:Java10 1 02
jorge-martinez-gil/ensemble-codesim
Advanced Detection of Source Code Clones via an Ensemble of Unsupervised Similarity Measures
Language:Java9 1 02
jorge-martinez-gil/graphcodebert-feature-integration
Improving Source Code Similarity Detection with GraphCodeBERT and Additional Feature Integration
Language:Python9 1 04
sssszh/Vulnerability-Detection
Fine-tuning CodeBERT for Vulnerability Detection
Language:Python8 1 00
ML4SE2022/Group4
Fine-tuning CodeBERT with AST-based Vectors for Code Translation
Language:C#5 3 01
RepoAnalysis/RepoSim
This repository contains experiments on comparing the similarity of Python repositories using ML models.
Language:Jupyter Notebook4 0 02
daimakram/Bug-Detection-Code-Summarization
Performs Code Summarization, Bug Detection, Bug Removal using different Natural language processing models including Garph CodeBERT, GREAT, GNN, CoText etc.
Language:Jupyter Notebook3 2 00
MarttiWu/codeopt
CodeOpt: A framework for optimizing code performance using Two-Stage Sampling, Few-Shot Learning, and Iterative Self-Reflection with support for Genetic Algorithm Inspired Chain-of-Thought (GA-COT).
Language:Python2 1 0
RepoMining/RepoSim4Py
A project for determining the similarity of python repositories based on embedding approach
Language:Jupyter Notebook2 0 02
Ahmedfir/java-business-locations
extracts business-logic code locations.
Language:Java1 3 12
blackscythe123/IRSE
The study uses the IRSE/FIRE dataset and explores the impact of combining original C code data with Python-derived silver-standard
Language:Jupyter Notebook1
bosszii2709/ai-dataset-generator
🤖 Generate tailored AI training datasets quickly and easily, transforming your domain knowledge into essential training data for model fine-tuning.
Language:Python1
hishamp3/codeDetection
Django implementation of CodeBERT for detecting vulnerable code.
Language:Python1 1 00
khushnood-rafique/Transformer-Based-Unit-Test-Generation
This study compares three transformer-based mod- els—CodeT5, CodeBERT, and CodeGen.
Language:Python1
roshan112-3/Auto-grading-C-programming-assignments-with-CodeBERT-and-Random-Forest-Regressor
Language:Jupyter Notebook10
sarvagyakrcs/s0.dev
The modern web development landscape is plagued by a peculiar paradox: despite the abundance of UI components and design systems, developers still spend countless hours reimplementing similar interfaces. S0 addresses this challenge by introducing a novel approach that combines advanced vector search capabilities.
Language:Python1 1 0
shruti10-designer/NNDL-Autograding
Auto-grading of C programs using Machine Learning and Deep Learning models such as random forest, CNN, LSTM etc and code embedding models such as CodeBERT. Also published a paper for the same in IEEE (14th ICCNT Conference)
Language:Jupyter Notebook1 1 00
Vaibhav06Jha28/ChainSage
"AI-powered vulnerability detection for Solidity smart contracts using Mistral + CodeBERT"
Language:Python1
aleksibovellan/ai-python-code-validator
AI/ML Trained Python Code Validator with Gradio Web Interface
Language:Python0 1 00
GianRomani/Neural_search_engine
Neural search engine for questions/answers from StackOverflow
Language:Jupyter Notebook0 1 00
hishamp3/codeXGLUE
CodeXGLUE, a benchmark dataset to foster machine learning research for program understanding and generation.
Language:Jupyter Notebook0 1 00
Radowan98/ZSVulD
Implementation and dataset for A Zero-Shot Framework for Cross-Project Vulnerability Detection in Source Code (Empirical Software Engineering, 2026).
Language:Python00
yegmor/CoCLR-ML_Reproducibility_Challenge_2021
Reproducibility report ofCoSQA: 20,000+ Web Queries for Code Search and QuestionAnswering for ML Reproducibility Challenge 2021
Language:Python0 0 00
kordy0-0/ai-dataset-generator
🛠️ Generate AI training datasets easily, transforming complex information from documents into structured data for model fine-tuning.
Language:Python
santoshpremi/AutoMCP
AutoMCP: Dynamic Tool Synthesis via Code Pattern Mining and Self-Regenerative Agent Architectures
Language:Jupyter Notebook
ZakriaComputerEngineer/Automated-Bug-Report-Classification-to-Improve-Source-Code-Quality
This repository is source code of conference paper "From Bug Reports to Code Quality: A Transformer-Based Classification Approach"
Language:Jupyter Notebook

codebert

neulab/code-bert-score

dessertlab/EVIL

RepoAnalysis/RepoSnipy

jorge-martinez-gil/graphcodebert-interpretability

jorge-martinez-gil/small-code-models

philippnormann/malicious-payload-detection

dessertlab/Targeted-Data-Poisoning-Attacks

EhsanMashhadi/ISSRE2023-BugSeverityPrediction

jorge-martinez-gil/ensemble-codesim

jorge-martinez-gil/graphcodebert-feature-integration

sssszh/Vulnerability-Detection

ML4SE2022/Group4

RepoAnalysis/RepoSim

daimakram/Bug-Detection-Code-Summarization

MarttiWu/codeopt

RepoMining/RepoSim4Py

Ahmedfir/java-business-locations

blackscythe123/IRSE

bosszii2709/ai-dataset-generator

hishamp3/codeDetection

khushnood-rafique/Transformer-Based-Unit-Test-Generation

roshan112-3/Auto-grading-C-programming-assignments-with-CodeBERT-and-Random-Forest-Regressor

sarvagyakrcs/s0.dev

shruti10-designer/NNDL-Autograding

Vaibhav06Jha28/ChainSage

aleksibovellan/ai-python-code-validator

GianRomani/Neural_search_engine

hishamp3/codeXGLUE

Radowan98/ZSVulD

yegmor/CoCLR-ML_Reproducibility_Challenge_2021

kordy0-0/ai-dataset-generator

santoshpremi/AutoMCP

ZakriaComputerEngineer/Automated-Bug-Report-Classification-to-Improve-Source-Code-Quality