budsus's Stars
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
microsoft/Data-Science-For-Beginners
10 Weeks, 20 Lessons, Data Science for All!
cloudcommunity/Free-Certifications
A curated list of free courses & certifications.
sebastianruder/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
FMInference/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
salesforce/CodeGen
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
isadorasophia/murder
Murder is a pixel art ECS game engine.
openai/human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
Elgg/Elgg
A social networking engine in PHP/MySQL
makcedward/nlp
:memo: This repository recorded my NLP journey.
textpattern/textpattern
A flexible, elegant, fast and easy-to-use content management system written in PHP.
zhudotexe/kani
kani (カニ) is a highly hackable microframework for chat-based language models with tool use/function calling. (NLP-OSS @ EMNLP 2023)
codemeta/codemeta
Minimal metadata schemas for science software and code, in JSON-LD
rsennrich/Bleualign
Machine-Translation-based sentence alignment tool for parallel text
facebookresearch/Neural-Code-Search-Evaluation-Dataset
evaluation dataset consisting of natural language query and code snippet pairs
Symbolk/Code2Graph
Towards converting multilingual source code into one language-agnostic graph representation.
ppashakhanloo/CodeTrek
A powerful relational representation of source code
facebookresearch/speech_translation
Demo and samples for universal speech translator
sola-st/IdBench
A benchmark for evaluating embeddings of identifiers in source code.
limiw/open-source-discussions
JMHReif/nodes2021-aura-training
Training materials for NODES 2021 training on Neo4j Aura
martysai/source-code-summarization
Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.
canggihpw/thesisdtetiugm
Latex Template for Thesis Writing at DTETI UGM
worldofcyberskills/Blackbird
Blackbird:- An OSINT tool to search fast for accounts by username across 131 sites.
IeuanWalker/Semantic-Web-Book-Search-Application
Project from University where i created a web application that sources data using semantic web technologies.
mranahmd/nbow2-text-class
Data and code used in the 2016 RepL4NLP ACL paper, "Learning Word Importance with the Neural Bag-of-Words Model"
poojaruhal/RP-class-comment-classification
budsus/codeprep
A toolkit for pre-processing large source code corpora