iNeil77

ELLIS PhD Candidate @UKPLab

Frankfurt, Germany

iNeil77's Stars

ibm-granite/dolomite-engine
A highly efficient library for large scale distributed training
Language:Python3213
bigcode-project/bigcodebench
BigCodeBench: The Next Generation of HumanEval
Language:Python13311
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Language:Python55133
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python46838
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python1.8k172
saltudelft/ml4se
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
65392
nomic-ai/contrastors
Train Models Contrastively in Pytorch
Language:Python47836
bitextor/neural-document-aligner
Document aligner which uses neural technologies to search matches across bilingual documents
Language:Python72
UKPLab/acl2024-ircoder
Data creation, training and eval scripts for the IRCoder paper
Language:Python7
davidfraser/pyan
pyan is a Python module that performs static analysis of Python code to determine a call dependency graph between functions and methods. This is different from running the code and seeing which functions are called and how often; there are various tools that will generate a call graph in that way, usually using debugger or profiling trace hooks - for example: https://pycallgraph.readthedocs.org/ This code was originally written by Edmund Horner, and then modified by Juha Jeronen. See README for the original blog posts and links to their repositories.
Language:Python616122
iNeil77/vllm-code-harness
Run code inference-only benchmarks quickly using vLLM
Language:Python7
BK-SCOSS/sctokenizer
A Source Code Tokenizer
Language:Python134
ASE-REEF/REEF-data
12
s2e-lab/SecurityEval
Repository for "SecurityEval Dataset: Mining Vulnerability Examples to Evaluate Machine Learning-Based Code Generation Techniques" published in MSR4P&S'22.
Language:Python5312
FormAI-Dataset/FormAI-dataset
17
OpenLMLab/MOSS-RLHF
MOSS-RLHF
Language:Python1.2k93
theblackcat102/evol-dataset
evol augment any dataset online
Language:Python557
codefuse-ai/Awesome-Code-LLM
A curated list of language modeling researches for code and related datasets.
1.1k78
NL2Code/NL2Code.github.io
Large Language Models Meet NL2Code: A Survey
Language:HTML3411
tobymao/sqlglot
Python SQL Parser and Transpiler
Language:Python6.1k610
FSoft-AI4Code/CodeText-parser
⚒️ Tree-sitter custom toolkit for extracting function and class from raw source file
Language:Python355
stas00/ml-engineering
Machine Learning Engineering Open Book
Language:Python10.3k616
mbasso/awesome-wasm
😎 Curated list of awesome things regarding WebAssembly (wasm) ecosystem.
8.7k497
Toloka/crowd-kit
Control the quality of your labeled data with the Python tools you already know.
Language:Python20515
dmarx/anthology-of-modern-ml
Collection of important articles to be treated as a textbook
Language:Jupyter Notebook52328
TQRG/security-patches-dataset
☠️ Ground-truth dataset for vulnerability prediction (known research datasets and data sources included such as NVD, CVE Details and OSV); tools to automatically update the data are provided.
Language:Jupyter Notebook7925
Jur1cek/gcj-dataset
Collected solutions from Google Code Jam programming competition (2008-2020).
589
arpitbbhayani/system-design-questions
Problem statements on System Design and Software Architecture as part of Arpit's System Design Masterclass
Language:Python1.8k395
AgileRL/AgileRL
Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.
Language:Python55341
ManifoldRG/NEKO_Archive
The NEKO Project is an open source effort to build a model of equivalent scale and capability as that reported in DeepMind’s 2022 Paper, A Generalist Agent .
101

iNeil77

iNeil77's Stars

ibm-granite/dolomite-engine

bigcode-project/bigcodebench

jzhang38/EasyContext

RLHFlow/RLHF-Reward-Modeling

OpenRLHF/OpenRLHF

saltudelft/ml4se

nomic-ai/contrastors

bitextor/neural-document-aligner

UKPLab/acl2024-ircoder

davidfraser/pyan

iNeil77/vllm-code-harness

BK-SCOSS/sctokenizer

ASE-REEF/REEF-data

s2e-lab/SecurityEval

FormAI-Dataset/FormAI-dataset

OpenLMLab/MOSS-RLHF

theblackcat102/evol-dataset

codefuse-ai/Awesome-Code-LLM

NL2Code/NL2Code.github.io

tobymao/sqlglot

FSoft-AI4Code/CodeText-parser

stas00/ml-engineering

mbasso/awesome-wasm

Toloka/crowd-kit

dmarx/anthology-of-modern-ml

TQRG/security-patches-dataset

Jur1cek/gcj-dataset

arpitbbhayani/system-design-questions

AgileRL/AgileRL

ManifoldRG/NEKO_Archive