gereka
Math (Graph Theory) PhD @ Lehigh University. NLP postdoc @ Marmara University. Currently principal NLP research engineer @ Huawei Turkey R&D
Huawei Turkey R&D Center
gereka's Stars
thunlp/hyperbolic_llm
deedy5/duckduckgo_search
Search for words, documents, images, videos, news, maps and text translation using the DuckDuckGo.com search engine. Downloading files and images to a local hard drive.
huggingface/lighteval
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
mrbesher/nanodecoder
A tiny decoder
yoheinakajima/prettygraph
An experimental UI for text-to-knowledge-graph generation
mims-harvard/PrimeKG
Precision Medicine Knowledge Graph (PrimeKG)
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
alon-albalak/data-selection-survey
A Survey on Data Selection for Language Models
stanfordnlp/pyreft
ReFT: Representation Finetuning for Language Models
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
karpathy/llm.c
LLM training in simple, raw C/CUDA
huggingface/text-clustering
Easily embed, cluster and semantically label text datasets
huggingface/cosmopedia
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
kennethleungty/Neural-Network-Architecture-Diagrams
Diagrams for visualizing neural network architecture (Created with diagrams.net)
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
EleutherAI/cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
ai-forever/mgpt
Multilingual Generative Pretrained Model
Triang-jyed-driung/RWKV-LM-RLHF-DPO
Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
GT-RIPL/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
google-deepmind/concordia
A library for generative social simulation
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
kesimeg/LORA-turkish-clip
Finetuning CLIP using LORA for Turkish language
allenai/papermage
library supporting NLP and CV research on scientific papers
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
atfortes/Awesome-LLM-Reasoning
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.
Giskard-AI/giskard
🐢 Open-Source Evaluation & Testing for LLMs and ML models