chkla's Stars
meta-llama/llama3
The official Meta Llama 3 GitHub site
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
adityatelange/hugo-PaperMod
A fast, clean, responsive Hugo theme.
arcee-ai/mergekit
Tools for merging pretrained large language models.
github/scripts-to-rule-them-all
Set of boilerplate scripts describing the normalized script pattern that GitHub uses in its projects.
jaymody/picoGPT
An unnecessarily tiny implementation of GPT-2 in NumPy.
imaurer/awesome-llm-json
Resource list for generating JSON using LLMs via function calling, tools, CFG. Libraries, Models, Notebooks, etc.
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
chakki-works/seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
leondz/garak
LLM vulnerability scanner
topoteretes/cognee
Deterministic LLMs Outputs for AI Applications and AI Agents
srush/annotated-mamba
Annotated version of the Mamba paper
danieldeutsch/sacrerouge
SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.
IBM/fastfit
FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes
IBM/unitxt
🦄 Unitxt: a python library for getting data fired up and set for training and evaluation
bloomberg/dataless-model-merging
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
epfml/DenseFormer
vinid/NegotiationArena
MoritzLaurer/synthetic-data-blog
This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data
UChicago-pol-methods/plsc-40601-CI-ML
Advanced Topics in Causal Inference PLSC 40601
gesiscss/WebBot
Browser extension to simulate browsing behaviour in search engines.
ZHZisZZ/emulated-disalignment
Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
flairNLP/CleanCoNLL
The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.
yuzhaouoe/pretraining-data-packing
phenixace/narrative-framing
FelixBenning/pyrfd
Pytorch implementation of RFD
SocialScienceDataLab/models-all-the-way
'Models all the way down' by Asya Magazinnik (Hertie School)
EADMSummerSchool2024/EADMSummerSchool2024.github.io
EADM Summer School
mrwunderbar666/ner_tool_comparison
Systematically comparing NER Tools
SocialScienceDataLab/group-inequalities-decomposition
Towards more life-course-sensitive decompositions of group-inequalities: Two approaches applied to the Gender Pension Gap by Carla Rowold (Oxford & MPIDR)