leonardtang's Stars
josh-ashkinaze/plurals
Plurals: A System for Guiding LLMs Via Simulated Social Ensembles
BCHSI/philter-ucsf
Open source clinical text de-identification
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
voxel51/zcore
cvs-health/langfair
LangFair is a Python library for conducting use-case level LLM bias and fairness assessments
microsoft/Trace
End-to-end Generative Optimization for AI Agents
datadreamer-dev/DataDreamer
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤
wasiahmad/Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
madaan/minimal-text-diffusion
A minimal implementation of diffusion models for text generation
refuel-ai/autolabel
Label, clean and enrich text datasets with LLMs.
spcl/graph-of-thoughts
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
amudide/switch_sae
Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)
google/oss-fuzz-gen
LLM powered fuzzing via OSS-Fuzz.
zeno-ml/zeno-build
Build, evaluate, understand, and fix LLM-based apps
Mihaiii/backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
sam-paech/antislop-sampler
turboderp/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
holistic-ai/holisticai
This is an open-source tool to assess and improve the trustworthiness of AI systems.
honeyhiveai/realign
Realign is a testing and simulation framework for AI applications.
ndif-team/nnsight
The nnsight package enables interpreting and manipulating the internals of deep learned models.
stanfordnlp/pyreft
ReFT: Representation Finetuning for Language Models
chujiezheng/chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
Aleph-Alpha/scaling
Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for training large language models.
Libr-AI/do-not-answer
Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs
karpathy/llm.c
LLM training in simple, raw C/CUDA
prometheus-eval/prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
aiverify-foundation/moonshot
Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
Repello-AI/whistleblower
Whistleblower is a tool for leaking system prompts and capability discovery of any API accessible LLM App. Built for developers, security red-teams and folks who want to know what's going on inside the LLM App they use daily
ridgerchu/matmulfreellm
Implementation for MatMul-free LM.
black-forest-labs/flux
Official inference repo for FLUX.1 models