Pinned Repositories
ART
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
art-langgraph
best-hn
deductive-reasoning
Train your own SOTA deductive reasoning model
email-deep-research
open_deep_research_training
Training setup for Langchain's Open Deep Research
OpenPipe
Turn expensive prompts into cheap fine-tuned models
pii-redaction
Detect and redact PII locally with SOTA performance
rl-experiments
OpenPipe Reinforcement Learning Experiments
Summary-RL
Train an agent to generate high quality summaries
OpenPipe's Repositories
OpenPipe/ART
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
OpenPipe/OpenPipe
Turn expensive prompts into cheap fine-tuned models
OpenPipe/deductive-reasoning
Train your own SOTA deductive reasoning model
OpenPipe/pii-redaction
Detect and redact PII locally with SOTA performance
OpenPipe/open_deep_research_training
Training setup for Langchain's Open Deep Research
OpenPipe/Summary-RL
Train an agent to generate high quality summaries
OpenPipe/rl-experiments
OpenPipe Reinforcement Learning Experiments
OpenPipe/email-deep-research
OpenPipe/best-hn
OpenPipe/art-langgraph
OpenPipe/art-notebooks
Notebooks to demonstrate ART (Agent Reinforcement Trainer) in practice!
OpenPipe/step-one
This repo is only used for searching reddit
OpenPipe/trpc-openapi
OpenAPI support for tRPC š§© - with streaming :)
OpenPipe/art-star-count
Display ART repository star count on a tablet
OpenPipe/tsoa
Build OpenAPI-compliant REST APIs using TypeScript and Node
OpenPipe/vllm-lora
A high-throughput and memory-efficient inference and serving engine for LLMs
OpenPipe/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
OpenPipe/axolotl
Go ahead and axolotl questions
OpenPipe/mistral-client-js
JS Client library for Mistral AI platform
OpenPipe/openapi-typescript-codegen
NodeJS library that generates Typescript or Javascript clients based on the OpenAPI specification
OpenPipe/trl
Train transformer language models with reinforcement learning.
OpenPipe/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
OpenPipe/vllm-completions
A high-throughput and memory-efficient inference and serving engine for LLMs
OpenPipe/ArcticInference
OpenPipe/S3LoRAResolver
OpenPipe/sglang
SGLang is a fast serving framework for large language models and vision language models.
OpenPipe/skypilot-catalog
OpenPipe/verl
verl: Volcano Engine Reinforcement Learning for LLMs