gpt-oss

There are 60 repositories under gpt-oss topic.

  • ollama/ollama

    Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

    Language:Go152k8578.2k13.1k
  • unsloth

    unslothai/unsloth

    Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

    Language:Python45.5k2622.5k3.7k
  • sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    Language:Python17.9k1173.2k2.9k
  • oumi-ai/oumi

    Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

    Language:Python8.5k6279644
  • InternLM/xtuner

    A Next-Generation Training Engine Built for Ultra-Large MoE Models

    Language:Python4.8k38577364
  • taishi-i/awesome-ChatGPT-repositories

    A curated list of resources dedicated to open source GitHub repositories related to ChatGPT and OpenAI API

  • yichuan-w/LEANN

    RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

    Language:Python2.6k2441243
  • papersgpt/papersgpt-for-zotero

    A powerful Zotero AI and MCP plugin with ChatGPT, Gemini, Claude, Grok, DeepSeek, OpenRouter, Kimi, GLM, SiliconFlow, GPT-oss, Gemma 3, Qwen 3

    Language:JavaScript1.9k136758
  • aws-samples/easy-model-deployer

    Deploy open-source LLMs on AWS in minutes — with OpenAI-compatible APIs and a powerful CLI/SDK toolkit.

    Language:Python7212417
  • milisp/codexia

    A powerful GUI and Toolkit for Codex CLI - run secure background agents. FileTree + note system, and more

    Language:TypeScript48188
  • Everywhere

    DearVa/Everywhere

    A context-aware AI assistant for your desktop. Ready to respond intelligently, seamlessly integrating multiple LLMs and MCP tools.

    Language:C#45448
  • sgl-project/awesome-sglang

    Make SGLang go brrr

  • kdeps

    kdeps/kdeps

    All-in-one offline-ready AI framework for building Dockerized full-stack applications with declarative PKL configuration, featuring integrated open-source LLMs for AI-powered APIs and workflows

    Language:Go27163
  • local-ai-zone/local-ai-zone.github.io

    Discover the Best AI Models for Your PC

    Language:HTML12
  • GGUFloader/gguf-loader

    Run ChatGPT OSS, Groke 2 Locally — GGUF Loader for 120B/20B Models | Open Source & Offline

    Language:Python11057
  • AmanPriyanshu/GPT-OSS-MoE-ExpertFingerprinting

    ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture

    Language:HTML10001
  • danny50610/bpe-tokeniser

    PHP port for openai/tiktoken (most)

    Language:PHP9100
  • milisp/plux

    💡AI finder/explorer. One click @files via a visual filetree and save insights in a notepad. build with Tauri

    Language:TypeScript9002
  • adarshM84/OpenTalkGpt

    A Chrome extension hosts an Ollama UI web server on localhost and other servers, helping you manage models and chat with any open-source model. 🚀💻✨

    Language:HTML8100
  • Perpetue237/agentsculptor

    agentsculptor is an experimental AI-powered development agent designed to analyze, refactor, and extend Python projects automatically. It uses an OpenAI-like planner–executor loop on top of a vLLM backend, combining project context analysis, structured tool calls, and iterative refinement. It has only been tested with gpt-oss-120b via vLLM.

    Language:Python610
  • KaartikDev/beacon-offline-agent

    Offline AI for emergencies with local context and cited answers from community Knowledge Packs.

    Language:Jupyter Notebook400
  • zacharytamas/harmony-ts

    JavaScript/TypeScript bindings for wasm of OpenAI's Harmony format parser

    Language:JavaScript400
  • elizabethsiegle/chat-w-taylor-on-newheights-and-travis-gq-autorag-openaioss

    Chat w/ the Taylor Swift New Heights podcast (and Travis Kelce's GQ article)

    Language:TypeScript3001
  • fcn06/swarm

    A Multi Agent Systems Framework, written in Rust. Domain Agents, specialists, can use tools. Workflow Agents can load or define a workflow and monitor execution. LLM as a Judge is used for evaluation. Discovery Service and Memory Service empower agent interactions.

    Language:Rust3011
  • org

    tjamescouch/org

    Your AI Development Team

    Language:TypeScript3131
  • everlastconsulting/gpt-oss-local-voice-agent-demo

    Demo-Repository (aus YouTube-Video) für einen lokalen Open-Source Sprachassistenten mit gpt-oss, Whisper & XTTS. Bereitstellung zur Inspiration, Weiterverwendung und Erweiterung gedacht.

    Language:JavaScript2001
  • Josephrp/SmolFactory

    finetune gpt-oss and smollm3 on your data easily and cheaply

    Language:Python2041
  • lalomorales22/oss-at-night

    Batch processing for overnight tasks with gpt-oss 20b

    Language:Python200
  • milisp/awesome-codex-cli

    A curated list of awesome resources, tools, and tutorials for OpenAI Codex CLI

  • milisp/awesome-gpt-oss

    A curated list of awesome GPT-OSS resources, tools, tutorials, and projects

  • nguyennampfiev/MCP-demo

    MCP with GPT-OSS

    Language:Python200
  • supermarsx/codex-cli-linker

    Helper script to enable using codex-cli with LM Studio, Ollama and other LLMs

    Language:Python200
  • abijeetraut1/ModelCube

    ModelCube is a comprehensive platform for exploring, managing, and optimizing AI models. It allows users to search and discover AI models, tune configurations for better performance, download and manage models for local use, and monitor their status and metrics.

    Language:TypeScript100
  • Alexyskoutnev/gpt-oss-tutorial

    Your fast track to GPT-OSS. Spin up the models, run inference, and build agentic workflows — all from clear, reproducible notebooks.

    Language:Jupyter Notebook100
  • sderosiaux/bifrost-ai

    🌈 Local LLM chat with conversation branching, mood-reactive UI, and time travel. Run GPT-OSS 20B locally with chain-of-thought reasoning.

    Language:TypeScript100
  • zer0int/GPT-OSS-20B-Windows-16GB-RTX4090

    No Hopper architecture (RTX 5090, etc.) required! <16 GB VRAM, Windows.

    Language:Python100