gemma

There are 168 repositories under gemma topic.

  • ollama/ollama

    Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

    Language:Go104k6055.3k8.3k
  • LocalAI

    mudler/LocalAI

    :robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

    Language:Go27.3k1929042k
  • unsloth

    unslothai/unsloth

    Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

    Language:Python19.7k1341.2k1.4k
  • GaiZhenbiao/ChuanhuChatGPT

    GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

    Language:Python15.3k847982.3k
  • yangjianxin1/Firefly

    Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

    Language:Python6k55281533
  • LostRuins/koboldcpp

    Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

    Language:C++5.9k68848378
  • xorbitsai/inference

    Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

    Language:Python5.8k421.5k477
  • google/gemma_pytorch

    The official PyTorch implementation of Google's Gemma models

    Language:Python5.3k3941514
  • elia

    darrenburns/elia

    A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.

    Language:Python1.9k1143121
  • google/generative-ai-docs

    Documentation for Google's Gen AI site - including the Gemini API and Gemma

    Language:Jupyter Notebook1.8k63121639
  • gemma-cookbook

    google-gemini/gemma-cookbook

    A collection of guides and examples for the Gemma open models from Google.

    Language:Jupyter Notebook9002615158
  • nextjs-ollama-llm-ui

    jakobhoeg/nextjs-ollama-llm-ui

    Fully-featured, beautiful web interface for Ollama LLMs - built with NextJS. Deploy with a single click.

    Language:TypeScript8471343205
  • papersgpt/papersgpt-for-zotero

    Zotero chat PDF with GPT, ChatGPT, Claude, Gemini

    Language:JavaScript66015
  • magpie-align/magpie

    Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

    Language:Python53753157
  • mlc-ai/web-llm-chat

    Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.

    Language:TypeScript53772370
  • aikit

    sozercan/aikit

    🏗️ Fine-tune, build, and deploy open-source LLMs easily!

    Language:Go40784131
  • Beomi/InfiniTransformer

    Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

    Language:Python35072531
  • InternLM/InternEvo

    InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

    Language:Python321108655
  • AI-Hypercomputer/JetStream

    JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

    Language:Python258192732
  • inferflow/inferflow

    Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

    Language:C++23681724
  • Picovoice/picollm

    On-device LLM Inference Powered by X-Bit Quantization

    Language:Python1999118
  • googlegpt

    KudoAI/googlegpt

    🤖 Adds AI to Google Search. Ask from any site. Powered by Google Gemma + GPT-4o!

    Language:JavaScript14721117
  • jorge-armando-navarro-flores/chat_with_your_docs

    Discover and converse with advanced AI models like Mistral, LLAMA2, and GPT-3.5 from leading sources like OLLAMA, Hugging Face, and OpenAI. Easily extract insights from PDFs, web pages, and YouTube videos with our intuitive interface. Unlock the power of knowledge with seamless chat interactions.

    Language:Python1424213
  • marklysze/LlamaIndex-RAG-WSL-CUDA

    Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B

    Language:Jupyter Notebook1212213
  • Upsonic/Client

    Self-Driven Autonomous Python Libraries

    Language:Python96235
  • adithya-s-k/YoloGemma

    Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detection and segmentation.

    Language:Python78326
  • fly-apps/ollama-open-webui

    Self-host a ChatGPT-style web interface for Ollama 🦙

    Language:Shell688122
  • tanyuqian/redco

    NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference

    Language:Python63317
  • Sidekick

    johnbean393/Sidekick

    A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software.

    Language:Swift61418
  • Mobile-Artificial-Intelligence/maid_llm

    maid_llm is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)

    Language:Dart5551013
  • Beomi/Gemma-EasyLM

    Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)

    Language:Python463111
  • BlahST

    QuantiusBenignus/BlahST

    Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp offline. Speak with local LLMs.

    Language:Shell46454
  • AI-Hypercomputer/jetstream-pytorch

    PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

    Language:Python42101416
  • luo-anthony/DeveloperGPT

    DeveloperGPT is a LLM-powered command line tool that enables natural language to terminal commands and in-terminal chat.

    Language:Python39285
  • ariya/gamal

    Research tool leveraging LLM for answers

    Language:JavaScript28102