gemma
There are 98 repositories under gemma topic.
ollama/ollama
Get up and running with Llama 3, Mistral, Gemma, and other large language models.
mudler/LocalAI
:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.
GaiZhenbiao/ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
unslothai/unsloth
Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
yangjianxin1/Firefly
Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
darrenburns/elia
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.
google/generative-ai-docs
Documentation for Google's Gen AI site - including the Gemini API and Gemma
jakobhoeg/nextjs-ollama-llm-ui
Fully-featured, beautiful web interface for Ollama LLMs - built with NextJS. Deploy with a single click.
awaescher/OllamaSharp
Ollama API bindings for .NET
Beomi/InfiniTransformer
Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
sozercan/aikit
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
inferflow/inferflow
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
google/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
marklysze/LlamaIndex-RAG-WSL-CUDA
Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B
Picovoice/picollm
On-device LLM Inference Powered by X-Bit Quantization
Upsonic/Upsonic
Self-Driven Autonomous Python Libraries
adithya-s-k/YoloGemma
Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detection and segmentation.
mlc-ai/web-llm-chat
Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.
tanyuqian/redco
NAACL '24 (Demo) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
Beomi/Gemma-EasyLM
Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)
fly-apps/ollama-open-webui
Deploy your very own ChatGPT-Style Web Interface for Ollama 🦙
luo-anthony/DeveloperGPT
DeveloperGPT is a LLM-powered command line tool that enables natural language to terminal commands and in-terminal chat.
Mobile-Artificial-Intelligence/maid_llm
maid_llm is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)
albertstarfield/project-zephyrine
Introducing Project Zephyrine: Elevating Your Interaction Plug and Play, and Employing GPU Acceleration within a Modernized Automata Local Graphical User Interface.
google/jetstream-pytorch
PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
LucknowAI/Lucknow-LLM
Collecting data for Building Lucknow's first LLM
GetStream/meeting-summary-ollama-gemma
Create an AI-powered meeting summary tool with Python, Ollama, and Gemma
bhattbhavesh91/google-gemma-finetuning-n2sql
Finetuning Google's Gemma Model for Translating Natural Language into SQL
groovybits/rsllm
Rust LLM Stream Analyzer and Content Generator
PavlidisLab/gemma.R
An R wrapper for the Gemma RESTful API
MaxMLang/RAG-nificent
RAG-nificent is a state-of-the-art framework leveraging Retrieval-Augmented Generation (RAG) to provide instant answers and references from a curated directory of PDFs containing information on any given topic. Supports Llama3 and OpenAI Models via the Groq API.
joydeb28/llm-lab
LLM, Fine Tuning, Llama 2, Gemma, Mixtral, vLLM, LangChain, RAG, ChromaDB, FAISS
marklysze/LlamaIndex-RAG-Linux-CUDA
Examples of RAG using Llamaindex with local LLMs in Linux - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B