vividfog's Stars
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Mozilla-Ocho/llamafile
Distribute and run LLMs with a single file.
HazyResearch/manifest
Prompt programming with FMs.
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
npuichigo/openai_trtllm
OpenAI compatible API for TensorRT LLM triton backend
triton-inference-server/tensorrtllm_backend
The Triton TensorRT-LLM Backend
arcee-ai/DALM
Domain Adapted Language Modeling Toolkit - E2E RAG
ShawhinT/YouTube-Blog
Codes to complement YouTube videos and blog posts on Medium.
FullStackRetrieval-com/RetrievalTutorials
gkamradt/QuickAgent
ParisNeo/ollama_proxy_server
A proxy server for multiple ollama instances with Key security
BatsResearch/bonito
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
bigcode-project/starcoder2
Home of StarCoder2!
soulteary/amazing-openai-api
Convert different model APIs into the OpenAI API format out of the box.
jcosta33/client-llm-vite
freuk/iter
🔁 Code iteration tool running on Groq
castorini/pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
parthsarthi03/raptor
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
pnuu/fmiopendata
Python interface for FMI open data
vividfog/nordpool-predict-fi
A Python app and a Random Forest ML model that predicts spot prices for the Nordpool FI market.
CharlesMod/quantizeHFmodel
Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.
michaelfeil/infinity
Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.
google-deepmind/graphcast
open-webui/open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
Mintplex-Labs/anything-llm
The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.
ex3ndr/llama-coder
Replace Copilot local AI
RussellCanfield/wingman-ai
An open source AI assistant VSCode extension. Works with Ollama, HuggingFace and OpenAI
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
protectai/rebuff
LLM Prompt Injection Detector