gvzdv's Stars
lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
chihfanhsu/gaze_correction
Correcting gaze by warping-based convolutional neural network in live video communication
neuml/txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
kyrolabs/awesome-agents
🤖 Awesome list of AI Agents
ggerganov/llama.cpp
LLM inference in C/C++
khoj-ai/khoj
Your AI second brain, open and self-hostable. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3).
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Vaibhavs10/optimise-my-whisper
ragapp/ragapp
The easiest way to use Agentic RAG in any enterprise
phidatahq/phidata
Build AI Assistants with memory, knowledge and tools.
mistralai/mistral-finetune
emcf/thepipe
Extract clean markdown from PDFs, URLs, Word docs, slides, videos, and more, ready for any LLM. ⚡
jacoblee93/fully-local-pdf-chatbot
Yes, it's another chat over documents implementation... but this one is entirely local!
fastdatascience/faststylometry
Stylometry library for Burrows' Delta method
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
netease-youdao/QAnything
Question and Answer based on Anything.
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
stas00/ml-engineering
Machine Learning Engineering Open Book
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
LizhenWangT/StyleAvatar
Code of SIGGRAPH 2023 Conference paper: StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
BenMusch/transit-guessr
GeoGuessr for Subway Systems
cbh123/narrator
David Attenborough narrates your life
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
isafulf/inbox_cleaner
A python script to help manage a Gmail inbox by filtering out promotional emails using GPT-3 or GPT-4.
gitmylo/bark-voice-cloning-HuBERT-quantizer
The code for the bark-voicecloning model. Training and inference.
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
gitmylo/audio-webui
A webui for different audio related Neural Networks
xinntao/facexlib
FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.