teamclouday's Stars
ollama/ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
ggerganov/llama.cpp
LLM inference in C/C++
zed-industries/zed
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
zsh-users/zsh-autosuggestions
Fish-like autosuggestions for zsh
milvus-io/milvus
A cloud-native vector database, storage for next generation AI applications
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
react-dnd/react-dnd
Drag and Drop for React
zsh-users/zsh-syntax-highlighting
Fish shell like syntax highlighting for Zsh.
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
clap-rs/clap
A full featured, fast Command Line Argument Parser for Rust
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
huggingface/chat-ui
Open source codebase powering the HuggingChat app
LargeWorldModel/LWM
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
arcee-ai/mergekit
Tools for merging pretrained large language models.
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
pynamodb/PynamoDB
A pythonic interface to Amazon's DynamoDB
langroid/langroid
Harness LLMs with Multi-Agent Programming
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
maelfabien/Multimodal-Emotion-Recognition
A real time Multimodal Emotion Recognition web app for text, sound and video inputs
jonghwanhyeon/python-ffmpeg
A python binding for FFmpeg which provides sync and async APIs
Wordcab/wordcab-transcribe
💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.
yagil/tokmon
CLI to monitor your program's OpenAI API token usage.