gasoved

gasoved's Stars

sharkdp/bat
A cat(1) clone with wings.
Language:Rust50.2k 207 1.4k1.3k
ultralytics/ultralytics
Ultralytics YOLO11 🚀
Language:Python34.3k 187 9.9k6.6k
microsoft/markitdown
Python tool for converting files and office documents to Markdown.
Language:Python26.2k 76 811k
ajeetdsouza/zoxide
A smarter cd command. Supports all major shells.
Language:Rust23.5k 48 599564
Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
Language:Python17.5k 167 1751.2k
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python9.1k 134 1.1k1.4k
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.9k 95 2.1k1k
vi/websocat
Command-line client for WebSockets, like netcat (or curl) for ws:// with advanced socat-like functions
Language:Rust7.2k 69 235280
gcanti/io-ts
Runtime type system for IO decoding/encoding
Language:TypeScript6.7k 54 441327
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.5k 73 1k800
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Language:Python6.5k 48 71481
django-oscar/django-oscar
Domain-driven e-commerce for Django
Language:Python6.3k 270 1.5k2.2k
pawelsalawa/sqlitestudio
A free, open source, multi-platform SQLite database manager.
Language:C5.5k 104 4.5k595
NexaAI/nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
Language:Python5.1k 546 66736
microsoft/LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Language:Python4.8k 34 134264
microsoft/presidio
Context aware, pluggable and customizable data protection and de-identification SDK for text and images
Language:Python4k 71 423582
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Language:Python2.7k 29 51185
magic-quill/MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Language:Python2.4k 25 87209
PintaProject/Pinta
Simple GTK# Paint Program
Language:C#1.9k 68 125278
gpt-omni/mini-omni2
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
Language:Python1.7k 94 53200
Standard-Intelligence/hertz-dev
first base model for full-duplex conversational audio
Language:Python1.7k 19 26107
huggingface/smollm
Everything about the SmolLM & SmolLM2 family of models
Language:Python1.5k 17 471
QwenLM/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python1.3k 33 8791
edwko/OuteTTS
Interface for OuteTTS models.
Language:Python783 20 4161
lhl/voicechat2
Local SRT/LLM/TTS Voicechat
Language:Python572 7 1661
mozilla/mozilla-django-oidc
A django OpenID Connect library
Language:Python460 21 238170
huggingface/meshgen
A blender addon for generating meshes with AI
Language:Python379 8 622
digidem/leaflet-side-by-side
A Leaflet control to add a split screen to compare two map overlays
Language:JavaScript364 18 42111
ScalingIntelligence/Archon
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
Language:Python149 2 09
eole-nlp/eole
Open language modeling toolkit based on PyTorch
Language:Python66 7 5012