aimerou's Stars
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
python-poetry/poetry
Python packaging and dependency management made easy
recommenders-team/recommenders
Best Practices on Recommendation Systems
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—foundation models
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
ScrapeGraphAI/Scrapegraph-ai
Python scraper based on AI
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
NVIDIA/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
lyst/lightfm
A Python implementation of LightFM, a hybrid recommendation algorithm.
keithito/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
DigitalPhonetics/IMS-Toucan
Controllable and fast Text-to-Speech for over 7000 languages!
jaywalnut310/glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
facebookresearch/audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
lingjzhu/CharsiuG2P
Multilingual G2P in 100 languages
UIC-Liu-Lab/ContinualLM
An Extensible Continual Learning Framework Focused on Language Models (LMs)
google-research/url-nlp
google-research-datasets/cvss
CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus
microsoft/fadtk
A simple library for Fréchet Audio Distance (FAD) calculation
ylacombe/finetune-hf-vits
Finetune VITS and MMS using HuggingFace's tools
roudimit/whisper-flamingo
[Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation
GSNCodes/Image-Classification-Streamlit-TensorFlow
A basic web-app for image classification using Streamlit and Tensorflow
gauthelo/kallaama-speech-dataset
A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.
intron-innovation/AfriSpeech-TTS
African accented clinical and general domain TTS
Niger-Volta-LTI/yoruba-voice-speech-recorder
App for recording speech utterances dictated from text prompts. Speaker name, audio-recording path & prompt text are saved to a metadata file. Use it for building speech recognition and speech synthesis corpora
pbogden/framework-map
How to put a satellite image on MapLibre in Observable Framework