virgile-blg's Stars
wagoodman/dive
A tool for exploring each layer in a docker image
lllyasviel/Fooocus
Focus on prompting and generating
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
huggingface/candle
Minimalist ML framework for Rust
eugeneyan/open-llms
📋 A list of open LLMs available for commercial use.
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Vaibhavs10/insanely-fast-whisper
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
Audio-AGI/AudioSep
Official implementation of "Separate Anything You Describe"
declare-lab/tango
A family of diffusion models for text-to-audio generation.
asteroid-team/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
LAION-AI/audio-dataset
Audio Dataset for training CLAP and other models
jatinchowdhury18/RTNeural
Real-time neural network inferencing
ZFTurbo/Music-Source-Separation-Training
Repository for training models for music source separation.
zhvng/open-musiclm
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
mir-aidj/all-in-one
All-In-One Music Structure Analyzer
QosmoInc/neutone_sdk
Join the community on Discord for more discussions around Neutone! https://discord.gg/VHSMzb8Wqp
PINTO0309/tflite2tensorflow
Generate saved_model, tfjs, tf-trt, EdgeTPU, CoreML, quantized tflite, ONNX, OpenVINO, Myriad Inference Engine blob and .pb from .tflite. Support for building environments with Docker. It is possible to directly access the host PC GUI and the camera to verify the operation. NVIDIA GPU (dGPU) support. Intel iHD GPU (iGPU) support. Supports inverse quantization of INT8 quantization model.
acids-ircam/flow_synthesizer
Universal audio synthesizer control learning with normalizing flows
SonyCSLParis/music-inpainting-ts
A collection of web interfaces for AI-assisted interactive music creation
Harmonai-org/oobleck
open soundstream-ish VAE codecs for downstream neural audio synthesis
Torsion-Audio/nn-inference-template
Neural network inference template for real-time cricital audio environments - presented at ADC23
sony/hFT-Transformer
Pytorch implementation of automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture (hFT-Transformer).
RickyDane/rdpFX
The simple fast file explorer in rust-tauri
egrinstein/roomfuser
Acoustic impulse response generation using diffusion models
MWM-io/nansypp
Unofficial implementation of NANSY++ in Pytorch Lightning
ctrotz/stylizing-video
Stylizing Video by Example (Jamriska et al., 2019)