cvillela's Stars
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
pgvector/pgvector
Open-source vector similarity search for Postgres
dottxt-ai/outlines
Structured Text Generation
instructor-ai/instructor
structured outputs for llms
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Stability-AI/stable-audio-tools
Generative models for conditional audio generation
Audio-AGI/AudioSep
Official implementation of "Separate Anything You Describe"
LCAV/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
WenjieDu/PyPOTS
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/classification/clustering/forecasting/anomaly detection/cleaning on incomplete industrial (irregularly-sampled) multivariate TS with NaN missing values
Harmonai-org/sample-generator
Tools to train a generative model on arbitrary audio samples
kahst/BirdNET-Analyzer
BirdNET analyzer for scientific audio data processing.
DavidDiazGuerra/gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
marl/openl3
OpenL3: Open-source deep audio and image embeddings
RetroCirce/HTS-Audio-Transformer
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
yizhilll/MERT
Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".
JoinMusic/fish
YouTube video to chords, lyrics, beat and melody.
XinhaoMei/WavCaps
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
Zeying-Gong/PatchMixer
About Code release for "PatchMixer: A Patch-Mixing Architecture for Long-Term Time Series Forecasting"
SonyCSLParis/music2latent
Encode and decode audio samples to/from compressed latent representations!
Navidfoumani/ConvTran
This is a PyTorch implementation of ConvTran
BorgwardtLab/Set_Functions_for_Time_Series
Repository of the ICML 2020 paper "Set Functions for Time Series"
cwx-worst-one/EAT
[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer
mlds-lab/interp-net
Code for "Interpolation-Prediction Networks for Irregularly Sampled Time Series", ICLR 2019.
jaeyeonkim99/EnCLAP
Official Implementation of EnCLAP (ICASSP 2024)
NeuralNotW0rk/LoRAW
Flexible LoRA Implementation to use with stable-audio-tools
MCR-PEFT/C-MCR
maswang32/hearinganythinganywhere
Hearing Anything Anywhere Code Release
prompteus/audio-captioning
Audio captioning - DCASE challenge 2023 task 6a
Labbeti/conette-audio-captioning
CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding