LittleFlyingSheep

Harbin Engineering University

LittleFlyingSheep's Stars

Phuriches/GenRepASD
Pytorch implementation of Deep Generic Representations for Domain-Generalized Anomalous Sound Detection: https://arxiv.org/abs/2409.05035
Language:Python2
Kota-Dohi/dcase2022_evaluator
Language:Python91
VectorSpaceLab/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
Language:Jupyter Notebook3k242
ZhuShaoQiang/PapersInTime
可以按照时间顺序，引用关系记录论文。
Language:HTML3
Audio-AGI/dcase2024_task9_baseline
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
Language:Python222
adapter-hub/adapters
A Unified Library for Parameter-Efficient and Modular Transfer Learning
Language:Jupyter Notebook2.6k348
muzairkhattak/multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
Language:Python68253
haoheliu/SemantiCodec-inference
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.
Language:Python1569
SarthakYadav/audio-mamba-official
Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"
Language:Python31
haidog-yaqub/DPMTSE
A Diffusion Probabilistic Model for Target Sound Extraction
Language:Python353
frankenliu/LOAE
Language:Python101
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
73441
jaeyeonkim99/EnCLAP
Official Implementation of EnCLAP (ICASSP 2024)
Language:Python905
Labbeti/dcase2024-task6-baseline
DCASE2024 Challenge Task 6 baseline system (Automated Audio Captioning)
Language:Python5
Labbeti/conette-audio-captioning
CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding
Language:Python14
boschresearch/acoustic-traffic-simulation-counting
Baseline code for DCASE 2024 Task10 and ICASSP 2024 paper
Language:Python54
state-spaces/mamba
Mamba SSM architecture
Language:Python13.4k1.1k
lisiyao21/Bailando
Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"
Language:Python39162
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python20.3k2.6k
OptimusPrimus/dcase2023_task6b
CP-JKU's Task6b Submission to DCASE2023
Language:Python5
karolpiczak/ESC-50
ESC-50: Dataset for Environmental Sound Classification
Language:Python1.4k290
meta-llama/llama
Inference code for Llama models
Language:Python56.6k9.6k
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook47.9k5.7k
microsoft/CLAP
Learning audio concepts from natural language supervision
Language:Python49538
haoheliu/AudioLDM-training-finetuning
AudioLDM training, finetuning, evaluation and inference.
Language:Python21441
haoheliu/AudioLDM2
Text-to-Audio/Music Generation
Language:Python2.3k182
LittleFlyingSheep/P-LocalAFT
This project is corresponding to the paper "Local Information Assisted Attention-free Decoder for Audio Captioning" published in IEEE Signal Processing Letters.
Language:Python4
thuhcsi/LightGrad
Language:Python6511
LAION-AI/audio-dataset
Audio Dataset for training CLAP and other models
Language:Python64253
XinhaoMei/WavCaps
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
Language:Python20612

LittleFlyingSheep

LittleFlyingSheep's Stars

Phuriches/GenRepASD

Kota-Dohi/dcase2022_evaluator

VectorSpaceLab/OmniGen

ZhuShaoQiang/PapersInTime

Audio-AGI/dcase2024_task9_baseline

adapter-hub/adapters

muzairkhattak/multimodal-prompt-learning

haoheliu/SemantiCodec-inference

SarthakYadav/audio-mamba-official

haidog-yaqub/DPMTSE

frankenliu/LOAE

ga642381/speech-trident

jaeyeonkim99/EnCLAP

Labbeti/dcase2024-task6-baseline

Labbeti/conette-audio-captioning

boschresearch/acoustic-traffic-simulation-counting

state-spaces/mamba

lisiyao21/Bailando

microsoft/unilm

OptimusPrimus/dcase2023_task6b

karolpiczak/ESC-50

meta-llama/llama

facebookresearch/segment-anything

microsoft/CLAP

haoheliu/AudioLDM-training-finetuning

haoheliu/AudioLDM2

LittleFlyingSheep/P-LocalAFT

thuhcsi/LightGrad

LAION-AI/audio-dataset

XinhaoMei/WavCaps