ElisaCastelli
MSc graduated student in Music and Acoustic Engineering | BSc graduated student in Computer Science and Engineering at Politecnico di Milano
Politecnico di MilanoComo
ElisaCastelli's Stars
nanahou/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
opendatalab/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
gcui-art/suno-api
Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.
asteroid-team/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
freds0/data_augmentation_for_asr
A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.
csteinmetz1/ai-audio-startups
Community list of startups working with AI in audio and music technology
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
khoj-ai/khoj
Your AI second brain, open and self-hostable. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3).
kinggongzilla/DCASE2023_Task2
nttcslab/dcase2023_task2_baseline_ae
Audio-AGI/dcase2024_task9_baseline
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
felixgontier/dcase-2023-baseline
craffel/mir_eval
Evaluation functions for music/audio information retrieval/signal processing algorithms.
meta-llama/llama3
The official Meta Llama 3 GitHub site
theMoro/DIRAugmentation
Improving Recording Device Generalization using Impulse Response Augmentation
mulab-mir/song-describer-dataset
The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.
zhvng/open-musiclm
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
RetroCirce/MusicLDM
The latent diffusion model for text-to-music generation.
mit-han-lab/mcunet
[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning
GauravSingh9356/J.A.R.V.I.S
Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.
musikalkemist/generativemusicaicourse
Resources for the Generative Music AI Course on The Sound of AI YouTube channel.
walkerkq/musiclyrics
https://www.kaylinpavlik.com/50-years-of-pop-music/
aggittle/Genre-Classification-with-Spotify-API
Classifying song genres with audio features from the Spotify API to explore the possibilities of content-based music recommendations.
dmgutierrez/hitmusicnet
An end-to-end architecture for Music Popularity Prediction
fyang93/diffusion
Efficient Diffusion for Image Retrieval
lucidrains/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
brycedrennan/imaginAIry
Pythonic AI generation of images and videos
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch