ElisaCastelli

MSc graduated student in Music and Acoustic Engineering | BSc graduated student in Computer Science and Engineering at Politecnico di Milano

Politecnico di MilanoComo

ElisaCastelli's Stars

nanahou/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Language:MATLAB709149
opendatalab/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Language:Python5k334
gcui-art/suno-api
Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.
Language:TypeScript1.3k288
asteroid-team/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
Language:Python92787
freds0/data_augmentation_for_asr
A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.
Language:Python285
csteinmetz1/ai-audio-startups
Community list of startups working with AI in audio and music technology
1.5k134
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.1k849
khoj-ai/khoj
Your AI second brain, open and self-hostable. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3).
Language:Python12.7k648
kinggongzilla/DCASE2023_Task2
Language:Python173
nttcslab/dcase2023_task2_baseline_ae
Language:Python5315
Audio-AGI/dcase2024_task9_baseline
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
Language:Python221
felixgontier/dcase-2023-baseline
Language:Python136
craffel/mir_eval
Evaluation functions for music/audio information retrieval/signal processing algorithms.
Language:Python604112
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python26.4k3k
theMoro/DIRAugmentation
Improving Recording Device Generalization using Impulse Response Augmentation
Language:Python10
mulab-mir/song-describer-dataset
The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.
Language:Jupyter Notebook1355
zhvng/open-musiclm
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
Language:Python51458
RetroCirce/MusicLDM
The latent diffusion model for text-to-music generation.
Language:Python1533
mit-han-lab/mcunet
[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning
Language:Python46082
GauravSingh9356/J.A.R.V.I.S
Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.
Language:Python839200
musikalkemist/generativemusicaicourse
Resources for the Generative Music AI Course on The Sound of AI YouTube channel.
Language:Jupyter Notebook14212
walkerkq/musiclyrics
https://www.kaylinpavlik.com/50-years-of-pop-music/
Language:R10941
aggittle/Genre-Classification-with-Spotify-API
Classifying song genres with audio features from the Spotify API to explore the possibilities of content-based music recommendations.
Language:Jupyter Notebook6
dmgutierrez/hitmusicnet
An end-to-end architecture for Music Popularity Prediction
Language:Python73
fyang93/diffusion
Efficient Diffusion for Image Retrieval
Language:Python22136
lucidrains/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Language:Python8k758
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Language:Python11.1k1.1k
brycedrennan/imaginAIry
Pythonic AI generation of images and videos
Language:Python7.9k437
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python25.4k5.3k
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python8k1k

ElisaCastelli

ElisaCastelli's Stars

nanahou/Awesome-Speech-Enhancement

opendatalab/PDF-Extract-Kit

gcui-art/suno-api

asteroid-team/torch-audiomentations

freds0/data_augmentation_for_asr

csteinmetz1/ai-audio-startups

OpenBMB/MiniCPM-V

khoj-ai/khoj

kinggongzilla/DCASE2023_Task2

nttcslab/dcase2023_task2_baseline_ae

Audio-AGI/dcase2024_task9_baseline

felixgontier/dcase-2023-baseline

craffel/mir_eval

meta-llama/llama3

theMoro/DIRAugmentation

mulab-mir/song-describer-dataset

zhvng/open-musiclm

RetroCirce/MusicLDM

mit-han-lab/mcunet

GauravSingh9356/J.A.R.V.I.S

musikalkemist/generativemusicaicourse

walkerkq/musiclyrics

aggittle/Genre-Classification-with-Spotify-API

dmgutierrez/hitmusicnet

fyang93/diffusion

lucidrains/imagen-pytorch

lucidrains/DALLE2-pytorch

brycedrennan/imaginAIry

huggingface/diffusers

lucidrains/denoising-diffusion-pytorch