andimarafioti

Machine Learning Research Engineer at Hugging Face.

Hugging FaceBern, Switzerland

Pinned Repositories

AIAMI
Automatic Identification of Acoustical Musical Instruments
Language:Python4 2 14
audioContextEncoder
A context encoder for audio inpainting
Language:Jupyter Notebook25 8 172
florence2-finetuning
Quick exploration into fine tuning florence 2
Language:Jupyter Notebook282 4 2426
GACELA
Generative adversarial context encoder for audio inpainting
Language:Jupyter Notebook25 5 33
MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Language:Python43
phaseRetrievalEvaluation
Time-Frequency Phase Retrieval for Audio --- The Effect of Transform Parameters
Language:Jupyter Notebook9 4 02
speech-to-speech-inference-toolkit
Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
Language:Python4 0 01
tifresi
STFT transforms suitable for use with PGHI (phase gradient heap integration)
Language:Python13 4 21
speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language:Python3.6k 45 92389
stftGAN
TiFGAN: Time Frequency Generative Adversarial Networks
Language:Jupyter Notebook115 7 513

andimarafioti's Repositories

andimarafioti/florence2-finetuning
Quick exploration into fine tuning florence 2
Language:Jupyter Notebook282 4 2426
andimarafioti/audioContextEncoder
A context encoder for audio inpainting
Language:Jupyter Notebook25 8 172
andimarafioti/GACELA
Generative adversarial context encoder for audio inpainting
Language:Jupyter Notebook25 5 33
andimarafioti/tifresi
STFT transforms suitable for use with PGHI (phase gradient heap integration)
Language:Python13 4 21
andimarafioti/phaseRetrievalEvaluation
Time-Frequency Phase Retrieval for Audio --- The Effect of Transform Parameters
Language:Jupyter Notebook9 4 02
andimarafioti/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Language:Python43
andimarafioti/speech-to-speech-inference-toolkit
Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
Language:Python4 0 01
andimarafioti/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python24
andimarafioti/audioLIME
audioLIME: Listenable Explanations Using Source Separation
Language:Python1 1 00
andimarafioti/hpc-docs
Guides, tutorials and documentation about the central HPC resources
Language:Shell1 1 00
andimarafioti/inflated_convnets_pytorch
Inflate DenseNet and ResNet as per I3D with ImageNet weight transfer
Language:Python1 1 00
andimarafioti/lightning-whisper-mlx
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
Language:Python1
andimarafioti/sms-tools
Sound analysis/synthesis tools for music applications
Language:Python1 2 01
andimarafioti/AIChallengeOEAW2019
Tools and helpers for the challenge competition that will be taking place as part of the OEAW's AI summer school 2019.
Language:Jupyter Notebook1 0
andimarafioti/andimarafioti
1 0
andimarafioti/ConwaysGameOfLife
Language:Python2 0
andimarafioti/gantools
A set of tools to deal with GANs
Language:Python1 0
andimarafioti/llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
andimarafioti/LSTMsForAudioInpainting
Language:Jupyter Notebook2 0
andimarafioti/LSTMsOnSpectrograms
Language:Jupyter Notebook3 0
andimarafioti/MetaCLIP
Everything about MetaCLIP: curation/training code, metadata, distribution and pre-trained models.
Language:Python0 0
andimarafioti/moonshine
Fast and accurate automatic speech recognition (ASR) for edge devices
Language:Python1
andimarafioti/sagan-models
Language:Python2 0
andimarafioti/scripts
Some scripts for easy sharing.
Language:Python1 0
andimarafioti/Self-Attention-GAN
Pytorch implementation of Self-Attention Generative Adversarial Networks (SAGAN)
Language:Python1 0
andimarafioti/smol-tools
Language:Python
andimarafioti/tifgan.github.io
Website
Language:CSS2 0
andimarafioti/UPD
[arXiv2024] Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models
andimarafioti/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
andimarafioti/wavegan
WaveGAN: using GANs to synthesize raw audio
Language:Python3 0