Pinned Repositories
AIAMI
Automatic Identification of Acoustical Musical Instruments
audioContextEncoder
A context encoder for audio inpainting
florence2-finetuning
Quick exploration into fine tuning florence 2
GACELA
Generative adversarial context encoder for audio inpainting
MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
phaseRetrievalEvaluation
Time-Frequency Phase Retrieval for Audio --- The Effect of Transform Parameters
speech-to-speech-inference-toolkit
Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
tifresi
STFT transforms suitable for use with PGHI (phase gradient heap integration)
speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
stftGAN
TiFGAN: Time Frequency Generative Adversarial Networks
andimarafioti's Repositories
andimarafioti/florence2-finetuning
Quick exploration into fine tuning florence 2
andimarafioti/audioContextEncoder
A context encoder for audio inpainting
andimarafioti/GACELA
Generative adversarial context encoder for audio inpainting
andimarafioti/tifresi
STFT transforms suitable for use with PGHI (phase gradient heap integration)
andimarafioti/phaseRetrievalEvaluation
Time-Frequency Phase Retrieval for Audio --- The Effect of Transform Parameters
andimarafioti/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
andimarafioti/speech-to-speech-inference-toolkit
Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
andimarafioti/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
andimarafioti/audioLIME
audioLIME: Listenable Explanations Using Source Separation
andimarafioti/hpc-docs
Guides, tutorials and documentation about the central HPC resources
andimarafioti/inflated_convnets_pytorch
Inflate DenseNet and ResNet as per I3D with ImageNet weight transfer
andimarafioti/lightning-whisper-mlx
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
andimarafioti/sms-tools
Sound analysis/synthesis tools for music applications
andimarafioti/AIChallengeOEAW2019
Tools and helpers for the challenge competition that will be taking place as part of the OEAW's AI summer school 2019.
andimarafioti/andimarafioti
andimarafioti/ConwaysGameOfLife
andimarafioti/gantools
A set of tools to deal with GANs
andimarafioti/llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
andimarafioti/LSTMsForAudioInpainting
andimarafioti/LSTMsOnSpectrograms
andimarafioti/MetaCLIP
Everything about MetaCLIP: curation/training code, metadata, distribution and pre-trained models.
andimarafioti/moonshine
Fast and accurate automatic speech recognition (ASR) for edge devices
andimarafioti/sagan-models
andimarafioti/scripts
Some scripts for easy sharing.
andimarafioti/Self-Attention-GAN
Pytorch implementation of Self-Attention Generative Adversarial Networks (SAGAN)
andimarafioti/smol-tools
andimarafioti/tifgan.github.io
Website
andimarafioti/UPD
[arXiv2024] Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models
andimarafioti/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
andimarafioti/wavegan
WaveGAN: using GANs to synthesize raw audio