zfarrell13's Stars
IliaZenkov/sklearn-audio-classification
An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, and cross-validation with a variety of ML techniques and MLP
jy0205/Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Harmonai-org/sample-generator
Tools to train a generative model on arbitrary audio samples
Harmonai-org/audio-diffusion-pytorch-fork
Audio generation using diffusion models, in PyTorch.
Natooz/MidiTok
MIDI / symbolic music tokenizers for Deep Learning models 🎶
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
NeoVertex1/SuperPrompt
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
AIHawk-FOSS/Auto_Jobs_Applier_AI_Agent
Auto_Jobs_Applier_AI_Agent by AIHawk is an AI Agent that automates the jobs application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way.
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
jordipons/musicnn
Pronounced as "musician", musicnn is a set of pre-trained deep convolutional neural networks for music audio tagging.
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Doriandarko/claude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.
Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
MusicLang/maidi
Work with symbolic music gen AI easily, based on midi manipulation.
JinhuaLiang/WavCraft
Official repo for WavCraft, an AI agent for audio creation and editing
ideoforms/pylive
Query and control Ableton Live from Python
ideoforms/AbletonOSC
Control Ableton Live 11 via Open Sound Control (OSC)
gluon/AbletonLive11_MIDIRemoteScripts
Sh4yy/personal-ai
sergree/matchering
🎚️ Open Source Audio Matching and Mastering
rabbitscam/rabbitr1
meta-llama/llama3
The official Meta Llama 3 GitHub site
firmai/financial-machine-learning
A curated list of practical financial machine learning tools and applications.
DamRsn/NeuralNote
Audio Plugin for Audio to MIDI transcription using deep learning.
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
jskripchuk/Synth1GAN
A GAN to generate preset banks for famous and free VST plugin Synth1
vgel/repeng
A library for making RepE control vectors
xai-org/grok-1
Grok open release
openai/grok