atoultaro
Yu Shiu. AI Data Scientist, Machine Learning Engineer, Semi-pro wildlife conservationist.
Stamford, CT
atoultaro's Stars
ollama/ollama
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Vaibhavs10/insanely-fast-whisper
martinblech/xmltodict
Python module that makes working with XML feel like you are working with JSON
docker/genai-stack
Langchain + Docker + Neo4j + Ollama
iusztinpaul/hands-on-llms
🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴
magenta/ddsp
DDSP: Differentiable Digital Signal Processing
openvpi/DiffSinger
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
cuthbertLab/music21
music21 is a Toolkit for Computational Musicology
Music-and-Culture-Technology-Lab/omnizart
Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.
vishnubob/python-midi
Python MIDI library
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
magenta/ddsp-vst
Realtime DDSP Neural Synthesizer and Effect
automl/NASLib
NASLib is a Neural Architecture Search (NAS) library for facilitating NAS research for the community by providing interfaces to several state-of-the-art NAS search spaces and optimizers.
ytdl-org/ytdl-nightly
Nightly builds for youtube-dl.
VincentGranville/Large-Language-Models
Large language Models (LLM)
google/speaker-id
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
linux08/machine-learning-books
DongKeon/Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
axeldelafosse/stemgen
🎛 Stemgen is a Stem file generator. Convert any track into a Stem and have fun with Traktor.
magenta/note-seq
A serializable note sequence representation and utilities.
AIcrowd/music-demixing-challenge-starter-kit
Starter kit for getting started in the Music Demixing Challenge.
adefossez/mdx21_demucs
Reproduction repository for the MDX 2021 Hybrid Demucs model
vis-nlp/UniChart
atmos-python/atmos
An atmospheric sciences library for Python
nrkno/wave-bwf-rf64
Extension of Pythons Wave-library to support BWF and RF64
tamiminaser/llm-single-gpu
Training and Working with LLMs on a Single GPU
franzcrs/openpose-with-caffe-for-MacM1
Successfully build openpose and caffe in Mac M1 using python 3.9