weiuniverse

Focus on Machine Learning, Deep Learning for Audio, Computer Vision

San jose

weiuniverse's Stars

hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
Language:Python43.9k 271 6306.4k
karpathy/LLM101n
LLM101n: Let's build a Storyteller
31.8k 2.8k 01.7k
mem0ai/mem0
The Memory layer for AI Agents
Language:Python24.7k 129 7292.3k
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python20.2k 172 1971.4k
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python14.1k 127 4071.5k
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python13.9k 145 7791.5k
goldmansachs/gs-quant
Python toolkit for quantitative finance
Language:Jupyter Notebook8.4k 164 311.1k
anthropics/anthropic-quickstarts
A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API
Language:TypeScript7.7k 71 1381.3k
lllyasviel/Omost
Your image is almost there!
Language:Python7.5k 46 82428
andrewyng/translation-agent
Language:Python5.2k 54 18621
Ceelog/DictionaryByGPT4
一本 GPT4 生成的单词书📚，超过 8000 个单词分析，涵盖了词义、例句、词根词缀、变形、文化背景、记忆技巧和小故事
Language:HTML4.1k 31 30269
kimiyoung/transformer-xl
Language:Python3.6k 84 133762
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python2.3k 46 400491
xdit-project/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Language:Python1.3k 27 141103
asteroid-team/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
Language:Python996 12 10791
lucidrains/transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Language:Python942 35 2842
yeyupiaoling/VoiceprintRecognition-Pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
Language:Python895 11 70132
lucidrains/rotary-embedding-torch
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch
Language:Python634 10 3249
liusongxiang/StarGAN-Voice-Conversion
This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks
Language:Python518 21 2192
DavidDiazGuerra/gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
Language:Cuda505 10 6095
facebookresearch/libri-light
dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.
Language:Python488 19 1678
schmiph2/pysepm
Python implementation of performance metrics in Loizou's Speech Enhancement book
Language:Python405 9 1288
winninghealth/WiNGPT2
WiNGPT是一个基于GPT的医疗垂直领域大模型，旨在将专业的医学知识、医疗信息、数据融会贯通，为医疗行业提供智能化的医疗问答、诊断支持和医学知识等信息服务，提高诊疗效率和医疗服务质量。
Language:Python363 5 2120
yluo42/TAC
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
Language:Python265 6 1555
chibui191/bitcoin_volatility_forecasting
GARCH and Multivariate LSTM forecasting models for Bitcoin realized volatility with potential applications in crypto options trading, hedging, portfolio management, and risk management
Language:Jupyter Notebook241 9 672
wenet-e2e/wesep
Target Speaker Extraction Toolkit
Language:Python144 6 1216
KunZhou9646/Emovox
This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".
Language:Python86 4 1211
wotouteng/fens.me
83 7 328
chentuochao/Target-Conversation-Extraction
This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamics"
Language:Python45 2 14
Alec-Wright/OpenAmp
Language:Python17 2 01

weiuniverse

weiuniverse's Stars

hacksider/Deep-Live-Cam

karpathy/LLM101n

mem0ai/mem0

black-forest-labs/flux

KwaiVGI/LivePortrait

m-bain/whisperX

goldmansachs/gs-quant

anthropics/anthropic-quickstarts

lllyasviel/Omost

andrewyng/translation-agent

Ceelog/DictionaryByGPT4

kimiyoung/transformer-xl

s3prl/s3prl

xdit-project/xDiT

asteroid-team/torch-audiomentations

lucidrains/transfusion-pytorch

yeyupiaoling/VoiceprintRecognition-Pytorch

lucidrains/rotary-embedding-torch

liusongxiang/StarGAN-Voice-Conversion

DavidDiazGuerra/gpuRIR

facebookresearch/libri-light

schmiph2/pysepm

winninghealth/WiNGPT2

yluo42/TAC

chibui191/bitcoin_volatility_forecasting

wenet-e2e/wesep

KunZhou9646/Emovox

wotouteng/fens.me

chentuochao/Target-Conversation-Extraction

Alec-Wright/OpenAmp