opencvbaby

Pinned Repositories

asteroid
The PyTorch-based audio source separation toolkit for researchers || Current highlight : we got our WHAMR results check it out here !
Language:Python0 0 00
audfprint
Landmark-based audio fingerprinting
Language:Python0 1 00
audioset_tagging_cnn
Language:Python00
awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
0 0 00
awesome-cpp
A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
0 0 00
berkeley-stat-157
Homepage for STAT 157 at UC Berkeley
Language:Jupyter Notebook0 0 00
Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language:Python0 0 00
chromaprint
C library for generating audio fingerprints used by AcoustID
Language:C++0 0 00
CTC-speech-recognition
This is a working example of using CTC for phone recognition on TIMIT
Language:Python0 1 00
NumpyDL
Deep Learning Library. For education. Based on pure Numpy. Support CNN, RNN, LSTM, GRU etc.
Language:Python1 2 00

opencvbaby's Repositories

opencvbaby/audioset_tagging_cnn
Language:Python00
opencvbaby/awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
0 0 00
opencvbaby/awesome-cpp
A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
0 0 00
opencvbaby/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language:Python0 0 00
opencvbaby/HuggingFaceModelDownloader
Simple go utility to download HuggingFace Models and Datasets
opencvbaby/interview
📚 C/C++ 技术面试基础知识总结，包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, including language, program library, data structure, algorithm, system, network, link loading library, interview experience, recruitment, recommendation, etc.
Language:C++1 0
opencvbaby/LatentSync
Taming Stable Diffusion for Lip Sync!
opencvbaby/Lichee
一个多模态内容理解算法框架，其中包含数据处理、预训练模型、常见模型以及模型加速等模块。
Language:Python0 0
opencvbaby/lightly
A python library for self-supervised learning on images.
Language:Python0 0
opencvbaby/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
opencvbaby/lipsync
Language:JavaScript0 0
opencvbaby/lookwhostalking
Look Who’s Talking: Active Speaker Detection in the Wild
Language:Python0 0
opencvbaby/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练、有监督微调、RLHF(奖励建模、强化学习训练)和DPO(直接偏好优化)。
Language:Python0 0
opencvbaby/neural-audio-fp
Language:Python0 0
opencvbaby/PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
opencvbaby/python-audio-separator
Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)
opencvbaby/pytorchvideo
A deep learning library for video understanding research.
Language:Python0 0
opencvbaby/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
Language:Python0 0
opencvbaby/simclr
SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners
Language:Jupyter Notebook0 0
opencvbaby/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python0 0
opencvbaby/syncnet_python
Out of time: automated lip sync in the wild
Language:Python1 0
opencvbaby/TangoFlux
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
opencvbaby/the-incredible-pytorch
The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.
0 0
opencvbaby/TransGPT
Language:Python0 0
opencvbaby/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python0 0
opencvbaby/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python0 0
opencvbaby/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python0 0
opencvbaby/Vicuna-LoRA-RLHF-PyTorch
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna
Language:Python0 0
opencvbaby/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
Language:Python1 0
opencvbaby/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C0 0