Pinned Repositories
asteroid
The PyTorch-based audio source separation toolkit for researchers || Current highlight : we got our WHAMR results check it out here !
audfprint
Landmark-based audio fingerprinting
audioset_tagging_cnn
awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
awesome-cpp
A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
berkeley-stat-157
Homepage for STAT 157 at UC Berkeley
Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
chromaprint
C library for generating audio fingerprints used by AcoustID
CTC-speech-recognition
This is a working example of using CTC for phone recognition on TIMIT
NumpyDL
Deep Learning Library. For education. Based on pure Numpy. Support CNN, RNN, LSTM, GRU etc.
opencvbaby's Repositories
opencvbaby/asteroid
The PyTorch-based audio source separation toolkit for researchers || Current highlight : we got our WHAMR results check it out here !
opencvbaby/audioset_tagging_cnn
opencvbaby/awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
opencvbaby/awesome-cpp
A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
opencvbaby/berkeley-stat-157
Homepage for STAT 157 at UC Berkeley
opencvbaby/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
opencvbaby/chromaprint
C library for generating audio fingerprints used by AcoustID
opencvbaby/DesignPattern
C++11全套设计模式-23种指针的用法(a full DesignPattern implement with c++11)
opencvbaby/interview
📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, including language, program library, data structure, algorithm, system, network, link loading library, interview experience, recruitment, recommendation, etc.
opencvbaby/Lichee
一个多模态内容理解算法框架,其中包含数据处理、预训练模型、常见模型以及模型加速等模块。
opencvbaby/lightly
A python library for self-supervised learning on images.
opencvbaby/lipsync
opencvbaby/lookwhostalking
Look Who’s Talking: Active Speaker Detection in the Wild
opencvbaby/Machine-Learning-Session
opencvbaby/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练、有监督微调、RLHF(奖励建模、强化学习训练)和DPO(直接偏好优化)。
opencvbaby/neural-audio-fp
opencvbaby/pytorchvideo
A deep learning library for video understanding research.
opencvbaby/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
opencvbaby/simclr
SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners
opencvbaby/so-vits-svc
SoftVC VITS Singing Voice Conversion
opencvbaby/syncnet_python
Out of time: automated lip sync in the wild
opencvbaby/the-incredible-pytorch
The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.
opencvbaby/TransGPT
opencvbaby/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
opencvbaby/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
opencvbaby/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
opencvbaby/Vicuna-LoRA-RLHF-PyTorch
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna
opencvbaby/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
opencvbaby/whisper.cpp
Port of OpenAI's Whisper model in C/C++
opencvbaby/YoutubeDNN
Impementation paper "Deep Neural Networks for YouTube Recommendations"