chenwj1989's Stars
TheAlgorithms/C-Plus-Plus
Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.
open-mmlab/mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
open-mmlab/mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
shouxieai/tensorRT_Pro
C++ library based on tensorrt integration
yangxy/GPEN
jeonsworld/ViT-pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
chaofengc/IQA-PyTorch
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
csteinmetz1/ai-audio-startups
Community list of startups working with AI in audio and music technology
mido/mido
MIDI Objects for Python
craigsapp/midifile
C++ classes for reading/writing Standard MIDI Files
google/visqol
Perceptual Quality Estimator for speech and audio
lonelygo/Shift-AI-models-to-real-world-products
Share some useful guides and references about how to shift AI models to real world products or projects.
xboot/libonnx
A lightweight, portable pure C99 onnx inference engine for embedded devices with hardware acceleration support.
google-research/leaf-audio
LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained for the task at hand, while using a very small number of parameters.
oneTaken/Awesome-Denoise
One-paper-one-short-contribution-summary of all latest image/burst/video Denoising papers with code & citation published in top conference and journal.
mushfiqulalam/isp
camera pipeline
lochenchou/MOSNet
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
MaybeShewill-CV/bisenetv2-tensorflow
Unofficial tensorflow implementation of real-time scene image segmentation model "BiSeNet V2: Bilateral Network with Guided Aggregation for Real-time Semantic Segmentation"
aianaconda/TensorFlow_Engineering_Implementation
The source code and dataset about <Deep Learning - Best Practices on TensorFlow Engineering Implementation>
mthli/webrtc-tutorial
Learning WebRTC the Hard Way 👀
yangcaoai/CoDA_NeurIPS2023
Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
shjwudp/shu
中文书籍收录整理, Collection of Chinese Books
cchen-cc/MA-SAM
PyTorch implementation for MA-SAM
SCRN-VRC/SimpNet-Deep-Learning-in-a-Shader
A trainable convolutional neural network inside a fragment shader
nanahou/Awesome-Bandwidth-Extension
This is a curated list of awesome Speech Bandwidth Extension tutorials, papers, libraries, datasets, tools, scripts and results. The purpose of this repo is to organize the world’s resources for speech bandwidth extension, and make them universally accessible and useful.
sergiogenilson/STMKF
A Real-Time Spatio-Temporal Vídeo Denoising Method with Kalman-based and Bilateral Filters Fusion
ConferencingSpeech/ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
haasn/fsrcnn-mpv
FSRCNN, implemented as an mpv hook
runngezhang/ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications