happyday630

Dolby LabBeijing, China

happyday630's Stars

NaiboWang/EasySpider
A visual no-code/code-free web crawler/spider易采集：一个可视化浏览器自动化测试/数据采集/爬虫软件，可以无代码图形化的设计和执行爬虫任务。别名：ServiceWrapper面向Web应用的智能化服务封装系统。
Language:JavaScript36k 224 5304.4k
google-research/google-research
Google Research
Language:Jupyter Notebook34.3k 752 1.3k7.9k
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python32.3k 313 9274.8k
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Language:Python30.5k 390 3.5k7.5k
lutzroeder/netron
Visualizer for neural network, deep learning and machine learning models
Language:JavaScript28.1k 302 1.2k2.8k
google-ai-edge/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
Language:C++27.6k 511 5.2k5.2k
mattingalls/Soundflower
MacOS system extension that allows applications to pass audio to other applications. Soundflower works on macOS Catalina.
Language:Objective-C8.9k 417 0613
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Language:Python8.3k 154 5431.1k
abhishekkrthakur/approachingalmost
Approaching (Almost) Any Machine Learning Problem
7.4k 144 661.1k
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
Language:Jupyter Notebook4.8k 43 125503
DepthAnything/Depth-Anything-V2
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Language:Python3.9k 32 185334
Rikorose/DeepFilterNet
Noise supression using deep filtering
Language:Python2.5k 30 285235
vietanhdev/anylabeling
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
Language:Python2.4k 22 134246
apple/ml-fastvit
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023
Language:Python1.8k 31 0102
google-ai-edge/mediapipe-samples
Language:Jupyter Notebook1.6k 45 198419
facebookresearch/multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
Language:Python1.5k 21 40140
LCAV/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
Language:Python1.5k 41 234432
karolpiczak/ESC-50
ESC-50: Dataset for Environmental Sound Classification
Language:Python1.4k 31 11290
jiawen-zhu/HQTrack
Tracking Anything in High Quality
Language:Python744 13 1965
google/visqol
Perceptual Quality Estimator for speech and audio
Language:C++698 27 72125
google-research/sound-separation
Language:Python651 27 16118
gordicarminkn/tvurls
250
microsoft/P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
Language:HTML210 21 2658
sigsep/sigsep-mus-eval
museval - source separation evaluation tools for python
Language:Python202 5 3036
crlandsc/Music-Demixing-with-Band-Split-RNN
An unofficial PyTorch implementation of Music Source Separation with Band-split RNN for MDX-23 ("Label Noise" Track)
Language:Python144 5 013
junyuchen-cjy/DTTNet-Pytorch
An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation
Language:Python79 4 310
antonyharfield/tflite-models-audioset-yamnet
A TFLite-compatible fork of YAMNet from tensorflow/models
Language:Jupyter Notebook29 3 212
farmaker47/Yamnet_classification_project
Language:Jupyter Notebook25 3 33
msgwak/Speech-enhancement-zoom-phone
Language:Jupyter Notebook44
balkce/demucstargetsel
Embedding- and location-based target selection strategies for the Demucs-Denoiser speech enhancement technique.
Language:Python1 1 01

happyday630

happyday630's Stars

NaiboWang/EasySpider

google-research/google-research

huggingface/pytorch-image-models

facebookresearch/detectron2

lutzroeder/netron

google-ai-edge/mediapipe

mattingalls/Soundflower

facebookresearch/demucs

abhishekkrthakur/approachingalmost

ChaoningZhang/MobileSAM

DepthAnything/Depth-Anything-V2

Rikorose/DeepFilterNet

vietanhdev/anylabeling

apple/ml-fastvit

google-ai-edge/mediapipe-samples

facebookresearch/multimodal

LCAV/pyroomacoustics

karolpiczak/ESC-50

jiawen-zhu/HQTrack

google/visqol

google-research/sound-separation

gordicarminkn/tvurls

microsoft/P.808

sigsep/sigsep-mus-eval

crlandsc/Music-Demixing-with-Band-Split-RNN

junyuchen-cjy/DTTNet-Pytorch

antonyharfield/tflite-models-audioset-yamnet

farmaker47/Yamnet_classification_project

msgwak/Speech-enhancement-zoom-phone

balkce/demucstargetsel