happyday630's Stars
NaiboWang/EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
google-research/google-research
Google Research
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
lutzroeder/netron
Visualizer for neural network, deep learning and machine learning models
google-ai-edge/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
mattingalls/Soundflower
MacOS system extension that allows applications to pass audio to other applications. Soundflower works on macOS Catalina.
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
abhishekkrthakur/approachingalmost
Approaching (Almost) Any Machine Learning Problem
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
DepthAnything/Depth-Anything-V2
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Rikorose/DeepFilterNet
Noise supression using deep filtering
vietanhdev/anylabeling
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
apple/ml-fastvit
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023
google-ai-edge/mediapipe-samples
facebookresearch/multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
LCAV/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
karolpiczak/ESC-50
ESC-50: Dataset for Environmental Sound Classification
jiawen-zhu/HQTrack
Tracking Anything in High Quality
google/visqol
Perceptual Quality Estimator for speech and audio
google-research/sound-separation
gordicarminkn/tvurls
microsoft/P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
sigsep/sigsep-mus-eval
museval - source separation evaluation tools for python
crlandsc/Music-Demixing-with-Band-Split-RNN
An unofficial PyTorch implementation of Music Source Separation with Band-split RNN for MDX-23 ("Label Noise" Track)
junyuchen-cjy/DTTNet-Pytorch
An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation
antonyharfield/tflite-models-audioset-yamnet
A TFLite-compatible fork of YAMNet from tensorflow/models
farmaker47/Yamnet_classification_project
msgwak/Speech-enhancement-zoom-phone
balkce/demucstargetsel
Embedding- and location-based target selection strategies for the Demucs-Denoiser speech enhancement technique.