zhangyhdgut
I'm a student at Dongguan University of Technology, passionate about developing artificial intelligence.
Dongguan University of TechnologyDongguan University of Technology
zhangyhdgut's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
chenjianhao66/go-GB28181
基于GB28181-2016标准实现的网络视频平台,用 Go 语言实现,实现了 SIP 协议和信令服务器。
webyang-male/vue3-mallManage
Vue3 + ElementPlus + Vite Vue3编程商城后台管理系统
szad670401/HyperLPR
基于深度学习高性能中文车牌识别 High Performance Chinese License Plate Recognition Framework.
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
alexw914/RK_VideoPipe
meta-llama/llama3
The official Meta Llama 3 GitHub site
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
ultralytics/ultralytics
Ultralytics YOLO11 🚀
sherlockchou86/VideoPipe
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
Elegycloud/clash-for-linux-backup
基于Clash Core 制作的Clash For Linux备份仓库 A Clash For Linux Backup Warehouse Based on Clash Core
frgfm/torch-cam
Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)
openvinotoolkit/openvino
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
ireader/media-server
RTSP/RTP/RTMP/FLV/HLS/MPEG-TS/MPEG-PS/MPEG-DASH/MP4/fMP4/MKV/WebM
NVIDIA/trt-samples-for-hackathon-cn
Simple samples for TensorRT programming
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
VainF/Torch-Pruning
[CVPR 2023] DepGraph: Towards Any Structural Pruning
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
ultralytics/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Sanster/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
ossrs/srs
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.