Henryplay

Henryplay's Stars

OpenDriveLab/End-to-end-Autonomous-Driving
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
2.1k213
zacharywhitley/awesome-ocr
868108
WenmuZhou/PytorchOCR
基于Pytorch的OCR工具库，支持常用的文字检测和识别算法
Language:Python1.4k304
cvg/LightGlue
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
Language:Python3.3k321
aleju/imgaug
Image augmentation for machine learning experiments.
Language:Python14.4k2.4k
microsoft/Web-Dev-For-Beginners
24 Lessons, 12 Weeks, Get Started as a Web Developer
Language:JavaScript83.2k12.3k
CASIA-IVA-Lab/FastSAM
Fast Segment Anything
Language:Python7.4k693
CyC2018/CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
175k50.8k
EbookFoundation/free-programming-books
:books: Freely available programming books
335k61.3k
Visualize-ML/Book4_Power-of-Matrix
Book_4_《矩阵力量》 | 鸢尾花书：从加减乘除到机器学习；上架！
Language:Python8.6k1.3k
ruanyf/weekly
科技爱好者周刊，每周五发布
46.8k2.8k
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Language:Jupyter Notebook2.8k271
chinese-poetry/chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人，21050首词。
Language:JavaScript48k9.6k
TapXWorld/ChinaTextbook
所有小初高、大学PDF教材。
Language:Roff1.8k436
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
Language:Python35.7k3.4k
lonePatient/BERT-NER-Pytorch
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
Language:Python2.1k424
DataTalksClub/mlops-zoomcamp
Free MLOps course from DataTalks.Club
Language:Jupyter Notebook11k2.1k
phodal/understand-prompt
【🔞🔞🔞 内含不适合未成年人阅读的图片】基于我擅长的编程、绘画、写作展开的 AI 探索和总结：StableDiffusion 是一种强大的图像生成模型，能够通过对一张图片进行演化来生成新的图片。ChatGPT 是一个基于 Transformer 的语言生成模型，它能够自动为输入的主题生成合适的文章。而 Github Copilot 是一个智能编程助手，能够加速日常编程活动。
Language:Jupyter Notebook4.2k360
gongminmin/awesome-aigc
A list of awesome AIGC works
54743
zzw922cn/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
3k513
yeungchenwa/OCR-SAM
Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting
Language:Python51937
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook35.5k4.2k
myhub/tr
Free Offline OCR 离线的中文文本检测+识别SDK
Language:Python1.3k378
lym0302/paddlespeech_tts_cpp
PaddleSpeech TTS cpp
Language:Python3512
kslz/sound_dataset_tools2
一个快速制作语音数据集的可视化工具
Language:Python19117
ZDisket/TensorVox
Desktop application for neural speech synthesis written in C++
Language:C++21020
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python11k1.8k
rhasspy/piper
A fast, local neural text to speech system
Language:C++6k435
janvainer/speedyspeech
Language:Python25036
ttroy50/cmake-examples
Useful CMake Examples
Language:CMake12.3k2.5k