Henryplay's Stars
OpenDriveLab/End-to-end-Autonomous-Driving
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
zacharywhitley/awesome-ocr
WenmuZhou/PytorchOCR
基于Pytorch的OCR工具库,支持常用的文字检测和识别算法
cvg/LightGlue
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
aleju/imgaug
Image augmentation for machine learning experiments.
microsoft/Web-Dev-For-Beginners
24 Lessons, 12 Weeks, Get Started as a Web Developer
CASIA-IVA-Lab/FastSAM
Fast Segment Anything
CyC2018/CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
EbookFoundation/free-programming-books
:books: Freely available programming books
Visualize-ML/Book4_Power-of-Matrix
Book_4_《矩阵力量》 | 鸢尾花书:从加减乘除到机器学习;上架!
ruanyf/weekly
科技爱好者周刊,每周五发布
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
chinese-poetry/chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
TapXWorld/ChinaTextbook
所有小初高、大学PDF教材。
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
lonePatient/BERT-NER-Pytorch
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
DataTalksClub/mlops-zoomcamp
Free MLOps course from DataTalks.Club
phodal/understand-prompt
【🔞🔞🔞 内含不适合未成年人阅读的图片】基于我擅长的编程、绘画、写作展开的 AI 探索和总结:StableDiffusion 是一种强大的图像生成模型,能够通过对一张图片进行演化来生成新的图片。ChatGPT 是一个基于 Transformer 的语言生成模型,它能够自动为输入的主题生成合适的文章。而 Github Copilot 是一个智能编程助手,能够加速日常编程活动。
gongminmin/awesome-aigc
A list of awesome AIGC works
zzw922cn/awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
yeungchenwa/OCR-SAM
Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
myhub/tr
Free Offline OCR 离线的中文文本检测+识别SDK
lym0302/paddlespeech_tts_cpp
PaddleSpeech TTS cpp
kslz/sound_dataset_tools2
一个快速制作语音数据集的可视化工具
ZDisket/TensorVox
Desktop application for neural speech synthesis written in C++
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
rhasspy/piper
A fast, local neural text to speech system
janvainer/speedyspeech
ttroy50/cmake-examples
Useful CMake Examples