chenbin0522's Stars
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
lllyasviel/Fooocus
Focus on prompting and generating
lyswhut/lx-music-desktop
一个基于 electron 的音乐软件
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
NaiboWang/EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
ultralytics/ultralytics
Ultralytics YOLO11 🚀
2noise/ChatTTS
A generative speech model for daily dialogue.
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
NanmiCoder/MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
TonnyL/Awesome_APIs
:octocat: A collection of APIs
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
jsvine/pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
pdfminer/pdfminer.six
Community maintained fork of pdfminer - we fathom PDF
pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
JoeanAmier/XHS-Downloader
小红书链接提取/作品采集工具:提取账号发布、收藏、点赞、专辑作品链接;提取搜索结果作品、用户链接;采集小红书作品信息;提取小红书作品下载地址;下载小红书无水印作品文件!
python-openxml/python-docx
Create and modify Word documents with Python
Fanghua-Yu/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Aegisub/Aegisub
Cross-platform advanced subtitle editor
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API
KoljaB/RealtimeTTS
Converts text to speech in realtime
aclap-dev/vdhcoapp
Companion application for Video DownloadHelper browser add-on
inlife/nexrender
📹 Data-driven render automation for After Effects
coqui-ai/xtts-streaming-server
110Art/a-bogus