tracyqan's Stars
itmorn/robot-mouse-track
随着互联网技术的发展,鼠标轨迹识别算法在很多人机交互产品中的需求日益增加,比如,一些网站为了防止被爬,增加了一些滑块验证码,但是一些软件已经可以模拟人的行为破解滑块验证码。本项目就是通过对鼠标轨迹的特征分析,判定是否是人的行为还是机器行为。常见应用场景:网站反爬虫、在线考试系统脚本刷题。文档:https://robot-mouse-track.readthedocs.io
cupy/cupy
NumPy & SciPy for GPU
fishaudio/fish-speech
Brand new TTS solution
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
tarasko/picows
Ultra-fast websocket client and server for asyncio
Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
deepset-ai/haystack
:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
budtmo/docker-android
Android in docker solution with noVNC supported and video recording
FlareSolverr/FlareSolverr
Proxy server to bypass Cloudflare protection
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
RapidAI/RapidOCR
Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle.
LLMBook-zh/LLMBook-zh.github.io
《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
HaujetZhao/CapsWriter-Offline
CapsWriter 的离线版,一个好用的 PC 端的语音输入工具
xai-org/grok-1
Grok open release
ljp-cyber/autowx
基于autojs的聊天项目
levihsu/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
netease-youdao/QAnything
Question and Answer based on Anything.
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
PyO3/pyo3
Rust bindings for the Python interpreter
rust-lang/rustlings
:crab: Small exercises to get you used to reading and writing Rust code!
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
tw93/Pake
🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
ihmily/DouyinLiveRecorder
可循环值守和多人录制的直播录制软件,支持抖音、TikTok、快手、虎牙、斗鱼、B站、小红书、pandatv、afreecatv、flextv、popkontv、twitcasting、winktv、百度、微博、酷狗、17Live、Twitch、Acfun、CHZZK等平台直播录制
YangLing0818/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
mnotgod96/AppAgent
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥