iLoveBug's Stars
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
SchneeHertz/node-edge-tts
Use Microsoft Edge's TTS service on Node.js with support for proxy and subtitles.
eloialonso/diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
zsxkib/cog-flux-dev-inpainting
🎨 Fill in masked parts of images with FLUX.1-dev 🖌️
Kwai-Kolors/Kolors
Kolors Team
THUDM/LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
cerredz/Tiktok-Image-Toolkit
A python script used to general a batch of text-to-image photos using AI for tiktok videos. Uses the Flux-Dev model and images can also be used for general purpose reasons.
hughescr/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
wnma3mz/wechat_articles_spider
微信公众号文章的爬虫
TencentARC/SEED-Story
SEED-Story: Multimodal Long Story Generation with Large Language Model
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
imputnet/cobalt
best way to save what you love
AutomaApp/automa
A browser extension for automating your browser by connecting blocks
bdambrosio/AllTheWorldAPlay
All the world is a play, we are but actors in it.
1c7/chinese-independent-developer
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻**独立开发者项目列表 -- 分享大家都在做什么
dxli94/WLASL
WACV 2020 "Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison"
apple/corenet
CoreNet: A library for training deep neural networks
kinivi/hand-gesture-recognition-mediapipe
This is a sample program that recognizes hand signs and finger gestures with a simple MLP using the detected key points. Handpose is estimated using MediaPipe.
patlevin/face-detection-tflite
Face and iris detection for Python based on MediaPipe
Navi-Studio/Virtual-Human-for-Chatting
Live2D Virtual Human for Chatting based on Unity
hukenovs/hagrid
HAnd Gesture Recognition Image Dataset
Kazuhito00/mediapipe-python-sample
MediaPipeのPythonパッケージのサンプルです。2024/9/1時点でPython実装のある15機能について用意しています。
lllyasviel/ControlNet-v1-1-nightly
Nightly release of ControlNet 1.1
weijunext/indie-hacker-tools
收录独立开发者出海技术栈和工具
stdlib-js/stdlib
✨ Standard library for JavaScript and Node.js. ✨
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
mlc-ai/web-llm
High-performance In-browser LLM Inference Engine