mnky4a6p's Stars
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
karpathy/LLM101n
LLM101n: Let's build a Storyteller
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
unclecode/crawl4ai
๐๐ค Crawl4AI: Crawl Smarter, Faster, Freely. For AI.
KwaiVGI/LivePortrait
Bring portraits to life!
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
asg017/sqlite-vec
A vector search SQLite extension that runs anywhere!
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
decodingml/llm-twin-course
๐ค ๐๐ฒ๐ฎ๐ฟ๐ป for ๐ณ๐ฟ๐ฒ๐ฒ how to ๐ฏ๐๐ถ๐น๐ฑ an end-to-end ๐ฝ๐ฟ๐ผ๐ฑ๐๐ฐ๐๐ถ๐ผ๐ป-๐ฟ๐ฒ๐ฎ๐ฑ๐ ๐๐๐ & ๐ฅ๐๐ ๐๐๐๐๐ฒ๐บ using ๐๐๐ ๐ข๐ฝ๐ best practices: ~ ๐ด๐ฐ๐ถ๐ณ๐ค๐ฆ ๐ค๐ฐ๐ฅ๐ฆ + 12 ๐ฉ๐ข๐ฏ๐ฅ๐ด-๐ฐ๐ฏ ๐ญ๐ฆ๐ด๐ด๐ฐ๐ฏ๐ด
AlexanderKoch-Koch/low_cost_robot
Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
THUDM/CodeGeeX4
CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more.
OpenAutoCoder/Agentless
Agentless๐ฑ: an agentless approach to automatically solve software development problems
skapadia3214/groq-moa
Mixture of Agents using Groq
apache/datafusion-comet
Apache DataFusion Comet Spark Accelerator
nasa-jpl/rosa
ROSA ๐ค is an AI Agent designed to interact with ROS1- and ROS2-based robotics systems using natural language queries. ROSA helps robot developers inspect, diagnose, understand, and operate robots.
virattt/financial-agent-ui
Financial agent + generative UI
jess-moss/koch-v1-1
A version 1.1 of the Alexander Koch low cost robot arm with some small changes.
threepointone/partyserver
PartyKit, for Workers
hustvl/EVF-SAM
Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"
deeptrust-ai/terifai
Terrify people
lalanikarim/webrtc-ai-voice-chat
A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.
agentsea/robbie-g2
kaiaai/LDS
Arduino LiDAR library supporting YDLIDAR X2/X3/X4, RPLIDAR A1, Xiaomi LDS02RR, Neato XV11, LD14P, CAMSENSE X1, Delta-2A/2B/2G
RuiningLi/puppet-master
Official Implementation of Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics
dalenguyen/pdfun
PDF services built with Angular & GCP