Pinned Repositories
AI-For-Beginners
12 Weeks, 24 Lessons, AI for All!
ai-hub-models
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
ai-in-sports
Source code for AI in Sports with Python
auto-dev
🧙AutoDev: The AI-powered coding wizard with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Document/Agent feature 🧪 included! 🚀
Awesome-AI-GPTs
Awesome AI GPTs, OpenAI GPTs, GPT-4, ChatGPT, GPTs, Prompts, plugins, Prompts leaking
Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
cheatsheets-ai
Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5
GoPose
GoPose人工智能运动分析软件可用于比赛、训练、科研等场景,其优势是无接触式测量和快速反馈。
objectdetection_script
一些关于目标检测的脚本的改进思路代码,详细请看readme.md
paper-reading
深度学习经典、新论文逐段精读
mtrocky's Repositories
mtrocky/ai-hub-models
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
mtrocky/auto-dev
🧙AutoDev: The AI-powered coding wizard with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Document/Agent feature 🧪 included! 🚀
mtrocky/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
mtrocky/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
mtrocky/objectdetection_script
一些关于目标检测的脚本的改进思路代码,详细请看readme.md
mtrocky/boxmot
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models
mtrocky/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
mtrocky/facefusion
Industry leading face manipulation platform
mtrocky/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
mtrocky/GVHMR
Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024
mtrocky/Kinovea
Video solution for sport analysis. Capture, inspect, compare, annotate and measure technical performances.
mtrocky/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
mtrocky/LongCite
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
mtrocky/MetaGPT
🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
mtrocky/Metric3D
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
mtrocky/MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
mtrocky/MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level MLLM on Your Phone
mtrocky/ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
mtrocky/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
mtrocky/ollama
Get up and running with Llama 2, Mistral, Gemma, and other large language models.
mtrocky/opensim-core
SimTK OpenSim C++ libraries and command-line applications, and Java/Python wrapping.
mtrocky/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
mtrocky/phidata
Build AI Assistants with memory, knowledge and tools.
mtrocky/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
mtrocky/pyskl
A toolbox for skeleton-based action recognition.
mtrocky/sapiens
High-resolution models for human tasks.
mtrocky/StableDiffusionOnDevice
本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。
mtrocky/tennis_analysis
This project analyzes Tennis players in a video to measure their speed, ball shot speed and number of shots. This project will detect players and the tennis ball using YOLO and also utilizes CNNs to extract court keypoints. This hands on project is perfect for polishing your machine learning, and computer vision skills.
mtrocky/xinference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
mtrocky/yolo11_suspicious_activity