Pinned Repositories
autogen
A programming framework for agentic AI 🤖
fk-visual-search
Flipkart's visual search and recommendation system
Flash3D
InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
llama.cpp
LLM inference in C/C++
LLMFarm
llama and other large language models on iOS and MacOS offline using GGML library.
mlc-llm
Universal LLM Deployment Engine with ML Compilation
point_based_clothing
Official PyTorch code for the paper: "Point-Based Modeling of Human Clothing" (ICCV 2021)
Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
VINS-AR
AR project based on "Monocular Visual-Inertial State Estimator on Mobile Phones"
GPTAlgoPro's Repositories
GPTAlgoPro/OpenScanner
Fast, reliable, and free document scanner app for iPhone
GPTAlgoPro/AL-Ref-SAM2
AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
GPTAlgoPro/apriltag
AprilTag is a visual fiducial system popular for robotics research.
GPTAlgoPro/awesome-hand-pose-estimation
Awesome work on hand pose estimation/tracking
GPTAlgoPro/ChatMLX
🤖✨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.
GPTAlgoPro/clash-verge-rev
Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)
GPTAlgoPro/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image (uncensored)
GPTAlgoPro/DepthAnyVideo
Depth Any Video with Scalable Synthetic Data
GPTAlgoPro/DocLayout-YOLO
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
GPTAlgoPro/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
GPTAlgoPro/hallo2
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
GPTAlgoPro/labelU
Data annotation toolbox supports image, audio and video data.
GPTAlgoPro/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
GPTAlgoPro/MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
GPTAlgoPro/MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
GPTAlgoPro/MixedRealityToolkit-Unity
This repository holds the third generation of the Mixed Reality Toolkit for Unity. The latest version of the MRTK can be found here.
GPTAlgoPro/ml-depth-pro
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
GPTAlgoPro/moshi
GPTAlgoPro/OpenXR-SDK-Source
Sources for OpenXR loader, basic API layers, and example code.
GPTAlgoPro/Pomelo
Pomelo is a Fork of Sudachi: a Nintendo Switch Emulator for iOS
GPTAlgoPro/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
GPTAlgoPro/Ryujinx
Experimental Nintendo Switch Emulator written in C#
GPTAlgoPro/shadPS4
PS4 emulator for Windows,Linux,MacOS
GPTAlgoPro/smplx
SMPL-X
GPTAlgoPro/Steel-LLM
Train a Chinese LLM From 0 by Personal
GPTAlgoPro/StereoKit
An easy-to-use XR engine for building AR and VR applications with C# and OpenXR!
GPTAlgoPro/SuperVINS
A robust real-time visual-inertial SLAM framework for challenging imaging conditions (integrated deep learning features)
GPTAlgoPro/T-MAC
Low-bit LLM inference on CPU with lookup table
GPTAlgoPro/TripoSR
GPTAlgoPro/TRLO