Pinned Repositories
AI-for-Urban
[KDD'2024] "UrbanGPT: Spatio-Temporal Large Language Models"
AIGC
Making big AI models cheaper, easier, and scalable
awesome-yolo-object-detection
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects.
chat-bot
GLM (General Language Model)
Computer-Vision-Nanodegree
Computer Vision Udacity Nanodegree which includes three projects: 1) Facial Key point detection. 2) Automatic Image - Captioning. 3) SLAM based Robot Localisation. The full course build in pytorch framework.
DAFL
A Pytorch implementation of "Data-Free Learning of Student Networks" (ICCV 2019).
GY-MCU90640-on-PC
The program is to take data and draw a thermal image from GY-MCU90640
lsmdc
A Joint Sequence Fusion Model for Video Question Answering and Retrieval. In ECCV 2018
tool_LLM
An open platform for training, serving, and evaluating large language model for tool learning.
yolo-V8
YOLOv8 🚀 in PyTorch > ONNX > CoreML > TFLite
autogyro's Repositories
autogyro/AI-for-Urban
[KDD'2024] "UrbanGPT: Spatio-Temporal Large Language Models"
autogyro/MoAI
Official PyTorch implementation code for realizing the technical part of Mixture of All Intelligence (MoAI) to improve performance of numerous zero-shot vision language tasks. (Under Review)
autogyro/saw-and-then-say
based on Qwen-VL (通义千问-VL) model proposed by Alibaba Cloud.
autogyro/YOLO-Arxiv-Daily
autogyro/active-hand
URDF Files and ROS 2 Description Package for the DexHand V2
autogyro/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
autogyro/awesome-LLMs-In-China
**大模型
autogyro/Bimanual-Manipulation-Model
Based on RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
autogyro/Chat_like_me
ChatTTS is a generative speech model for daily dialogue.
autogyro/D-FINE
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
autogyro/Domain-Generalized-Semantic-Segmentation
[CVPR 2024] Official implement of <Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation>
autogyro/Evolving-Neural-NetworksNeural-
Evolving Self-Assembling Neural Networks
autogyro/Eyes_of_Wukong
Image Resolution and Text Label Are Important Things for Large Multi-modal Models
autogyro/JetMoE
Reaching LLaMA2 Performance with 0.1M Dollars
autogyro/JQR-1
hardware BOM for rx1 humanoid robot
autogyro/LLM-VAD
Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"
autogyro/LLM_MultiAgents_Survey_Papers
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
autogyro/Multi-Objective-Optimization-with-Metaheuristics
A framework for single/multi-objective optimization with metaheuristics
autogyro/Network_Security_GPT
网络安全预训练大模型
autogyro/overesea_web_tools
收录独立开发者出海技术栈和工具
autogyro/Planning-Vision-Model
Dynamic VQA Dataset and Self-adaptive Planning Agent
autogyro/R-LLM-traffic-control
iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement
autogyro/read_charactors_from_image
Ready-to-use OCR
autogyro/Robot-dog
A Quadruped Robot Easily Constructed through E-Commerce with Sheet Metal Welding and Machining
autogyro/see-at-edge
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
autogyro/self-trainning-video-tuning
Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
autogyro/Swarm-Intelligence-Algorithm
Grouped Machine Learning Algorithm, including: Genetic Algorithm, Particle Swarm Optimization, Simulated Annealing, Ant Colony Optimization Algorithm,Immune Algorithm, Artificial Fish Swarm Algorithm, Differential Evolution and TSP(Traveling salesman)
autogyro/top-cvpr-2024-papers
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
autogyro/Video-understand-LMM
This is the official implementation of "Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams"
autogyro/voice-chatbot
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit