yysu-888's Stars
apple/ml-mobileclip
This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024
sally-203/hand_eye_calibration
Hand eye calibration with two modes: eye in hand and eye to hand
yysu-888/Low-light-Image-Enhancement
Low-light-Image-Enhancement
AgibotTech/agibot_x1_train
The reinforcement learning training code for AgiBot X1.
Ewenwan/MVision
机器人视觉 移动机器人 VS-SLAM ORB-SLAM2 深度学习目标检测 yolov3 行为检测 opencv PCL 机器学习 无人驾驶
DefTruth/CUDA-Learn-Notes
📚 Tensor/CUDA Cores, 📖150+ CUDA Kernels, toy-hgemm library🔥(achieve the performance of cuBLAS 🎉🎉).
NVIDIA/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
MichalZawalski/embodied-CoT
Embodied Chain of Thought: A robotic policy that reason to solve the task.
yysu-888/yolov10-ncnn
arrayfire/arrayfire
ArrayFire: a general purpose GPU library.
luyh20/FGC-GraspNet
ICRA 2022 "Hybrid Physical Metric For 6-DoF Grasp Pose Detection"
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
sphericalcylinder/MetalCompute
A C++ wrapper for the Apple metal-cpp library to make it easier to run compute kernels on the GPU
BrokenSource/DepthFlow
🌊 Images to → 2.5D Parallax Effect Video. A Free and Open Source ImmersityAI alternative
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
pnnx/pnnx
PyTorch Neural Network eXchange
konrad-gajdus/miniMNIST-c
yysu-888/clip.cpp
CLIP model deploy in plain C/C++ using ggml machine learning library
microsoft/proxy
Proxy: Next Generation Polymorphism in C++
openvla/openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
haosulab/MPlib
a Lightweight Motion Planning Package
TencentGameMate/chinese_speech_pretrain
chinese speech pretrained models
facebookresearch/habitat-sim
A flexible, high-performance 3D simulator for Embodied AI research.
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
BMPixel/moffee
moffee: Make Markdown Ready to Present
argmaxinc/DiffusionKit
On-device Inference of Diffusion Models for Apple Silicon
RQLuo/MixTeX-Latex-OCR
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
unum-cloud/usearch
Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
ARM-software/ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
tensor-compiler/taco
The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs