quan-luu's Stars
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
junyanz/pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
NVIDIA/Cosmos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
isl-org/MiDaS
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
isl-org/ZoeDepth
Metric depth estimation from a single image
isaac-sim/IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
luca-medeiros/lang-segment-anything
SAM with text prompt
cvxgrp/cvxpylayers
Differentiable convex optimization layers
hkchengrex/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
microsoft/onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
uzh-rpg/flightmare
An Open Flexible Quadrotor Simulator
sofa-framework/sofa
Real-time multi-physics simulation with an emphasis on medical simulation.
acados/acados
Fast and embedded solvers for nonlinear optimal control
utiasDSL/safe-control-gym
PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and RL
Auromix/ROS-LLM
ROS-LLM is a framework designed for embodied intelligence applications in ROS. It allows natural language interactions and leverages Large Language Models (LLMs) for decision-making and robot control. With an easy configuration process, this framework allows for swift integration, enabling your robot to operate with it in as little as ten minutes.
CodexLabsLLC/Colosseum
Open source simulator for autonomous robotics built on Unreal Engine with support for Unity
torch-js/torch-js
Node.js binding for PyTorch.
linchangyi1/Awesome-Touch
Tactile Sensing and Simulation; Visual Tactile Manipulation; Open Source.
UrielCh/opencv4nodejs
ESM Nodejs bindings to OpenCV 3/4
uzh-rpg/rpg_event_representation_learning
Repo for learning event representations
uzh-rpg/snn_angular_velocity
Event-Based Angular Velocity Regression with Spiking Networks
isri-aist/RoboManipBaselines
Software that integrates various imitation learning methods and benchmark task environments to provide baselines for robot manipulation
raghavmecheri/pytorchjs
Torch and TorchVision, but for NodeJS.
ngc92/quadgym
OpenAI gym Environments for Quadrotor Control
Ho-lab-jaist/SimTacLS
Open simulation tool for large-scale vision-based tactile sensing devices, based on a proposed SOFA-GAZEBO-GAN framework.
Ho-lab-jaist/protac
Data, software for sensing/perception and controlling ProTac arm