quan-luu

quan-luu's Stars

openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook27.4k 326 4103.4k
junyanz/pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
Language:Python23.5k 345 1.5k6.4k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python21.5k 161 1.6k2.4k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.8k 116 3981.4k
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python9.1k 92 207971
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python8.8k 36 3001.1k
NVIDIA/Cosmos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
Language:Python7.5k 84 87476
isl-org/MiDaS
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
Language:Python4.7k 74 245655
isl-org/ZoeDepth
Metric depth estimation from a single image
Language:Jupyter Notebook2.5k 35 120224
isaac-sim/IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
Language:Python2.2k 37 216446
luca-medeiros/lang-segment-anything
SAM with text prompt
Language:Python2k 12 63218
cvxgrp/cvxpylayers
Differentiable convex optimization layers
Language:Python1.9k 58 114167
hkchengrex/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Language:Python1.8k 20 137197
microsoft/onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
Language:C++1.3k 38 166351
uzh-rpg/flightmare
An Open Flexible Quadrotor Simulator
Language:C++1.1k 35 168357
sofa-framework/sofa
Real-time multi-physics simulation with an emphasis on medical simulation.
Language:C++967 56 867318
acados/acados
Fast and embedded solvers for nonlinear optimal control
Language:C907 24 332259
utiasDSL/safe-control-gym
PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and RL
Language:Python668 11 53135
Auromix/ROS-LLM
ROS-LLM is a framework designed for embodied intelligence applications in ROS. It allows natural language interactions and leverages Large Language Models (LLMs) for decision-making and robot control. With an easy configuration process, this framework allows for swift integration, enabling your robot to operate with it in as little as ten minutes.
Language:Python578 9 1373
CodexLabsLLC/Colosseum
Open source simulator for autonomous robotics built on Unreal Engine with support for Unity
Language:C++427 19 75135
torch-js/torch-js
Node.js binding for PyTorch.
Language:C++311 6 439
linchangyi1/Awesome-Touch
Tactile Sensing and Simulation; Visual Tactile Manipulation; Open Source.
291 8 1030
UrielCh/opencv4nodejs
ESM Nodejs bindings to OpenCV 3/4
Language:C++263 8 13451
uzh-rpg/rpg_event_representation_learning
Repo for learning event representations
Language:Python142 11 1227
uzh-rpg/snn_angular_velocity
Event-Based Angular Velocity Regression with Spiking Networks
Language:Python111 15 919
isri-aist/RoboManipBaselines
Software that integrates various imitation learning methods and benchmark task environments to provide baselines for robot manipulation
Language:Python79 7 015
raghavmecheri/pytorchjs
Torch and TorchVision, but for NodeJS.
Language:JavaScript36 2 133
ngc92/quadgym
OpenAI gym Environments for Quadrotor Control
Language:Python16 3 19
Ho-lab-jaist/SimTacLS
Open simulation tool for large-scale vision-based tactile sensing devices, based on a proposed SOFA-GAZEBO-GAN framework.
Language:Python3 1 01
Ho-lab-jaist/protac
Data, software for sensing/perception and controlling ProTac arm
Language:Python1 0 00