Max-Fu's Stars
CompVis/stable-diffusion
A latent text-to-image diffusion model
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
gpakosz/.tmux
🇫🇷 Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ❤️
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
naver/dust3r
DUSt3R: Geometric 3D Vision Made Easy
paperswithcode/galai
Model API for GALACTICA
openai/consistencydecoder
Consistency Distilled Diff VAE
apple/ml-fastvit
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023
real-stanford/diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
muskie82/MonoGS
[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM
takuseno/d3rlpy
An offline deep reinforcement learning library
NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
IFL-CAMP/easy_handeye
Automated, hardware-independent Hand-Eye Calibration
apple/ARKitScenes
This repo accompanies the research paper, ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data and contains the data, scripts to visualize and process assets, and training code described in our paper.
robocasa/robocasa
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
real-stanford/scalingup
[CoRL 2023] This repository contains data generation and training code for Scaling Up & Distilling Down
NVlabs/mimicgen
This code corresponds to simulation environments used as part of the MimicGen project.
kevinzakka/mujoco_scanned_objects
MuJoCo Models for Google's Scanned Objects Dataset
young-geng/scalax
A simple library for scaling up JAX programs
JeanElsner/panda-py
Python bindings for real-time control of Franka Emika robots.
TonyLianLong/CrossMAE
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
mc-lan/ClearCLIP
[ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
Max-Fu/tvl
Max-Fu/icrt
The official repo for the paper "In-Context Imitation Learning via Next-Token Prediction"
visiont3lab/photometric_stereo
BerkeleyAutomation/fog_x