gyhandy

Ph.D. Student in USC, interested in Computer Vision, Machine Learning, and AGI

University of Southern CaliforniaLos Angeles

gyhandy's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python168k 1.6k 2.7k44.3k
meta-llama/llama
Inference code for Llama models
Language:Python56k 526 9699.5k
chenfei-wu/TaskMatrix
Language:Python34.5k 300 3533.3k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.4k 339 2684k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.9k 304 1.4k2.5k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.7k 157 1.5k2.2k
meta-llama/codellama
Inference code for CodeLlama models
Language:Python15.9k 185 1951.9k
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook9.8k 97 656958
mistralai/mistral-inference
Official inference library for Mistral models
Language:Jupyter Notebook9.6k 125 144848
NVlabs/imaginaire
NVIDIA's Deep Imagination Team's PyTorch Library
Language:Python4k 109 171447
dreamgaussian/dreamgaussian
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
Language:Python3.9k 46 151351
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
Language:Python2.3k 23 91110
isl-org/ZoeDepth
Metric depth estimation from a single image
Language:Jupyter Notebook2.3k 33 114213
ScanNet/ScanNet
Language:C1.8k 41 158348
gradslam/gradslam
gradslam is an open source differentiable dense SLAM library for PyTorch
Language:Python1.3k 47 38159
NVlabs/BundleSDF
[CVPR 2023] BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects
Language:Python1k 16 168114
zju3dv/OnePose
Code for "OnePose: One-Shot Object Pose Estimation without CAD Models", CVPR 2022
Language:Python932 50 6779
Jumpat/SegmentAnythingin3D
Segment Anything in 3D with NeRFs (NeurIPS 2023)
Language:Python876 16 8054
facebookresearch/omni3d
Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"
Language:Python718 22 5465
apple/ARKitScenes
This repo accompanies the research paper, ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data and contains the data, scripts to visualize and process assets, and training code described in our paper.
Language:Python653 26 6257
Vision-CAIR/ChatCaptioner
Official Repository of ChatCaptioner
Language:Jupyter Notebook450 4 726
url-kaist/dynaVINS
DynaVINS : A Visual-Inertial SLAM for Dynamic Environments
Language:C++335 13 1649
OPPO-Mente-Lab/Subject-Diffusion
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
Language:Python279 8 1011
SamsungLabs/imvoxelnet
[WACV2022] ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection
Language:Python277 7 7429
crockwell/Cap3D
[NeurIPS 2023] Scalable 3D Captioning with Pretrained Models
Language:Python219 7 3214
concept-fusion/concept-fusion
Code release for ConceptFusion [RSS 2023]
173 6 108
notmahi/clip-fields
Teaching robots to respond to open-vocab queries with CLIP and NeRF-like neural fields
Language:Python153 9 1017
Yushi-Hu/tifa
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
Language:Python136 3 99
DavidMChan/caption-by-committee
Using LLMs and pre-trained caption models for super-human performance on image captioning.
Language:Python40 3 34
gyhandy/Text2Image-for-Detection
DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection
Language:Python17 1 01

gyhandy

gyhandy's Stars

Significant-Gravitas/AutoGPT

meta-llama/llama

chenfei-wu/TaskMatrix

tatsu-lab/stanford_alpaca

microsoft/unilm

haotian-liu/LLaVA

meta-llama/codellama

salesforce/LAVIS

mistralai/mistral-inference

NVlabs/imaginaire

dreamgaussian/dreamgaussian

UX-Decoder/Semantic-SAM

isl-org/ZoeDepth

ScanNet/ScanNet

gradslam/gradslam

NVlabs/BundleSDF

zju3dv/OnePose

Jumpat/SegmentAnythingin3D

facebookresearch/omni3d

apple/ARKitScenes

Vision-CAIR/ChatCaptioner

url-kaist/dynaVINS

OPPO-Mente-Lab/Subject-Diffusion

SamsungLabs/imvoxelnet

crockwell/Cap3D

concept-fusion/concept-fusion

notmahi/clip-fields

Yushi-Hu/tifa

DavidMChan/caption-by-committee

gyhandy/Text2Image-for-Detection