gyhandy
Ph.D. Student in USC, interested in Computer Vision, Machine Learning, and AGI
University of Southern CaliforniaLos Angeles
gyhandy's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
facebookresearch/llama
Inference code for LLaMA models
moymix/TaskMatrix
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Stability-AI/generative-models
Generative Models by Stability AI
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
facebookresearch/codellama
Inference code for CodeLlama models
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
huggingface/trl
Train transformer language models with reinforcement learning.
mistralai/mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
NVlabs/imaginaire
NVIDIA's Deep Imagination Team's PyTorch Library
dreamgaussian/dreamgaussian
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
isl-org/ZoeDepth
Metric depth estimation from a single image
NVlabs/BundleSDF
[CVPR 2023] BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects
zju3dv/OnePose
Code for "OnePose: One-Shot Object Pose Estimation without CAD Models", CVPR 2022
bytedance/MVDream
Multi-view Diffusion for 3D Generation
KovenYu/WonderJourney
Vision-CAIR/ChatCaptioner
Official Repository of ChatCaptioner
Zeqiang-Lai/Mini-DALLE3
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
OPPO-Mente-Lab/Subject-Diffusion
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
crockwell/Cap3D
[NeurIPS 2023] Scalable 3D Captioning with Pretrained Models
Yushi-Hu/tifa
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
briannlongzhao/DreamDistribution
DavidMChan/caption-by-committee
Using LLMs and pre-trained caption models for super-human performance on image captioning.
gyhandy/3D-Copy-Paste
[NeurIPS 2023] 3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D Detection
fostiropoulos/ablator
Model Ablation Tool-Kit for Deep Learning Model
cardi/USCthesis
a LaTeX style for theses and dissertations at USC
gyhandy/Text2Image-for-Detection
DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection
AaronXu9/ISL