Pinned Repositories
2019_algorithm_intern_information
2020年的算法实习岗位/校招公司信息表,部分包括内推码,和常见深度学习算法岗面试题及答案,暑期计算机视觉实习面经和总结
Analysis-on-the-Scale-of-Wildfire-Incident-in-California
artifact-directory-template
Template for specifying locations for all capstone project components
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
Awesome-Multimodal-LLM
Research Trends in LLM-guided Multimodal Learning.
finetune_dolly
MQT-LLaVA
[NeurIPS 2024] Matryoshka Query Transformer for Large Vision-Language Models
Revisit_CLIP
BLIVA
(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
MRAG-Bench
Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models
gordonhu608's Repositories
gordonhu608/MQT-LLaVA
[NeurIPS 2024] Matryoshka Query Transformer for Large Vision-Language Models
gordonhu608/finetune_dolly
gordonhu608/Revisit_CLIP
gordonhu608/artifact-directory-template
Template for specifying locations for all capstone project components
gordonhu608/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
gordonhu608/Awesome-Multimodal-LLM
Research Trends in LLM-guided Multimodal Learning.
gordonhu608/Awesome_Prompting_Papers_in_Computer_Vision
A curated list of prompt-based paper in computer vision and vision-language learning.
gordonhu608/cvpr-latex-template
Extended LaTeX template for CVPR/ICCV papers
gordonhu608/Data-Visualization-DSC106-
Data Visualization Course work using javascript and html
gordonhu608/Deep-Learning-Projects-CSE151B-
Course work of CSE151B. I strongly suggest you to read reports
gordonhu608/peft_llama
Peft_BLIP_LLaMA
gordonhu608/Primal_Dual_RL
gordonhu608/SIIM-Detection
gordonhu608/DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
gordonhu608/EGO4D
gordonhu608/hand_eye_real_robot
gordonhu608/kaggel-llm-science-exam-2023
gordonhu608/llama-recipes
Examples and recipes for Llama 2 model
gordonhu608/LLaVA-UHD-Better
A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo
gordonhu608/MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (Evaluation Pipeline)
gordonhu608/nasa-mars-imagery
gordonhu608/Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
gordonhu608/pytorch_resnet_cifar10
Proper implementation of ResNet-s for CIFAR10/100 in pytorch that matches description of the original paper.
gordonhu608/Scalable_Analytic_System_DSC102
Course work of DSC102
gordonhu608/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
gordonhu608/Su_Lab_research_training
gordonhu608/transformers_llava
connect vision tower and projection to LLM
gordonhu608/VALOR
Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models
gordonhu608/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
gordonhu608/yolov5
YOLOv5 in PyTorch > ONNX > CoreML > TFLite