Y445n9's Stars
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
schrodingercatss/tuning_playbook_zh_cn
一本系统地教你将深度学习模型的性能最大化的战术手册。
zjunlp/LLMAgentPapers
Must-read Papers on LLM Agents.
MasterBin-IIAU/UNINEXT
[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval
AGI-Edgerunners/LLM-Agents-Papers
A repo lists papers related to LLM based agent
liliu-avril/Awesome-Segment-Anything
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
TinyLLaVA/TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
Ruixxxx/Awesome-Vision-Mamba-Models
[Official Repo] A Survey on Vision Mamba: Models, Applications and Challenges
zhuyiche/llava-phi
52CV/ICCV-2023-Papers
erkil1452/gaze360
Code for the Gaze360: Physically Unconstrained Gaze Estimation in the Wild Dataset
AdaptiveMotorControlLab/AmadeusGPT
We turn natural language descriptions of behaviors into machine-executable code
Kartik-3004/facexformer
Official implementation of FaceXFormer: A Unified Transformer for Facial Analysis
OceannTwT/Tool-Planner
Tool-Planner: Dynamic Solution Tree Planning for Large Language Model with Tool Clustering
z-x-yang/DoraemonGPT
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
lxq1000/SwinFace
Official Pytorch Implementation of the paper, "SwinFace: A Multi-task Transformer for Face Recognition, Facial Expression Recognition, Age Estimation and Face Attribute Estimation"
kyotovision-public/dynamic-3d-gaze-from-afar
hustvl/ViTGaze
francescotonini/object-aware-gaze-target-detection
Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)
nkuhzx/GFIE
【CVPR2023】GFIE: A Dataset and Baseline for Gaze-Following from 2D to 3D in Indoor Environments
lxq1000/Faceptor
Official implementation of Faceptor: A Generalist Model for Face Perception.
guanxiongsun/vfe.pytorch
Video Feature Enhancement with PyTorch
caixin1998/UnReGA
Official code for the CVPR 2023 paper "Source-free Adaptive Gaze Estimation by Uncertainty Reduction".
francescotonini/human-gaze-target-detection-transformer
An implementation of the paper "End-to-End Human-Gaze-Target Detection with Transformers"
Azong-HQU/MMTrack
The official implementation for the paper [Towards Unified Token Learning for Vision-Language Tracking].
francescotonini/multimodal-across-domains-gaze-target-detection
Official repo of "Multimodal Across Domains Gaze Target Detection" @ ICMI 2022
csguoh/MGTR
[ACCV2022] An official implement of the paper "MGTR: End-to-end Mutual Gaze Detection with Transformer".
sangmin-git/MMSI
Code for "Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations" (CVPR 2024 Oral)
px39n/End-to-End-Human-Gaze-Target-Detection-with-Transformers
Unofficial Realization of paper