Y445n9

NO DESCRIPTION

East China Normal UniversityShanghai,China

Y445n9's Stars

Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.3k 220 4582.9k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
11.7k 269 108758
schrodingercatss/tuning_playbook_zh_cn
一本系统地教你将深度学习模型的性能最大化的战术手册。
2.4k 14 5216
zjunlp/LLMAgentPapers
Must-read Papers on LLM Agents.
1.7k 46 990
MasterBin-IIAU/UNINEXT
[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval
Language:Python1.5k 99 56156
AGI-Edgerunners/LLM-Agents-Papers
A repo lists papers related to LLM based agent
Language:Python973 33 971
liliu-avril/Awesome-Segment-Anything
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
765 19 750
TinyLLaVA/TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
Language:Python565 13 10353
Ruixxxx/Awesome-Vision-Mamba-Models
[Official Repo] A Survey on Vision Mamba: Models, Applications and Challenges
416 10 128
zhuyiche/llava-phi
Language:Python356 27 2337
52CV/ICCV-2023-Papers
240 6 412
erkil1452/gaze360
Code for the Gaze360: Physically Unconstrained Gaze Estimation in the Wild Dataset
Language:Python226 9 5442
AdaptiveMotorControlLab/AmadeusGPT
We turn natural language descriptions of behaviors into machine-executable code
Language:Python199 2 68
Kartik-3004/facexformer
Official implementation of FaceXFormer: A Unified Transformer for Facial Analysis
Language:Python183 10 1618
OceannTwT/Tool-Planner
Tool-Planner: Dynamic Solution Tree Planning for Large Language Model with Tool Clustering
Language:Python116 2 33
z-x-yang/DoraemonGPT
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
Language:Jupyter Notebook70 6 45
lxq1000/SwinFace
Official Pytorch Implementation of the paper, "SwinFace: A Multi-task Transformer for Face Recognition, Facial Expression Recognition, Age Estimation and Face Attribute Estimation"
Language:Python65 3 97
kyotovision-public/dynamic-3d-gaze-from-afar
Language:Jupyter Notebook49 3 117
hustvl/ViTGaze
Language:Python32 3 24
francescotonini/object-aware-gaze-target-detection
Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)
Language:Python31 4 123
nkuhzx/GFIE
【CVPR2023】GFIE: A Dataset and Baseline for Gaze-Following from 2D to 3D in Indoor Environments
Language:Python26 1 53
lxq1000/Faceptor
Official implementation of Faceptor: A Generalist Model for Face Perception.
Language:Python220
guanxiongsun/vfe.pytorch
Video Feature Enhancement with PyTorch
Language:Python20 2 22
caixin1998/UnReGA
Official code for the CVPR 2023 paper "Source-free Adaptive Gaze Estimation by Uncertainty Reduction".
Language:Python19 3 21
francescotonini/human-gaze-target-detection-transformer
An implementation of the paper "End-to-End Human-Gaze-Target Detection with Transformers"
Language:Python14 2 52
Azong-HQU/MMTrack
The official implementation for the paper [Towards Unified Token Learning for Vision-Language Tracking].
Language:Python13 2 91
francescotonini/multimodal-across-domains-gaze-target-detection
Official repo of "Multimodal Across Domains Gaze Target Detection" @ ICMI 2022
Language:Python10 1 54
csguoh/MGTR
[ACCV2022] An official implement of the paper "MGTR: End-to-end Mutual Gaze Detection with Transformer".
Language:Python8 1 13
sangmin-git/MMSI
Code for "Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations" (CVPR 2024 Oral)
Language:Python80
px39n/End-to-End-Human-Gaze-Target-Detection-with-Transformers
Unofficial Realization of paper
Language:Python21

Y445n9

Y445n9's Stars

Vision-CAIR/MiniGPT-4

BradyFU/Awesome-Multimodal-Large-Language-Models

schrodingercatss/tuning_playbook_zh_cn

zjunlp/LLMAgentPapers

MasterBin-IIAU/UNINEXT

AGI-Edgerunners/LLM-Agents-Papers

liliu-avril/Awesome-Segment-Anything

TinyLLaVA/TinyLLaVA_Factory

Ruixxxx/Awesome-Vision-Mamba-Models

zhuyiche/llava-phi

52CV/ICCV-2023-Papers

erkil1452/gaze360

AdaptiveMotorControlLab/AmadeusGPT

Kartik-3004/facexformer

OceannTwT/Tool-Planner

z-x-yang/DoraemonGPT

lxq1000/SwinFace

kyotovision-public/dynamic-3d-gaze-from-afar

hustvl/ViTGaze

francescotonini/object-aware-gaze-target-detection

nkuhzx/GFIE

lxq1000/Faceptor

guanxiongsun/vfe.pytorch

caixin1998/UnReGA

francescotonini/human-gaze-target-detection-transformer

Azong-HQU/MMTrack

francescotonini/multimodal-across-domains-gaze-target-detection

csguoh/MGTR

sangmin-git/MMSI

px39n/End-to-End-Human-Gaze-Target-Detection-with-Transformers