Pinned Repositories
acapp
django web app
ACoLP
Open Set Video HOI detection from Action-centric Chain-of-Look Prompting, ICCV2023
ai-web-app
Django Artificial Intelligence Web App for Facial Expression Recognition (FER)
ARL
IEEE Transactions on Affective Computing "Facial Action Unit Detection Using Attention and Relation Learning"
Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
atlas
Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)
attention_branch_network
Attention Branch Network (CIFAR100, ImageNet models)
AU-Net
Towards robust facial action units detection
Augmentation-Adapted-Retriever
[ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In".
awesome-visual-question-answering
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.
vhzy's Repositories
vhzy/ACoLP
Open Set Video HOI detection from Action-centric Chain-of-Look Prompting, ICCV2023
vhzy/attention_branch_network
Attention Branch Network (CIFAR100, ImageNet models)
vhzy/AU-Net
Towards robust facial action units detection
vhzy/ChatCaptioner
Official Repository of ChatCaptioner
vhzy/CIRL
[CVPR 2022 Oral] Code release for "Causality Inspired Representation Learning for Domain Generalization"
vhzy/CIS
implementation for "Causal Intervention for Subject-Deconfounded Facial Action Unit Recognition" (AAAI 2022).
vhzy/ChineseNMT
ChineseNMT: Translate English to Chinese with PyTorch Implementation of Transformer
vhzy/DL_GCN
手写了卷积神经网络内核,来处理图上的节点分类与链路预测任务,在三个数据集cora,citeseer,ppi上进行试验,并分析了自环、层数、DropEdge、PairNorm、激活函数等因素对模型的分类和预测性能的影响。
vhzy/Emotion-Investigator
An Exciting Deep Learning-based Flask web app that predicts the Facial Expressions of users and also does Graphical Visualization of the Expressions.
vhzy/EVA
EVA Series: Visual Representation Fantasies from BAAI
vhzy/face_action_unit
code for facial au detection
vhzy/FastChat
The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
vhzy/FAU_CVPR2021
vhzy/FiD
Fusion-in-Decoder
vhzy/FrozenBiLM
[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
vhzy/InvReg
Invariant Feature Regularization for Fair Face Recognition (ICCV'23)
vhzy/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
vhzy/markdown-notes
vhzy/ME-GraphAU
[IJCAI 2022] Learning Multi-dimensional Edge Feature-based AU Relation Graph for Facial Action Unit Recognition, Pytorch code
vhzy/memorizing-transformers-pytorch
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
vhzy/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
vhzy/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
vhzy/movie_knowledge_graph_app
电影知识图谱,主要包括实体识别、实体查询、关系查询以及智能问答等。movie knowledge graph(Entity identification, graph display, and intelligent question and answer)
vhzy/Notes
vhzy/prophet
Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".
vhzy/PSVL
Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).
vhzy/PyTorch_Learning
vhzy/SeViLA
Self-Chained Image-Language Model for Video Localization and Question Answering
vhzy/StatisticalLearning_USTC
Statistical Learning course in USTC. 中科大统计学习(刘东)课程复习资料。
vhzy/Ultra-Fast-Lane-Detection
Ultra Fast Structure-aware Deep Lane Detection (ECCV 2020)