vhzy

AI phd condidate

Pinned Repositories

acapp
django web app
Language:JavaScript0 1 00
ACoLP
Open Set Video HOI detection from Action-centric Chain-of-Look Prompting, ICCV2023
Language:Python0 0 00
ai-web-app
Django Artificial Intelligence Web App for Facial Expression Recognition (FER)
Language:Jupyter Notebook00
ARL
IEEE Transactions on Affective Computing "Facial Action Unit Detection Using Attention and Relation Learning"
Language:C++00
Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python0 0 00
atlas
Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)
Language:Python00
attention_branch_network
Attention Branch Network (CIFAR100, ImageNet models)
Language:Python0 0 00
AU-Net
Towards robust facial action units detection
Language:Python0 0 00
Augmentation-Adapted-Retriever
[ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In".
Language:Python00
awesome-visual-question-answering
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.
0 1 00

vhzy's Repositories

vhzy/ACoLP
Open Set Video HOI detection from Action-centric Chain-of-Look Prompting, ICCV2023
Language:Python0 0 00
vhzy/attention_branch_network
Attention Branch Network (CIFAR100, ImageNet models)
Language:Python0 0 00
vhzy/AU-Net
Towards robust facial action units detection
Language:Python0 0 00
vhzy/ChatCaptioner
Official Repository of ChatCaptioner
Language:Jupyter Notebook0 0 00
vhzy/CIRL
[CVPR 2022 Oral] Code release for "Causality Inspired Representation Learning for Domain Generalization"
Language:Python0 0 00
vhzy/CIS
implementation for "Causal Intervention for Subject-Deconfounded Facial Action Unit Recognition" (AAAI 2022).
Language:Python0 0 00
vhzy/ChineseNMT
ChineseNMT: Translate English to Chinese with PyTorch Implementation of Transformer
Language:Python0 0
vhzy/DL_GCN
手写了卷积神经网络内核，来处理图上的节点分类与链路预测任务，在三个数据集cora,citeseer,ppi上进行试验，并分析了自环、层数、DropEdge、PairNorm、激活函数等因素对模型的分类和预测性能的影响。
Language:Python0 0
vhzy/Emotion-Investigator
An Exciting Deep Learning-based Flask web app that predicts the Facial Expressions of users and also does Graphical Visualization of the Expressions.
Language:Jupyter Notebook0 0
vhzy/EVA
EVA Series: Visual Representation Fantasies from BAAI
Language:Python0 0
vhzy/face_action_unit
code for facial au detection
1 0
vhzy/FastChat
The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
Language:Python0 0
vhzy/FAU_CVPR2021
Language:Python0 0
vhzy/FiD
Fusion-in-Decoder
vhzy/FrozenBiLM
[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Language:Python0 0
vhzy/InvReg
Invariant Feature Regularization for Fair Face Recognition (ICCV'23)
Language:Python0 0
vhzy/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook0 0
vhzy/markdown-notes
1 0
vhzy/ME-GraphAU
[IJCAI 2022] Learning Multi-dimensional Edge Feature-based AU Relation Graph for Facial Action Unit Recognition, Pytorch code
Language:Python0 0
vhzy/memorizing-transformers-pytorch
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
vhzy/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
Language:Python0 0
vhzy/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
Language:Python0 0
vhzy/movie_knowledge_graph_app
电影知识图谱，主要包括实体识别、实体查询、关系查询以及智能问答等。movie knowledge graph(Entity identification, graph display, and intelligent question and answer)
Language:JavaScript0 0
vhzy/Notes
Language:JavaScript1 0
vhzy/prophet
Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".
Language:Python0 0
vhzy/PSVL
Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).
Language:Python0 0
vhzy/PyTorch_Learning
1 0
vhzy/SeViLA
Self-Chained Image-Language Model for Video Localization and Question Answering
Language:Python0 0
vhzy/StatisticalLearning_USTC
Statistical Learning course in USTC. 中科大统计学习（刘东）课程复习资料。
Language:TeX0 0
vhzy/Ultra-Fast-Lane-Detection
Ultra Fast Structure-aware Deep Lane Detection (ECCV 2020)
Language:Python0 0