Yui010206

Ph.D. student

UNC, Chapel HillChapel Hill

Pinned Repositories

RACCooN
(arXiv.2405.18406) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives
Language:Python31 2 51
8086
Homework Codes in 8086 (Assembly Language) | HW from COA
Language:Assembly0 1 00
AIART
0 2 00
AIART_Website
an image style translatiton website
0 2 01
CREMA
☕️ CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
Language:Python28 2 12
IVA-0
[MM24] Zero-Shot Controllable Image-to-Video Animation via Motion Decomposition
10
MoPRL
[TCSVT] Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Language:Python12 1 20
SeViLA
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
Language:Python182 3 2722
SJTU_SE_Groupwork
宿舍二手商品交易小组
Language:JavaScript2 1 06
VideoTree
Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"
Language:Python89 2 103

Yui010206's Repositories

Yui010206/SeViLA
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
Language:Python182 3 2722
Yui010206/CREMA
☕️ CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
Language:Python28 2 12
Yui010206/MoPRL
[TCSVT] Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
Language:Python12 1 20
Yui010206/IVA-0
[MM24] Zero-Shot Controllable Image-to-Video Animation via Motion Decomposition
10
Yui010206/AlphaPose
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
Language:Python0 1 00
Yui010206/arunmallya.github.io
my public website
Language:JavaScript0 1 00
Yui010206/awesome-anomaly-detection
A curated list of awesome anomaly detection resources
1 0
Yui010206/awesome-vln
A curated list of research papers in Vision-Language Navigation (VLN)
1 0
Yui010206/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Language:Python1 0
Yui010206/grid-feats-vqa
Grid features pre-training code for visual question answering
Language:Python0 0
Yui010206/HOI-Learning-List
A list of Human-Object Interaction Learning.
1 0
Yui010206/just-ask
[TPAMI Special Issue on ICCV 2021 Best Papers, Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Language:Jupyter Notebook1 0
Yui010206/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Python0 0
Yui010206/MAC
2 0
Yui010206/magenta
Magenta: Music and Art Generation with Machine Intelligence
Language:Python1 0
Yui010206/merlot_reserve
Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"
Language:Python1 0
Yui010206/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Language:Python1 0
Yui010206/n2nmn
Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017
Language:SourcePawn1 0
Yui010206/Person-Search-with-Natural-Language-Description
Person Search with Natural Language Description
Language:Lua1 0
Yui010206/Research
novel deep learning research works with PaddlePaddle
Language:Python1 0
Yui010206/Scene-Graph-Benchmark.pytorch
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”
Language:Jupyter Notebook1 0
Yui010206/seg2vid
Video Generation from Single Semantic Label Map
Language:Python1 0
Yui010206/SJTUThesis
Shanghai Jiao Tong University XeLaTeX Thesis Template
Language:TeX1 0
Yui010206/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Language:Python1 0
Yui010206/transformer-time-series-prediction
proof of concept for a transformer-based time series prediction model
Language:Python1 0
Yui010206/VGT
Video Graph Transformer for Video Question Answering (ECCV'22)
Language:Python0 0
Yui010206/video-swin-transformer-pytorch
Video Swin Transformer - PyTorch
Language:Python1 0
Yui010206/video_feature_extractor
Easy to use video deep features extractor
Language:Python1 0
Yui010206/ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
Language:Python1 0
Yui010206/Yui010206.github.io
Language:SCSS2 0