Pinned Repositories
2D-Virtual-Data
BinCopyPaste: Several Clicks to build datasets for instance segmentation in bin-picking scenarios
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
AnimateDiff
Official implementation of AnimateDiff.
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
BEV-SAN
BEVDepth
Official code for BEVDepth.
M2Chat
UTMP
Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
litwellchi's Repositories
litwellchi/M2Chat
litwellchi/BEV-SAN
litwellchi/2D-Virtual-Data
BinCopyPaste: Several Clicks to build datasets for instance segmentation in bin-picking scenarios
litwellchi/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
litwellchi/AnimateDiff
Official implementation of AnimateDiff.
litwellchi/Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
litwellchi/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
litwellchi/BEVDepth
Official code for BEVDepth.
litwellchi/UTMP
litwellchi/Category-6D-Pose
litwellchi/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
litwellchi/dynamic_grasping
litwellchi/LaVIT
LaVIT: Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
litwellchi/LitTools
litwellchi/litwellchi.github.io
litwellchi/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
litwellchi/llama-illusion
litwellchi/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
litwellchi/magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
litwellchi/MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
litwellchi/MISA
MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis
litwellchi/modulated_fusion_transformer
Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition
litwellchi/MOSEI_UMONS
A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis
litwellchi/Multimodal-Infomax
This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.
litwellchi/QueryRCNN
litwellchi/SFA
Official Implementation of "Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers"
litwellchi/Volumetric-Aggregation-Transformer
Official Implementation of VAT
litwellchi/VTK-AR
litwellchi/xiaoweichi.com