litwellchi

Pinned Repositories

2D-Virtual-Data
BinCopyPaste: Several Clicks to build datasets for instance segmentation in bin-picking scenarios
Language:Python00
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript00
AnimateDiff
Official implementation of AnimateDiff.
Language:Python00
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
Language:Python00
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python00
BEV-SAN
Language:Python61
BEVDepth
Official code for BEVDepth.
Language:Python00
M2Chat
Language:Python29 3 30
UTMP
00
Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Language:HTML188 8 314

litwellchi's Repositories

litwellchi/M2Chat
Language:Python29 3 30
litwellchi/BEV-SAN
Language:Python61
litwellchi/2D-Virtual-Data
BinCopyPaste: Several Clicks to build datasets for instance segmentation in bin-picking scenarios
Language:Python00
litwellchi/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript00
litwellchi/AnimateDiff
Official implementation of AnimateDiff.
Language:Python00
litwellchi/Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
Language:Python00
litwellchi/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python00
litwellchi/BEVDepth
Official code for BEVDepth.
Language:Python00
litwellchi/UTMP
00
litwellchi/Category-6D-Pose
litwellchi/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
litwellchi/dynamic_grasping
Language:Python
litwellchi/LaVIT
LaVIT: Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
litwellchi/LitTools
Language:Python
litwellchi/litwellchi.github.io
Language:HTML
litwellchi/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Language:Python
litwellchi/llama-illusion
litwellchi/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
litwellchi/magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
Language:Python
litwellchi/MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
litwellchi/MISA
MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis
Language:Python
litwellchi/modulated_fusion_transformer
Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition
Language:Python
litwellchi/MOSEI_UMONS
A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis
Language:Python
litwellchi/Multimodal-Infomax
This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.
Language:Python
litwellchi/QueryRCNN
Language:Python
litwellchi/SFA
Official Implementation of "Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers"
litwellchi/Volumetric-Aggregation-Transformer
Official Implementation of VAT
litwellchi/VTK-AR
Language:JavaScript
litwellchi/xiaoweichi.com
Language:HTML