Pinned Repositories
fish-speech
SOTA Open Source TTS
OneChart
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
sglang
SGLang is a fast serving framework for large language models and vision language models.
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
LSAP
pytorchvideo
A deep learning library for video understanding research.
zhang-jr's Repositories
zhang-jr/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
zhang-jr/LSAP
zhang-jr/pytorchvideo
A deep learning library for video understanding research.