Pinned Repositories
audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
ContinuousFlowNLG
Pytorch version of Continuous Language Generative Flow (ACL 2021)
DeCEMBERT
Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)
i-Code
MulAgentRef
Perceiver_VL
PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)
TVLT
PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)
VidLanKD
Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))
zinengtang.github.io
Personal Website
zinengtang's Repositories
zinengtang/TVLT
PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)
zinengtang/VidLanKD
Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))
zinengtang/Perceiver_VL
PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)
zinengtang/DeCEMBERT
Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)
zinengtang/ContinuousFlowNLG
Pytorch version of Continuous Language Generative Flow (ACL 2021)
zinengtang/i-Code
zinengtang/MulAgentRef
zinengtang/zinengtang.github.io
Personal Website
zinengtang/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
zinengtang/AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
zinengtang/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
zinengtang/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
zinengtang/UDOP
zinengtang/CS184_FinalProject
Computational Design of High-level Interlocking Puzzles (Siggraph 2022 Journal Track Paper)
zinengtang/cs265-tasks
zinengtang/make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
zinengtang/ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
zinengtang/youtube-dl
Command-line program to download videos from YouTube.com and other video sites