poppynull's Stars
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
tuna/thuthesis
LaTeX Thesis Template for Tsinghua University
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
google-research/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
BilibiliVideoDownload/BilibiliVideoDownload
Cross-platform download bilibili video desktop software, support windows, macOS, Linux
TheNetAdmin/zjuthesis
Zhejiang University Graduation Thesis LaTeX Template
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
sanjib-sen/WebLaTex
A complete alternative for Overleaf with VSCode + Web + Git Integration + Copilot + Grammar & Spell Checker + Live Collaboration Support. Based on GitHub Codespace and Dev container.
ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
danieljf24/awesome-video-text-retrieval
A curated list of deep learning resources for video-text retrieval.
w4123/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
ranchlai/mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
showlab/UniVTG
[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding
TXH-mercury/VALOR
Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
yawenzeng/Awesome-Cross-Modal-Video-Moment-Retrieval
前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。
rishikksh20/FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
wjun0830/QD-DETR
Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)
lucoiso/UEAzSpeech
This plugin integrates Azure Speech Cognitive Services in Unreal Engine.
ivanvovk/durian-pytorch
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
rotten-work/vits-mandarin-windows
VITS for Mandarin. Support Windows and Linux, low-end and high-end hardwares
wjun0830/CGDETR
Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding"
ga642381/FastSpeech2
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:
zsyh/Thesis-SE-ZJU-LaTeX
浙大软院研究生毕业论文 Latex 模版(非官方)2021夏季
cnaigithub/Auto_Tuning_Zeroshot_TTS_and_VC
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis", Interspeech 2023
rishikksh20/NaturalSpeech2
willyfh/awesome-video-text-datasets
A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.
yangbang18/CARE
(TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information
callsys/TextVR
[PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension
La-pa/bilibili-download-vedio
本脚本所使用Python代码,用于下载bilibili视频,并将视频转化为音频的形式。