poppynull

poppynull's Stars

facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.2k 428 4.2k6.4k
tuna/thuthesis
LaTeX Thesis Template for Tsinghua University
Language:TeX4.5k 88 6691.1k
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language:Python4.4k 34 327451
google-research/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
Language:Python3.3k 39 258429
BilibiliVideoDownload/BilibiliVideoDownload
Cross-platform download bilibili video desktop software, support windows, macOS, Linux
Language:TypeScript3.1k 44 132394
TheNetAdmin/zjuthesis
Zhejiang University Graduation Thesis LaTeX Template
Language:TeX2.6k 15 308601
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python2k 49 126319
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
1.9k 170 467
sanjib-sen/WebLaTex
A complete alternative for Overleaf with VSCode + Web + Git Integration + Copilot + Grammar & Spell Checker + Live Collaboration Support. Based on GitHub Codespace and Dev container.
Language:TeX976 8 13277
ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Language:Python852 12 110121
danieljf24/awesome-video-text-retrieval
A curated list of deep learning resources for video-text retrieval.
583 19 266
w4123/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python471 6 269
ranchlai/mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
Language:Python462 8 39109
showlab/UniVTG
[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding
Language:Python315 6 4628
TXH-mercury/VALOR
Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
Language:Python256 9 2215
yawenzeng/Awesome-Cross-Modal-Video-Moment-Retrieval
前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。
224 11 332
rishikksh20/FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
Language:Jupyter Notebook223 11 1251
wjun0830/QD-DETR
Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)
Language:Python196 4 4415
lucoiso/UEAzSpeech
This plugin integrates Azure Speech Cognitive Services in Unreal Engine.
Language:C++192 8 14844
ivanvovk/durian-pytorch
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
Language:Python182 8 1048
rotten-work/vits-mandarin-windows
VITS for Mandarin. Support Windows and Linux, low-end and high-end hardwares
Language:Jupyter Notebook112 1 513
wjun0830/CGDETR
Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding"
Language:Python110 5 1911
ga642381/FastSpeech2
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:
Language:Python92 8 616
zsyh/Thesis-SE-ZJU-LaTeX
浙大软院研究生毕业论文 Latex 模版（非官方）2021夏季
Language:TeX80 1 031
cnaigithub/Auto_Tuning_Zeroshot_TTS_and_VC
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis", Interspeech 2023
Language:Python78 3 210
rishikksh20/NaturalSpeech2
Language:Python70 13 03
willyfh/awesome-video-text-datasets
A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.
28 2 03
yangbang18/CARE
(TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information
Language:Jupyter Notebook21 1 10
callsys/TextVR
[PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension
Language:Python19 2 30
La-pa/bilibili-download-vedio
本脚本所使用Python代码，用于下载bilibili视频，并将视频转化为音频的形式。
Language:Python3