NoFish-528
Undergraduate Student @ Xidian University; Research Intern @ Microsoft Research Asia
Xidian UniveristyBeijing
Pinned Repositories
CS-BAOYAN-2024
2024年保研经验贴和相关物料
CSBasicKnowledge
This repo will record some knowledge about computer science, artificial intelligence and EE
faster-git
a chinese tutorial of git
thorough-pytorch
PyTorch入门教程,在线阅读地址:https://datawhalechina.github.io/thorough-pytorch/
AI-research-tools
:hammer:AI 方向好用的科研工具
awesome-ai-tools
A curated list of Artificial Intelligence Top Tools
clip_zip
encodec-pytorch
unofficial implementation of the High Fidelity Neural Audio Compression
nlp-speech-2023-winter-learning
NoFish-528
NoFish-528's Repositories
NoFish-528/encodec-pytorch
unofficial implementation of the High Fidelity Neural Audio Compression
NoFish-528/NoFish-528
NoFish-528/nlp-speech-2023-winter-learning
NoFish-528/awesome-ai-tools
A curated list of Artificial Intelligence Top Tools
NoFish-528/download_dataset_scripts
NoFish-528/EnCodec_Trainer
NoFish-528/faster-git
NoFish-528/NoFish-528.github.io
NoFish-528/pre-train-dockerfile
An Intro to set up your Speech Docker environment and debug using VSCode
NoFish-528/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
NoFish-528/ASR-paper
:fire: ASR教程: https://dataxujing.github.io/ASR-paper/
NoFish-528/BertWithPretrained
An implementation of the BERT model and its related downstream tasks based on the PyTorch framework
NoFish-528/CS-BAOYAN-2024
NoFish-528/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
NoFish-528/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
NoFish-528/MaskedVectorQuantization
Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation"
NoFish-528/MelSpec_GPT_VQVAE
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
NoFish-528/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
NoFish-528/PhoneLM
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
NoFish-528/PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
NoFish-528/pytorch-template
To be the world's best PyTorch project template.
NoFish-528/SpecVQGAN
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
NoFish-528/speech-language-model
A collection of papers related to speech language models
NoFish-528/speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
NoFish-528/TempoTokens
This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
NoFish-528/textlesslib
Library for Textless Spoken Language Processing
NoFish-528/USLM
Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"
NoFish-528/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
NoFish-528/X-LLM
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
NoFish-528/XDU-WiKi
为了帮助在XDU就读的本科生们而创立的WiKi网站