open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

PythonMIT

Issues

[BUG]: 'NS2Trainer' object has no attribute '_count_parameters'
#182 opened 2 months ago by a897456
14
[Help]: Loss NaN occured while training VALL-E during the second stage (the NAR decoder)
#211 opened 5 days ago by Ming-er
0
[Help]: Reproduced TTA Results
#209 opened 10 days ago by KunZhou9646
0
[Help]: Can use external pretrained codec for NeuralSpeech2
#208 opened 15 days ago by chazo1994
0
[BUG]: GPU memory leak during GAN-based vocoders training
#181 opened 12 days ago by vanIvan
2
[Help]: Data preprocessing for NaturalSpeech2 TTS
#179 opened 2 months ago by AvivSham
4
[Help]: NaturalSpeech2 training and data preprocess issue
#206 opened 20 days ago by CreepJoye
1
[Help]: Question of Data Preparation for TTA
#176 opened 2 months ago by jiusansan222
3
natural speech3 FACodec
#207 opened 20 days ago by wwfcnu
0
[Feature]: Audiocap dataset dev and test files
#186 opened a month ago by KunZhou9646
2
[Help]: Performance Inferior to Demo Showcase in terms of "FACodec: Voice Conversion Samples"
#203 opened 24 days ago by zyy-fc
1
[Help]: something wrong with the run.sh on windows
#199 opened a month ago by rainbowjack
2
[Help]: Is there any loss that linearly correlate to performance of TTA autoencoder?
#184 opened 2 months ago by Jiang-Stan
3
[Help]: Questions about FACodec's Parameter
#194 opened a month ago by Pydataman
2
[Feature]: Replace hand-craft hparams with dataclass and omegaconf framework
#198 opened a month ago by Nugine
1
[Feature]: Use ruff to improve code style
#195 opened a month ago by Nugine
1
[BUG]: Uid format in `preprocessors/popbutfy.py` may be incorrect
#196 opened a month ago by Nugine
1
[Help]: Infer AR solely for VALL-E?
#191 opened a month ago by caigun
1
[Help]: RuntimeError: cuDNN error: CUDNN_STATUS_INTERNAL_ERROR
#193 opened a month ago by mysxs
2
[Help]: The difference between the FAcodec pretrained model "FACodecEncoderV2" vs "FACodecEncoder"
#192 opened a month ago by zyy-fc
1
[BUG]: the lengths of the features after FACodecEncoderV2 is not match
#188 opened a month ago by Mahaotian1
1
[Help]: may i ask what is the diffrent between TTA and TTM?
#190 opened a month ago by rainbowjack
2
[Help]: FACodec. How to recreate demo examples for voice conversion?
#161 opened 2 months ago by Allessyer
9
[Help]: Some questions about the SVC model
#187 opened a month ago by Yuki-zik
2
[Dataset]: Audiocap dev and test files
#185 opened a month ago by KunZhou9646
0
[BUG]: missing reference to `modules.generic.conv` in `modules/encoder/conv_encoder.py`
#178 opened 2 months ago by karljeon44
1
[Feature]: For Music - VALL-E transformer RAG (and other embedding solutions)
#170 opened 2 months ago by bennmann
1
[Help]: Latent
#174 opened 2 months ago by a897456
3
[Help] How to do data processing of the tta project?
#168 opened 2 months ago by spiralanch
2
[BUG]: ns2_dataset.py does not have this two part, phones and num_frames, which must be need in ns2_trainer.py
#171 opened 2 months ago by a897456
4
[Help]: When using Valle_libritts pre -training model, the model failed to load the model correctly.
#169 opened 2 months ago by song201216
3
[Feature]: FACodec training
#156 opened 3 months ago by TechInterMezzo
1
[Help]: FileNotFoundError: [Errno 2] No such file or directory: 'data\\metadata\\libritts\\train-clean-100#1970#28415#1970_28415_000067_000000.pkl'
#172 opened 2 months ago by a897456
2
How to convert text to target audio（TTS） using ns3_codec（naturalspeech3）
#166 opened 2 months ago by aguang1201
6
[Help]: MultiGPU TTA training
#159 opened 3 months ago by fpicetti
2
[BUG]: FastSpeech2 train failed when under cuda_devices="1,2,3"
#167 opened 2 months ago by huangxu1991
3
[Help]: while trainning transfomerSVC
#164 opened 2 months ago by suted2
4
[Help]: inference the model of vall-e report error
#134 opened 2 months ago by fangbingxiao
3
[BUG]: FACodec outputs noise
#173 opened 2 months ago by lifeiteng
1
[Help]: What configuration of server is appropriate to run this project.
#146 opened 2 months ago by liuyuhualilith
2
[BUG]: Typos in TTA task
#158 opened 2 months ago by fpicetti
2
[BUG]: TTA ldm training loss
#162 opened 2 months ago by Sreyan88
0
length mismatch for FACodecDecoderV2
#160 opened 2 months ago by chenjiasheng
1
[BUG]- state_dic saved by "accelerator" cannot be load due to "shared tensors" problem
#149 opened 3 months ago by zyy-fc
1
[Help]: May I ask when naturalspeech3_facodec/resolve/main/ns3_facodec_encoder_v2.bin will be released?
#157 opened 3 months ago by wcr369
1
[Help]: Need a list of hardware configurations.
#151 opened 3 months ago by isKEKE
2
N
#150 opened 3 months ago by zyy-fc
0
[BUG]-NaturalSpeech2 data preprocess & pitch loss
#148 opened 3 months ago by zyy-fc
2
[Help]: Inference
#142 opened 3 months ago by mysxs
5
[BUG]: G2P module fails to initialize when using `pypinyin` as backend
#138 opened 3 months ago by yuantuo666
0