This is to show the basic steps to submit data to ENA repository. WGS data with PE reads will be submitted. we have 44 samples with 826 PE fastq files.
The PE reads should always share the prefix of file.
It is recommanded to test the submissions using the Webin test service
In order to understand ENA accession numbers: How to understand ENA accession numbers
- Study x1 (Do it online interactively)
- Samples x44 (Do it online interactively)
- Data x143x2 (PE-reads) (Do it online interactively, prepare the speadsheet and md5 with the following scripts); Options:
FTP_upload.md
- make_md5.sh
- Run_make_md5.sh #Only needed when more than two PE reads files of one single sample are presented, and this script is simply used to run
make_md5.sh
- make_spreadlist_test.sh
NOTE More than two PE files of one single sample(multiple runs/libiaries) need to be submitted? The mulitple PE reads files of the same sample should be stored under one folder named with the sample name (structure: [Sample_name]/[Multiple_Reads_names] ). Then all the folder names(which equals the sample list) are listed in the file Example/Folders
- Samples x2 : in
Example/Folders
- Data 8x2 : PE reads (2 sample x 8 PE reads files of one single sample) prefix are in the 2nd columns of
Example/Folders_R1_test
, 1st column is sample name.