/ENA_Submission_ZW

Steps to submit data to ENA repository

Primary LanguageShell

ENA_Submission_test

Author: Zhou Wu

Date: 2020-08-06

This is to show the basic steps to submit data to ENA repository. WGS data with PE reads will be submitted. we have 44 samples with 826 PE fastq files.

The PE reads should always share the prefix of file.

It is recommanded to test the submissions using the Webin test service

In order to understand ENA accession numbers: How to understand ENA accession numbers

Steps for submission on ENA

  1. Study x1 (Do it online interactively)
  2. Samples x44 (Do it online interactively)
  3. Data x143x2 (PE-reads) (Do it online interactively, prepare the speadsheet and md5 with the following scripts); Options: FTP_upload.md

Scripts

  1. make_md5.sh
  2. Run_make_md5.sh #Only needed when more than two PE reads files of one single sample are presented, and this script is simply used to run make_md5.sh
  3. make_spreadlist_test.sh

NOTE More than two PE files of one single sample(multiple runs/libiaries) need to be submitted? The mulitple PE reads files of the same sample should be stored under one folder named with the sample name (structure: [Sample_name]/[Multiple_Reads_names] ). Then all the folder names(which equals the sample list) are listed in the file Example/Folders

Example files:

  1. Samples x2 : in Example/Folders
  2. Data 8x2 : PE reads (2 sample x 8 PE reads files of one single sample) prefix are in the 2nd columns of Example/Folders_R1_test, 1st column is sample name.