Shingo Miyauchi 7April20
Proteomic Information Navigated Genomic Outlook (PRINGO) and Transposon Identification Nominative Genome Overview (TINGO; Morin et al., 2019) are custom R scripts for data integration and visualisation. This demonstration includes the main Figures, which were used to support the core findings in Miyauchi et al. (2020) in Nature Communications.
PRINGO enables us to visualise combined output from the established tools and websites including BUSCO (Sima et al., 2015), Genome stats from Mycocosm (http://mycocosm.jgi.doe.gov), Secretome prediction (Pellegrin et al., 2015), identified transposable elements (Morin et al., 2019), the CAZyme repertoire (http://www.cazy.org/). TINGO generates visual bubble charts from identified transposable elements (Morin et al., 2019).
*Please note that PRINGO and TINGO do not produce bioinformatic results. They are visualisation tools for complex genomic data.
Please install, R Studio, R 3.5.1
Please install R packages, egg 0.4.0, dplyr 0.7.6, ggplot2 3.0.0, gtools 3.8.1, ggpubr 0.2, reshape2 1.4.3, RColorBrewer 1.1-2
- Download files into a folder
- Automatically start up R Studio by clicking an R script
- Make sure the working directory is in the folder so that R recognises the input file.
- Read instructions in the scripts and execute the code (line by line).
- Figures are generated as output in the same folder.
*Please note that raw figures generated from the scripts were adjusted with Adobe Illustator for the final reader-friendly version.
We visualised the combined results of; 1) calculated BUSCO (genome completeness); 2) Genome stats from Mycocosm; 3) identified transposable elements; 4) the number of theoretically secreted proteins from secretome prediction (see Supplementary Table 2).
Input file: 135_Fungi_GF_Step2_GenomicInfoCombined.csv
Output file: 135_Fungi_GF_Step3_GenomicFeatures.pdf
We visualised the combined results of the genome size and identified transposable elements by grouping 135 fungi based on their lifestyles.
Input file: 135_Fungi_TINGO_Step5_Taxo_Coverage_Stats.csv
Output file: 135_Fungi_TINGO_Step6.5_LifeStyle_Box_GenomeSizeVsTEs.pdf
We visualised the combined results of identified transposable elements for 135 fungi using TINGO pipeline (Morin et al., 2019). The script produces two plots - The TE coverage in the genomes and the copy number of TE families. Please note that we only used the TE coverage plot in the manuscript.
Input file: 135_Fungi_TINGO_Step5_Taxo_Coverage_Stats.csv
Output files: 135_Fungi_TINGO_Step6.4_LifeStyle_TE_Coverage.pdf 135_Fungi_TINGO_Step6.4_LifeStyle_TE_CopyNumber.pdf
We selected the secreted CAZymes of interest from the count of secretomes and used as an input file. The script makes a heatmap showing the count of genes and legends separately. We put them together using Adobe Illustrator.
Input file: 135_fungi_PCWDE_Step1_CountOfCAZymes.csv
Output files: 135_fungi_PCWDE_Step2_PCWDEs_Secreted_legend.pdf (legend only) 135_fungi_PCWDE_Step2_PCWDEs_Secreted.pdf (heatmap)
- Morin, E., Miyauchi, S., San Clemente, H., Chen, E.C.H., Pelin, A., de la, Providencia, I., Ndikumana, S., Beaudet, D., Hainaut, M., Drula, E., Kuo, A., Tang, N., Roy, S., Viala, J., Henrissat, B., Grigoriev, I.V., Corradi, N., Roux, C. and Martin, F.M. (2019), Comparative genomics of Rhizophagus irregularis, R. cerebriforme, R. diaphanus and Gigaspora rosea highlights specific genetic features in Glomeromycotina. New Phytol, 222: 1584-1598. doi:10.1111/nph.15687
- Pellegrin, C., Morin, E., Martin, F. M. & Veneault-Fourrey, C. Comparative analysis of secretomes from ectomycorrhizal fungi with an emphasis on small-secreted proteins. Front. Microbiol. 6, (2015).
- Sima, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V., and Zdobnov, E. M. (2015). Genome analysis BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31,3210–3212. doi: 10.1093/bioinformatics/btv351