/seq_toolkit

Various scripts for bioinformatics analysis pipeline.

Primary LanguageJupyter Notebook

Description:

This repository contains various scripts for bioinformatics analysis pipeline.

  • ncbi_nucleotide_db_seq_data_downloader.py: used to download sequence fasta files from NCBI-Nucleotide.
  • chromosomes_counter.py: used to count the number of chromosomes in a sequencing file.
  • genes_variants_counter.ipynb: used to identify genes, transcript variants, and other stuff.
  • files_merger.py: used to merge multiple fasta files into one.
  • get_sra_data.sh: used to download sequence SRA and fastq files from NCBI-SRA database.