/gcp-for-bioinformatics

GCP Essentials for Bioinformatics Researchers

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Google Cloud Platform (GCP) for Bioinformatics

This repository shows how to use ☁️Google Cloud Platform public cloud services to scale bioinformatics data analysis tasks using cloud best practices for GCP. This use cases featured as exampled are called any and all of the following: genomic-scale data workflows, pipelines, analysis or batch jobs.

This content is intended for researchers - in particular, this guide is for those who are NEW to working with GCP.
You have a number of options on how to use the materials provided in this course. A summary is shown below left.

This Repo includes content you can read, watch or run:

  • πŸ“— READ - one page of this Repo (MD page)
  • πŸ“Ί WATCH - linked YouTube screencasts
  • πŸ“™ RUN - Jupyter Notebook example
  • :octocat: TRY - linked GitHub Repos
  • πŸ“˜ EXPAND - linked (external) resources
  • πŸ” SCAN - search a list in this Repo

NOTE: If you are NEW to bioinformatics and have a computational background, see my 'study Repo' which includes links to explanations of bioinformatics concepts, tools and platforms - link


πŸ“Ί Click below to WATCH 'Lynn's Welcome Video' (4 min) on YouTube

Welcome to GCP for Bioinformatics


Why would I choose to use a public cloud vendor for bioinformatics?

⭐️ SAVE MONEY run (and pay for) scalable analysis jobs only when you need to run them
⭐️ SAVE TIME use vendor-managed infrastructure & best-practice patterns for fast repeatable research
πŸ“— READ the FAQ for GCP bioinformatics for this Repo
πŸ“• READ Nature article: "Cloud computing for genomic data analysis and collaboration"
πŸ“— READ the top 4 most common use cases for using the public cloud for bioinformatics researchers


Contibutions

We love contributions! See this short style guide when making pull requests to this repo.