/Zero2Hero_DataScientist

In this repo I put together a curriculum for some one to go from zero background to employable data scientist

Primary LanguageCommon Workflow Language

Zero2Hero_DataScientist

https://medium.com/@eking_30347/blogging-with-medium-and-github-pages-2ff40c870053 In this repo I put together a curriculum for some one to go from zero background to employable data scientist

Lower Tier

The very basics of linear algebra forms the basis of most complex machine learning algorithms. So its important to know the basics right. So start with Khan Academy.

https://www.khanacademy.org/math/linear-algebra

Programming fundamentals https://github.com/ForrestKnight/open-source-cs-python

OOP

Python/R https://academic.oup.com/bioinformatics/article/21/16/3427/215530

JavaScript

Shell Scripting/UNIX

git/SCRUM/CI/CD/TDD

Middle Tier

Here we study bioinformatics platforms, software and tools and pipelines related to current omics

  1. CDL/WDL https://www.youtube.com/watch?v=4J6kiYFrqdA&feature=emb_rel_end

  2. AWS Certifications https://github.com/cjgunase/aws-ccp/blob/master/010-cloud-concepts.pdf

https://www.udemy.com/course/aws-certified-cloud-practitioner-new/learn/lecture/20053382#overview

https://aws.amazon.com/health/

aws application https://medium.com/datadriveninvestor/tutorial-launch-your-personal-genomics-cloud-app-in-15-min-aws-genetics-python-ml-b0d1540e6e70

  1. Genomics in the cloud

  2. Biostar handbook

  3. Fundamentals of Genetics, Genomics, Proteomics, metagenomics.

Upper Tier

Deep Learning http://iamtrask.github.io/2015/07/12/basic-python-network/

Mathematics

https://www.ibi.vu.nl/teaching/masters/bi_tools/2007/tools_lec13_2007_handout.pdf https://github.com/thatSaneKid/fourier/blob/master/Fourier%20Transform%20-%20A%20Visual%20Introduction.ipynb

Bayesian Statistics

HMM

https://github.com/luisguiserrano/hmm/blob/master/Simple%20HMM.ipynb

Bioinformatics programing http://rosalind.info/problems/ba10a/

Bioinformatics pipelines https://galaxyproject.github.io/training-material/topics/variant-analysis/tutorials/dip/tutorial.html https://www.youtube.com/watch?v=Tivdr-2zQz4

Bioinformatics Visualizations https://galaxyproject.github.io/training-material/topics/variant-analysis/tutorials/dip/tutorial.html