/ds-quickstart

Data science quickstart

Primary LanguageScala

ds-quickstart

Data science quickstart

Getting going with the right tools to help you get the job done.

Intro

Overcome the barriers to just getting started.

Broken down into a series of projects, each for a different tool.

Each project will contain the following.

  • Links to source / documentations
  • Environment setup instructions
  • "hello world"
  • Just the basics - examples of most common actions
  • Cheat sheet - a one-page reference for important commands

Assumed starting point: osx (High Sierra) Will use Homebrew package manager for installations:

/usr/bin/ruby -e "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/master/install)"

Sections

Planned & completed projects

  • Shell
    • vim
  • Git(hub)
  • Python
    • Pandas
    • scikit-learn
    • Jupyter
  • Scala
    • sbt
    • maven
    • gradle
      • groovy
  • Spark
    • SQL
    • MLlib
    • zeppelin (databricks)
  • regex
  • Documentation
    • Markdown
    • Latex
    • Dot