/algsdatasci

CSOR-4246 Algorithms for Data Science

Algorithms

Course page

Notes

09-08-2015

09-10-2015

09-15-2015

09-17-2015

09-22-2015

09-24-2015

09-29-2015

Class Information

Homeworks will use IPython

Topics

  • Asymptotics, searching, fast integer and matrix multiplication
  • Graph algorithms (BFS, DFS, shortest paths)
  • Data compression
  • Hashing, Bloom filters, count-min sketch
  • Dynamic programming
  • Network flows
  • Linear programming
  • NP-completeness
  • Approximation algorithms
  • The Web graph, Hubs & Authorities, Page Rank
  • SVD for PCA
  • Clustering
  • Streaming

Algorithms

Definition

  • transforms one set of values into a new set of values
  • efficiency in terms of time and space

Running Time

Definition: number of primitive computational steps performed For example,

  • arithmetic
    • add
    • subtract
    • multiply
    • divide
  • data movement
    • load
    • store
    • copy
  • control
    • branching
    • subroutine call
    • return