/cs4501

Materials for this semesters Internet Scale Applications

MIT LicenseMIT

CS4501 - Internet Scale Applications

This course will provide a survey of methods for building large-scale internet websites and mobile apps. The intent is to build upon prior classes by explaining how theory meets practice. Topics covered will include scaling, security, large team software engineering, etc. There will be a series of cumulative course projects resulting in students building a working marketplace website. Weekly readings from industry and academic sources will complement the weekly lecture.

Prerequisites

CS3240 (Advanced Software Development) or equivalent experience building non-trivial Python/Django web applications is required. Understanding issues around persistence, databases, concurrency, networking etc along with Linux experience will be extremely helpful in this class.

A course project will be developed using Python, Django, MySQL, and Docker containers. A basic familiarity with HTML and CSS is preferred but not strictly required.

Project Overview

You will work in a team of three students to build a working marketplace website. Marketplaces such as Airbnb, Uber, DonorsChoose, eBay, Etsy and Watsi are both highly profitable when they succeed and technically challenging to build. It's up to you to decide what kind of marketplace your team will build. Specific instructions for each project will be posted and linked below thoughtout the semester.

Due dates are listed below for each part of the project. These may be revised as the term goes on based on how everyone is doing. You will be graded as a team (unless unusual conditions warrant giving different grades). Grades will be based on: completeness of solution, correctness of solution, and being on time completing each assignment.

Course materials

Syllabus and project assignments will be here in GitHub. Feel free to fork and send PRs with corrections, additions or any other changes you think would help fellow/future students.

Lecture slides are in Google drive at https://drive.google.com/folderview?id=0BzWAJQVnIIRYfk9JUmwtbUVKS1pqb0k0Q2ZYU3pPZ3gxV2VnVDctVU51VjFYTTVaR25xR3c&usp=sharing. Note that prior semester's slides can be found here for future lectures, but they may be out of date / subject to update this semester. Generally, the further out a lecture is the more likely that the slides have not been reworked for this semester.

Readings are listed for each week and should be completed BEFORE the week they are assigned so that we can discuss in class. There's also this useful high-level summary of many of the topics in this course https://github.com/donnemartin/system-design-primer

Course Topics (subject to revision as course progresses)

1. Anatomy of the modern internet, websites and mobile apps

2. Requirements and documentation

  • User stories
  • Product requirement documents
  • Design and architecture documents

3. Service based architecture, part I

4. Databases

5. Service based architecture, part II: Multi-screen development for mobile and desktop

  • Proliferation of channels for consuming apps
  • App and experience logic server side vs client side
  • Responsive web design vs building different apps for different devices
  • Consistent experiences on mobile native, mobile web, desktop web, tablet, email etc
  • Service composition

6. Security

7. Messaging and queing

8. Search

9. Users and reputation

10. Speed

11. Testing and DevOps

12. TBD

13. Advertising

14. Parallel batch processing

15. AB-testing and Analytics

16. Traffic—SEO, SEM, Social, Paid Marketing, Email

17. TBD

  • Quiz #2

Grading

Grades will be computed approximately as follows: 50% two quizzes 50% class project

Note, these are guidelines and teaching staff may deviate as they believe the situation requires.

Project grading policies

  • 10 points off for being late
  • 10 points off minimum if project doesn't work
  • 1-5 points off for not folowing "best practices" such as poor code design, lack of abstraction etc TBD Each team may get ONE project extension. You must ask for it via email at least 72 hours (3 days) before the project is due. You must email your primary TA and Tom to ask for it.

Academic Honor Code

All work products that you submit for this class—whether projects, quizes or anything else—is assumed to be your own creation UNLESS you specifically call out that you are including someone else's work, either in spirit or verbatim. Violating this honor code will result in severe disciplinary action potentially including referral to the University Honor Code comittee who may recommend expulsion.

If there is reason for you to submit other's work in your github repo then you must cite them as the author of the work. If their work has a copyright notice on it, make sure that copyright notice is perserved and contained in your work saying "Portions of this work copyright DATE by AUTHOR". If the work has a license agreement associated with it, make sure you are complying with the terms of the license agreement. If there is no license or copyright, you must still explicitly list the author(s) whose work you are including. This should be in the form of a comment at the top of the source file including their work.

Finally, if you are using someone else's work as the substantial basis of your work (even if you're not literally copying it character by character) then you are still copying their work and you must cite them.

Office Hours

The teaching staff for this semester are:

Office Hours, Project Review, Help, and Demo Hours (led by TAs):

  • Sai 10-11:30am Weds (Rice 340) and Fri (Olsson 001)
  • William 12-1:30pm Mon and Weds (Rice 436)
  • Winston 11:30-12:30pm Weds (Rice 340)
  • Dion TBD
  • Tom 1:00-2:00pm Mon (location TBD). Please email/slack me first to reserve a time. If you don't make an appointment, I may not be available.

There is also always help from the teaching staff and your fellow classmates through slack at https://cs4501-isa.slack.com.

We will use Collab announcements when I need to email everyone in the class for announcements, project assignments etc. I'll send a Welcome announcement after the first class. If you don't receive it, check with me or the TA's to figure out what's wrong. If you added the class after the first day then please make sure to review the archive of announcements on Collab.

Finally, it's always better to ask questions, bring up problems, etc. sooner versus later. Please don't be shy. I enjoy people coming to office hours, asking question on Slack, bringing things up in class etc.