/lectures

Lecture notes for EC 607

Primary LanguageHTMLMIT LicenseMIT

Data science for economists

Lecture outline and quicklinks

  1. Introduction: Motivation, software installation, and data visualization [Slides]
  2. Version control with Git(Hub) [Slides]
  3. Learning to love the shell [Slides]
  4. R language basics [Slides]
  5. Data wrangling and tidying with the “Tidyverse” [Slides / Notebook]
  6. Webscraping: (1) Server-side and CSS [Notebook]
  7. Webscraping: (2) Client-side and APIs [Notebook]
  8. Regression analysis in R [Notebook]
  9. Spatial analysis in R [Notebook]
  10. Functions in R: (1) Introductory concepts [Notebook]
  11. Functions in R: (2) Advanced concepts [Notebook]
  12. Parallel programming [Notebook]
  13. Docker [rOpenSci tutorial]
  14. Cloud computing with Google Compute Engine [Notebook]
  15. High performance computing (UO Talapas cluster) [Slides from Nick Maggio guest lecture.]
  16. Databases: SQL(ite) and BigQuery [Notebook]
  17. Spark [Notebook]
  18. Machine learning: (1)
  19. Machine learning: (2)

Details

This is a graduate economics seminar taught by Grant McDermott at the University of Oregon.

Please read the syllabus before you go through any of the lectures. This will detail software requirements and installation, and give you a better sense of the aims and scope of the course. I also have an "FAQ" section at the end that covers frequently asked questions (or, at least, potentially asked questions). Speaking of which, here follow answers to some questions that are more specifically related to this repo.

FAQ

How do I download this material and keep up to date with any changes?

Please note that this is a work in progress, with new material being added every week.

If you just want to read the lecture slides or HTML notebooks in your browser, then you should simply scroll up to the Lecture outline and quicklinks section at the top of this page. Completed lectures will be hyperlinked as soon as they have been added. Remember to check back in regularly to get any updates. Or, you can watch or star the repo to get notified automatically.

If you actually want to run the analysis and code on your own system (highly recommended), then you will need to download the material to your local machine. The best way to do this is to clone the repo via Git and then pull regularly to get updates. Please take a look at these slides if you are unfamiliar with Git or are unsure how to do any of that. Once that's done, you will find each lecture contained in a numbered folder (e.g. 01-intro). The lectures themselves are written in R Markdown and then exported to HMTL format. Click on the HTML files if you just want to view the slides or notebooks.

I've spotted a mistake or would like to contribute

Please open a new issue. Better yet, please fork the repo and submit an upstream pull request. I'm very grateful for any contributions, but may be slow to respond while this course is still be developed. Similarly, I am unlikely to help with software troubleshooting or conceptual difficulties for non-enrolled students. Others may feel free to jump in, though.

Can I use/adapt your material for a similar course that I'm teaching?

Sure. That's partly why I have made everything publicly available. I only ask two favours. 1) Please let me know (email/Twitter) if you do use material from this course, or have found it useful in other ways. 2) An acknowledgment somewhere in your own syllabus or notes would be much appreciated.

Are you willing to teach a (condensed) version of this course at my institution?

Possibly. Please contact me if you would like to discuss further.

Do you plan to turn these lecture notes into a book?

Depends on a lot things and I'm too time constrained right now... but I'm thinking about it. Preliminary working title: "Data science for economists (and other animals)".

License

The material in this repository is made available under the MIT license.