NOTICE: The master
branch you are reading now are the WIP version 2. The chapter structure is different from version 1. If readers want to refer to version 1, please follow this link: https://github.com/hupili/python-for-data-and-media-communication-gitbook/tree/v1
This is an open source text book for communication students to learn Python and data skills. The purpose of this course is to motivate the students to become a T-shape talent in communications field. The course involves intensive training of Python and quest in solving practical problems. This open source book collects all the materials related with lab exercises covering basic Python, data scraping, table manipulation and data mining.
You can get started by reading Week 00 note. This note requires no background in programming or technology. It walks you through our learning environment so you know how to find learning materials and seek for help effectively. You are also welcome to contact the Fall 2018 team for any questions, suggestions or discussions. The best way to ask question is to create an issue in this repo. Our teaching team stands by and regularly checks the issue tracker.
This repo is the text book by Pili Hu's offering of COMM7780/ JOUR7280 in HKBU.
- Notes: Week 00 - GitHub and markdown
- Notes: Week 01 - Terminal, Python, Jupyter Notebook
- Notes: Week 02 - Python as a powerful caculator: basics and arithmetics
- Notes: Week 03 - Control flow
- Notes: Week 04 - Data Structure
- Notes: Week 05 - Serialization: File, CSV, JSON
- Notes: Week 06 - API
- Notes: Week 07 - Get semi-structured data - Web scraping
- Notes: Week 08 - Advanced scraping: anti-crawler, browser emulation and other nitty gritty
- Notes: Week 09 - Work with table: data cleaning and pre-processing
- Notes: Week 10 - Work with table: 1D analysis and 2D analysis
- Notes: Week 11 - Present findings: data visualization and reproducible report
- Notes: Week 12 - Text data
- Notes: Week 13 - Datetime and Time Series
- Notes: Week 14 - Network data
- Notes: Week 15 - Geographical data
- Notes: Week 16 - High Dimensional Data
- Notes: Week 17 - Machine learning primer: clustering, classification, regression
- Notes: Week 18 - Python Engineering and Data Engineering
You can search through our whole repo, including all the notes and FAQs using the built-in GitHub search function. For example, you can search for "encoding".
- Course Admin
- Grading Scheme
- Guide for Contributor
- Projects Guideline
- Setup Python Environment on Windows and MAC
- Shell
- Python Language Basics
- Python 2 v.s. Python 3
- Dataprep
- Pro Tips
- Resources
- Guide for contributor
- GitHub
- HTML
- Encoding
- pip
- Computational Thinking
- File I/O
- Leetcode
- module: geopy
- module: requests
- module: csv
- module: BeautifulSoup
- module: jupyter
- module: pandas
- module: seaborn
- module: matplotlib
- module: lxml
- module: python-twitter
- module: datetime
- module: selenium
- module: wordcloud
- module: geopandas
CC-BY-NC-ND