/dataviz-1

Support files for "Data Visualization: A Practical Introduction"

dataviz

Kieran Healy

This repository is outdated and not maintained. Please install and use the socviz library instead

This repository contains support files for Data Visualization: A Practical Introduction and courses taught from it. It is an RStudio project and contains a series of R Markdown files organized in parallel to the book's chapters. The R Markdown files contain code to reproduce almost all the figures in the book, along with space for your own notes. A more general note-taking template can be found in the template/ folder.

The contents of this repository---in updated and imporoved form---are included in the socviz library of data and functions that accompanies the book. They can be conveniently extracted from there using socviz's setup_course_notes() function. See the socviz library documentation for more details. This repository is no longer updated or maintained.

Data Visualization: A Practical Introduction teaches you data visualization using R and ggplot2 in a clear, sensible, and reproducible way. It is published by Princeton University Press.

You can purchase the book from Amazon, from Powell's, or from the Publisher.

Through a series of worked examples, the book shows you how to build plots piece by piece, beginning with scatterplots and summaries of single variables, then moving on to more complex graphics. Topics covered include plotting continuous and categorical variables, layering information on graphics; faceting grouped data to produce effective “small multiple” plots; transforming data to easily produce visual summaries on the graph such as trend lines, linear fits, error ranges, and boxplots; creating maps, and also some alternatives to maps worth considering when presenting country- or state-level data. Plotting estimates from statistical models and from complex survey designs are also covered. The book then explores the process of refining plots to accomplish common tasks such as highlighting key features of the data, labeling particular items of interest, annotating plots, and changing their overall appearance. Finally, it discusses some strategies for presenting graphical results in different formats, and to different sorts of audiences.

Learning how to visualize data effectively is more than just knowing how to write code that produces figures from data. This book will teach you how to do that. But it will also teach you how to think about the information you want to show, and how to consider the audience you are showing it to—including the most common case, when the audience is yourself.