Instructor
William S. Cleveland, Haas 222, wsc@purdue.edu
Time
TueThur 1:30 - 2:45
Prerequisites
Knowledge of basic probability; mathematics through
calculus and linear algebra; and basic statistics including least-squares
fitting of parametric functions to data. No previous knowledge of data
visualization is needed.
Primary Audience
Graduate students in university departments where data are analyzed.
Location
Lectures and Labs: Physics 26.
If you are not familiar with Physics, fastest route is to enter main entrance
on Northwestern, go down stairs and through glass doors, turn left, and it is
straight ahead.
Description
BACKGROUND. Visual displays allow us to explore data to see overall patterns and to see detailed behavior; no other approach can compete in revealing the structure of data so thoroughly. Analyses without visualization run the risk of using inappropriate methods and models for the data. They run the risk of missing important unexpected behavior. They do not preserve the information in the data.
CONTENT. The course content will focus fundamentally on how to analyze data. Through many case studies, it will present visualization methods, going through a number of standard numerical methods and models for statistical analysis, showing how visualization enhances these methods and models. This illustrates the use of the visualization methods, and demonstrates why they are essential to valid analyses that preserve the information in data. In addition, lectures will cover the lattice graphics system in R, which can be used to carry out all methods discussed in the course. To support this, a certain number of classes will consist of labs in which participants will use lattice.
Participant Responsibilities
Participants are expected to attend class and submet all homework assignments. Homework will consist of analyzing data in R, using lattice graphics to carry out visualization methods presented in the course. There will be no tests for final exam.
Lectures on Visualization Methods
Visualization Methods and Their Application
Book: Visualizing Data . Provided by instructor.
Trellis Display for Modeling Data from Designed Experiments
R and Lattice Graphics
An Easy-Going Introduction to Lattice Graphics
Easy-Going Introduction to Lattice Graphics
http://ml.stat.purdue.edu/stat695t/writings/sarkar.lattice.book
Deepayan Sarkar: Lattice: Multivariate Data Visualization in R
A book on trellis display in R with Lattice Graphics.
lattice.Rdata
An R data file with R dataset objects and R functions that make most of the
graphs in the book Visualizing Data.
psl.rawsold.RData
An R data frame for Chicago Bears Permanent Seat Licenses (PSLs) sold from 2007 to 2012.
book functions in text
A .txt file with all book functions. Each is defined. In R one can use source()
to read the functions into R.