We offer a series of data analysis and programming workshops to members of the Harvard and MIT communities. If you are not a Harvard or MIT affiliate, or if your schedule does not permit you to attend these workshops, you can still use our workshop materials for self-study.

# Workshop Materials

### Python Introduction

### Description

Materials for the "Python Introduction" workshop. This hands-on workshop introduces the basic elements of Python by working through an example of how to analyze text data. Python is a general purpose programming language commonly used for data cleaning, analysis, visualization, and other applications. Note that the focus of the workshop is on **how** to use Python rather than **why** you might want to use Python over other software. This workshop is appropriate for those with little or no prior experience with Python.

### Materials and resources

- Setup instructions for installing software.
- All workshop materials as a zip compressed archive.
- Static workshop notes with code examples and results.

### Python Web Scraping

### Description

Materials for the "Python Web Scraping" workshop. This hands-on workshop will introduce basic techniques for web-scraping using popular Python libraries. This is an intermediate-level, and somewhat challenging, workshop appropriate for those who have been using Python for at least a few months. You should be familar with all of the material in the Python Introduction workshop and have used these skills in your own projects to the point where you are comfortable with them.

### Materials and resources

- Setup instructions for installing software.
- All workshop materials as a zip compressed archive.
- Static workshop notes with code examples and results.

### R Introduction

### Description

Materials for the "R Introduction" workshop. This hands-on workshop will teach you how to use R to import and manipulate data, install and manage packages, conduct basic statistical analyses, and create common graphical displays. Note that the focus of the workshop is on **how** to use R rather than **why** you might want to use R over other software. This workshop is appropriate for those with little or no prior experience with R.

### Materials and resources

- Setup instructions for installing software.
- All workshop materials as a zip compressed archive.
- Static workshop notes with code examples and results.

### R Regression models

### Description

Materials for the "R Regression models" workshop. This hands-on workshop will demonstrate how to deploy a variety of statistical procedures using R, including multiple regression, modeling with categorical variables, as well as model diagnostics and comparison. Note that the focus of the workshop is on **how** to use R to fit models - **we do not teach the theory behind the models and assume that you already have a solid background in statistical modeling and want to apply this in R**. This is an intermediate-level workshop appropriate for those who have been using R for at least a few weeks. You should be familar with all of the material in the R Introduction workshop and have used these skills in your own projects.

### Materials and resources

- Setup instructions for installing software.
- All workshop materials as a zip compressed archive.
- Static workshop notes with code examples and results.

### R graphics

### Description

Materials for the "R graphics" workshop.This hands-on workshop provides an introduction to the popular ggplot2 R graphics package. It will cover how to create a wide variety of graphical displays in R, using techniques such as layering, mapping variables to aesthetics, working with scales, faceting, and themes. This is an intermediate-level workshop appropriate for those who have been using R for at least a few weeks. You should be familar with all of the material in the R Introduction workshop and have used these skills in your own projects.

### Materials and resources

- Setup instructions for installing software.
- All workshop materials as a zip compressed archive.
- Static workshop notes with code examples and results.

### R data wrangling

### Description

Materials for the "R data wrangling" workshop. This hands-on workshop will prepare you for dealing with messy data by walking you through a real-life example. This is an intermediate-level workshop appropriate for those who have been using R for at least a few weeks. You should be familar with all of the material in the R Introduction workshop and have used these skills in your own projects.

### Materials and resources

- Setup instructions for installing software.
- All workshop materials as a zip compressed archive.
- Static workshop notes with code examples and results.

### Stata Introduction

### Description

Materials for the "Stata Introduction" workshop. This hands-on workshop provides an introduction to Stata, including how to import and manipulate data, as well as calculate descriptive statistics. This workshop is appropriate for those with little or no prior experience with Stata.

### Materials and resources

- Stata software downloads for Harvard FAS affiliates.
- Static workshop notes with code examples and results.
- All workshop materials as a zip compressed archive.

### Introduction to Programming

### Description

Materials for the "Introduction to Programming for Researchers" workshop.

### Materials and resources

- All workshop data on Dataverse.
- Workshop code on Github.

### Data Science Tools

### Description

Materials for the "Data Science Tools" workshop. In this workshop we look at the kinds of tools that data scientists use (programming languages, statistics packages, version control, text editors and integrated development environments) and discuss the pros and cons of popular alternatives.

### Materials and resources

- All workshop materials as a zip compressed archive.
- Workshop notes with code examples and results.

Materials from discontinued workshops are unmaintained, but remain available here.