Web scraping is a method of extracting and restructuring information from web pages. This workshop will introduce basic techniques for web scraping using the popular Python libraries BeautifulSoup and Requests. Participants will practice accessing websites, parsing information, and storing data in a CSV file. This workshop is intended for social scientists who are new to web scraping, but have some familiarity with Python or have attended the Introduction to Python workshop.
Computers with Python pre-loaded are available on a first-come, first-served basis. If you with to use your own laptop, please install the Anaconda distribution of Python 3.6 from https://www.continuum.io/downloads
If you are having trouble installing this version of Python, please contact data science services at firstname.lastname@example.org .
This workshop is free for Harvard and MIT affiliates. Click here to sign up!
This workshop will introduce you to the Unix shell, a power tool that allows people to do complex things with just a few keystrokes, combine existing programs in new ways, and automate repetitive tasks.
This introduction to the popular ggplot2 R graphics package will show you how to create a wide variety of graphical displays in R, using techniques such as layering, mapping variables to aesthetics, working withr scales, faceting, and themes.
This class will discuss the fundamentals of structured data, introduce you to using SQLite (a lightweight database available on all most computing platforms), and teach you the basics of querying and summarizing data with SQL