Python: Let's scrape a website (repeat)

  • Event: 2019 CAR Conference
  • Speaker: Mike Stucka of Gannett/USA TODAY Network
  • Date/Time: Saturday, Mar. 9 at 9:00am
  • Location: Diamond Cove - conference level
  • Audio file: No audio file available.

This hands-on training will illustrate how Python can be used to grab a lot of data from a website at once, whether by pulling content from a page or interacting with forms. You’ll want to be comfortable writing loops in Python, though you won’t necessarily need to be able to write a function from scratch!

This session is good for people who feel comfortable with Python’s data types and control flow (if/else, loops). Experience with HTML is a plus but not necessary. Note: It would be useful to attend the session "How it Works: The Internet" in advance if you’re not familiar with the topic already.

Speaker Bios

  • Mike Stucka has been the self-described data dork at The Palm Beach Post since early 2016. Before that, he used data on the beat at places like The Salem (Mass.) News and The Telegraph of Macon, Ga. He started with data at his college paper, when he still had hair. He is a graduate of Northeastern University, Loyola University Chicago, and a great IRE bootcamp a decade ago. @mikestucka

Related Tipsheets

No tipsheets have yet been uploaded for this event.