Setting up a reproducible data analysis workflow in R

  • Event: 2018 CAR Conference
  • Speaker: Andrew Tran of The Washington Post
  • Date/Time: Saturday, Mar. 10 at 3:30pm
  • Location: Northwestern/Ohio St
  • Audio file: No audio file available.

You will save time, produce better results, create more trusted analyses, reduce risk of errors and encourage collaboration by implementing reproducible data analysis workflow techniques for data journalism. We will be going over R Notebooks and RMarkdown to weave together narrative text and code to produce elegantly formatted PDFs and HTML for sharing. We will walk through hosting these reports and raw data files on GitHub Pages. We will discuss best practices on how to structure your projects and repos. And if there is time, you will learn how to turn specific scripts into generalized functions to be used in future analyses.

This session is good for: R users interested in improving their workflow

Speaker Bios

  • Andrew Ba Tran is an investigative data reporter on The Washington Post's rapid response team. He previously was a data editor in Connecticut where he investigated racial profiling by police and before that was a data producer in Boston. Other East Coast newsrooms he's made his way through include The Virginian-Pilot and the Sun-Sentinel. He’s a Metpro Fellow and a Chips Quinn Scholar, too. 

Related Tipsheets

No tipsheets have yet been uploaded for this event.