How and why to make your data analysis reproducible
You understand how you processed your data. Does your editor? Your reader? You, in six months? Without a replicable approach to extracting, transforming and loading data, we are often frustrated in our efforts to share or update our work. Join us for a panel discussion of reproducible data workflows. We’ll talk about why we use standardized processes for collecting, cleaning and analyzing data, and share practices that work for us. We’ll also discuss strategies for smart human intervention (i.e. reporting, logging and documentation) in automated workflows.
Hannah Cushman Garland is a wayward journalist turned software developer. She cut her teeth on public life in mid-Missouri, covering municipal economic development and elections. An alumna of the Missouri School of Journalism and a veteran of the Associated Press, Hannah remains deeply interested in how information is consumed, shared, and acted upon. https://hancush.github.io
Ryann Grochowski Jones is the data editor at ProPublica. Previously, she was deputy editor for data at ProPublica and a data reporter at inewsource in San Diego. She received her master's degree from the University of Missouri School of Journalism, where she was a data librarian for IRE/NICAR. Ryann began her career as a municipal beat reporter for her hometown newspaper in Wilkes-Barre, Pennsylvania. @ryanngro
Jeremy Singer-Vine is the data editor at BuzzFeed News. He also publishes Data Is Plural, a weekly newsletter of useful/curious datasets. Website: jsvine.com
Andrew Ba Tran is an investigative data reporter on The Washington Post's rapid response team. He previously was a data editor in Connecticut where he investigated racial profiling by police and before that was a data producer in Boston. Other East Coast newsrooms he's made his way through include The Virginian-Pilot and the Sun-Sentinel. He’s a Metpro Fellow and a Chips Quinn Scholar, too.
No tipsheets have yet been uploaded for this event.