Source:1494Jeff Ernsthausen, Derek Eder, Eric van Zanten, Forest Gregg
Affiliation:The Atlanta Journal-Constitution, DataMade
Tired of popping the same data set into OpenRefine every time you want to answer basic questions like “who gave the most money to politicians in Idaho this year?” Sure you are. We all are. Then come see a demonstration of Dedupe, a tool that uses machine learning to identify unique individuals, organizations and other entities in the kinds of messy datasets that journalists encounter the most. We’ll go over the basics of how the tool works and give a demonstration of how to use it to find the unique entities in your datasets.