Fuzzy matching is a process for linking up names that are similar but not quite the same. It has become an increasingly important part of data-led investigations as a way to identify connections between public figures, key people and companies that are relevant to a story. This class will cover how fuzzy matching typically fits into the investigative process, with some story examples.
Max Harlow, who developed the CSV Match command line tool, will show you how to run some of the different types of fuzzy matching on some real datasets, including the pros and cons of each.
This session is good for: People who feel comfortable using the command line.