SQL CARwash: Cleaning dirty data
Spend enough time around databases and inevitably you’ll come across one that has an obnoxious number of variations on city names: New York City. New York. NYC. NY. And yes, even NY City. If you’re not sure how to handle that, this session is for you. We’ll cover how to deal with multiple spellings and misspellings, strange date formats and category codes, as well as a few other tricks and tips for using SQL to clean data.
This session will be most useful if: You are familiar with basic SQL statements.
Madi Alexander is a computational journalist at The Dallas Morning News. She was previously at data reporter at Bloomberg Government in Washington, D.C. Madi has a master's degree in journalism from the University of Missouri. @MadiLAlexander
No tipsheets have yet been uploaded for this event.