SQL CARwash: Cleaning dirty data
Spend enough time around databases and inevitably you’ll come across one that has an obnoxious number of variations on city names: New York City. New York. NYC. NY. And yes, even NY City. If you’re not sure how to handle that, this session is for you. We’ll cover how to deal with multiple spellings and misspellings, strange date formats and category codes, as well as a few other tricks and tips for using SQL to clean data.
This session will be most useful if: You are familiar with basic SQL statements.
Madi Alexander is a computational journalist at The Dallas Morning News. She was previously a data reporter at Bloomberg Government in Washington, D.C. Madi has a master's degree in journalism from the University of Missouri. She volunteers for the Dallas puppy-raising chapter of Guide Dogs for the Blind. Madi also keeps a database of all the national parks and historical sites she has visited — a total of 40 thus far. @MadiLAlexander
No tipsheets have yet been uploaded for this event.