PDF 2: Using OCR to extract data from PDFs
This class will cover basic approaches for getting text out of PDF documents using powerful and freely available tools. Participants will be introduced to basic concepts and walk through tackling common challenges encountered with tricky PDF documents.
This session is good for: People who are unfamiliar with the PDF-to-text tools or would like to learn how optical character recognition (OCR) tools can be used for extracting difficult text from images embedded in PDF document.
Chad Day is a reporter on the AP's investigative team in Washington. He covers the Mueller investigation and President Donald Trump. He previously reported on campaign finance, the Trump campaign, juvenile justice and child maltreatment. @ChadSDay
No tipsheets have yet been uploaded for this event.