Cart 0 $0.00
IRE favicon


Resource ID: #30216
Source: Emily Le Coz
Affiliation: USA TODAY
Date: 2023-06-22



Slides for "Extracting data from PDFs," a session at the IRE 2023 conference in Orlando on Thursday, June 22, 2023.

Session description: Join this class to learn how to liberate data trapped inside PDFs. This class will cover some basic approaches for getting text out of PDF documents using powerful and freely available tools. Participants will be introduced to basic concepts and some common challenges encountered when working with tricky PDF documents.

This session is good for: People who are unfamiliar with PDF-to-text tools or would like to learn how these tools can be used for extracting difficult text from images embedded in a PDF document.

It's encouraged, but not required, to get access ahead of the workshop to Google Pinpoint and its Beta Data Extraction Tool. Sign up for Google Pinpoint HERE and register for its the Beta Data Extraction Tool HERE.

109 Lee Hills Hall, Missouri School of Journalism   |   221 S. Eighth St., Columbia, MO 65201   |   573-882-2042   |   |   Privacy Policy
apartmentpenciluserscalendar-fullcrossmenu linkedin facebook pinterest youtube rss twitter instagram facebook-blank rss-blank linkedin-blank pinterest youtube twitter instagram
My cart
Your cart is empty.

Looks like you haven't made a choice yet.