Cart 0 $0.00
IRE favicon


Resource ID: #6081
Subject: 2022 NICAR Conference Tip Sheets
Source: Chad Day
Affiliation: The Wall Street Journal
Date: 2022-03-04



This class seeks to help you solve a common problem in journalism: Data stored in a computer generated PDF -- or, worse, an image PDF. We'll first walk through how extract text from a computer-generated PDF using a command line tool. Then we'll step up to Optical Character Recognition, or OCR, to work on image files.

This session is good for: People with experience using their computer's command-line interface.

109 Lee Hills Hall, Missouri School of Journalism   |   221 S. Eighth St., Columbia, MO 65201   |   573-882-2042   |   |   Privacy Policy
apartmentpenciluserscalendar-fullcrossmenu linkedin facebook pinterest youtube rss twitter instagram facebook-blank rss-blank linkedin-blank pinterest youtube twitter instagram
My cart
Your cart is empty.

Looks like you haven't made a choice yet.