free/inexpensive OCR software to convert pdf to searchable pdf or doc?

geepondy

Flashlight Enthusiast
Joined
Apr 15, 2001
Messages
4,898
Location
Massachusetts
I'm not sure where on the internet to ask this so I'll throw it out to the cafe. I have a scanned pdf file. Any free or inexpensive OCR software to read the file and convert it to either searchable pdf or doc file? Abbyy FineReader seems to be the holy grail but it's also $300. I found one free one "simple OCR" but it only reads image files, not .pdf's.
 
The newer versions of office (2003 on IIRC) include a pretty decent OCR package.

It will only import mdi or tiff files, but you can easily sort that by printing the pdf using the MS image writer driver.

PM me if you want me if you haven't the software and want me to have a stab at it.
 
This might do the trick. It's a PDF to word converter. It has worked fine for me. Just click on get keycode and they will send it to your e-mail. simple.
 
Last edited:
There are tons of free programs that convert to/from PDF/Word, including google.

However, OP is asking about OCR specifically. I don't see where google does OCR conversion, but didn't read all their details.
 
From a little searching... If you put the .pdf on a web server, and add a link to it to a publicly-available web page, the Google bots will OCR it automatically.

I'd assume Google has poured a fair amount of resources into OCR, since they've now digitized and OCR'ed nearly every book ever written.
 
From a little searching... If you put the .pdf on a web server, and add a link to it to a publicly-available web page, the Google bots will OCR it automatically.

I'd assume Google has poured a fair amount of resources into OCR, since they've now digitized and OCR'ed nearly every book ever written.

Good follow up, and knowing they did that with books meant they had the technology, but I didn't see it mentioned or built into the Google Docs application as a feature. Interesting way around what appears to be a missing feature. I imagine there is a reason they don't include it outright.
 
Top