The Google Code Blog announced that Google has "re-released" the Tesseract OCR software to the open source community. OCR, optical character recognition, is the technology for converting text on a physical paper into computer based text. So if you have a ton of papers you typed up in your college days and you want them stored in digital format, you can use OCR to translate those documents for you.
Tesseract was originally developed by HP between the years of 1985 and 1995. In 2005 HP and University of Nevada in Las Vegas opened it to the community. Google claims that Tesseract OCR is "far more accurate than any other Open Source OCR package out there." Some more detail at Computing.co.uk.
The Original Search Marketing Event is Back!
SES Denver (Oct 16) offers an intense day of learning all the critical aspects of search engine optimization (SEO) and paid search advertising (PPC). The mission of SES remains the same as it did from the start - to help you master being found on search engines. Register today!