The Google Code Blog announced that Google has "re-released" the Tesseract OCR software to the open source community. OCR, optical character recognition, is the technology for converting text on a physical paper into computer based text. So if you have a ton of papers you typed up in your college days and you want them stored in digital format, you can use OCR to translate those documents for you.
Tesseract was originally developed by HP between the years of 1985 and 1995. In 2005 HP and University of Nevada in Las Vegas opened it to the community. Google claims that Tesseract OCR is "far more accurate than any other Open Source OCR package out there." Some more detail at Computing.co.uk.
Meet Your Favorite Search Engine Watch Contributors
Many of SEW's leading expert contributors will be at ClickZ Live, the new online and digital marketing event kicking off in New York (March 31-April 3). Hear from the likes of: Thom Craver, Josh Braaten, Lisa Barone, Simon Heseltine, Josh McCoy, Lisa Raehsler, Greg Jarboe, Dan Cristo, Joseph Kerschbaum, John Gagnon, Eric Enge and more!