OCR on Linux: great news
Willem van der Walt
wvdwalt at csir.co.za
Fri Sep 29 02:57:17 EDT 2006
Hi,
I have compiled and tested tesseract.
IMHO it is better than gocr and ocrad, but it is finniky about the images
it handles.
It does not just ocr any tif file. It has to be at specific resollution
and things like that.
There is a script that prepare the image to work.
With image magic one can convert whatever into the required .tif.
I had a funny experience with this program on my fedora 3 box.
It compiled, but hung when I ran it. Under Debian, it worked.
I suspect some gcc version issue.
It is good news in general, and if I had to do a lot of OCR now, I would
have used it as is.
I would say it is about like the old easy scan stuff from Arcenstone.
It is also rather slow.
Regards, Willem
On Thu, 28 Sep 2006, Michael Whapples wrote:
> By the way the URL you gave is wrong, www.sf.net/projects/tesseract-ocr
>
> From
> Michael Whapples
> ----- Original Message -----
> From: "Lorenzo Taylor" <lorenzo at taylor.homelinux.net>
> To: "speakup" <speakup at braille.uwo.ca>
> Sent: Thursday, September 28, 2006 10:04 PM
> Subject: OCR on Linux: great news
>
>
> > -----BEGIN PGP SIGNED MESSAGE-----
> > Hash: SHA1
> >
> > I just read a review on linux.com about a new open source OCR engine for
> > Linux. Well, it's not exactly new, but it has recently been open
> > sourced by none other than Google, HP and UNLV. According to the
> > review, it's about 97.74% accurate, although it only recognizes tif files
> > and doesn't yet correctly understand page layouts with more than 1
> > column. However these are planned features in future releases. This
> > app is called tesseract, and can be found at
> >
> > http://sourceforge.net/projects/tesseract
> >
> > HTH somebody,
> > Lorenzo
> > - --
> > I've always found anomalies to be very relaxing. It's a curse.
> > - --Jadzia Dax: Star Trek Deep Space Nine (The Assignment)
> > -----BEGIN PGP SIGNATURE-----
> > Version: GnuPG v1.4.5 (GNU/Linux)
> >
> > iD8DBQFFHDjUG9IpekrhBfIRAq5PAKDFpBdtnH/47VOxAs9K0ow7HbxtjgCeOgGp
> > 1ORpTF+O/MpkioCwRBLFn28=
> > =Kxdp
> > -----END PGP SIGNATURE-----
> >
> >
> >
>
> _______________________________________________
> Speakup mailing list
> Speakup at braille.uwo.ca
> http://speech.braille.uwo.ca/mailman/listinfo/speakup
>
--
This message is subject to the CSIR's copyright, terms and conditions and
e-mail legal notice. Views expressed herein do not necessarily represent the
views of the CSIR.
CSIR E-mail Legal Notice
http://mail.csir.co.za/CSIR_eMail_Legal_Notice.html
CSIR Copyright, Terms and Conditions
http://mail.csir.co.za/CSIR_Copyright.html
For electronic copies of the CSIR Copyright, Terms and Conditions and the CSIR
Legal Notice send a blank message with REQUEST LEGAL in the subject line to
CallCentre at csir.co.za.
This message has been scanned for viruses and dangerous content by MailScanner,
and is believed to be clean. MailScanner thanks Transtec Computers for their support.
More information about the Speakup
mailing list