I wonder how hard it would be to run a couple pages of dense print (though in a ...

avian · on Oct 5, 2017

On several separate occasions I tried to use Tesseract to OCR base64 and similar text that was printed in a normal monospace font (i.e. not a special OCR font) and scanned.

I never got even close to getting useful results. I tried limiting the alphabet, disabling language models as far as I could and at most I could get a few recognizable character sequences right out of the whole page. I got the impression that the whole thing very much depends on being able to split text into English words and have easily separated paragraphs.

Karunamon · on Oct 5, 2017

Well, crap. Looks like QR codes are the better idea, then. Thanks for saving me a boatload of time!

philsnow · on Oct 5, 2017

there are fonts that were designed specifically to be easily OCRed with high reliability. look for "OCR A".