Am I the last to find this? command line ocr tesseract won't directly support .pdf but pdftocairo produces .jpg among others which tesseract will read. May not do well with collumns but not too bad. Is there anything better? Thanks tom Fowle