tesseract

OCR (Optical Character Recognition) engine.

Recognize text in an image and save it to `output.txt` (the `.txt` extension is added automatically):

tesseract {image.png} {output}

Specify a custom language (default is English) with an ISO 639-2 code (e.g. deu = Deutsch = German):

tesseract -l deu {image.png} {output}

tesseract --list-langs

tesseract -psm {0_to_10} {image.png} {output}

tesseract --help-psm

Copyright © 2014—present the tldr-pages team and contributors.

This work is licensed under the Creative Commons Attribution 4.0 International License (CC-BY).