What license is Tesseract distributed under?

The code in this repository is licensed under the Apache License, Version 2.0.

Which library does Tesseract use for image opening?

Tesseract uses the Leptonica library for opening input images.

Tesseract OCR

Tesseract OCR | Find AI List

Overview

Tesseract OCR is an open-source engine used for optical character recognition, capable of converting images containing text into machine-readable text. Originally developed at Hewlett-Packard, it is now maintained by Google and a community of contributors. Tesseract 4 introduced a new neural net (LSTM) based OCR engine focused on line recognition, while still supporting the legacy Tesseract OCR engine. It's compatible with various image formats like PNG, JPEG, and TIFF and supports multiple output formats including plain text, hOCR (HTML), PDF, TSV, ALTO, and PAGE. Developers can integrate it into applications using the C or C++ API. It relies on the Leptonica library for image handling, offering a flexible solution for text extraction from images. It's designed to be trained for recognizing different languages and customized character sets.

Common tasks

Optical Character Recognition Text Extraction Image to Text Conversion

FAQ

View all

What image formats does Tesseract support?

Tesseract supports various image formats including PNG, JPEG, and TIFF.

How can I improve the OCR results?

In many cases, improving the quality of the image you are giving Tesseract will yield better OCR results. Pre-processing steps can significantly improve accuracy.

Does Tesseract have a GUI application?

No, Tesseract does not include a GUI application. You can find third-party applications that provide a GUI in the 3rdParty documentation.

Can Tesseract be trained to recognize other languages?

Yes, Tesseract can be trained to recognize other languages. See Tesseract Training for more information.

FAQ+