Perform OCR using free Tesseract SDK


PDF Focus .Net can work with Tesseract OCR engine. To perform OCR we'll use free OCR library by Tesseract (https://github.com/tesseract-ocr/tesseract).
The library is OpenSource and can be used by Apache-2.0 license.

The official website of Tesseract Open Source OCR Engine (main repository): https://tesseract-ocr.github.io/



If you wan to know more information about Tesseract Engine and its main features . Please read the next manual: https://tesseract-ocr.github.io/tessdoc/#introduction

Notice: Please make sure, that you have read the manual carefully. And also, when creating your solution, you must place the folder "tessdata" in the directory of your solution. Also check that you see eng.file - english language, de.file - germany, etc.

The full code sample you may download directly from NuGet C#: The link

The full code sample you may download directly from NuGet VB .NET: The link


If you need a new code example or have a question: email us at support@sautinsoft.com or ask at Online Chat (right-bottom corner of this page) or use the Form below:



Questions and suggestions from you are always welcome!

We are developing .Net components since 2002. We know PDF, DOCX, RTF, HTML, XLSX and Images formats. If you need any assistance with creating, modifying or converting documents in various formats, we can help you. We will write any code example for you absolutely free.