IronOcr 2020.11.2

IronOCR is an advanced OCR (Optical Character Recognition) library for C# and .NET

It provides Tesseract OCR on Mac, Windows, Linux, Azure and Docker for:
* .Net Framework 4.0 +
* .Net Standard 2.0 +
* .Net Core 2.0 +
* .Net 5
* Mono
* Xamarin

IronOCR reads Text, Barcodes & QR from all major image and PDF formats using the latest Tesseract 5 engine. This library adds OCR functionality to Desktop, Console and Web applications in minutes.

IronOCR's Unique Features:
* Pure .Net OCR API
* All OCR tasks run locally (no SAAS)
* 125 languages
* Barcode & QR Code reading
* Corrects low quality, noisy and distorted scans
* Performance tuned above and beyond any other known build of Tesseract OCR.
* Reads PDFs
* Reads multi-page TIFFs
* Can save any OCR Scan to a searchable PDF document or XHTML

Data output options include: Plain Text, Barcode Data and an OCR Result class containing paragraphs, lines, words, and characters.

Language Support:
125 Languages including Arabic, Chinese, English, Finnish, French, German, Hebrew, Italian, Japanese, Korean, Portuguese, Russian, Spanish...  Custom language packs can also be created.

Licensing & Support available for commercial deployments. Email: developers@ironsoftware.com

For code examples, documentation & more visit http://ironsoftware.com/csharp/ocr/

Install-Package IronOcr -Version 2020.11.2
dotnet add package IronOcr --version 2020.11.2
<PackageReference Include="IronOcr" Version="2020.11.2" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add IronOcr --version 2020.11.2
The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Release Notes

- Tesseract 5 & 4 Support
- 125 Language Packs
- .Net Core Support
- .Net Standard Support
- .Net Framework 4.0 + Support
- Linux Support
- MacOS Support
- Mono Support
- Xamarin Mac Support

NuGet packages (128)

Showing the top 5 NuGet packages that depend on IronOcr:

Package Downloads
IronOcr.Languages.Hebrew
The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * Hebrew * HebrewBest * HebrewFast * HebrewAlphabet * HebrewAlphabetBest * HebrewAlphabetFast ==================================== OCR בשפה העברית ב- C# & .NET. אופטימיזציה של C# Tesseract 5 OCR בנפרד .NET OCR API. ממיר מסמכים, תמונות ו- PDF לסורק לטקסט. דוגמאות C# ו- VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Hebrew support including: * Hebrew (also known as עברית) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Hebrew * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/
IronOcr.Languages.ChineseSimplified
Simplified Chinese Language Pack for the Iron OCR C# & VB.Net library. The OCR engine adds OCR functionality to Desktop, Console and Web applications. IronOCR reads Barcode and QR codes. IronOCR supports Console Applications, ASP.NET Web Applications, MVC, and Desktop Applications written in all .Net languages. The Library preprocesses images to help read scans with low resolution & contrast, distortion, and heavy background noise. Output can be in plain text or through the advanced object model to extract headings, paragraphs, lines, words, and characters from a page's content. Other language packs and C# / VB.net code examples available at http://ironsoftware.com/csharp/ocr/ Product & licensing support on email at developers@ironsoftware.com
IronOcr.Languages.Arabic
The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * Arabic * ArabicBest * ArabicFast * ArabicAlphabet * ArabicAlphabetBest * ArabicAlphabetFast ==================================== OCR للغة العربية في C# & .NET. محسن C# Tesseract 5 OCR في .NET OCR API مستقل. يحول مستندات الماسح الضوئي والصور و PDF إلى نص. أمثلة على C# و VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Arabic support including: * Arabic (also known as العربية) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Arabic * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/
IronOcr.Languages.Portuguese
The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * Portuguese * PortugueseBest * PortugueseFast ==================================== OCR em português em C# e .NET. OCR C# Tesseract 5 otimizado em uma API .NET OCR independente. Converte documentos do scanner, imagens e PDF em texto. Exemplos C# e VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Portuguese support including: * Portuguese (also known as Português) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Portuguese * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/
IronOcr.Languages.German
The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * German * GermanBest * GermanFast * GermanFraktur ==================================== Deutschsprachige OCR in C# & .NET. Optimierte C# Tesseract 5 OCR in einer eigenständigen .NET OCR-API. Konvertiert Scannerdokumente, Bilder und PDF in Text. C# & VB Beispiele: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also German support including: * German (also known as Deutsch) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in German * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/

GitHub repositories

This package is not used by any popular GitHub repositories.

Version History

Version Downloads Last updated
2020.11.2 5,508 11/13/2020
4.4.0 117,247 6/21/2018
4.3.0.1 13,059 4/9/2018
4.2.2.51 2,642 1/22/2018
4.2.2.5 2,881 1/19/2018
4.2.2.3 827 1/15/2018
4.2.2.1 1,338 12/1/2017
4.2.2 1,035 12/1/2017
4.2.1.5 2,284 9/9/2017
4.2.1.2 449 9/8/2017
4.2.1.1 454 9/6/2017
4.2.0 614 9/5/2017
4.1.1 1,967 8/4/2017
4.1.0 455 8/2/2017
4.0.10 1,367 1/12/2017
4.0.9 705 12/20/2016