Bytescout.PDFExtractor 11.3.0.3983

Bytescout PDF Extractor SDK for .NET, ASP.NET, ActiveX - extract data from PDF documents

Install-Package Bytescout.PDFExtractor -Version 11.3.0.3983
dotnet add package Bytescout.PDFExtractor --version 11.3.0.3983
<PackageReference Include="Bytescout.PDFExtractor" Version="11.3.0.3983" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add Bytescout.PDFExtractor --version 11.3.0.3983
The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Release Notes

Bytescout PDF Extractor SDK for .NET, ASP.NET, ActiveX.

ByteScout, Inc. (c) 2008-2020.

Compatibility: .NET Framework 2.0 or later; .NET Core 2.0 or later.
Works with: .NET, ASP.NET, ActiveX, Visual Basic 6, Classic ASP, Delphi and others.

Features:

- Extracts data from PDF files in TXT, CSV, XML, XLS, XLSX, JSON formats;
- Extracts embedded images, files and attachments from PDF files;
- Splits and merges PDF files, extracts a single page or range of pages;
- Extracts data from whole document page or specified rectangular region;
- Extracts PDF document information (author, subject, producer etc);
- Detects tables;
- Searches text inside document with regex support;
- Extracts data from PDF forms;
- Reads text from scanned PDF documents using OCR (Optical Character Recognition);
- Provides ActiveX interface to use from legacy programming languages (Visual Basic 6, Delphi) and scripting (VBscript, JScript and others);
- And much more...

History of changes:

11.3.0.3983 (October 26, 2020)
==============================
+ DocumentSplitter: Added support for regions with inverted page numbers. For example, "!1" means "the last page", "!1-!3" or "!3-" means "last three pages".
+ DocumentSplitter: Added support for "*" split range that means "split every single page".
+ Added 'InfoExtractor.Metadata' property that gets XMP metadata from the document.
= Improved joining of multi-line cells in tables without borders ('LineGroupingMode.JoinOrphanedRows' mode).
= Improved detection of OCR language file versions.
= Improved .NET Core 2.0 compatibility.
= Improved unwrapping of multi-line cell text.
- Fixed issue when invisible vector drawings were causing unwanted separation of text objects.
- Fixed extraction from area when running OCR against image file (not PDF!).
= Improved parsing of PDF documents.
- Other minor fixes and improvements.

11.2.0.3919 (June 20, 2020)
===========================
+ 'MultimediaExtractor' now supports extraction of 3D-animation objects.
- 'TextExtractor.Find()' now keeps original font names in found object information.
= Improved column detection in 'ColumnDetectionMode.Borders' mode.
- 'SearchablePDFMaker' did not process vector-only pages. Fixed now.
= Improved regex text search in 'TextExtractor'.
+ Added 'DetectUnderlineTextStyle' and 'DetectStrikeoutTextStyle' properties to 'JSONExtractor' and 'XMLExtractor'.
+ Added 'OCRWhiteList' and 'OCRBlackList' properties to extractors.
+ Added 'Invert' OCR preprocessing filter.
+ Added 'Scale' OCR preprocessing filter.
= Improved joining of multi-line cells in tables without borders ('LineGroupingMode.JoinOrphanedRows' mode).
= Improved performance of 'ImageExtractor'.
+ Added page rectangles to 'InfoExtractor'.
= Improved 'OCRAnalyzer'.
= Improved automatic deletion of duplicated text objects during the extraction.
- Fixed extraction issues in .NET Core version.
= Improved parsing of PDF documents.
- Other minor fixes and improvements.

11.1.0.3845 (March 19, 2020)
============================
+ Added 'OCROverallConfidence' property in all extractors that.
+ SearchablePDFMaker: Added 'KeepOriginalRotation' property.
- SearchablePDFMaker: fixed crash on mixed English-Arabic text recognition.
+ PDF Multitool: Added "Developer Tools" sub-menu to the context menu.
= Improved parsing of PDF documents.
- Other minor fixes and improvements.

11.0.0.3805 (February 11, 2020)
===============================
+ Added support for new revision of PDF encryption (ISO 32000-2:2017 compliance).
+ Added 'LicenseInfo' property providing detailed information about your license.
+ Added 'Grayscale' filter to OCRImagePreprocessingFilters.
= Dramatically improved column extraction for multiple tables on a page. Works only in 'ColumnDetectionMode.Borders' mode for tables with borders between columns and rows.
= Greatly improved 'ColumnDetectionMode.BorderedTables'. As in the table detection, it now uses optical recognition to detect bordered tables and their columns on scanned documents.
= Improved 'InfoExtractor' to return the encrypted and password-protected states without asking a password or throwing an exception.
= Added document permissions information to 'InfoExtractor'.
= DocumentSplitter: added zero-padding to page numbers in generated file names.
= Improved extraction of duplicated text (shadow-like effect).
= Improved 'MultimediaExtractor'.
- Fixed text search issues on some documents.
- Fixed bug that damaged extracted text only during multi-thread processing.
- Fixed crash on subsequent extractions with different OCR modes.
- Fixed .NET Core compatibility issue.
= Improved parsing of PDF documents.
- Other minor fixes and improvements.

10.8.0.3732 (December 4, 2019)
==============================
+ Remover2: Added 'MaskColor' property that allows to change color of masking rectangle.
- Remover and Remover2: Fixed incomplete removal of the text in some cases.
- XMLExtractor and XFDFExtractor: fixed missing control types.
- Fixed parsing of combobox items that consist of value+label pairs.
= Improved handling of Arabic fonts and charsets.
= Improved handling of CJK fonts and charsets.
= Improved parsing of PDF documents.
- Other minor fixes and improvements.

...

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version History

Version Downloads Last updated
11.3.0.3983 318 10/26/2020
11.2.1.3959 470 9/1/2020
11.2.1.3929 480 7/14/2020
11.2.1.3926 163 7/9/2020
11.2.0.3919 236 6/30/2020
11.1.0.3869 2,339 4/10/2020
11.1.0.3864 323 4/4/2020
11.1.0.3849 380 3/27/2020
11.1.0.3845 359 3/19/2020
11.0.0.3834 482 3/6/2020
11.0.0.3832 231 3/4/2020
11.0.0.3830 212 3/4/2020
11.0.0.3815 404 2/21/2020
11.0.0.3805 465 2/11/2020
10.8.0.3758 1,159 12/19/2019
10.8.0.3750 294 12/17/2019
10.8.0.3744 250 12/12/2019
10.8.0.3741 196 12/10/2019
10.8.0.3736 317 12/6/2019
10.8.0.3732 265 12/4/2019
10.7.2.3710 598 11/13/2019
10.7.1.3705 245 11/11/2019
10.7.0.3697 368 11/2/2019
10.6.0.3666 1,029 10/1/2019
10.5.0.3637 995 9/2/2019
10.4.0.3618 686 8/15/2019
10.4.0.3613 314 8/13/2019
10.4.0.3602 368 8/7/2019
10.3.0.3566 884 7/2/2019
10.2.0.3548 925 6/13/2019
10.2.0.3534 277 6/11/2019
10.2.0.3525 299 6/7/2019
10.2.0.3514 335 5/28/2019
10.1.0.3444 739 4/5/2019
10.1.0.3439 322 4/4/2019
10.0.0.3429 386 3/25/2019
10.0.0.3427 295 3/25/2019
10.0.0.3424 309 3/23/2019
10.0.0.3423 291 3/23/2019
10.0.0.3422 294 3/23/2019
10.0.0.3421 340 3/21/2019
9.4.0.3398 413 3/12/2019
9.3.0.3366 719 2/12/2019
9.3.0.3357 401 2/4/2019
9.3.0.3354 303 1/31/2019
9.2.0.3293 1,258 11/20/2018
9.2.0.3262 624 10/24/2018
9.2.0.3259 361 10/24/2018
9.1.0.3170 983 7/26/2018
9.1.0.3167 536 7/18/2018
9.1.0.3165 437 7/18/2018
9.1.0.3163 491 7/18/2018
9.0.0.3095 1,613 4/23/2018
9.0.0.3087 738 4/13/2018
9.0.0.3080 546 4/11/2018
8.8.1.3046 961 2/20/2018
8.8.1.3025 1,175 1/29/2018
8.8.0.3021 577 1/23/2018
8.7.0.2981 2,198 11/8/2017
8.6.0.2917 1,530 8/2/2017
8.6.0.2912 510 8/1/2017
8.5.0.2863 739 6/9/2017
8.5.0.2861 594 6/8/2017
8.5.0.2856 600 6/1/2017
8.4.1.2829 4,814 4/12/2017
8.4.0.2821 588 3/29/2017
8.3.0.2809 917 3/13/2017
8.3.0.2806 522 3/12/2017
8.3.0.2803 531 3/6/2017
8.3.0.2801 504 3/6/2017
8.3.0.2800 513 3/6/2017
8.3.0.2798 493 3/6/2017
8.3.0.2796 515 3/6/2017
8.3.0.2794 511 3/6/2017
8.2.0.2699 916 1/11/2017
8.1.1.2606 1,391 10/25/2016
8.1.0.2600 582 10/21/2016
8.0.0.2542 776 9/1/2016
8.0.0.2541 562 9/1/2016
8.0.0.2528 603 8/23/2016
8.0.0.2523 558 8/19/2016
7.0.0.2493 24,157 6/27/2016
7.0.0.2489 510 6/27/2016
7.0.0.2480 1,202 6/10/2016
7.0.0.2474 864 5/26/2016
6.30.0.2421 750 3/24/2016
6.20.0.2354 770 1/20/2016
6.12.0.2239 3,524 9/22/2015
5.20.0.1871 1,260 2/5/2015
5.0.0.1626 1,290 8/14/2014
4.0.0.1487 805 5/31/2014
3.40.0.1349 934 3/11/2014
3.20.0.1092 943 8/5/2013
3.20.0.1075 1,637 7/12/2013
3.10.0.1051 816 6/29/2013
3.0.0.839 894 3/26/2013
2.50.0.769 898 2/25/2013