EvoPdf Next PDF to Text Converter for .NET
The PDF to Text Converter is a component of the EvoPdf Next Library for .NET that enables text extraction from PDF documents
in the original layout or optimized for reading, as well as text search in PDF that returns the exact positions of the matches.
The library targets .NET Standard 2.0 and can be used in .NET Core and .NET Framework applications that you
can deploy on Windows and Linux platforms, including Azure App Service and Functions or Docker.
|
|
|
|
|
| EvoPdf Next PDF to Text Converter
|
| for .NET |
|
|
|
|
The PDF to Text Converter is a component of the EvoPdf Next Library for .NET that enables text extraction from PDF documents in the original layout or optimized for reading,
as well as text search in PDF that returns the exact positions of the matches.
The PDF to Text Converter is distributed as part of the
EvoPdf Next PDF Processor for .NET,
which also includes functionality for converting PDF pages to images and extracting images from PDF documents.
The library targets .NET Standard 2.0 and can be used in .NET Core and .NET Framework applications that you
can deploy on Windows and Linux platforms, including Azure App Service and Functions or Docker.
|
|
|
|
|
|
|
 |
|
Main Features
|
|
|
-
Convert PDF documents from memory, stream or file to text
-
Search text in PDF documents, returning text positions
-
Convert or search password-protected PDF documents
-
Convert or search the entire PDF or a range of PDF pages
|
|
-
Extract text with the original layout or optimized for reading
-
Search text with a case-sensitive option
-
Search text with a whole-word option
-
Asynchronous methods that can be used with async and await
|
|
|
 |
|
Compatibility
|
|
|
-
Windows 10, 11 and Windows Server 2016 to 2025
-
Linux 64-bit distributions
-
.NET 10.0, 9.0, 8.0, 7.0, 6.0, .NET Standard 2.0
-
.NET Framework 4.6.2 to 4.8.1
|
|
-
Azure App Service and Azure Functions
-
Azure Cloud Services and Virtual Machines
-
Web, console and desktop applications
-
Docker containers for Windows and Linux
|
|
|
 |
|
Getting Started
|
|
|
|
You can quickly get started with the ASP.NET demo application available for download, or you can integrate the library into your own project.
The online documentation,
contains detailed instructions on how to run an application using EvoPdf Next Library for .NET on Windows and Linux machines, Azure App Service
and Azure Functions for Windows and Linux.
You can view the current capabilities of the library by checking the
online demo
application and the API reference in the online documentation.
|
|
|
|
Download Demo Application
|
|
|
|
The ZIP package available for download from the link below includes an ASP.NET demo application project with complete C# source code covering all major library features.
|
|
|
|
|
|
Running the samples in the demo application that involve HTML to PDF conversion features on Linux platforms might require installing some dependency packages. The documentation includes an entire section dedicated to building, publishing and running the demo application on multiple platforms.
|
|
 |
|
NuGet Packages
|
|
|
|
The PDF to Text Converter is distributed as part of the
EvoPdf.Next.PdfProcessor.Windows
NuGet package when targeting Windows and as part of the
EvoPdf.Next.PdfProcessor.Linux
NuGet package when targeting Linux.
The Windows package is referenced by the
EvoPdf.Next.Windows
metapackage for all components and the Linux package is referenced by the
EvoPdf.Next.Linux
metapackage for all components.
There are also multiplatform metapackages that reference both the Windows and Linux packages:
EvoPdf.Next.PdfProcessor
for the PDF Processor functionality and
EvoPdf.Next
for the entire EvoPdf Next library.
|
|
 |
|
Installation
|
|
|
|
The PDF Processor component generally does not require the installation of additional dependencies,
either on Windows or on Linux.
|
|
 |
|
EvoPdf.Next Namespace
|
|
|
|
All components of the EvoPdf Next for .NET library share the same
EvoPdf.Next
namespace and can be used together in the same application.
To use the library in your own code, add the using directive at the top of your C# source file, as shown below.
|
|
// add this using statement at the top of your C# file
using EvoPdf.Next;
|
|
 |
|
C# Code Sample |
|
|
You can use the sample code below to convert a PDF document to a string and search for text in the PDF.
|
|
// Create the PDF to Text converter instance with default options
PdfToTextConverter pdfToTextConverter = new PdfToTextConverter();
// Extract text from the specified PDF file
string extractedText = pdfToTextConverter.ConvertToText(pdfFilePath);
// Search text in PDF
bool caseSensitive = false;
bool wholeWord = false;
FindTextLocation[] findTextLocations = pdfToTextConverter.FindText(pdfFilePath, textToFind, caseSensitive, wholeWord);
|
|