How to write a function to convert PDF to Excel in C# / .Net Core for 15 minutes?

PDF Focus .Net

.Net assembly which gives API to convert PDF to All: DOCX, RTF, HTML, XML, Text, Excel, Images in .Net and C#.
PDF to Excel scheme

PDF Focus .Net

How to write a function to convert PDF to Excel in C# / .Net Core for 15 minutes?
PDF to Excel scheme

Introduction

Imagine, that in 15 minutes after reading this text you will be able to provide API for your application to read tables from a PDF and convert them in Excel-spreadsheet, using C# or VB.Net.

           SautinSoft.PdfFocus f = new PdfFocus();
            f.OpenPdf(@"d:\Invoice.pdf");

            if (f.PageCount > 0)
                f.ToExcel(@"d:\Invoice.xls");          

PDF Focus .Net is standalone .Net assembly. It provides you a full set of API to convert PDF documents into Excel workbooks.

Another point of interest is that PDF Focus .Net can understand and recreate real tables with rows and cells. Maybe it sounds obvious, but PDF documents don't have real tables. If you see a table inside a PDF, it is in fact a set of lines. To further accentuate this point, download PDF Focus .Net, 69.3 Mb and evaluate it now!

Your application will be able to recognize PDF documents with tables even with invisible borders and create from them an Excel spreadsheet.


Download

To see this functionality firsthand, download the freshest «PDF Focus .Net» with code examples, 69.3 Mb.

Limitations

PDF Focus .Net The limitations of the free version are: The trial notice "Created by unlicensed version of PDF Focus .Net" and the random addition of the word "TRIAL".


In these screenshots you can see input PDF document and output XLS with extracted table:

PDF to Excel, PDF to XLS, PDF to XLSX, PDF to tables, .Net, library, C#, VB.Net, component, convert, pdf, asp.net

Requirements and Technical Information

Requires .NET Framework 4.0 or higher. Our product is compatible with all .NET languages and supports all Operating Systems where .NET Framework and .NET Core can be used. Note that PDF Focus .Net is entirely written in managed C#, which makes it absolutely standalone and an independent library.

.Net Framework 4.0 and higher and .Net Core 2.0 and higher

.NET Framework 4.5, 4.6.1 and higher.The old version for old .NET 2.0 can be found here

.NET Standard 2.0

.NET Core, .NET 5.0 and higher.


Multi-platform component, runs on:


Our component has proven itself on cloud platforms and services:

  • Microsoft Azure
  • Amazon Web Services (AWS)
  • Google Cloud Platform
  • SharePoint
  • Docker
  • Xamarin Forms
  • etc.