PRODUCTS   DOWNLOAD PURCHASE SUPPORT BLOG  COMPANY  
 
Key Features
PDF to Word, RTF
PDF to Word, DOCX
PDF to Images: JPEG, TIFF
PDF to HTML
PDF to Excel new!
>>PDF to XML new!
PDF to Text
Download trial
Purchase
Online-demo
Documentation
What's New
Tutorials & Videos
How to convert PDF to JPG with high Quality in C# and .Net.
How to convert PDF to Word in C#.
How to convert PDF to Jpeg in C#.
Extract Text from PDF in C#.
How to convert PDF to Multipage TIFF in C#.
How to convert PDF to Black-White Multipage TIFF in C#.
Delphi.Net – How to convert PDF to Word.
How to convert a password protected PDF document.

"Convert PDF to XML in C# using PDF Focus .Net library"

Convert PDF to XML in C# using PDF Focus .Net library

Introduction

     Let's see how to add "PDF to XML feature" into any .Net application. First of all, to give your .Net application ability to convert PDF documents to XML, add a reference to the "SautinSoft.PdfFocus.dll" assembly. You may download it here, 16.4 Mb.

Let's take a look to a very straightforward example in C#:

            SautinSoft.PdfFocus f = new SautinSoft.PdfFocus();
            f.XmlOptions.ConvertNonTabularDataToSpreadsheet = false;
            f.OpenPdf(@"c:\Table.pdf");
            f.ToXml(@"c:\Table.xml");
          

After launching this code you will get XML-document produced from Table.pdf. Since we have set the property "ConvertNonTabularDataToSpreadsheet" to false, all textual data will be skipped. In other words, only tables will be converted to XML.

<document>
  <page index="1">
    <table>
      <tgroup cols="5">
        <row>
          <entry rowspan="2">September</entry>
          <entry colspan="2">October</entry>
          <entry colspan="2">November</entry>
        </row>
      </tgroup>
    </table>
  </page>
</document>

Thus, you may adjust the component to get such XML document as you wish.

Download PDF Focus .Net Online Demo


Have a Question?

You have a question, need a support or a code example: ask our support team at support@sautinsoft.com. We will help you quickly and with pleasure.



What PDF Focus .Net gives me:

  1. Save time and cost of the product development. The writing of own PDF converter in C# requires a lot of days of hard work and takes approx 100 000 C# code lines. PDF Focus .Net will save a lot of my time and I'll need only to add the reference to "SautinSoft.PdfFocus.dll" and write 3-4 C# code lines.

  2. The one product provides me by set of API to convert PDF at various formats at once: Word, DOCX, RTF, Excel, HTML, Text, Multipage TIFF, Jpeg, PNG.

  3. Because of this is absolutely standalone PDF library, my clients don't need to have Adobe Acrobat, MS Word, Excel or other software.

  4. My software will have the one of the fastest API to convert PDFs inside, you may check this.

  5. I can select the any of product editions: Total, Office, HTML and Picasso. This saves my money. Let's us say, that I'm looking for only the API to convert PDF to HTML and don't need other features. Given this point, I can purchase the "HTML" edtion of PDF Focus .Net. It's much cheaper than competitor's price.

  6. The one time price. I pay a one time and can use «PDF Focus .Net» for a whole life without any additional fees. My license will never expires.

  7. As bonus, I get unlimited technical support during a whole life for free and independently of the date of my purchasing.

Some examples to convert PDF to XML in C# and VB.Net

1. Convert PDF file to XML file in C#:

            
            string pathToPdf = @"c:\Table.pdf";
            string pathToXml = Path.ChangeExtension(pathToPdf, ".xml");

            // Convert PDF file to XML file.
            SautinSoft.PdfFocus f = new SautinSoft.PdfFocus();

	    	// This property is necessary only for registered version.
		    //f.Serial = "XXXXXXXXXXX";

            // Let's convert only tables to XML and skip all textual data.
            f.XmlOptions.ConvertNonTabularDataToSpreadsheet = false;

            f.OpenPdf(pathToPdf);

            if (f.PageCount > 0)
            {
                int result = f.ToXml(pathToXml);
                
                //Show HTML document in browser
                if (result==0)
                {
                    System.Diagnostics.Process.Start(pathToXml);
                }
            }
      
2. Convert PDF file to XML file in VB.Net:
            
        Dim pathToPdf As String = @"c:\Table.pdf"
        Dim pathToXml As String = Path.ChangeExtension(pathToPdf, ".xml")

        ' Convert PDF file to XML file.
        Dim f As New SautinSoft.PdfFocus()

        ' This property is necessary only for registered version.
        'f.Serial = "XXXXXXXXXXX";

        ' Let's convert only tables to XML and skip all textual data.
        f.XmlOptions.ConvertNonTabularDataToSpreadsheet = False

        f.OpenPdf(pathToPdf)

        If f.PageCount > 0 Then
            Dim result As Integer = f.ToXml(pathToXml)

            'Show HTML document in browser
            If result = 0 Then
                System.Diagnostics.Process.Start(pathToXml)
            End If
        End If
      
If anyone needs a code sample in C#, VB.Net, ASP.Net etc "How to convert PDF to XML", email us at support@sautinsoft.com or ask at Online Chat (right-bottom corner of this page). We'll help you certainly!


Requrements and Technical Information

     Requires only .Net 4.0 or higher. Our product is compatible with all .Net languages and supports all Operating Systems where .Net Framework can be used. Note that PDF Focus .Net is entirely written in managed C#, which makes it absolutely standalone and an independent library.



Reasonable prices

     The library offers developers the high quality conversion at reasonably prices. The main point of interest is that PDF Focus .Net is less expensive than all of its more or less popular counterparts. For example, a single license for PDF Focus .Net - HTML Edition cost $399 and the Total Edition starts from $578.


Download

To see this functionality firsthand, download the freshest «PDF Focus .Net» with code examples, 16.4 Mb.

 
 
Products Downloads Support Company
Buy now
Follow us on Google Plus SautinSoft on Facebook SautinSoft on Twitter SautinSoft on Youtube Our Blog
Copyright © 2002 - 2017 SautinSoft (started from sautin.com). All rights reserved.