How to convert selected pages in HTML using C# and .NET


In today's world of web application and automation system development, converting data from one format to another is a crucial task. The need to convert documents containing HTML markup, which can be divided into individual pages or sections, is particularly pressing. In this article, we'll explain how to convert selected HTML pages in C# and .NET using the powerful component RTF TO HTML .NET from SautinSoft library.

In most scenarios when working with HTML content, it's necessary to extract or convert only specific parts or pages of a document.
For example:

  • From a web page divided into pages using pagination.
  • From large HTML documents containing multiple sections that require separate versions.
  • To automate the extraction and conversion of selected content segments.

Using the library allows you to select the desired pages or document segments and convert them to HTML—without having to process the entire document, significantly increasing efficiency and speeding up workflow.

The implementation is not just a function call, but a document processing strategy that includes:

  • Splitting the document into sections or pages, often based on markup, tags, or numbers.
  • Using library methods to extract the desired segments.
  • Converting the selected sections to HTML.

Important!
Please note that the `ConvertToHtmlFile` function converts the entire document by default. Therefore, for a targeted approach, it is recommended to use the library's capabilities for extracting individual pages, for example, through API settings or by pre-splitting the document.

Input file:

SelectedPages html input

Output result:

SelectedPages html output

Complete code

using System;
using System.Collections.Generic;
using System.Linq;
using System.Text;
using System.IO;
using SautinSoft;

namespace Example
{
    class Program
    {
        static void Main(string[] args)
        {
            PageIndexAndCount();
        }
        /// <summary>
        /// Let's convert 1st and 3rd pages.
        /// </summary>
        static void PageIndexAndCount()
        {
			// Get your free key here:   
            // https://sautinsoft.com/start-for-free/
			
            // If you need more information about "RTF to HTML .Net" 
            // Email us at: support@sautinsoft.com.
			
            string inpFile = @"..\..\..\example.docx";
            string outFile = @"Result.html";

            RtfToHtml r = new RtfToHtml();

            // Let's convert pages: 1, 3.
            RtfToHtml.HtmlFixedSaveOptions opt = new RtfToHtml.HtmlFixedSaveOptions()
            {
                Title = "Pages 1, 3",
                SelectedPages = new int[] {0, 2}
            };

            try
            {
                r.Convert(inpFile, outFile, opt);
            }
            catch (Exception ex)
            {
                Console.WriteLine($"Conversion failed! {ex.Message}");
            }

            // Open the result.
            System.Diagnostics.Process.Start(new System.Diagnostics.ProcessStartInfo(outFile) { UseShellExecute = true });
        }
    }
}

Download

Imports System
Imports System.Collections.Generic
Imports System.Linq
Imports System.Text
Imports System.IO
Imports SautinSoft

Namespace Example
	Friend Class Program
		Shared Sub Main(ByVal args() As String)
			PageIndexAndCount()
		End Sub
		''' <summary>
		''' Let's convert 1st and 3rd pages.
		''' </summary>
		Private Shared Sub PageIndexAndCount()
				    ' Get your free key here:   
            ' https://sautinsoft.com/start-for-free/
			
            ' If you need more information about "RTF to HTML .Net" 
            ' Email us at: support@sautinsoft.com.
			Dim inpFile As String = "..\..\..\example.docx"
			Dim outFile As String = "Result.html"

			Dim r As New RtfToHtml()

			' Let's convert pages: 1, 3.
			Dim opt As new RtfToHtml.HtmlFixedSaveOptions() With {
				.Title = "Pages 1, 3",
				.SelectedPages = New Integer() {0, 2}
			}

			Try
				r.Convert(inpFile, outFile, opt)
			Catch ex As Exception
				Console.WriteLine($"Conversion failed! {ex.Message}")
			End Try

			' Open the result.
			System.Diagnostics.Process.Start(New System.Diagnostics.ProcessStartInfo(outFile) With {.UseShellExecute = True})
		End Sub
	End Class
End Namespace

Download


If you need a new code example or have a question: email us at support@sautinsoft.com or ask at Online Chat (right-bottom corner of this page) or use the Form below:


Captcha

Questions and suggestions from you are always welcome!

We are developing .Net components since 2002. We know PDF, DOCX, RTF, HTML, XLSX and Images formats. If you need any assistance with creating, modifying or converting documents in various formats, we can help you. We will write any code example for you absolutely free.