How to work with SpecialCharacters in a document using C# and .NET

  1. Add SautinSoft.Document from Nuget.
  2. Load a DOCX document.
  3. Find a special character and remove it.

In this code example, we remove all page breaks in the document.

Complete code

using System.Linq;
using SautinSoft.Document;

namespace Example
{
    class Program
    {
        static void Main(string[] args)
        {
            // Get your free 30-day key here:   
            // https://sautinsoft.com/start-for-free/

            DeletePageBreak();
        }
        /// <summary>
        /// Working with special characters in a document. How delete all page breaks in DOCX.
        /// </summary>
        /// <remarks>
        /// Details: https://sautinsoft.com/products/document/help/net/developer-guide/special-character-text-in-docx-document-net-csharp-vb.php
        /// </remarks>
        static void DeletePageBreak()
        {
            string filePath = @"..\..\..\example.docx";
            string fileResult = @"Result.docx";
            DocumentCore dc = DocumentCore.Load(filePath);
            foreach (SpecialCharacter sc in dc.GetChildElements(true, ElementType.SpecialCharacter).Reverse())
            {
                if (sc.CharacterType == SpecialCharacterType.PageBreak)
                    sc.Parent.Content.Delete();
            }
            dc.Save(fileResult);
            System.Diagnostics.Process.Start(new System.Diagnostics.ProcessStartInfo(filePath) { UseShellExecute = true });
            System.Diagnostics.Process.Start(new System.Diagnostics.ProcessStartInfo(fileResult) { UseShellExecute = true });
        }
    }
}

Download

Imports System
Imports System.IO
Imports System.Linq
Imports SautinSoft.Document

Module Sample
    Sub Main()
        DeletePageBreak()
    End Sub
    ''' Get your free 30-day key here:   
    ''' https://sautinsoft.com/start-for-free/
    ''' <summary>
    ''' Working with special characters in a document. How delete all page breaks in DOCX.
    ''' </summary>
    ''' <remarks>
    ''' Details: https://sautinsoft.com/products/document/help/net/developer-guide/special-character-text-in-docx-document-net-csharp-vb.php
    ''' </remarks>
    Sub DeletePageBreak()
        Dim filePath As String = "..\..\..\example.docx"
        Dim fileResult As String = "Result.docx"
        Dim dc As DocumentCore = DocumentCore.Load(filePath)
        For Each sc As SpecialCharacter In dc.GetChildElements(True, ElementType.SpecialCharacter).Reverse()
            If sc.CharacterType = SpecialCharacterType.PageBreak Then
                sc.Parent.Content.Delete()
            End If
        Next sc
        dc.Save(fileResult)
        System.Diagnostics.Process.Start(New System.Diagnostics.ProcessStartInfo(filePath) With {.UseShellExecute = True})
        System.Diagnostics.Process.Start(New System.Diagnostics.ProcessStartInfo(fileResult) With {.UseShellExecute = True})
    End Sub
End Module

Download


If you need a new code example or have a question: email us at support@sautinsoft.com or ask at Online Chat (right-bottom corner of this page) or use the Form below:



Questions and suggestions from you are always welcome!

We are developing .Net components since 2002. We know PDF, DOCX, RTF, HTML, XLSX and Images formats. If you need any assistance with creating, modifying or converting documents in various formats, we can help you. We will write any code example for you absolutely free.