Overview HTML to RTF .Net


Introduction

HTML to RTF .Net is a cross-platform .NET library that allows your applications to convert any HTML documents to Word, RTF, DOCX, Text. After adding a reference to "SautinSoft.HtmlToRtf.dll" and entering 3-4 lines of C#, you can use the API in your applications:


Input formats

HTML documents. The component can read and parse all types of HTML: 3.2, HTML 4.01, HTML 5 with CSS and XHTML. Our component doesn't require any special version of HTML format it will work with any version. The component has own HTML parser.

Scripting languages (like a JQuery, JavaScript) are not supported. If you want to convert HTML (saturated with JavaScript and complex CSS) to PDF, use another our component - PDF Vision .Net.

These CSS are now supported by the current version of «HTML to RTF .Net»:

css html

Output formats

  • DOCX. The component creates DOCX documents in accordance with Office Open XML (Ecma 4th edition). The component has its own DOCX render. You can transform HTML documents to DOCX format as MemoryStream, String, Files, URI or Bytes Array.
  • RTF. The component creates documents in RTF 1.8 format. The component has its own RTF render. You can transform HTML documents to RTF format as MemoryStream, String, Files, URI or Bytes Array.
  • TXT. The component can also create Text from HTML. Unicode characters are supported.

Advanced Features

  • Support for any languages: English, Japanese, Chinese, Russian and others
  • The resulting RTF files are fully preserved:
    • HTML content structure
    • Text display
    • font, color, text style
    • tables
    • images in JPEG, BMP, GIF, PNG formats