All Projects → DavidBenko → MHTML-to-HTML-Decoding-in-C-Sharp

DavidBenko / MHTML-to-HTML-Decoding-in-C-Sharp

Licence: MIT license
Converts MHTML to HTML

Programming Languages

C#
18002 projects

Projects that are alternatives of or similar to MHTML-to-HTML-Decoding-in-C-Sharp

GemBox.Email.Examples
Read and write email files (MSG, EML, MHTML), and compose, receive and send email messages using POP, IMAP, SMTP, and EWS in a simple and efficient way.
Stars: ✭ 18 (-41.94%)
Mutual labels:  mhtml
js-cfb
💾 OLE File Container Format
Stars: ✭ 54 (+74.19%)
Mutual labels:  mhtml

MHTML to HTML Decoding in C#

MHTML (short for MIME HTML) is a web archive that stores a web page’s HTML and (normally remote) resources in one file. It is composed in a manner similar to an HTML email, using the content-type ‘multipart-related’. The data is split into parts and base64 encoded.

Although this code will decode .mht and .mhtml files, in it’s current state it will only decode the base64 content-transfer encoding. It has been tested on .mhtml files exported from SQL Server Reporting Service (SSRS). It features it’s own logging and a way return valid HTML (with images)

The return of the decompression value is a List<string[]>. Each List element is a section of the MHTML, and the contents of each List element is as follows: string[0] is the Content-Type string[1] is the Content-Name string[2] is the converted data

Using the getHTMLText() method will return the full HTML and will use the cid:’s to insert the base64 image data (valid in newer browsers).

And here is how to use it

string mhtml = "This is your MHTML string"; // Make sure the string is in UTF-8 encoding
MHTMLParser parser = new MHTMLParser(mhtml);
string html = parser.getHTMLText(); // This is the converted HTML
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].