Copyright © 2024 contributors of Mutaforma • Github

Home

What Mutaforma does

Mutaforma is a document conversion tool that currently supports the conversion of DOCX files to HTML.

Mutaforma's conversion process relies heavily on the semantic information present in the source document. For documents created with word processors like Microsoft Word or LibreOffice Writer, this semantic information is typically applied through the use of styles, which convey meaning and structure to the content.

For example, when converting a DOCX document to HTML using Mutaforma, all paragraphs with the applied style "Normal" will be transformed into p elements and all paragraphs with the style "Heading 1" to h1 elements.

Screenshot of an example Word document
<!DOCTYPE html>
<html>
  <head>...</head>
  <body>
    <h1>Heading with applied style "Heading 1"</h1>
    <p>Paragraph with applied style "Normal"</p>
  </body>
</html>

What Mutaforma does not

No preservation of stylistic aspects

Mutaforma does not preserve the visual styling of the source document, including font families, font sizes, colors, margins, paddings, and other layout-specific details.

No roundtrip conversion

Once a document has been converted, it cannot be reverted to its original format. This is because Mutaforma prioritizes preserving the document's semantic information, rather than its original layout or formatting.