HTML Cleaner

Paste HTML — strip out Microsoft Word and Figma export junk. Returns clean, semantic HTML you can paste into a CMS.

What this tool removes

Microsoft Word junk

  • class="MsoNormal", MsoListParagraph, etc. — all classes starting with Mso
  • style="mso-pagination: …; mso-ascii-font-family: …" — all mso-* CSS declarations
  • <o:p>, <w:wordDocument>, <v:shapetype>, <m:r> — all Office-namespaced tags
  • <!--[if gte mso 9]>…<![endif]--> — Microsoft conditional comments
  • xmlns:o=…, xmlns:w=…, xmlns:v=… on the root tag

Figma export junk

Most Figma "Export to HTML" plugins generate classes that match the layer names from your design file. They have no place in production HTML.

  • Frame12, frame-3, Group34, Rectangle1, Vector, Ellipse, Layer5
  • auto-layout-vertical, auto-layout-horizontal, auto-layout-grid
  • figma-frame, node-1234abcd (hash-based generated names)
  • Properties1, Property1Default — variant property labels
  • Type=Primary, State=Default, Size=Medium — Figma variant names

What it keeps

The cleaner is conservative — it only strips things that are definitely noise:

  • All semantic structure — elements, attributes, content text, links
  • Legitimate inline stylescolor, background, font-size, etc. (only mso-* declarations are dropped)
  • Non-Microsoft, non-Figma classes — your own utility classes, framework classes, etc.
  • Scripts, stylesheets, images — not touched

How to use it

  1. Copy HTML from Word / Figma / any source with bloat
  2. Paste it above and hit "Clean it"
  3. Review the summary of what was removed
  4. Copy or download the cleaned HTML — it's ready to paste into your CMS