Remove the inert document from the HTML fragment parsing algorithm #11970

foolip · 2025-11-28T18:55:20Z

Remove the inert document from the HTML fragment parser
Add a target argument to the HTML/XML fragment parsing algorithms

At least two implementers are interested (and none opposed):
- …
- …
Tests are written and can be reviewed and commented upon at:
- …
Implementation bugs are filed:
- Chromium: …
- Gecko: …
- WebKit: …
- Deno (only for timers, structured clone, base64 utils, channel messaging, module resolution, web workers, and web storage): …
- Node.js (only for timers, structured clone, base64 utils, channel messaging, and module resolution): …
Corresponding HTML AAM & ARIA in HTML issues & PRs:
MDN issue is filed: …
The top of this comment includes a clear commit message to use.

(See WHATWG Working Mode: Changes for more details.)

/dynamic-markup-insertion.html ( diff )
/parsing.html ( diff )
/xhtml.html ( diff )

foolip · 2025-11-28T19:04:03Z

This is speculative editing following #11669 (comment) to see what it would mean to remove the inert document from the HTML fragment parsing algorithm.

There are two questions, one small and one big.

Small: Is it necessary to put something on the stack of open elements to not violate assumptions elsewhere? At least Chromium and WebKit put a DocumentFragment on the stack of open elements, but that's not an element. In a quick survey of "stack of open elements" I couldn't find anything that would be broken by letting it be empty, but if there is something perhaps the context element or a shallow copy of it could be placed on the stack of open elements.

Big: What were the side effects of using an inert document that implementations might have achieved in some other way, and that also need to be spec'd?

The main reason for exploring this is to pave way for streamHTMLUnsafe() to simply insert directly into the target node, but it's not strictly necessary, the inert document could be kept around in the definition of existing APIs if it's too risky to change.

cc @zcorpan

annevk

I think I agree this is what we need to do, but I don't want to lose sight of the requirements for streamHTML() (and setHTML()) while we do this. For those cases we do still want to create in a separate document (and then maybe mutate) before moving things over.

annevk · 2025-12-01T07:42:48Z

source

-   data-x="concept-tree-child">children</span>, in <span>tree order</span>.</p></li>
+   <li><p><span data-x="concept-node-append">Append</span> the resulting <code>Document</code>
+   node's <span>document element</span>'s <span data-x="concept-tree-child">children</span> to
+   <var>target</var>, in <span>tree order</span>.</p></li>


This will create the wrong mutation records.

annevk · 2025-12-01T07:44:48Z

source

-   <li><p>Let <var>document</var> be a <code>Document</code> node whose <span
-   data-x="concept-document-type">type</span> is "<code data-x="">html</code>".</p></li>
+   <li><p>Let <var>parser</var> be a new <span>HTML parser</span> associated with
+   <var>context</var>'s <span>node document</span>.</p></li>


This seems wrong as this would mean the parser will potentially manipulate that document. Though exactly how this is layered today is unclear.

Here are the bits that I could find beyond inserting nodes:

https://html.spec.whatwg.org/multipage/parsing.html#the-initial-insertion-mode (setting the quirks mode)

https://html.spec.whatwg.org/multipage/parsing.html#create-an-element-for-the-token (increment and decrement document's throw-on-dynamic-markup-insertion counter)

https://html.spec.whatwg.org/multipage/parsing.html#the-end (Update the current document readiness to "interactive" and lots of other things)

I'll have to check how implementations deal with these cases.

foolip · 2025-12-01T08:53:43Z

I think I agree this is what we need to do, but I don't want to lose sight of the requirements for streamHTML() (and setHTML()) while we do this. For those cases we do still want to create in a separate document (and then maybe mutate) before moving things over.

Do you mean for sanitizer, or are there other reasons to use an intermediate document? My thinking was that we'd integrate the sanitizer into the parser so that it's streaming in order to support streamHTML(), and then probably setHTML() could just use the same setup.

foolip added 2 commits November 28, 2025 19:12

Remove the inert document from the HTML fragment parser

57a1873

Add a target argument to the HTML/XML fragment parsing algorithms

46d394f

foolip changed the title ~~foolip/fragment parser no inert doc~~ Remove the inert document from the HTML fragment parser Nov 28, 2025

foolip changed the title ~~Remove the inert document from the HTML fragment parser~~ Remove the inert document from the HTML fragment parsing algorithm Nov 28, 2025

annevk reviewed Dec 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove the inert document from the HTML fragment parsing algorithm #11970

Remove the inert document from the HTML fragment parsing algorithm #11970

Uh oh!

foolip commented Nov 28, 2025 •

edited by pr-preview bot

Loading

Uh oh!

foolip commented Nov 28, 2025 •

edited

Loading

Uh oh!

annevk left a comment

Uh oh!

annevk Dec 1, 2025

Uh oh!

annevk Dec 1, 2025

Uh oh!

foolip Dec 1, 2025

Uh oh!

foolip commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

Remove the inert document from the HTML fragment parsing algorithm #11970

Are you sure you want to change the base?

Remove the inert document from the HTML fragment parsing algorithm #11970

Uh oh!

Conversation

foolip commented Nov 28, 2025 • edited by pr-preview bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

foolip commented Nov 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

annevk left a comment

Choose a reason for hiding this comment

Uh oh!

annevk Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

annevk Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

foolip Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

foolip commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

foolip commented Nov 28, 2025 •

edited by pr-preview bot

Loading

foolip commented Nov 28, 2025 •

edited

Loading