Remove parser function output parsing when unneeded #9

Derugon · 2024-11-29T11:48:57Z

Motivation

Parser functions and tags output a string, which may be unparsed wikitext, semi-parsed wikitext (on which replaceVariables was called), raw string, or a number (as a string).

Currently, most of these functions/tags treat their output as unparsed wikitext, and ask the parser to parse the output into semi-parsed wikitext:

parser functions set the noparse => false option on the result (parsing in a child frame),
tags call $parser->replaceVariables (parsing in the same frame).

When the output is not unparsed wikitext, 1 or 2 pre-processor nodes are created (then expanded), which takes a slight additional parsing time, without affecting the result.

Proposed changes

Remove noparse options or replaceVariables calls when we know the output does not contain any unparsed wikitext.

Note: This PR does not change whether unescaped wikitext is parsed or not. In practice, if we unescape braces or angle brackets, then we have to parse the output, otherwise we would not have to (need further testing?). This would require changing how the ParserPower::unescape function works, so I left it out of this proposal.

Derugon · 2024-11-29T16:37:58Z

It seems list functions that return unescaped wikitext parse it twice, so I'm gonna work on it a little more.

RheingoldRiver · 2024-11-29T19:28:07Z

Sounds good, thanks so much for your contributions already!!

Derugon · 2024-11-29T20:26:10Z

Well, thank you all for still maintaining it, and for taking the time to review these PRs. :)

Derugon · 2024-12-04T16:50:51Z

I wrote this had no impact on code, but it is not true.

The base issue

Using the changes from this PR with templates from an existing wiki it caused a (subtle) change, a beneficial one in my case, but still a breaking change: generated wikitext with transcluded wikitext syntax will have it evaluated.

For example, {{#trim: {{(}}{{(}}!{{)}}{{)}} }} yields |, while I would expect it to produce the same result as {{(}}{{(}}!{{)}}{{)}}, i.e. {{!}}.

This means, this PR (as of now) is a breaking change for the #trim and #or parser functions (and only these 2).

The issue with unescaping

This is a side-effect we have to deal with when unescaping: after the text is unescaped, we need to parse it again, so we parse {{#uesc: \{\{!\}\} }} the same way as {{#uesc: {{(}}{{(}}!{{)}}{{)}} }}, and both yield |.

Changing it would mean variables can no longer contribute reliably to unescaping, e.g. {{#uesc: {{X}} }} would yield {{!}} (with Template:X containing \{\{!\}\}), which would completely break the purpose of unescaping.

So I'll double down on what I said last week, and not suggest to remove the extra parsing from unescaped text in this PR.

…ensions-ParserPower into no-noparse

Mistakenly rolled back some lines in previous branch merge

…ensions-ParserPower into no-noparse

Remove parser function output parsing when unneeded

4dcf14b

Derugon marked this pull request as draft November 29, 2024 16:35

Derugon marked this pull request as ready for review December 4, 2024 16:51

Derugon added 5 commits December 4, 2024 17:56

Merge branch 'master' into no-noparse

7a09ff2

Merge branch 'master' of https://github.com/wiki-gg-oss/mediawiki-ext…

5a5b8fa

…ensions-ParserPower into no-noparse

Fix linkpage/linktext output

c31a77b

Mistakenly rolled back some lines in previous branch merge

argmap

59584ee

Merge branch 'master' of https://github.com/wiki-gg-oss/mediawiki-ext…

b08083a

…ensions-ParserPower into no-noparse

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove parser function output parsing when unneeded #9

Remove parser function output parsing when unneeded #9

Derugon commented Nov 29, 2024

Derugon commented Nov 29, 2024 •

edited

Loading

RheingoldRiver commented Nov 29, 2024

Derugon commented Nov 29, 2024

Derugon commented Dec 4, 2024 •

edited

Loading

Remove parser function output parsing when unneeded #9

Are you sure you want to change the base?

Remove parser function output parsing when unneeded #9

Conversation

Derugon commented Nov 29, 2024

Motivation

Proposed changes

Derugon commented Nov 29, 2024 • edited Loading

RheingoldRiver commented Nov 29, 2024

Derugon commented Nov 29, 2024

Derugon commented Dec 4, 2024 • edited Loading

The base issue

The issue with unescaping

Derugon commented Nov 29, 2024 •

edited

Loading

Derugon commented Dec 4, 2024 •

edited

Loading