Truncate diff of very long texts if not in verbose mode (fix #12406) #12634

devdanzin · 2024-07-20T13:54:16Z

The _diff_text_ function might try to calculate diffs of huge texts, taking a very long time, even if that diff is going to be truncated in non-verbose mode. This PR adds truncation of the texts before calculating the diff in that case. New tests added, old tests pass.

I'm not sure the code is correct for when equal trailing characters aren't skipped.

Happy to address any reviews or suggestions.

Closes #12406.

for more information, see https://pre-commit.ci

src/_pytest/assertion/util.py

…xt().

for more information, see https://pre-commit.ci

devdanzin · 2024-07-22T16:37:51Z

The logic for when equal leading characters aren't skipped is now sound.

nicoddemus

Hey @devdanzin!

Really sorry about the delay on this one, it fell through the cracks.

I'm afraid it needs further updates since #12766 landed, see my comments.

Also it is understandable if you would prefer to drop this instead, as it has not been given the attention it deserved and the implementation will probably be a bit more complicated to handle the new limit options.

nicoddemus · 2024-10-04T09:53:32Z

src/_pytest/assertion/util.py

@@ -308,6 +311,21 @@ def _diff_text(left: str, right: str, verbose: int = 0) -> list[str]:
                ]
                left = left[:-i]
                right = right[:-i]
+        shortest = min(left, right, key=lambda x: len(x))
+        lines = j = start = 0
+        if shortest.count("\n") >= DEFAULT_MAX_LINES:


I'm afraid this PR got outdated after #12766.

I think the path forward would be for _diff_text() to receive both "max lines" and "max chars" by parameter instead (they being int | None, with None meaning "no limits").

nicoddemus · 2024-10-04T09:54:34Z

src/_pytest/assertion/util.py

@@ -308,6 +311,21 @@ def _diff_text(left: str, right: str, verbose: int = 0) -> list[str]:
                ]
                left = left[:-i]
                right = right[:-i]
+        shortest = min(left, right, key=lambda x: len(x))
+        lines = j = start = 0
+        if shortest.count("\n") >= DEFAULT_MAX_LINES:


I'm under the impression the code can be made simpler by skipping max chars first, then dealing with max lines... 🤔

devdanzin added 3 commits July 20, 2024 10:37

Avoid creating a diff of very long texts if not in verbose mode.

9a3c384

Add changelog entry.

5a239d7

Add myself to AUTHORS.

c06185e

psf-chronographer bot added the bot:chronographer:provided (automation) changelog entry is part of PR label Jul 20, 2024

[pre-commit.ci] auto fixes from pre-commit.com hooks

c17e498

for more information, see https://pre-commit.ci

Pierre-Sassoulas reviewed Jul 20, 2024

View reviewed changes

src/_pytest/assertion/util.py Outdated Show resolved Hide resolved

devdanzin and others added 3 commits July 20, 2024 11:38

Use constants from truncate.py instead of magical numbers in _diff_te…

6e22c94

…xt().

Merge.

a580e6f

[pre-commit.ci] auto fixes from pre-commit.com hooks

57e0c78

for more information, see https://pre-commit.ci

Pierre-Sassoulas added the type: performance performance or memory problem/improvement label Jul 20, 2024

devdanzin added 3 commits July 22, 2024 11:47

Merge branch 'main' into huge_text_diff

8f96cf3

Fix the logic in _diff_text, add a test and update others.

83164fb

Formatting.

a48c40d

Merge branch 'pytest-dev:main' into huge_text_diff

e41e39a

nicoddemus requested changes Oct 4, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Truncate diff of very long texts if not in verbose mode (fix #12406) #12634

Truncate diff of very long texts if not in verbose mode (fix #12406) #12634

devdanzin commented Jul 20, 2024

devdanzin commented Jul 22, 2024

nicoddemus left a comment

nicoddemus Oct 4, 2024

nicoddemus Oct 4, 2024

Truncate diff of very long texts if not in verbose mode (fix #12406) #12634

Are you sure you want to change the base?

Truncate diff of very long texts if not in verbose mode (fix #12406) #12634

Conversation

devdanzin commented Jul 20, 2024

devdanzin commented Jul 22, 2024

nicoddemus left a comment

Choose a reason for hiding this comment

nicoddemus Oct 4, 2024

Choose a reason for hiding this comment

nicoddemus Oct 4, 2024

Choose a reason for hiding this comment