feat(#3744): Human-Readable Message for Prohibited Comment #3802

volodya-lombrozo · 2025-01-10T13:24:17Z

In this PR I've added supportive grammar rule that is used to catch prohibited comments and report them to a user:

prohibitedComment
    : comment
    ;

Closes: #3744.

volodya-lombrozo · 2025-01-10T14:02:52Z

@maxonfjvipon Could you have a look, please?

maxonfjvipon · 2025-01-10T14:22:34Z

@volodya-lombrozo it's a very interesting idea, but I'm not sure if it's "right" to put wrong scenarios to grammar to detect errors. It seems that grammar is supposed to contain only "possible" scenarios. Did you see anywhere else such approach when obviously incorrect scenarios are put into grammar to detect syntax errors?

volodya-lombrozo · 2025-01-10T14:57:52Z

@maxonfjvipon I share your concern. Actually, using this approach was a last resort. I didn’t want to use it initially.
As for examples, yes, there are plenty of them:

The Definitive ANTLR 4 Reference 2nd Edition: Section 9.4 Error Alternatives
The real example is the Groovy lang. They use this technique a lot (at least in Lexer). For example for UNEXPECTED_CHAR
Also some sites mention this approach as well

yegor256 · 2025-01-10T16:12:17Z

@volodya-lombrozo maybe we should use a two-phases compilation pipeline for the .g4 file. First, we take the original .g4 file and inject such additional constructs into it, producing new (very large) .g4 file. Then, we let ANTLR compile this large file. WDYT?

volodya-lombrozo · 2025-01-13T07:59:02Z

@volodya-lombrozo maybe we should use a two-phases compilation pipeline for the .g4 file. First, we take the original .g4 file and inject such additional constructs into it, producing new (very large) .g4 file. Then, we let ANTLR compile this large file. WDYT?

@yegor256 I believe this solution would be overly complicated, to be honest. Originally, the problem with error handling was related to the EO ANTLR grammar. The grammar is created in such a way that if one of the comments is prohibited (by the grammar), then almost the entire AST becomes incorrect (see the picture below).

The perfect solution would be to rework the grammar in a way to avoid such situations. But it might require more dramatic changes that I would like to avoid.

maxonfjvipon · 2025-01-13T08:48:05Z

@volodya-lombrozo could you please describe how would you change the grammar so it would be more convenient for catching error? Maybe I could figure it out

yegor256 · 2025-01-13T09:05:41Z

I believe this solution would be overly complicated, to be honest.

@volodya-lombrozo I do like this PR, but I don't understand how many of such PRs we will have in the future, with similar grammar tricks. If it's only one, that's fine. However, if there will be others, maybe we should think about something less intrusive -- something that doesn't pollute our .g4 file with error-specific rules. WDYT?

volodya-lombrozo · 2025-01-13T10:10:34Z

@volodya-lombrozo could you please describe how would you change the grammar so it would be more convenient for catching error? Maybe I could figure it out

@maxonfjvipon I can't say for now, it requires some additional deep investigation. ANTLR can't recognise either object, or objects sub-rule (object EOL?)* and immediately throws an error. It seems, that it can't decide whether object is a master or a slave:

object
    : master
    | slave
    ;

All the available recovery strategies don't help either.
As I understand, the idea here is to make ANTLR recognise deeper nodes. In order to achieve this we need to reduce collisions between rules. I guess.

volodya-lombrozo · 2025-01-13T10:20:52Z

I believe this solution would be overly complicated, to be honest.

@volodya-lombrozo I do like this PR, but I don't understand how many of such PRs we will have in the future, with similar grammar tricks. If it's only one, that's fine. However, if there will be others, maybe we should think about something less intrusive -- something that doesn't pollute our .g4 file with error-specific rules. WDYT?

@yegor256 I can't agree more here. However, I can't say for now how much of such PR's we will have in the future. I need to move forward and try to solve similar issues:

#3745
#3746

If they will require same "tricks", it will mean that we need to solve this problem somehow else. And we will be able to discuss it in the next PR.
What do you think?

yegor256 · 2025-01-13T10:27:54Z

@volodya-lombrozo maybe we should try to implement our own "Error Strategy": https://www.antlr.org/api/Java/org/antlr/v4/runtime/ANTLRErrorStrategy.html Also, check this: https://stackoverflow.com/questions/26675254/antlr-error-strategy-to-skip-tokens-until-rule-matches-again

volodya-lombrozo · 2025-01-20T11:34:50Z

We solved this issue by using different approach. See #3818

volodya-lombrozo added 6 commits December 27, 2024 15:07

feat(objectionary#3744): state the desired behaviour

cda7690

feat(objectionary#3744): add prohibitedComment rule

89e8fcb

feat(objectionary#3744): terminal error

dcc8dc3

feat(objectionary#3744): fix the grammar to catch interesting comments

d19f985

feat(objectionary#3744): verify all error messages

40c0c4d

feat(objectionary#3744): fix some code offences

70faa33

github-actions bot added the core label Jan 10, 2025

volodya-lombrozo marked this pull request as draft January 10, 2025 13:26

volodya-lombrozo added 2 commits January 10, 2025 16:31

Merge branch 'master' into 3744_wrong_position

3223143

feat(objectionary#3744): happy 2025 year

7148349

volodya-lombrozo marked this pull request as ready for review January 10, 2025 14:02

volodya-lombrozo requested a review from maxonfjvipon January 10, 2025 14:02

volodya-lombrozo mentioned this pull request Jan 15, 2025

feat(#3744): Human-Readable Error Message for Prohibited Comment in an Object Definition #3818

Merged

volodya-lombrozo closed this Jan 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(#3744): Human-Readable Message for Prohibited Comment #3802

feat(#3744): Human-Readable Message for Prohibited Comment #3802

volodya-lombrozo commented Jan 10, 2025

volodya-lombrozo commented Jan 10, 2025

maxonfjvipon commented Jan 10, 2025

volodya-lombrozo commented Jan 10, 2025 •

edited

Loading

yegor256 commented Jan 10, 2025

volodya-lombrozo commented Jan 13, 2025

maxonfjvipon commented Jan 13, 2025

yegor256 commented Jan 13, 2025

volodya-lombrozo commented Jan 13, 2025

volodya-lombrozo commented Jan 13, 2025

yegor256 commented Jan 13, 2025

volodya-lombrozo commented Jan 20, 2025

feat(#3744): Human-Readable Message for Prohibited Comment #3802

feat(#3744): Human-Readable Message for Prohibited Comment #3802

Conversation

volodya-lombrozo commented Jan 10, 2025

volodya-lombrozo commented Jan 10, 2025

maxonfjvipon commented Jan 10, 2025

volodya-lombrozo commented Jan 10, 2025 • edited Loading

yegor256 commented Jan 10, 2025

volodya-lombrozo commented Jan 13, 2025

maxonfjvipon commented Jan 13, 2025

yegor256 commented Jan 13, 2025

volodya-lombrozo commented Jan 13, 2025

volodya-lombrozo commented Jan 13, 2025

yegor256 commented Jan 13, 2025

volodya-lombrozo commented Jan 20, 2025

volodya-lombrozo commented Jan 10, 2025 •

edited

Loading