How do I start enforcing schema only after a keyword is hit during inference? #120

accupham · 2024-07-11T21:44:06Z

For example, sometimes during chain of reasoning, it is helpful to have the model reason about the answer in free-form, then output the JSON. So perhaps inference could start without enforcement, then when “‘’’\n{“ is encountered, then start enforcement.

noamgat · 2024-07-13T05:44:10Z

Interesting. It should be possible to chain several parsers, for example something like (this is pseudocode for the LM Format Enforcer)

SequenceParser
      element 1: Regex Parser <regex for anything except ```\n{>```\n{    # any string that ends with your json prefix
      element 2: JsonSchemaParser (schema).add_character('{')  # adding the { character to start at the json parsing state in which the regex already reached

RegexParser, SequenceParser and JsonSchemaParser are all existing classes.
I did not try this, but in theory it should work.
However, In reality, I would expect better results in a multiturn scenario:

User: Please do xxxxx. Share your chain of thought reasoning as well.
Assistant: .....
User: Based on the arugments above, output your answer in JSON in the following schema: <schema>
Assistant: <LMFE Json Schema Parser active here>

As in this way, it is mandatory to start the json output at a specific point, where it makes sense conversation wise. In the first scenario, the LLM might want to end the response, and LMFE won't let it (because it didn't output json yet), causing hallucinations.

accupham · 2024-07-13T13:54:29Z

With regards to the first scenario, what if you did something like this:

SequenceParser:
    element 1:
        UnionParser:
             element 1: RegexParser
             element 2: ForceStopParser
    element 2:
        JSONSchemaParser

I think I would have to modify Sequence parser can_end so it could stop on any, instead of all here:

lm-format-enforcer/lmformatenforcer/characterlevelparser.py

Line 165 in f1dd75b

return all([parser.can_end() for parser in self.parsers])

But then now the LLM can end the conversation if it encounters an EOS or stopword before JSON is emitted if the conversation is appropriate, avoiding nasty hallucinations.

I would like to open a PR by adding a new parser that does effectively this with one-shot. Is this the right direction or am I overcomplicating things?

noamgat · 2024-07-16T17:45:51Z

I'm not sure it warrants a PR, its OK if your code has classes from the LMFE hierarchy. Maybe it could be a sample.
Alternatively, create a UnionParser from the two options (one with the json and one without).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do I start enforcing schema only after a keyword is hit during inference? #120

How do I start enforcing schema only after a keyword is hit during inference? #120

accupham commented Jul 11, 2024

noamgat commented Jul 13, 2024

accupham commented Jul 13, 2024 •

edited

Loading

noamgat commented Jul 16, 2024

How do I start enforcing schema only after a keyword is hit during inference? #120

How do I start enforcing schema only after a keyword is hit during inference? #120

Comments

accupham commented Jul 11, 2024

noamgat commented Jul 13, 2024

accupham commented Jul 13, 2024 • edited Loading

noamgat commented Jul 16, 2024

accupham commented Jul 13, 2024 •

edited

Loading