Call Analyst to perform validation #187

sfc-gh-cnivera · 2024-10-18T23:41:47Z

We currently maintain copies of the validation logic in both the internal Analyst codepaths as well as this OSS app. Often, the OSS app can become out of date. Instead of performing validation locally, we will simply call Analyst with the current YAML string, as it performs validation at inference time. Any error returned is shown to the user.

The diff for this PR seems big but it's mostly deleting unnecessary code + tests.

sfc-gh-cnivera · 2024-10-18T23:44:49Z

app_utils/chat.py

+
+
+@st.cache_data(ttl=60, show_spinner=False)
+def send_message(


I just moved send_message out of iteration.py and into chat.py so that we could share it with the validation code.

Most of the code is the same except for how we handle errors; instead of preprocessing it into a "Failed request with status" message, we attempt to pick out the message field from the error payload and render it, which hopefully shows a bit nicer in the UI.

sfc-gh-ybsat · 2024-10-21T20:05:24Z

semantic_model_generator/validate_model.py

-        logger.info(f"Checking logical table: {table.name}")
-        try:
-            validate_all_cols(table)
-            sqls = generate_select(table, 1)


Logan recently added an "explain select" PR in orchestrator which does this generate select behavior.
It is implemented as part of the parallel task flow (so further away from the validation path). Can we test the behavior and see if we also get a useful error message displayed to the user if the physical columns dont exist on the table?

If the physical column doesn't exist (e.g. i just renamed one of the dimensions to something fake), it seems that the validation error isn't thrown. Looking through the orchestrator logs I do see that there is an error being warned but it doesn't look like it is propagated as a fatal error, I've asked Logan some questions on which warnings are returned from Analyst.

The warning is temporary (like baby sit it in prod) but i think he plans to return error eventually. Good to confirm with him

Logan confirmed what you mentioned - for now it just logs the error but doesn't return yet, so we don't get a validation error yet. Once that's changed on the orchestrator side I think we'll see errors here properly

sfc-gh-ybsat · 2024-10-21T20:07:13Z

semantic_model_generator/validate_model.py

-
-    logger.info("Successfully validated!")
+    dummy_request = [
+        {"role": "user", "content": [{"type": "text", "text": "SMG app validation"}]}


do you plan to handle this special string of "SMG app validation" on orchestrator side?

the one part to confirm is whether we will reliable get the "explain select" behavior in parallel task flow, or whether, if the classification module returns first (which is unlikely) we would not get the right error message back to user

Ah interesting, thanks for calling that out. I do think we see the explain select behavior, in the orchestrator logs I can see Begin validating semantic context and End validating semantic context with this dummy question.

I don't think I want to do any special casing for this question in the orchestrator inference paths (besides from perhaps preventing logs from being emitted, or something). Is there a generic question that'd work better to make sure we hit the relevant validation codepaths?

Ok this is good enough if you saw it working.
I like your sample question now because it is basically guaranted to be non-sql (and thus not block on sqlgen latency)

sfc-gh-cnivera added 2 commits October 18, 2024 16:40

call analyst for validation

b4e3ec2

better error handling

5bd070c

sfc-gh-cnivera linked an issue Oct 18, 2024 that may be closed by this pull request

Migrate local validation to Analyst call #183

Closed

sfc-gh-cnivera commented Oct 18, 2024

View reviewed changes

sfc-gh-cnivera marked this pull request as ready for review October 21, 2024 17:48

sfc-gh-cnivera requested review from sfc-gh-rehuang, sfc-gh-jsummer and sfc-gh-twhite as code owners October 21, 2024 17:48

sfc-gh-cnivera requested a review from sfc-gh-ybsat October 21, 2024 19:51

sfc-gh-ybsat reviewed Oct 21, 2024

View reviewed changes

sfc-gh-cnivera requested a review from sfc-gh-ybsat October 22, 2024 18:34

sfc-gh-ybsat approved these changes Oct 23, 2024

View reviewed changes

sfc-gh-cnivera merged commit 2f1f675 into main Oct 23, 2024
2 checks passed

sfc-gh-cnivera deleted the cnivera/remote-validation branch October 23, 2024 22:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Call Analyst to perform validation #187

Call Analyst to perform validation #187

sfc-gh-cnivera commented Oct 18, 2024 •

edited

Loading

sfc-gh-cnivera Oct 18, 2024 •

edited

Loading

sfc-gh-ybsat Oct 21, 2024

sfc-gh-cnivera Oct 21, 2024

sfc-gh-ybsat Oct 21, 2024

sfc-gh-cnivera Oct 22, 2024 •

edited

Loading

sfc-gh-ybsat Oct 21, 2024

sfc-gh-cnivera Oct 21, 2024 •

edited

Loading

sfc-gh-ybsat Oct 21, 2024



		@st.cache_data(ttl=60, show_spinner=False)
		def send_message(

Call Analyst to perform validation #187

Call Analyst to perform validation #187

Conversation

sfc-gh-cnivera commented Oct 18, 2024 • edited Loading

sfc-gh-cnivera Oct 18, 2024 • edited Loading

Choose a reason for hiding this comment

sfc-gh-ybsat Oct 21, 2024

Choose a reason for hiding this comment

sfc-gh-cnivera Oct 21, 2024

Choose a reason for hiding this comment

sfc-gh-ybsat Oct 21, 2024

Choose a reason for hiding this comment

sfc-gh-cnivera Oct 22, 2024 • edited Loading

Choose a reason for hiding this comment

sfc-gh-ybsat Oct 21, 2024

Choose a reason for hiding this comment

sfc-gh-cnivera Oct 21, 2024 • edited Loading

Choose a reason for hiding this comment

sfc-gh-ybsat Oct 21, 2024

Choose a reason for hiding this comment

sfc-gh-cnivera commented Oct 18, 2024 •

edited

Loading

sfc-gh-cnivera Oct 18, 2024 •

edited

Loading

sfc-gh-cnivera Oct 22, 2024 •

edited

Loading

sfc-gh-cnivera Oct 21, 2024 •

edited

Loading