What's the right langchain API for binding output parsing vs native structured generation vs tool calling. (Target models - GPT4o & Gemini 1.5 pro) #6851

AnishPimpley · 2024-09-19T18:08:38Z

AnishPimpley
Sep 19, 2024

Checked other resources

I added a very descriptive title to this question.
I searched the LangChain documentation with the integrated search.
I used the GitHub search to find a similar question and didn't find it.

Commit to Help

I commit to help with one of those options 👆

Example Code

from typing import Optional

from langchain_core.pydantic_v1 import BaseModel, Field


# Pydantic
class Joke(BaseModel):
    """Joke to tell user."""

    setup: str = Field(description="The setup of the joke")
    punchline: str = Field(description="The punchline to the joke")
    rating: Optional[int] = Field(
        default=None, description="How funny the joke is, from 1 to 10"
    )


structured_llm = llm.with_structured_output(Joke)

structured_llm.invoke("Tell me a joke about cats")


Is this the right syntax for native structured generation ?

Description

My usecase:

To generate long form text (blogs) with structed output generation. The current structured generation workflow in langchain has ~4 ways of generating structured output. I am confused about which is which.

I'm outlining all the types of structured text generation in langchain below. They are in order of preference.

Native structured outputs - https://openai.com/index/introducing-structured-outputs-in-the-api/

This is where the LLM provider takes care of structured output generation. The LLM provider ingests a pydantic schema and guarantees structured output. The provider's implementation is opaque, but the intuition is that the LLM provider does "normal generation + constrained decoding" on their servers. There may be some extra tuning or prompt engineering work done on their end, but it is black-box. This is most desirable, as the implementation is native with minimal langchain interference.

Json mode - https://python.langchain.com/docs/integrations/chat/openai/#model-features

The precursor to structured outputs. It guarantees JSON but doesn't take in schema as parameter. Schema must be passed as part of the prompt. It is also natively supported by the LLM provider. Likely also uses "normal generation + constrained decoding". Must be paired with a 3rd party (langchain) output parser to guarantee that the JSON follows some target pydantic schema.

Plain generation + output parsing - https://python.langchain.com/v0.1/docs/modules/model_io/output_parsers/types/pydantic/

This is full YOLO mode. The model generates text. No guarantees for it to be valid JSON or follow a schema. But, most times the model still succeeds. All guarantees are relegated to the post processing output parser. This is not ideal, but is still better than function calling

Native Tool calling - https://python.langchain.com/docs/integrations/chat/openai/#tool-calling

Tool calling has the same handshake, but llm providers use a separate fine tuned models (openai functions) for tool calling. Tool calling finetunes make the models concise, and unsuitable for long form text generation. I do not want tool calling, it creates tiny outputs and is strongly biased to write like a programmer, not a creative individual.

My question:

I want to ensure I'm using native structured outputs for Azure ChatOpenAI (GPT 4o latest) and ChatVertexAI (gemini 1.5 pro).

What is the right python syntax for it ?

Is this the right syntax - https://python.langchain.com/docs/how_to/structured_output/#pydantic-class ?

System Info

platform - Linux (wsl)
package - python

@dosu · 2024-09-19T18:10:28Z

dosubot[bot]
bot Sep 19, 2024

To ensure you're using native structured outputs for Azure ChatOpenAI (GPT 4o latest) and ChatVertexAI (Gemini 1.5 pro), you can use the with_structured_output method. Here is the correct syntax for implementing this in TypeScript:

Using a Zod Schema:

import { z } from "zod";

const joke = z.object({
  setup: z.string().describe("The setup of the joke"),
  punchline: z.string().describe("The punchline to the joke"),
  rating: z.number().optional().describe("How funny the joke is, from 1 to 10"),
});

const structuredLlm = model.withStructuredOutput(joke);

await structuredLlm.invoke("Tell me a joke about cats");

Passing a Schema Name for Additional Context:

const structuredLlm = model.withStructuredOutput(joke, { name: "joke" });

await structuredLlm.invoke("Tell me a joke about cats");

Using an OpenAI-style JSON Schema Dict:

const structuredLlm = model.withStructuredOutput(
  {
    "name": "joke",
    "description": "Joke to tell user.",
    "parameters": {
      "title": "Joke",
      "type": "object",
      "properties": {
        "setup": {"type": "string", "description": "The setup for the joke"},
        "punchline": {"type": "string", "description": "The joke's punchline"},
      },
      "required": ["setup", "punchline"],
    },
  }
);

await structuredLlm.invoke("Tell me a joke about cats", { name: "joke" });

Specifying the Output Method:

const structuredLlm = model.withStructuredOutput(joke, {
  method: "json_mode",
  name: "joke",
});

await structuredLlm.invoke(
  "Tell me a joke about cats, respond in JSON with `setup` and `punchline` keys"
);

These examples ensure that the output adheres to the defined schema, providing structured data generation for your use case with Azure ChatOpenAI and ChatVertexAI ^[1]^[2].

To continue talking to Dosu, mention @dosu.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's the right langchain API for binding output parsing vs native structured generation vs tool calling. (Target models - GPT4o & Gemini 1.5 pro) #6851

{{title}}

Replies: 1 comment

{{title}}

Select a reply

What's the right langchain API for binding output parsing vs native structured generation vs tool calling. (Target models - GPT4o & Gemini 1.5 pro) #6851

AnishPimpley Sep 19, 2024

Checked other resources

Commit to Help

Example Code

Description

System Info

Replies: 1 comment

dosubot[bot] bot Sep 19, 2024

AnishPimpley
Sep 19, 2024

dosubot[bot]
bot Sep 19, 2024