From 8ec48d8a38b881e05cc2ae39a3168ceed40566ac Mon Sep 17 00:00:00 2001 From: RizaFarheen Date: Tue, 25 Jun 2024 14:29:08 +0400 Subject: [PATCH] Optional flag for LLM tasks --- docs/reference-docs/ai-tasks/llm-chat-complete.md | 1 + docs/reference-docs/ai-tasks/llm-generate-embeddings.md | 1 + docs/reference-docs/ai-tasks/llm-get-embeddings.md | 1 + docs/reference-docs/ai-tasks/llm-index-document.md | 1 + docs/reference-docs/ai-tasks/llm-index-text.md | 1 + docs/reference-docs/ai-tasks/llm-search-index.md | 1 + docs/reference-docs/ai-tasks/llm-store-embeddings.md | 1 + docs/reference-docs/ai-tasks/llm-text-complete.md | 1 + 8 files changed, 8 insertions(+) diff --git a/docs/reference-docs/ai-tasks/llm-chat-complete.md b/docs/reference-docs/ai-tasks/llm-chat-complete.md index 63460db5..d597af38 100644 --- a/docs/reference-docs/ai-tasks/llm-chat-complete.md +++ b/docs/reference-docs/ai-tasks/llm-chat-complete.md @@ -45,6 +45,7 @@ A system task to complete the chat query. It can be used to instruct the model's | stopWords | Provide the stop words to be omitted during the text generation process.

In LLM, stop words may be filtered out or given less importance during the text generation process to ensure that the generated text is coherent and contextually relevant. | | topP | Another parameter to control the randomness of the model’s output. This parameter defines a probability threshold and then chooses tokens whose cumulative probability exceeds this threshold.

For example: Imagine you want to complete the sentence: “She walked into the room and saw a ______.” Now, the top 4 words the LLM model would consider based on the highest probabilities would be:If you set the top-p parameter to 0.70, the AI will consider tokens until their cumulative probability reaches or exceeds 70%. Here's how it works:At this point, the cumulative probability is 75%, exceeding the set top-p value of 70%. Therefore, the AI will randomly select one of the tokens from the list of "Cat," "Dog," and "Book" to complete the sentence because these tokens collectively account for approximately 75% of the likelihood. | | maxTokens | The maximum number of tokens to be generated by the LLM and returned as part of the result. A token should be approximately 4 characters. | +| optional | Enabling this option renders the task optional. The workflow continues unaffected by the task's outcome, whether it fails or remains incomplete. | ## Output Parameters diff --git a/docs/reference-docs/ai-tasks/llm-generate-embeddings.md b/docs/reference-docs/ai-tasks/llm-generate-embeddings.md index ec6fa203..d55eabe6 100644 --- a/docs/reference-docs/ai-tasks/llm-generate-embeddings.md +++ b/docs/reference-docs/ai-tasks/llm-generate-embeddings.md @@ -34,6 +34,7 @@ A system task to generate embeddings from the input data provided. Embeddings ar | llmProvider | Choose the required LLM provider. You can only choose providers to which you have access for at least one model from that provider.

**Note**:If you haven’t configured your AI / LLM provider on your Orkes console, navigate to the **Integrations** tab and configure your required provider. Refer to this doc on [how to integrate the LLM providers with Orkes console and provide access to required groups](/content/category/integrations/ai-llm).| | model | Choose from the available language model for the chosen LLM provider. You can only choose models for which you have access.

For example, If your LLM provider is Azure Open AI & you’ve configured *text-davinci-003* as the language model, you can choose it under this field. | | text | Provide the text to be converted and stored as a vector. The text can also be [passed as parameters to the workflow](https://orkes.io/content/developer-guides/passing-inputs-to-task-in-conductor).| +| optional | Enabling this option renders the task optional. The workflow continues unaffected by the task's outcome, whether it fails or remains incomplete. | ## Output Parameters diff --git a/docs/reference-docs/ai-tasks/llm-get-embeddings.md b/docs/reference-docs/ai-tasks/llm-get-embeddings.md index 894adfb0..3c214f29 100644 --- a/docs/reference-docs/ai-tasks/llm-get-embeddings.md +++ b/docs/reference-docs/ai-tasks/llm-get-embeddings.md @@ -32,6 +32,7 @@ A system task to get the numerical vector representations of words, phrases, sen | namespace | Choose from the available namespace configured within the chosen vector database.

Namespaces are separate isolated environments within the database to manage and organize vector data effectively.

**Note**:Namespace field is applicable only for Pinecone integration and is not applicable to Weaviate integration.| | index | Choose the index in your vector database where indexed text or data was stored.

**Note:**For Weaviate integration, this field refers to the class name, while in Pinecone integration, it denotes the index name itself.| | embeddings | Choose the embeddings from which the stored data is to be retrieved. It needs to be from the same embedding model that was used to create the other embeddings that are stored in the same index. | +| optional | Enabling this option renders the task optional. The workflow continues unaffected by the task's outcome, whether it fails or remains incomplete. | ## Output Parameters diff --git a/docs/reference-docs/ai-tasks/llm-index-document.md b/docs/reference-docs/ai-tasks/llm-index-document.md index cfd80213..a5350bb9 100644 --- a/docs/reference-docs/ai-tasks/llm-index-document.md +++ b/docs/reference-docs/ai-tasks/llm-index-document.md @@ -42,6 +42,7 @@ A system task to index the provided document into a vector database that can be | mediaType | Select the media type of the file to be indexed. Currently, supported media types include: | | chunkSize | Specifies how long each segment of the input text should be when it’s divided for processing by the LLM.

For example, if your article contains 2000 words and you specify the chunk size of 500, then the document would be divided into four chunks for processing. | | chunkOverlap | Specifies the overlap quantity between the adjacent chunks.

For example, if the chunk overlap is specified as 100, then the first 100 words of each chunk would overlap with the last 100 words of the previous chunk. | +| optional | Enabling this option renders the task optional. The workflow continues unaffected by the task's outcome, whether it fails or remains incomplete. | ## Examples diff --git a/docs/reference-docs/ai-tasks/llm-index-text.md b/docs/reference-docs/ai-tasks/llm-index-text.md index 00b58995..a05cee37 100644 --- a/docs/reference-docs/ai-tasks/llm-index-text.md +++ b/docs/reference-docs/ai-tasks/llm-index-text.md @@ -38,6 +38,7 @@ A system task to index the provided text into a vector space that can be efficie | embeddingModel | Choose from the available language model for the chosen LLM provider. | | text | Provide the text to be indexed. | | docId | Provide the ID of the document where you need to store the indexed text. | +| optional | Enabling this option renders the task optional. The workflow continues unaffected by the task's outcome, whether it fails or remains incomplete. | ## Examples diff --git a/docs/reference-docs/ai-tasks/llm-search-index.md b/docs/reference-docs/ai-tasks/llm-search-index.md index 1db00448..1e4de348 100644 --- a/docs/reference-docs/ai-tasks/llm-search-index.md +++ b/docs/reference-docs/ai-tasks/llm-search-index.md @@ -38,6 +38,7 @@ For example, in a recommendation system, a user might issue a query to find prod | llmProvider | Choose the required LLM provider configured.

**Note:**If you haven’t configured your AI / LLM provider on your Orkes console, navigate to the Integrations tab and configure your required provider. Refer to this doc on [how to integrate the LLM providers with Orkes console](/content/category/integrations/ai-llm).| | model | Choose from the available language model configured for the chosen LLM provider.

For example, If your LLM provider is Azure Open AI & you’ve configured _text-davinci-003_ as the language model, you can choose it under this field. | | query | Provide your search query. A query typically refers to a question, statement, or request made in natural language that is used to search, retrieve, or manipulate data stored in a database. | +| optional | Enabling this option renders the task optional. The workflow continues unaffected by the task's outcome, whether it fails or remains incomplete. | ## Output Parameters diff --git a/docs/reference-docs/ai-tasks/llm-store-embeddings.md b/docs/reference-docs/ai-tasks/llm-store-embeddings.md index d054af82..995c8f7e 100644 --- a/docs/reference-docs/ai-tasks/llm-store-embeddings.md +++ b/docs/reference-docs/ai-tasks/llm-store-embeddings.md @@ -36,6 +36,7 @@ A system task responsible for storing the generated embeddings produced by the [ | embeddingModelProvider | Choose the required LLM provider for embedding.

**Note**:If you haven’t configured your AI / LLM provider on your Orkes console, navigate to the **_Integrations_** tab and configure your required provider. Refer to this doc on [how to integrate the LLM providers with Orkes console](https://orkes.io/content/category/integrations/ai-llm). | | embeddingModel | Choose from the available language model for the chosen LLM provider. | | Id | Optional field to provide the vector ID. | +| optional | Enabling this option renders the task optional. The workflow continues unaffected by the task's outcome, whether it fails or remains incomplete. | ## Examples diff --git a/docs/reference-docs/ai-tasks/llm-text-complete.md b/docs/reference-docs/ai-tasks/llm-text-complete.md index f0214cb9..697e4f81 100644 --- a/docs/reference-docs/ai-tasks/llm-text-complete.md +++ b/docs/reference-docs/ai-tasks/llm-text-complete.md @@ -47,6 +47,7 @@ A system task to predict or generate the next phrase or words in a given text ba | stopWords | Provide the stop words to be omitted during the text generation process.

In LLM, stop words may be filtered out or given less importance during the text generation process to ensure that the generated text is coherent and contextually relevant. | topP | Another parameter to control the randomness of the model’s output. This parameter defines a probability threshold and then chooses tokens whose cumulative probability exceeds this threshold.

For example: Imagine you want to complete the sentence: “She walked into the room and saw a ______.” Now, the top 4 words the LLM model would consider based on the highest probabilities would be:If you set the top-p parameter to 0.70, the AI will consider tokens until their cumulative probability reaches or exceeds 70%. Here's how it works:At this point, the cumulative probability is 75%, exceeding the set top-p value of 70%. Therefore, the AI will randomly select one of the tokens from the list of "Cat," "Dog," and "Book" to complete the sentence because these tokens collectively account for approximately 75% of the likelihood.| | maxTokens | The maximum number of tokens to be generated by the LLM and returned as part of the result. A token should be approximately 4 characters. | +| optional | Enabling this option renders the task optional. The workflow continues unaffected by the task's outcome, whether it fails or remains incomplete. | ## Output Parameters