📦 Release: v0.1.0-rc.1 #280

roma-glushko · 2024-06-24T09:52:44Z

The first major update with breaking changes to the language chat schemas
and begging of work on instrumenting the gateway with OpenTelemetry.

- Initing a new type of workflow, a streaming (async) routing workflow using the Streaming Chat API as an example - Updated the Bruno collection - Updated the LanguageModel API to include `ChatStream()` and `SupportChatStream()` methods - Get the streaming router working - Implemented SSE event parsing to be able to work with OpenAI streaming chat API - Integrated OpenAI chat streaming into the Glide's streaming chat API - Covered the happy workflow by tests

* 🔒 Upgraded the crypto lib * ⬆️ Upgrade Go to 1.22.1 * 🔒 Fiber to v2.52.2

…tStream (#166) - Separated sync and streaming chat schemas - Extracted assumptions on where to find latency from routing strategies to a separate `LatencyGetters` that can be different for different models/workflows - Elaborated the client provider `chatStream()` interface. Clients now expose a response channel instead of being provided with by caller - Connected the stream chat workflow to latency & health tracking - Refined the `chatStream()` method of clients to return a stream struct - Separated latency tracking of the streaming workflow from the sync chat workflow - defined a new `HealthTracker` to incorporate all health tracking logic

Improve general coverage of the codebase: - covered a few configs by tests - file content expansion in configurations

- Separated chat & chat stream request schemas - introduced a new finish reason field - added metadata to stream chat response - allow to attach some metadata to a chat stream request and then attach it to each chat stream chunk - adjusted error message schema to include request ID and metadata

Handle a wrong API key case to make the model as unavailable permanently

…age (#184) - Fixed the header where Anthropic API key is passed - Started propagating token usage of Anthropic requests - Corrected the TokenUsage interface by changing count field to integers from floats

* #173: add streaming * #173: update header and test data * #173: Update test and schema * #173: lint --------- Co-authored-by: Max <[email protected]>

* #171: support streaming * #171: add tests & lint * #171: update chat.go * #171: lint * #171: update test --------- Co-authored-by: Max <[email protected]>

…penAI, Azure and Cohere (#194) - text length bound passed in request params - content moderation/toxicity - Cohere streaming workflow doesn't seem to be working as errMapper was not really initialized. I have fixed that in this PR - Cohere now ignores stream chunk types that Glide doesn't support like citation related stuff - Cohere stream chunks are not set with the correct model name (e.g. some placeholder was used before)

Rendering Durations as strings rather than nanosecond integers

…re chat streams correctly (#201) - implementing a custom stream reader to correctly handle Cohere streams - Start handling the stream-start event to propagate generationID to all following chunks

…n case of some errors (#203) - Passed RouterID and ModelID information in the chat stream messages - Introduced a new ChatStreamMessage type that joins both chunk and error messages. Removed unneeded context from provider chatStream structs - defined a set of possible error codes during chat streaming - started simplifying logging by using context-based loggers - Introduced finish_reason on the error schema

- Fixed validation of nested arrays, so it can now reach all structures including provider params - Removed ChatHistory & ConversationID fields from the params - Added a bunch of other params like max_tokens, penalties, k, p, etc. - Added validations to some params

…g swagger.yaml file (#211) This change fixes panics like "./docs/swagger.yaml is not found"

# Conflicts: # README.md # docs/docs.go # docs/swagger.json # docs/swagger.yaml # go.mod # go.sum # pkg/api/http/handlers.go # pkg/api/http/server.go # pkg/api/schemas/chat_stream.go # pkg/gateway.go # pkg/providers/azureopenai/chat_stream.go # pkg/providers/azureopenai/client.go # pkg/providers/cohere/chat.go # pkg/providers/cohere/chat_stream.go # pkg/providers/cohere/chat_stream_test.go # pkg/providers/cohere/client.go # pkg/providers/cohere/config.go # pkg/providers/cohere/schemas.go # pkg/providers/cohere/testdata/chat_stream.success.txt # pkg/providers/lang.go # pkg/providers/openai/chat.go # pkg/providers/openai/chat_stream.go # pkg/providers/openai/client.go # pkg/providers/provider.go # pkg/providers/testing/lang.go # pkg/providers/testing/models.go # pkg/routers/config.go # pkg/routers/router.go # pkg/routers/router_test.go

…rics and OTEL Collector (#225)

We use go.opentelemetry.io/contrib/exporters/autoexport for standard loading exporter configurations via env variables.

… secrets in a secure way (#244)

…251)

…s to give clients more context around the error (#236) - Introduced a new error type to hold useful context like HTTP response status, error name, message - If all providers are unavailable, we are not throwing 500 error anymore but 503 - Start throwing unknown_error with 500 status on unexpected exceptions - Predefined all static HTTP errors instead of creating them every time they occur - Introduced the name field on the error schema - Changed the req/response schema to snake_case (hopefully, to stick with it forever) - Removed Bruno collections (it doesn't cover all our needs like websocket or gRPC protocol) - Moved all schemas to `api/schema` package - Made router list API opaque - Changed the field name for overrides not to clash with defined statements in some languages

…er configs (#260)

Removing omitempty fields from chat response per request

- Introduced a new concept/struct `ChatParams` that contains all param overrides for the specific modelName/modelID - Adjusted the LangModel interface to rely on `ChatParams` rather than the original request schema for both sync and stream chat API - Standardize on the chat message structure with two fields. Removed all duplicated structures - Fixed Ollama's broken/half-backed tests

Basic implementation of connection pooling for chat functionality.

codecov · 2024-06-24T10:36:12Z

Codecov Report

Attention: Patch coverage is 67.04805% with 144 lines in your changes missing coverage. Please review.

Project coverage is 65.98%. Comparing base (754af34) to head (59aafab).

Files	Patch %	Lines
pkg/api/http/handlers.go	0.00%	23 Missing ⚠️
pkg/api/schemas/errors.go	10.52%	17 Missing ⚠️
pkg/api/schemas/pool.go	0.00%	16 Missing ⚠️
pkg/providers/bedrock/chat.go	54.16%	11 Missing ⚠️
pkg/gateway.go	0.00%	9 Missing ⚠️
pkg/telemetry/telemetry.go	86.20%	6 Missing and 2 partials ⚠️
pkg/api/schemas/chat_stream.go	0.00%	6 Missing ⚠️
pkg/providers/anthropic/chat.go	75.00%	3 Missing and 1 partial ⚠️
pkg/providers/ollama/chat.go	75.00%	3 Missing and 1 partial ⚠️
pkg/api/schemas/chat.go	88.88%	2 Missing and 1 partial ⚠️
... and 23 more

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #280      +/-   ##
==========================================
- Coverage   66.98%   65.98%   -1.00%     
==========================================
  Files          78       83       +5     
  Lines        3577     3634      +57     
==========================================
+ Hits         2396     2398       +2     
- Misses       1054     1114      +60     
+ Partials      127      122       -5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

roma-glushko and others added 30 commits March 3, 2024 00:12

🐛 Update README.md to fix helm chart location (#167)

ec690f4

🔓 #148 update crypto lib, golang, fiber (#168)

c480cfe

* 🔒 Upgraded the crypto lib * ⬆️ Upgrade Go to 1.22.1 * 🔒 Fiber to v2.52.2

🔧 updated .go-version

1add6e3

✅ #146: Covered the telemetry by tests (#176)

3742035

Improve general coverage of the codebase: - covered a few configs by tests - file content expansion in configurations

✨ #170 Handle unauthorized error in health tracker (#180)

369ddea

Handle a wrong API key case to make the model as unavailable permanently

✨🐛 #183: Fix Anthropic API key header and start counting its token us…

e37b0f8

…age (#184) - Fixed the header where Anthropic API key is passed - Started propagating token usage of Anthropic requests - Corrected the TokenUsage interface by changing count field to integers from floats

📝 Separate and list all supported capabilities per provider (#190)

bc1a665

✨ #173: Add Streaming Support for Azure OpenAI (#188)

428c467

* #173: add streaming * #173: update header and test data * #173: Update test and schema * #173: lint --------- Co-authored-by: Max <[email protected]>

✨ #171: Cohere Streaming Chat Support (#189)

f76eb86

* #171: support streaming * #171: add tests & lint * #171: update chat.go * #171: lint * #171: update test --------- Co-authored-by: Max <[email protected]>

📝 Made Glide's changelog compatible with the keepchangelog.com format

f27e889

📝 Kept Python & NodeJS clients in SDK section

12df04f

🔧 #186: Rendering Durations in a human-friendly way (#202)

45d9aa1

Rendering Durations as strings rather than nanosecond integers

🐛 #200: Implemented a custom json per line stream reader to read Cohe…

9a0ee6d

…re chat streams correctly (#201) - implementing a custom stream reader to correctly handle Cohere streams - Start handling the stream-start event to propagate generationID to all following chunks

📝 Updated providers that support chat streaming

de3677e

🐛 #209: Embed Swagger specs into binary to fix panic caused by missin…

8e77c63

…g swagger.yaml file (#211) This change fixes panics like "./docs/swagger.yaml is not found"

📝 Added 0.0.3-rc.2 changelog

75c2df3

📝 Marked 0,03-rc.2 as stable version 0.0.3

1b9e9f6

Merge branch 'main' into develop

8f4a8be

📝 Fixed typo

cad92f5

🐛 #217: Set build info correctly in Glide images (#218)

bc5d95f

👷 #219: Setup local telemetry stack with Jaeger, Grafana, VictoriaMet…

cf137da

…rics and OTEL Collector (#225)

🔧 Use github.com/EinStack/glide as module name to support go install cmd

2db285c

📝 Defined a way to manage EinStack Glide project (#234)

66c5f5b

roma-glushko and others added 19 commits May 5, 2024 18:32

👷‍♂️ Added a new GH action to watch for glide activity stream (#239)

d453ca6

✨🔧 Setup Open Telemetry Metrics and Traces (#237)

602f53e

We use go.opentelemetry.io/contrib/exporters/autoexport for standard loading exporter configurations via env variables.

🐛 Running the notification action from the base repo context to share…

8b420f7

… secrets in a secure way (#244)

🔧 #221 Add B3 propagator (#242)

a54eb57

🔧 #241 Support overriding OTEL resource attributes (#243)

ab244b2

🔧 #164 Make client connection pool configurable across all providers (#…

bd03442

…251)

🔧 #248 Disable span and metrics by default (#254)

321b002

🔧 #220 Instrument API server (#255)

3b37d0b

💥 Convert all camelCase config fields to the snake_case in the provid…

70f1207

…er configs (#260)

🔧 Instrument gateway process (#256)

8986edb

🔧 #238 Implements human readable durations in config (#253)

03f1805

🔧 #266: removing omitempty from response definition (#267)

da72edb

Removing omitempty fields from chat response per request

🔧 #262: adding connection pool for chat request and response (#271)

0bb4878

Basic implementation of connection pooling for chat functionality.

✨ Switched to the new docs

27b98af

🔒 Updated golang to 1.22.4 to address CVE-2024-24790 (#276)

b7e7db4

🔧 Adds support for Air live reloading (#270)

08ee414

🔧 #240: Automatically install air (#277)

765f874

roma-glushko added the type:release label Jun 24, 2024

roma-glushko self-assigned this Jun 24, 2024

#67: Collected changelog for 0.1.0-rc.1

7faf386

roma-glushko changed the title ~~📦 Release: v0.1.0~~ 📦 Release: v0.1.0-rc.1 Jun 24, 2024

Merge branch 'main' into develop

59aafab

roma-glushko merged commit 1e2b16f into main Jun 24, 2024
15 of 16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

📦 Release: v0.1.0-rc.1 #280

📦 Release: v0.1.0-rc.1 #280

roma-glushko commented Jun 24, 2024 •

edited

Loading

codecov bot commented Jun 24, 2024

📦 Release: v0.1.0-rc.1 #280

📦 Release: v0.1.0-rc.1 #280

Conversation

roma-glushko commented Jun 24, 2024 • edited Loading

Added

Changed

Breaking Changes

Fixed

Security

Miscellaneous

codecov bot commented Jun 24, 2024

Codecov Report

roma-glushko commented Jun 24, 2024 •

edited

Loading