You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When launch the DocSum example using the docker compose and/or helm with the latest images built from source, sending the curl request to the DocSum mega gateway service results the following errors:
$ curl http://${host_ip}:8888/v1/docsum -H "Content-Type: multipart/form-data" -F "messages=Text Embeddings Inference (TEI) is a toolkit for deploying and serving open source text embeddings and sequence classification models. TEI enables high-performance extraction for the most popular models, including FlagEmbedding, Ember, GTE and E5." -F "max_tokens=32" -F "language=en" -F "stream=false"
Internal Server Error
Reproduce steps
Follow the DocSum Xeon Readme:
export host_ip=
source ../../../set_env.sh
docker compose up -d
curl http://${host_ip}:8888/v1/docsum -H "Content-Type: multipart/form-data" -F "messages=Text Embeddings Inference (TEI) is a toolkit for deploying and serving open source text embeddings and sequence classification models. TEI enables high-performance extraction for the most popular models, including FlagEmbedding, Ember, GTE and E5." -F "max_tokens=32" -F "language=en" -F "stream=false"
Raw log
$ docker compose logs docsum-xeon-backend-server
WARN[0000] The "https_proxy" variable is not set. Defaulting to a blank string.
WARN[0000] The "HUGGINGFACEHUB_API_TOKEN" variable is not set. Defaulting to a blank string.
WARN[0000] The "no_proxy" variable is not set. Defaulting to a blank string.
WARN[0000] The "http_proxy" variable is not set. Defaulting to a blank string.
WARN[0000] The "no_proxy" variable is not set. Defaulting to a blank string.
WARN[0000] The "http_proxy" variable is not set. Defaulting to a blank string.
WARN[0000] The "https_proxy" variable is not set. Defaulting to a blank string.
WARN[0000] The "HUGGINGFACEHUB_API_TOKEN" variable is not set. Defaulting to a blank string.
WARN[0000] The "no_proxy" variable is not set. Defaulting to a blank string.
WARN[0000] The "https_proxy" variable is not set. Defaulting to a blank string.
WARN[0000] The "http_proxy" variable is not set. Defaulting to a blank string.
WARN[0000] The "no_proxy" variable is not set. Defaulting to a blank string.
WARN[0000] The "https_proxy" variable is not set. Defaulting to a blank string.
WARN[0000] The "http_proxy" variable is not set. Defaulting to a blank string.
docsum-xeon-backend-server | /usr/local/lib/python3.11/site-packages/pydantic/_internal/_fields.py:132: UserWarning: Field "model_name_or_path"in Audio2TextDoc has conflict with protected namespace "model_".
docsum-xeon-backend-server |
docsum-xeon-backend-server | You may be able to resolve this warning by setting `model_config['protected_namespaces'] = ()`.
docsum-xeon-backend-server | warnings.warn(
docsum-xeon-backend-server | [2024-11-14 02:37:39,377] [ INFO] - Base service - CORS is enabled.
docsum-xeon-backend-server | [2024-11-14 02:37:39,378] [ INFO] - Base service - Setting up HTTP server
docsum-xeon-backend-server | [2024-11-14 02:37:39,379] [ INFO] - Base service - Uvicorn server setup on port 8888
docsum-xeon-backend-server | INFO: Waiting for application startup.
docsum-xeon-backend-server | INFO: Application startup complete.
docsum-xeon-backend-server | INFO: Uvicorn running on http://0.0.0.0:8888 (Press CTRL+C to quit)
docsum-xeon-backend-server | [2024-11-14 02:37:39,392] [ INFO] - Base service - HTTP server setup successful
docsum-xeon-backend-server | INFO: 100.80.243.121:59802 - "POST /v1/docsum HTTP/1.1" 500 Internal Server Error
docsum-xeon-backend-server | ERROR: Exception in ASGI application
docsum-xeon-backend-server | Traceback (most recent call last):
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/uvicorn/protocols/http/h11_impl.py", line 406, in run_asgi
docsum-xeon-backend-server | result = await app( # type: ignore[func-returns-value]
docsum-xeon-backend-server | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/uvicorn/middleware/proxy_headers.py", line 60, in __call__
docsum-xeon-backend-server |return await self.app(scope, receive, send)
docsum-xeon-backend-server | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/fastapi/applications.py", line 1054, in __call__
docsum-xeon-backend-server | await super().__call__(scope, receive, send)
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/applications.py", line 113, in __call__
docsum-xeon-backend-server | await self.middleware_stack(scope, receive, send)
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/middleware/errors.py", line 187, in __call__
docsum-xeon-backend-server | raise exc
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/middleware/errors.py", line 165, in __call__
docsum-xeon-backend-server | await self.app(scope, receive, _send)
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/prometheus_fastapi_instrumentator/middleware.py", line 174, in __call__
docsum-xeon-backend-server | raise exc
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/prometheus_fastapi_instrumentator/middleware.py", line 172, in __call__
docsum-xeon-backend-server | await self.app(scope, receive, send_wrapper)
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/middleware/cors.py", line 85, in __call__
docsum-xeon-backend-server | await self.app(scope, receive, send)
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/middleware/exceptions.py", line 62, in __call__
docsum-xeon-backend-server | await wrap_app_handling_exceptions(self.app, conn)(scope, receive, send)
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app
docsum-xeon-backend-server | raise exc
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 42, in wrapped_app
docsum-xeon-backend-server | await app(scope, receive, sender)
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 715, in __call__
docsum-xeon-backend-server | await self.middleware_stack(scope, receive, send)
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 735, in app
docsum-xeon-backend-server | await route.handle(scope, receive, send)
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 288, in handle
docsum-xeon-backend-server | await self.app(scope, receive, send)
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 76, in app
docsum-xeon-backend-server | await wrap_app_handling_exceptions(app, request)(scope, receive, send)
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app
docsum-xeon-backend-server | raise exc
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 42, in wrapped_app
docsum-xeon-backend-server | await app(scope, receive, sender)
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 73, in app
docsum-xeon-backend-server | response = await f(request)
docsum-xeon-backend-server | ^^^^^^^^^^^^^^^^
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/fastapi/routing.py", line 301, in app
docsum-xeon-backend-server | raw_response = await run_endpoint_function(
docsum-xeon-backend-server | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/fastapi/routing.py", line 212, in run_endpoint_function
docsum-xeon-backend-server |return await dependant.call(**values)
docsum-xeon-backend-server | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
docsum-xeon-backend-server | File "/home/user/GenAIComps/comps/cores/mega/gateway.py", line 423, in handle_request
docsum-xeon-backend-server | data = await request.json()
docsum-xeon-backend-server | ^^^^^^^^^^^^^^^^^^^^
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/site-packages/starlette/requests.py", line 249, in json
docsum-xeon-backend-server | self._json = json.loads(body)
docsum-xeon-backend-server | ^^^^^^^^^^^^^^^^
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/json/__init__.py", line 346, in loads
docsum-xeon-backend-server |return _default_decoder.decode(s)
docsum-xeon-backend-server | ^^^^^^^^^^^^^^^^^^^^^^^^^^
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/json/decoder.py", line 337, in decode
docsum-xeon-backend-server | obj, end = self.raw_decode(s, idx=_w(s, 0).end())
docsum-xeon-backend-server | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
docsum-xeon-backend-server | File "/usr/local/lib/python3.11/json/decoder.py", line 355, in raw_decode
docsum-xeon-backend-server | raise JSONDecodeError("Expecting value", s, err.value) from None
docsum-xeon-backend-server | json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)
The text was updated successfully, but these errors were encountered:
Priority
P1-Stopper
OS type
Ubuntu
Hardware type
Xeon-SPR
Installation method
Deploy method
Running nodes
Single Node
What's the version?
73879d3
Description
When launch the DocSum example using the docker compose and/or helm with the latest images built from source, sending the curl request to the DocSum mega gateway service results the following errors:
Reproduce steps
Follow the DocSum Xeon Readme:
Raw log
The text was updated successfully, but these errors were encountered: