You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
#0 0x00007fc64625c520 in onnxruntime::common::Status onnxruntime::contrib::cuda::QkvToContext<__half>(cudaDeviceProp const&, cublasContext*&, onnxruntime::Stream*, onnxruntime::contrib::AttentionParameters&, onnxruntime::contrib::cuda::AttentionData<__half>&) () from libonnxruntime_providers_cuda.so
#1 0x00007fc6461acc7b in onnxruntime::contrib::cuda::Attention<onnxruntime::MLFloat16>::ComputeInternal(onnxruntime::OpKernelContext*) const () from libonnxruntime_providers_cuda.so
#2 0x00007fc645c35b86 in onnxruntime::cuda::CudaKernel::Compute(onnxruntime::OpKernelContext*) const () from libonnxruntime_providers_cuda.so
#3 0x00007fc671c4dd0d in ?? () from /libonnxruntime.so
#4 0x00007fc671c42ea9 in ?? () from /libonnxruntime.so
#5 0x00007fc671c51a4d in ?? () from /libonnxruntime.so
To reproduce
Install ONNX runtime onto AWS EKS container.
Load BAAI/bge-base-en-v1.5 model.
Run inference using inference session.
Urgency
This is blocking us from deploying to GPU in production.
Platform
Linux
OS Version
Ubuntu 22.04
ONNX Runtime Installation
Released Package
ONNX Runtime Version or Commit ID
1.16.3
ONNX Runtime API
C++
Architecture
X64
Execution Provider
CUDA
Execution Provider Library Version
CUDA 11.6
The text was updated successfully, but these errors were encountered:
This issue has been automatically marked as stale due to inactivity and will be closed in 30 days if no further activity occurs. If further support is needed, please provide an update and/or more details.
Describe the issue
The ONNX runtime produces a segmentation fault when used with the CUDA execution provider. GDB inspect returns:
To reproduce
Urgency
This is blocking us from deploying to GPU in production.
Platform
Linux
OS Version
Ubuntu 22.04
ONNX Runtime Installation
Released Package
ONNX Runtime Version or Commit ID
1.16.3
ONNX Runtime API
C++
Architecture
X64
Execution Provider
CUDA
Execution Provider Library Version
CUDA 11.6
The text was updated successfully, but these errors were encountered: