SafeIntOnOverflow() Integer overflow error when inferencing on too many samples with Python #18905

jacktwosense · 2023-12-21T15:28:22Z

Describe the issue

When inferencing with too many samples in Python, I get this error on Ubuntu:
[E:onnxruntime:Default, allocator.cc:36 operator()] /onnxruntime_src/onnxruntime/core/common/safeint.h:17 static void SafeIntExceptionHandler<onnxruntime::OnnxRuntimeException>::SafeIntOnOverflow() Integer overflow

When running the same script on an M1 Mac, it appears as a SIGABRT with libc++abi terminating message.

I'm able to mitigate this by inferencing in batches and concatenating the results.

To reproduce

Inference with a large number of samples (low millions?). Maybe > 32767 is sufficient, since that's the max for int16, but I haven't verified that.

Urgency

No response

Platform

Linux

OS Version

Ubuntu

ONNX Runtime Installation

Released Package

ONNX Runtime Version or Commit ID

1.16.3

ONNX Runtime API

Python

Architecture

X64

Execution Provider

Default CPU

Execution Provider Library Version

No response

The text was updated successfully, but these errors were encountered:

skottmckay · 2023-12-26T22:39:19Z

Are you able to share a model that this occurs with?

…y return a nullptr. This is inconsistent as an actual memory allocation failure throws. An overflow would typically be due to bad input so an exception makes more sense given that. Change to throw so code using MakeUniquePtr* and AllocArray* doesn't need to check for nullptr. Add some extra info to the log message to help debugging. Should help with #18905 by avoiding the invalid attempted usage of a nullptr from the allocation. Extra info _might_ help with figuring out where the overflow is coming from which is the real issue.

…#18941) ### Description  If we fail to calculate the buffer size (due to overflow) we currently return a nullptr. This is inconsistent as an actual memory allocation failure throws. An overflow would typically be due to bad input so an exception makes more sense given that. Change to throw so code using MakeUniquePtr* and AllocArray* doesn't need to check for nullptr. Add some extra info to the log message to help debugging. ### Motivation and Context  Should help with #18905 by avoiding the invalid attempted usage of a nullptr from the allocation. Extra info _might_ help with figuring out where the overflow is coming from which is the real issue.

yf711 added the core runtime issues related to core runtime label Dec 22, 2023

skottmckay mentioned this issue Dec 27, 2023

Throw if unique_ptr or array allocation fails due to SafeInt overflow #18941

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SafeIntOnOverflow() Integer overflow error when inferencing on too many samples with Python #18905

SafeIntOnOverflow() Integer overflow error when inferencing on too many samples with Python #18905

jacktwosense commented Dec 21, 2023

skottmckay commented Dec 26, 2023

SafeIntOnOverflow() Integer overflow error when inferencing on too many samples with Python #18905

SafeIntOnOverflow() Integer overflow error when inferencing on too many samples with Python #18905

Comments

jacktwosense commented Dec 21, 2023

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

skottmckay commented Dec 26, 2023