Introduce reused_buffer_index_per_stream in allocation planner which will be reset after computing the reuse buffer for each stream #19515
Azure Pipelines / Big Models (Llama2_ONNX_FP16 Llama2_ONNX_FP16)
succeeded
Feb 22, 2024 in 14m 2s
Llama2_ONNX_FP16 Llama2_ONNX_FP16 succeeded
Loading