Extend OrtAllocator API to get Allocator statistics #24785

toothache · 2025-05-16T04:45:37Z

Description

Extend IAllocator to get Allocator statistics:

Add OrtAllocator::GetStats and AllocatorGetStats C-API.
Add Ort::Allocator::GetStats Cxx API to parse the string and return as map.
Add UT.

Motivation and Context

Our system integrates multiple models for inference, each with varying memory demands. Providing a mechanism to retrieve detailed memory statistics would be useful for analyzing memory usage across models and devices more effectively.

yuslepukhin · 2025-05-16T18:32:19Z

I will take a look at it on Monday

onnxruntime/core/framework/allocator_stats.h

include/onnxruntime/core/session/onnxruntime_c_api.h

include/onnxruntime/core/session/onnxruntime_cxx_inline.h

onnxruntime/core/session/allocator_adapters.cc

tianleiwu · 2025-05-20T03:29:56Z

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows GPU Doc Gen CI Pipeline, Windows x64 QNN CI Pipeline

azure-pipelines · 2025-05-20T03:30:17Z

Azure Pipelines successfully started running 5 pipeline(s).

include/onnxruntime/core/session/onnxruntime_c_api.h

onnxruntime/core/session/allocator_adapters.cc

include/onnxruntime/core/framework/allocator.h

include/onnxruntime/core/session/onnxruntime_c_api.h

include/onnxruntime/core/session/onnxruntime_cxx_api.h

include/onnxruntime/core/session/onnxruntime_cxx_inline.h

onnxruntime/test/shared_lib/test_allocator.cc

include/onnxruntime/core/session/onnxruntime_cxx_inline.h

yuslepukhin · 2025-05-27T16:35:20Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,ONNX Runtime Web CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline

yuslepukhin · 2025-05-27T16:35:21Z

/azp run Linux QNN CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Linux Android Emulator QNN CI Pipeline,Android CI Pipeline,iOS CI Pipeline,ONNX Runtime React Native CI Pipeline,Linux DNNL CI Pipeline,Linux MIGraphX CI Pipeline,Linux ROCm CI Pipeline

azure-pipelines · 2025-05-27T16:35:37Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2025-05-27T16:35:39Z

Azure Pipelines successfully started running 3 pipeline(s).

onnxruntime/core/session/allocator_adapters.cc

include/onnxruntime/core/session/onnxruntime_c_api.h

include/onnxruntime/core/session/onnxruntime_cxx_api.h

onnxruntime/core/session/onnxruntime_c_api.cc

onnxruntime/core/session/default_cpu_allocator_c_api.cc

include/onnxruntime/core/session/onnxruntime_c_api.h

include/onnxruntime/core/session/onnxruntime_cxx_api.h

include/onnxruntime/core/session/onnxruntime_c_api.h

include/onnxruntime/core/framework/allocator.h

onnxruntime/core/session/allocator_adapters.cc

yuslepukhin

toothache · 2025-06-03T23:17:00Z

@yuslepukhin @skottmckay can anyone help complete the PR? Thanks!

tianleiwu · 2025-06-03T23:35:39Z

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows GPU Doc Gen CI Pipeline, Windows x64 QNN CI Pipeline

azure-pipelines · 2025-06-03T23:35:58Z

Azure Pipelines successfully started running 5 pipeline(s).

toothache changed the title ~~Extend IAllocator API to get Allocator statistics~~ Extend OrtAllocator API to get Allocator statistics May 16, 2025

hanbitmyths requested a review from yuslepukhin May 16, 2025 05:05

Add api to get allocator stats.

343f920

toothache force-pushed the alloc_stats branch from ccf3954 to 343f920 Compare May 19, 2025 11:37

yuslepukhin requested changes May 19, 2025

View reviewed changes

toothache added 3 commits May 20, 2025 07:24

Address the comments.

c08b129

Fix UT.

f591045

Raise not implemented exception in default IAllocator::GetStats.

0e9c691

toothache marked this pull request as ready for review May 20, 2025 01:47

Polish the code.

6911375