feat: add --ep-options to winml perf (and other commands) for runtime EP provider options

## Motivation

When benchmarking or running inference on QNN NPU (and other EPs), the runtime EP provider options can significantly affect latency — independently of the build-time quantization config. For example, on QNN HTP:

| EP option | Affects compile | Affects runtime | Values |
|---|---|---|---|
| \htp_performance_mode\ | ❌ (no-op) | ✅ clock governor | \urst\, \high_performance\, \alanced\, \low_power\, \default\ |
| \htp_graph_finalization_optimization_mode\ | ✅ changes compiled graph | ✅ | \

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add --ep-options to winml perf (and other commands) for runtime EP provider options #865

Motivation

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

EP option	Affects compile	Affects runtime	Values
\htp_performance_mode\	❌ (no-op)	✅ clock governor	\�urst, \high_performance, \�alanced, \low_power, \default\
\htp_graph_finalization_optimization_mode\	✅ changes compiled graph	✅	\

feat: add --ep-options to winml perf (and other commands) for runtime EP provider options #865

Description

Motivation

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions