onnx_benchmark.py

Latency: Average time per inference.
Throughput: Inferences per second.
CPU vs GPU: Compares execution providers if available.

Benchmarks the inference speed of the exported ONNX model.

Metrics

make onnx_benchmark