Runtime Backends¶

OpenDetect uses ONNX Runtime execution providers (EPs). Provider selection is automatic and ordered by capability.

Selection Rules¶

Provider resolution currently follows this order:

You control behavior with:

hardware_acceleration: default True, set False to force CPU
tensor_rt: default False, set True to allow TensorRT providers
mixed_precision: default False at CLI, can be enabled for FP16/TF32-friendly paths

pip install "opendetect[tensorrt]" is not sufficient by itself.

You must also install a compatible TensorRT system stack:

Verification:

import onnxruntime as ort
print(ort.get_available_providers())

You should see TensorrtExecutionProvider (or NvTensorRtRtxExecutionProvider) and CUDAExecutionProvider.

CLI usage:

opendetect-infer --image input.jpg --model-id rfdetr-m --tensor-rt --mixed-precision

Known limitation:

YOLOX m/l/x are not currently supported on TensorRT due to export/runtime compatibility issues.

OpenDetect already includes selection logic for:

Current status:

If acceleration is not used, print available providers and verify expected EP names.
Confirm your ONNX Runtime build matches the desired hardware backend.
Fall back to --no-hardware-acceleration to confirm functional CPU inference first.