Ampere’s inference acceleration engine is fully integrated with ONNX Runtime framework. ONNX models and software written with ONNX Runtime API can run as-is, without any modifications.
Overview
Benchmarks
[tablefield field-name=”benchmarks” table-class=”benchmarks-table”]