ORCA quantization method

alema416 · June 26, 2025, 8:48am

Good Evening. Is there a resource where the method of quantization for ORCA when compiling an onnx classifier (e.g mobilenet_v2) is detailed? I am using the device as one of many possible deployment hardware platforms for a research project and I need to explain the quantization process for each one of them. For example: for HAILO there is information in their relevant documentation. Is there something similar for ORCA? Thank you.

lawrence · June 26, 2025, 6:49pm

ORCA uses full integer quantization from TensorFlow without per-axis quantization. You can see the details here: Post-training quantization | Google AI Edge | Google AI for Developers

Topic		Replies	Views
Create customized SNN code General brainchip	10	155	May 27, 2025
Can I run inference with Orca on STM32 devices? General orca	5	26	June 24, 2025
Model porting guide: Hosted and local compiler user guide Guides pysdk	0	83	September 6, 2024
Hailo guide 1: Hailo world – running your first inference on a Hailo device using DeGirum PySDK Guides pysdk , hailo	14	341	March 6, 2025
Hailo guide 3: Simplifying object detection on a Hailo device using DeGirum PySDK Guides pysdk , hailo	0	375	February 7, 2025

ORCA quantization method

Related topics