Login

Become a leader in the IoT community!

Join our community of embedded and IoT practitioners to contribute experience, learn new skills and collaborate with other developers with complementary skillsets.

Step 1 of 5

CREATE YOUR PROFILE *Required

Step 2 of 5

WHAT BRINGS YOU TO DEVHEADS? *Choose 1 or more

Connect & collaborate 🤝with other tech professionals

I want to connect & collaborate with other techies

Learn & Grow 📚

Learn from our helpers & grow your tech knowledge

Contribute Experience & Expertise 🔧

Become a helpful hacker & assist others by sharing

Step 3 of 5

WHAT'S YOUR INTEREST OR EXPERTISE? *Choose 1 or more

Hardware Design 💡

PCB design, analog circuits, and more.

Embedded Software 💻

Embedded OS, firmware/middleware, debug & tools

Edge Networking ⚡

Real-time/low-power connectivity & IoT device management

Step 4 of 5

Personalize your profile

Step 5 of 5

Read & agree to our COMMUNITY RULES

We want this server to be a welcoming space! Treat everyone with respect. Absolutely no harassment, witch hunting, sexism, racism, or hate speech will be tolerated.
If you see something against the rules or something that makes you feel unsafe, let staff know by messaging @admin in the "support-tickets" tab in the Live DevChat menu.
No age-restricted, obscene or NSFW content. This includes text, images, or links featuring nudity, sex, hard violence, or other graphically disturbing content.
No spam. This includes DMing fellow members.
You must be over the age of 18 years old to participate in our community.
You agree to our Terms of Service (https://www.devheads.io/terms-of-service/) and Privacy Policy (https://www.devheads.io/privacy-policy)

By clicking "Finish", you have read and agreed to the our Terms of Service and Privacy Policy.

Fixing INT8 Quantization Error for Depthwise Conv2D Layers

Posted by wafa_athmani
10:22 pm
21/11/2024

Hey everyone,

Thanks for the previous suggestions on tackling the inference timeout issue in my vibration anomaly detection project. I implemented quantization to optimize the model, but now I’m encountering a new error:

**Error Message:**

Quantization Error: Unsupported Layer Type in INT8 Conversion - Layer 5 (Depthwise Conv2D)

It seems like the quantization process is failing specifically at Layer 5, which uses a Depthwise Conv2D operation.
What’s the best approach to handle layers that aren’t compatible with INT8 quantization? Should I consider retraining with a different architecture, or is there a workaround to manually adjust these layers?

Thanks in advance for your help!

marveeamasi#0

November 21, 2024 at 10:23 pm

Instead of fully quantizing the model to `INT8`, you can use `mixed precision quantization`. This approach leaves unsupported layers like `Depthwise Conv2D` in `float32 FP32` while quantizing the rest of the model to `INT8`

For `TensorFlow Lite`, you can specify dynamic range quantization for unsupported layers. See how you can adjust your conversion script:
“`
converter = tf.lite.TFLiteConverter.from_keras_model(model)
converter.optimizations = [tf.lite.Optimize.DEFAULT]
converter.target_spec.supported_ops = [tf.lite.OpsSet.TFLITE_BUILTINS_INT8, # INT8 quantized ops
tf.lite.OpsSet.TFLITE_BUILTINS] # FP32 fallback for unsupported layers
converter.inference_input_type = tf.int8 # Input quantized as int8
converter.inference_output_type = tf.int8 # Output quantized as int8
tflite_model = converter.convert()

“`

Cancel Reply

Name *

Email *

Website

Save my name, email, and website in this browser for the next time I comment.