(Go: >> BACK << -|- >> HOME <<)

SlideShare a Scribd company logo
How to Run Audio and
Vision AI Algorithms at
Ultra-Low Power
Presenter:
Deepak Mital
Sr. Director, Architecture
Synaptics Incorporated
• Many IoT applications do not require “continuous maximum” compute
• Continuous monitoring results in battery drain
• Examples:
• Security camera: Turn on main processing for actual detection only when confirmed
necessary
• Human presence detection (HPD) and identification to turn device on: Run HPD
detection and identification algorithm only when detected “potential” presence
• Predictive maintenance: Enable advanced detection only when initial metrics are met
• Shoplift prevention: Enable detailed analytics only when “potential” threat detected
Problem statement
2
© 2024 Synaptics Inc
• Multistage hardware: Capable of running
Audio and Video AI algorithms
• Highly efficient AI models with different KPIs
for each stage
• Tight orchestration of software to invoke each
stage
Solution
3
© 2024 Synaptics Inc
Always-on domain
High performance
High efficiency
Power
management
System
memories
Security
USB
/
serial
/
MIPI
U55 NPU
Cortex-M55
μNPU
Cortex-M4
Vision AI
pipeline
JPEG
Audio
VAD
ISP, encoders
Sensing logic
Deep sleep: GPIO (Wake), internal clock
Reset
• Ultra-low power: Microwatts hardware,
always on
• Sound detection
• Image change detection
• Critical model requirements are for very
few false negatives
• False negatives will render device
unresponsive
Solution – Stage 1
4
© 2024 Synaptics Inc
Always-on domain
High performance
High efficiency
Power
management
System
memories
Security
USB
/
serial
/
MIPI
U55 NPU
Cortex-M55
μNPU
Cortex-M4
Vision AI
pipeline
JPEG
Audio
VAD
ISP, encoders
Sensing logic
Deep sleep: GPIO (Wake), internal clock
Reset
• Mid- to low power – 10s of microwatts
hardware, activated by stage 1 via software
• AI algorithms (example):
• Wake-word detection
• Human presence detection
• Critical model requirements are for very
few false negatives and false positives
• False negatives will render device
unresponsive
• False positives will increase power
consumption
Solution – Stage 2
5
© 2024 Synaptics Inc
Always-on domain
High performance
High efficiency
Power
management
System
memories
Security
USB
/
serial
/
MIPI
U55 NPU
Cortex-M55
μNPU
Cortex-M4
Vision AI
pipeline
JPEG
Audio
VAD
ISP, encoders
Sensing logic
Deep sleep: GPIO (Wake), internal clock
Reset
• High performance, activated by Stage 2 via
software
• AI algorithms (example):
• Person identification
• Object detection
• Critical model requirements are for very high
performance at low power
• Slow run times will increase power
consumption
Solution – Stage 3
6
© 2024 Synaptics Inc
Always-on domain
High performance
High efficiency
Power
management
System
memories
Security
USB
/
serial
/
MIPI
U55 NPU
Cortex-M55
μNPU
Cortex-M4
Vision Ai
pipeline
JPEG
Audio
VAD
ISP, encoders
Sensing logic
Deep sleep: GPIO (Wake), internal clock
Reset
• Different requirements for AI models at each stage
• Need AI models optimized for different KPIs: accuracy, performance, and size
• NAS-based model generation architecture where the models are purpose built for the
constrained silicon
• Primary factors affecting inference KPI
• Model architecture design
• Model quantization
• Approach: Jointly optimize model architecture and quantization under memory
constraints
AI models
7
© 2024 Synaptics Inc
• Resolution – [28x28 – 32x32]
• Kernel size – [3x3, 5x5, 7x7]
• Depth – [2, 3, 4]
• Width (channel expansion factor) – [2, 3, 4]
• Mixed-precision quantization parameters –
[4 bit, 6 bit, 8 bit]
Multi-precision NAS search range for classification
8
© 2024 Synaptics Inc
CIFAR-10 classification – Mixed vs 8- or 4-bit precision
9
© 2024 Synaptics Inc
CIFAR-10 classification comparison
10
© 2024 Synaptics Inc
• Resolution – [320x240 –
640x480]
• Kernel size – [3x3, 5x5, 7x7]
• Depth – [2, 3, 4]
• Width (channel expansion
factor) – [2, 3, 4]
• Mixed-precision quantization
parameters – [4 bit, 6 bit, 8 bit]
Object detection dataset
11
© 2024 Synaptics Inc
COCO person detection – Mixed vs 8- or 4-bit precision
12
© 2024 Synaptics Inc
COCO person detection comparison
13
© 2024 Synaptics Inc
• Model development stage KPI:
• COCO Instance Mask mAP: 0.636
• Latency: 92.19 ms
• Resolution: 480x640 (VGA)
• Weights: 1.57 M parameters
• Model run on hardware:
• Inference time: 96 ms
• Total frame time: 120 ms
Segmentation run on Stage 3
14
© 2024 Synaptics Inc
• Building full applications running at ultra-low power requires high levels of integration
of hardware and software
• Multiple levels of processing is needed to wake up silicon components as needed
• Stage 2 and Stage 3 come out of deep sleep based on results from previous stage
• The low-power orchestration demands tight software integration
• Each stage requires AI models with different KPIs on accuracy, model size, and speed
• Need to have NAS-based model generation/training software to enable the complete
solution
• Solution enables battery-powered devices that are AI capable and can run for many
months/years
Summary
15
© 2024 Synaptics Inc
Resources
16
Synaptics Astra embedded processors
https://www.synaptics.com/products/embedded-processors
Synaptics Astra evaluation Kit
https://synacsm.atlassian.net/servicedesk/customer/portal/543/grou
p/563/create/6387
Synaptics Astra software
https://github.com/synaptics-astra
© 2024 Synaptics Inc

More Related Content

Similar to “How to Run Audio and Vision AI Algorithms at Ultra-low Power,” a Presentation from Synaptics

OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar
Ganesan Narayanasamy
 
MIPI DevCon 2016: MIPI CSI-2 Application for Vision and Sensor Fusion Systems
MIPI DevCon 2016: MIPI CSI-2 Application for Vision and Sensor Fusion SystemsMIPI DevCon 2016: MIPI CSI-2 Application for Vision and Sensor Fusion Systems
MIPI DevCon 2016: MIPI CSI-2 Application for Vision and Sensor Fusion Systems
MIPI Alliance
 
Computer architecture for vision system
Computer architecture for vision systemComputer architecture for vision system
Computer architecture for vision system
AkashPatil334
 
FPGA Hardware Accelerator for Machine Learning
FPGA Hardware Accelerator for Machine Learning FPGA Hardware Accelerator for Machine Learning
FPGA Hardware Accelerator for Machine Learning
Dr. Swaminathan Kathirvel
 
“Efficiency Unleashed: The Next-gen NXP i.MX 95 Applications Processor for Em...
“Efficiency Unleashed: The Next-gen NXP i.MX 95 Applications Processor for Em...“Efficiency Unleashed: The Next-gen NXP i.MX 95 Applications Processor for Em...
“Efficiency Unleashed: The Next-gen NXP i.MX 95 Applications Processor for Em...
Edge AI and Vision Alliance
 
SCADA ( Supervisory Control and Data Acquisition system) Software Solutions
SCADA ( Supervisory Control and Data Acquisition system) Software SolutionsSCADA ( Supervisory Control and Data Acquisition system) Software Solutions
SCADA ( Supervisory Control and Data Acquisition system) Software Solutions
Embitel Technologies (I) PVT LTD
 
“Visual AI at the Edge: From Surveillance Cameras to People Counters,” a Pres...
“Visual AI at the Edge: From Surveillance Cameras to People Counters,” a Pres...“Visual AI at the Edge: From Surveillance Cameras to People Counters,” a Pres...
“Visual AI at the Edge: From Surveillance Cameras to People Counters,” a Pres...
Edge AI and Vision Alliance
 
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr..."Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
Edge AI and Vision Alliance
 
GDG Meetup Jakarta - Low Power IoT
GDG Meetup Jakarta - Low Power IoTGDG Meetup Jakarta - Low Power IoT
GDG Meetup Jakarta - Low Power IoT
Alwin Arrasyid
 
Scalable AI Solution cross AI platforms
Scalable AI Solution cross AI platformsScalable AI Solution cross AI platforms
Scalable AI Solution cross AI platforms
KTN
 
Accelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to CloudAccelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to Cloud
Rebekah Rodriguez
 
FPGA_prototyping proccesing with conclusion
FPGA_prototyping proccesing with conclusionFPGA_prototyping proccesing with conclusion
FPGA_prototyping proccesing with conclusion
PersiPersi1
 
TASSA-X
TASSA-XTASSA-X
AIDC India - AI Vision Slides
AIDC India - AI Vision SlidesAIDC India - AI Vision Slides
AIDC India - AI Vision Slides
Intel® Software
 
Project_updated
Project_updatedProject_updated
Project_updated
Shaikh Zaid
 
Accelerating the AIoT @ the EDGE
Accelerating the AIoT @ the EDGE Accelerating the AIoT @ the EDGE
Accelerating the AIoT @ the EDGE
Amazon Web Services
 
FYP-Final-External
FYP-Final-ExternalFYP-Final-External
FYP-Final-External
Ahmed Rik
 
2020 vision - the journey from research lab to real-world product
2020 vision - the journey from research lab to real-world product2020 vision - the journey from research lab to real-world product
2020 vision - the journey from research lab to real-world product
KTN
 
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
Edge AI and Vision Alliance
 
Real Time Video Processing in FPGA
Real Time Video Processing in FPGA Real Time Video Processing in FPGA
Real Time Video Processing in FPGA
QuEST Global (erstwhile NeST Software)
 

Similar to “How to Run Audio and Vision AI Algorithms at Ultra-low Power,” a Presentation from Synaptics (20)

OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar
 
MIPI DevCon 2016: MIPI CSI-2 Application for Vision and Sensor Fusion Systems
MIPI DevCon 2016: MIPI CSI-2 Application for Vision and Sensor Fusion SystemsMIPI DevCon 2016: MIPI CSI-2 Application for Vision and Sensor Fusion Systems
MIPI DevCon 2016: MIPI CSI-2 Application for Vision and Sensor Fusion Systems
 
Computer architecture for vision system
Computer architecture for vision systemComputer architecture for vision system
Computer architecture for vision system
 
FPGA Hardware Accelerator for Machine Learning
FPGA Hardware Accelerator for Machine Learning FPGA Hardware Accelerator for Machine Learning
FPGA Hardware Accelerator for Machine Learning
 
“Efficiency Unleashed: The Next-gen NXP i.MX 95 Applications Processor for Em...
“Efficiency Unleashed: The Next-gen NXP i.MX 95 Applications Processor for Em...“Efficiency Unleashed: The Next-gen NXP i.MX 95 Applications Processor for Em...
“Efficiency Unleashed: The Next-gen NXP i.MX 95 Applications Processor for Em...
 
SCADA ( Supervisory Control and Data Acquisition system) Software Solutions
SCADA ( Supervisory Control and Data Acquisition system) Software SolutionsSCADA ( Supervisory Control and Data Acquisition system) Software Solutions
SCADA ( Supervisory Control and Data Acquisition system) Software Solutions
 
“Visual AI at the Edge: From Surveillance Cameras to People Counters,” a Pres...
“Visual AI at the Edge: From Surveillance Cameras to People Counters,” a Pres...“Visual AI at the Edge: From Surveillance Cameras to People Counters,” a Pres...
“Visual AI at the Edge: From Surveillance Cameras to People Counters,” a Pres...
 
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr..."Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
 
GDG Meetup Jakarta - Low Power IoT
GDG Meetup Jakarta - Low Power IoTGDG Meetup Jakarta - Low Power IoT
GDG Meetup Jakarta - Low Power IoT
 
Scalable AI Solution cross AI platforms
Scalable AI Solution cross AI platformsScalable AI Solution cross AI platforms
Scalable AI Solution cross AI platforms
 
Accelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to CloudAccelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to Cloud
 
FPGA_prototyping proccesing with conclusion
FPGA_prototyping proccesing with conclusionFPGA_prototyping proccesing with conclusion
FPGA_prototyping proccesing with conclusion
 
TASSA-X
TASSA-XTASSA-X
TASSA-X
 
AIDC India - AI Vision Slides
AIDC India - AI Vision SlidesAIDC India - AI Vision Slides
AIDC India - AI Vision Slides
 
Project_updated
Project_updatedProject_updated
Project_updated
 
Accelerating the AIoT @ the EDGE
Accelerating the AIoT @ the EDGE Accelerating the AIoT @ the EDGE
Accelerating the AIoT @ the EDGE
 
FYP-Final-External
FYP-Final-ExternalFYP-Final-External
FYP-Final-External
 
2020 vision - the journey from research lab to real-world product
2020 vision - the journey from research lab to real-world product2020 vision - the journey from research lab to real-world product
2020 vision - the journey from research lab to real-world product
 
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
“The Future of AI is Here Today: Deep Dive into Qualcomm’s On-Device AI Offer...
 
Real Time Video Processing in FPGA
Real Time Video Processing in FPGA Real Time Video Processing in FPGA
Real Time Video Processing in FPGA
 

More from Edge AI and Vision Alliance

“Deploying Large Language Models on a Raspberry Pi,” a Presentation from Usef...
“Deploying Large Language Models on a Raspberry Pi,” a Presentation from Usef...“Deploying Large Language Models on a Raspberry Pi,” a Presentation from Usef...
“Deploying Large Language Models on a Raspberry Pi,” a Presentation from Usef...
Edge AI and Vision Alliance
 
“Meeting the Critical Needs of Accuracy, Performance and Adaptability in Embe...
“Meeting the Critical Needs of Accuracy, Performance and Adaptability in Embe...“Meeting the Critical Needs of Accuracy, Performance and Adaptability in Embe...
“Meeting the Critical Needs of Accuracy, Performance and Adaptability in Embe...
Edge AI and Vision Alliance
 
“Build a Tiny Vision Application in Minutes with the Edge App SDK,” a Present...
“Build a Tiny Vision Application in Minutes with the Edge App SDK,” a Present...“Build a Tiny Vision Application in Minutes with the Edge App SDK,” a Present...
“Build a Tiny Vision Application in Minutes with the Edge App SDK,” a Present...
Edge AI and Vision Alliance
 
“The Importance of Memory for Breaking the Edge AI Performance Bottleneck,” a...
“The Importance of Memory for Breaking the Edge AI Performance Bottleneck,” a...“The Importance of Memory for Breaking the Edge AI Performance Bottleneck,” a...
“The Importance of Memory for Breaking the Edge AI Performance Bottleneck,” a...
Edge AI and Vision Alliance
 
“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...
“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...
“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...
Edge AI and Vision Alliance
 
“Transforming Enterprise Intelligence: The Power of Computer Vision and Gen A...
“Transforming Enterprise Intelligence: The Power of Computer Vision and Gen A...“Transforming Enterprise Intelligence: The Power of Computer Vision and Gen A...
“Transforming Enterprise Intelligence: The Power of Computer Vision and Gen A...
Edge AI and Vision Alliance
 
“Challenges and Solutions of Moving Vision LLMs to the Edge,” a Presentation ...
“Challenges and Solutions of Moving Vision LLMs to the Edge,” a Presentation ...“Challenges and Solutions of Moving Vision LLMs to the Edge,” a Presentation ...
“Challenges and Solutions of Moving Vision LLMs to the Edge,” a Presentation ...
Edge AI and Vision Alliance
 
“Implementing Transformer Neural Networks for Visual Perception on Embedded D...
“Implementing Transformer Neural Networks for Visual Perception on Embedded D...“Implementing Transformer Neural Networks for Visual Perception on Embedded D...
“Implementing Transformer Neural Networks for Visual Perception on Embedded D...
Edge AI and Vision Alliance
 
“A Cutting-edge Memory Optimization Method for Embedded AI Accelerators,” a P...
“A Cutting-edge Memory Optimization Method for Embedded AI Accelerators,” a P...“A Cutting-edge Memory Optimization Method for Embedded AI Accelerators,” a P...
“A Cutting-edge Memory Optimization Method for Embedded AI Accelerators,” a P...
Edge AI and Vision Alliance
 
“Optimized Vision Language Models for Intelligent Transportation System Appli...
“Optimized Vision Language Models for Intelligent Transportation System Appli...“Optimized Vision Language Models for Intelligent Transportation System Appli...
“Optimized Vision Language Models for Intelligent Transportation System Appli...
Edge AI and Vision Alliance
 
“Image Signal Processing Optimization for Object Detection,” a Presentation f...
“Image Signal Processing Optimization for Object Detection,” a Presentation f...“Image Signal Processing Optimization for Object Detection,” a Presentation f...
“Image Signal Processing Optimization for Object Detection,” a Presentation f...
Edge AI and Vision Alliance
 
“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...
“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...
“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...
Edge AI and Vision Alliance
 
"Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr...
"Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr..."Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr...
"Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr...
Edge AI and Vision Alliance
 
“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...
“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...
“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...
Edge AI and Vision Alliance
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...
“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...
“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...
Edge AI and Vision Alliance
 
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
Edge AI and Vision Alliance
 
“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...
“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...
“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...
Edge AI and Vision Alliance
 
“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...
“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...
“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...
Edge AI and Vision Alliance
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 

More from Edge AI and Vision Alliance (20)

“Deploying Large Language Models on a Raspberry Pi,” a Presentation from Usef...
“Deploying Large Language Models on a Raspberry Pi,” a Presentation from Usef...“Deploying Large Language Models on a Raspberry Pi,” a Presentation from Usef...
“Deploying Large Language Models on a Raspberry Pi,” a Presentation from Usef...
 
“Meeting the Critical Needs of Accuracy, Performance and Adaptability in Embe...
“Meeting the Critical Needs of Accuracy, Performance and Adaptability in Embe...“Meeting the Critical Needs of Accuracy, Performance and Adaptability in Embe...
“Meeting the Critical Needs of Accuracy, Performance and Adaptability in Embe...
 
“Build a Tiny Vision Application in Minutes with the Edge App SDK,” a Present...
“Build a Tiny Vision Application in Minutes with the Edge App SDK,” a Present...“Build a Tiny Vision Application in Minutes with the Edge App SDK,” a Present...
“Build a Tiny Vision Application in Minutes with the Edge App SDK,” a Present...
 
“The Importance of Memory for Breaking the Edge AI Performance Bottleneck,” a...
“The Importance of Memory for Breaking the Edge AI Performance Bottleneck,” a...“The Importance of Memory for Breaking the Edge AI Performance Bottleneck,” a...
“The Importance of Memory for Breaking the Edge AI Performance Bottleneck,” a...
 
“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...
“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...
“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...
 
“Transforming Enterprise Intelligence: The Power of Computer Vision and Gen A...
“Transforming Enterprise Intelligence: The Power of Computer Vision and Gen A...“Transforming Enterprise Intelligence: The Power of Computer Vision and Gen A...
“Transforming Enterprise Intelligence: The Power of Computer Vision and Gen A...
 
“Challenges and Solutions of Moving Vision LLMs to the Edge,” a Presentation ...
“Challenges and Solutions of Moving Vision LLMs to the Edge,” a Presentation ...“Challenges and Solutions of Moving Vision LLMs to the Edge,” a Presentation ...
“Challenges and Solutions of Moving Vision LLMs to the Edge,” a Presentation ...
 
“Implementing Transformer Neural Networks for Visual Perception on Embedded D...
“Implementing Transformer Neural Networks for Visual Perception on Embedded D...“Implementing Transformer Neural Networks for Visual Perception on Embedded D...
“Implementing Transformer Neural Networks for Visual Perception on Embedded D...
 
“A Cutting-edge Memory Optimization Method for Embedded AI Accelerators,” a P...
“A Cutting-edge Memory Optimization Method for Embedded AI Accelerators,” a P...“A Cutting-edge Memory Optimization Method for Embedded AI Accelerators,” a P...
“A Cutting-edge Memory Optimization Method for Embedded AI Accelerators,” a P...
 
“Optimized Vision Language Models for Intelligent Transportation System Appli...
“Optimized Vision Language Models for Intelligent Transportation System Appli...“Optimized Vision Language Models for Intelligent Transportation System Appli...
“Optimized Vision Language Models for Intelligent Transportation System Appli...
 
“Image Signal Processing Optimization for Object Detection,” a Presentation f...
“Image Signal Processing Optimization for Object Detection,” a Presentation f...“Image Signal Processing Optimization for Object Detection,” a Presentation f...
“Image Signal Processing Optimization for Object Detection,” a Presentation f...
 
“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...
“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...
“Squeezing the Last Milliwatt and Cubic Millimeter from Smart Cameras Using t...
 
"Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr...
"Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr..."Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr...
"Maximize Your AI Compatibility with Flexible Pre- and Post-processing," a Pr...
 
“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...
“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...
“Addressing Tomorrow’s Sensor Fusion and Processing Needs with Cadence’s Newe...
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...
“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...
“Silicon Slip-ups: The Ten Most Common Errors Processor Suppliers Make (Numbe...
 
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
 
“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...
“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...
“How Arm’s Machine Learning Solution Enables Vision Transformers at the Edge,...
 
“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...
“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...
“Nx EVOS: A New Enterprise Operating System for Video and Visual AI,” a Prese...
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 

Recently uploaded

Pigging Solutions Sustainability brochure.pdf
Pigging Solutions Sustainability brochure.pdfPigging Solutions Sustainability brochure.pdf
Pigging Solutions Sustainability brochure.pdf
Pigging Solutions
 
Cookies program to display the information though cookie creation
Cookies program to display the information though cookie creationCookies program to display the information though cookie creation
Cookies program to display the information though cookie creation
shanthidl1
 
HTTP Adaptive Streaming – Quo Vadis (2024)
HTTP Adaptive Streaming – Quo Vadis (2024)HTTP Adaptive Streaming – Quo Vadis (2024)
HTTP Adaptive Streaming – Quo Vadis (2024)
Alpen-Adria-Universität
 
UiPath Community Day Kraków: Devs4Devs Conference
UiPath Community Day Kraków: Devs4Devs ConferenceUiPath Community Day Kraków: Devs4Devs Conference
UiPath Community Day Kraków: Devs4Devs Conference
UiPathCommunity
 
Quality Patents: Patents That Stand the Test of Time
Quality Patents: Patents That Stand the Test of TimeQuality Patents: Patents That Stand the Test of Time
Quality Patents: Patents That Stand the Test of Time
Aurora Consulting
 
this resume for sadika shaikh bca student
this resume for sadika shaikh bca studentthis resume for sadika shaikh bca student
this resume for sadika shaikh bca student
SadikaShaikh7
 
Data Protection in a Connected World: Sovereignty and Cyber Security
Data Protection in a Connected World: Sovereignty and Cyber SecurityData Protection in a Connected World: Sovereignty and Cyber Security
Data Protection in a Connected World: Sovereignty and Cyber Security
anupriti
 
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Erasmo Purificato
 
Hire a private investigator to get cell phone records
Hire a private investigator to get cell phone recordsHire a private investigator to get cell phone records
Hire a private investigator to get cell phone records
HackersList
 
Why do You Have to Redesign?_Redesign Challenge Day 1
Why do You Have to Redesign?_Redesign Challenge Day 1Why do You Have to Redesign?_Redesign Challenge Day 1
Why do You Have to Redesign?_Redesign Challenge Day 1
FellyciaHikmahwarani
 
Coordinate Systems in FME 101 - Webinar Slides
Coordinate Systems in FME 101 - Webinar SlidesCoordinate Systems in FME 101 - Webinar Slides
Coordinate Systems in FME 101 - Webinar Slides
Safe Software
 
MYIR Product Brochure - A Global Provider of Embedded SOMs & Solutions
MYIR Product Brochure - A Global Provider of Embedded SOMs & SolutionsMYIR Product Brochure - A Global Provider of Embedded SOMs & Solutions
MYIR Product Brochure - A Global Provider of Embedded SOMs & Solutions
Linda Zhang
 
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - MydbopsScaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Mydbops
 
20240702 QFM021 Machine Intelligence Reading List June 2024
20240702 QFM021 Machine Intelligence Reading List June 202420240702 QFM021 Machine Intelligence Reading List June 2024
20240702 QFM021 Machine Intelligence Reading List June 2024
Matthew Sinclair
 
What Not to Document and Why_ (North Bay Python 2024)
What Not to Document and Why_ (North Bay Python 2024)What Not to Document and Why_ (North Bay Python 2024)
What Not to Document and Why_ (North Bay Python 2024)
Margaret Fero
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
Emerging Tech
 
What's Next Web Development Trends to Watch.pdf
What's Next Web Development Trends to Watch.pdfWhat's Next Web Development Trends to Watch.pdf
What's Next Web Development Trends to Watch.pdf
SeasiaInfotech2
 
AI_dev Europe 2024 - From OpenAI to Opensource AI
AI_dev Europe 2024 - From OpenAI to Opensource AIAI_dev Europe 2024 - From OpenAI to Opensource AI
AI_dev Europe 2024 - From OpenAI to Opensource AI
Raphaël Semeteys
 
Lessons Of Binary Analysis - Christien Rioux
Lessons Of Binary Analysis - Christien RiouxLessons Of Binary Analysis - Christien Rioux
Lessons Of Binary Analysis - Christien Rioux
crioux1
 
Observability For You and Me with OpenTelemetry
Observability For You and Me with OpenTelemetryObservability For You and Me with OpenTelemetry
Observability For You and Me with OpenTelemetry
Eric D. Schabell
 

Recently uploaded (20)

Pigging Solutions Sustainability brochure.pdf
Pigging Solutions Sustainability brochure.pdfPigging Solutions Sustainability brochure.pdf
Pigging Solutions Sustainability brochure.pdf
 
Cookies program to display the information though cookie creation
Cookies program to display the information though cookie creationCookies program to display the information though cookie creation
Cookies program to display the information though cookie creation
 
HTTP Adaptive Streaming – Quo Vadis (2024)
HTTP Adaptive Streaming – Quo Vadis (2024)HTTP Adaptive Streaming – Quo Vadis (2024)
HTTP Adaptive Streaming – Quo Vadis (2024)
 
UiPath Community Day Kraków: Devs4Devs Conference
UiPath Community Day Kraków: Devs4Devs ConferenceUiPath Community Day Kraków: Devs4Devs Conference
UiPath Community Day Kraków: Devs4Devs Conference
 
Quality Patents: Patents That Stand the Test of Time
Quality Patents: Patents That Stand the Test of TimeQuality Patents: Patents That Stand the Test of Time
Quality Patents: Patents That Stand the Test of Time
 
this resume for sadika shaikh bca student
this resume for sadika shaikh bca studentthis resume for sadika shaikh bca student
this resume for sadika shaikh bca student
 
Data Protection in a Connected World: Sovereignty and Cyber Security
Data Protection in a Connected World: Sovereignty and Cyber SecurityData Protection in a Connected World: Sovereignty and Cyber Security
Data Protection in a Connected World: Sovereignty and Cyber Security
 
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
 
Hire a private investigator to get cell phone records
Hire a private investigator to get cell phone recordsHire a private investigator to get cell phone records
Hire a private investigator to get cell phone records
 
Why do You Have to Redesign?_Redesign Challenge Day 1
Why do You Have to Redesign?_Redesign Challenge Day 1Why do You Have to Redesign?_Redesign Challenge Day 1
Why do You Have to Redesign?_Redesign Challenge Day 1
 
Coordinate Systems in FME 101 - Webinar Slides
Coordinate Systems in FME 101 - Webinar SlidesCoordinate Systems in FME 101 - Webinar Slides
Coordinate Systems in FME 101 - Webinar Slides
 
MYIR Product Brochure - A Global Provider of Embedded SOMs & Solutions
MYIR Product Brochure - A Global Provider of Embedded SOMs & SolutionsMYIR Product Brochure - A Global Provider of Embedded SOMs & Solutions
MYIR Product Brochure - A Global Provider of Embedded SOMs & Solutions
 
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - MydbopsScaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
 
20240702 QFM021 Machine Intelligence Reading List June 2024
20240702 QFM021 Machine Intelligence Reading List June 202420240702 QFM021 Machine Intelligence Reading List June 2024
20240702 QFM021 Machine Intelligence Reading List June 2024
 
What Not to Document and Why_ (North Bay Python 2024)
What Not to Document and Why_ (North Bay Python 2024)What Not to Document and Why_ (North Bay Python 2024)
What Not to Document and Why_ (North Bay Python 2024)
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
 
What's Next Web Development Trends to Watch.pdf
What's Next Web Development Trends to Watch.pdfWhat's Next Web Development Trends to Watch.pdf
What's Next Web Development Trends to Watch.pdf
 
AI_dev Europe 2024 - From OpenAI to Opensource AI
AI_dev Europe 2024 - From OpenAI to Opensource AIAI_dev Europe 2024 - From OpenAI to Opensource AI
AI_dev Europe 2024 - From OpenAI to Opensource AI
 
Lessons Of Binary Analysis - Christien Rioux
Lessons Of Binary Analysis - Christien RiouxLessons Of Binary Analysis - Christien Rioux
Lessons Of Binary Analysis - Christien Rioux
 
Observability For You and Me with OpenTelemetry
Observability For You and Me with OpenTelemetryObservability For You and Me with OpenTelemetry
Observability For You and Me with OpenTelemetry
 

“How to Run Audio and Vision AI Algorithms at Ultra-low Power,” a Presentation from Synaptics

  • 1. How to Run Audio and Vision AI Algorithms at Ultra-Low Power Presenter: Deepak Mital Sr. Director, Architecture Synaptics Incorporated
  • 2. • Many IoT applications do not require “continuous maximum” compute • Continuous monitoring results in battery drain • Examples: • Security camera: Turn on main processing for actual detection only when confirmed necessary • Human presence detection (HPD) and identification to turn device on: Run HPD detection and identification algorithm only when detected “potential” presence • Predictive maintenance: Enable advanced detection only when initial metrics are met • Shoplift prevention: Enable detailed analytics only when “potential” threat detected Problem statement 2 © 2024 Synaptics Inc
  • 3. • Multistage hardware: Capable of running Audio and Video AI algorithms • Highly efficient AI models with different KPIs for each stage • Tight orchestration of software to invoke each stage Solution 3 © 2024 Synaptics Inc Always-on domain High performance High efficiency Power management System memories Security USB / serial / MIPI U55 NPU Cortex-M55 μNPU Cortex-M4 Vision AI pipeline JPEG Audio VAD ISP, encoders Sensing logic Deep sleep: GPIO (Wake), internal clock Reset
  • 4. • Ultra-low power: Microwatts hardware, always on • Sound detection • Image change detection • Critical model requirements are for very few false negatives • False negatives will render device unresponsive Solution – Stage 1 4 © 2024 Synaptics Inc Always-on domain High performance High efficiency Power management System memories Security USB / serial / MIPI U55 NPU Cortex-M55 μNPU Cortex-M4 Vision AI pipeline JPEG Audio VAD ISP, encoders Sensing logic Deep sleep: GPIO (Wake), internal clock Reset
  • 5. • Mid- to low power – 10s of microwatts hardware, activated by stage 1 via software • AI algorithms (example): • Wake-word detection • Human presence detection • Critical model requirements are for very few false negatives and false positives • False negatives will render device unresponsive • False positives will increase power consumption Solution – Stage 2 5 © 2024 Synaptics Inc Always-on domain High performance High efficiency Power management System memories Security USB / serial / MIPI U55 NPU Cortex-M55 μNPU Cortex-M4 Vision AI pipeline JPEG Audio VAD ISP, encoders Sensing logic Deep sleep: GPIO (Wake), internal clock Reset
  • 6. • High performance, activated by Stage 2 via software • AI algorithms (example): • Person identification • Object detection • Critical model requirements are for very high performance at low power • Slow run times will increase power consumption Solution – Stage 3 6 © 2024 Synaptics Inc Always-on domain High performance High efficiency Power management System memories Security USB / serial / MIPI U55 NPU Cortex-M55 μNPU Cortex-M4 Vision Ai pipeline JPEG Audio VAD ISP, encoders Sensing logic Deep sleep: GPIO (Wake), internal clock Reset
  • 7. • Different requirements for AI models at each stage • Need AI models optimized for different KPIs: accuracy, performance, and size • NAS-based model generation architecture where the models are purpose built for the constrained silicon • Primary factors affecting inference KPI • Model architecture design • Model quantization • Approach: Jointly optimize model architecture and quantization under memory constraints AI models 7 © 2024 Synaptics Inc
  • 8. • Resolution – [28x28 – 32x32] • Kernel size – [3x3, 5x5, 7x7] • Depth – [2, 3, 4] • Width (channel expansion factor) – [2, 3, 4] • Mixed-precision quantization parameters – [4 bit, 6 bit, 8 bit] Multi-precision NAS search range for classification 8 © 2024 Synaptics Inc
  • 9. CIFAR-10 classification – Mixed vs 8- or 4-bit precision 9 © 2024 Synaptics Inc
  • 11. • Resolution – [320x240 – 640x480] • Kernel size – [3x3, 5x5, 7x7] • Depth – [2, 3, 4] • Width (channel expansion factor) – [2, 3, 4] • Mixed-precision quantization parameters – [4 bit, 6 bit, 8 bit] Object detection dataset 11 © 2024 Synaptics Inc
  • 12. COCO person detection – Mixed vs 8- or 4-bit precision 12 © 2024 Synaptics Inc
  • 13. COCO person detection comparison 13 © 2024 Synaptics Inc
  • 14. • Model development stage KPI: • COCO Instance Mask mAP: 0.636 • Latency: 92.19 ms • Resolution: 480x640 (VGA) • Weights: 1.57 M parameters • Model run on hardware: • Inference time: 96 ms • Total frame time: 120 ms Segmentation run on Stage 3 14 © 2024 Synaptics Inc
  • 15. • Building full applications running at ultra-low power requires high levels of integration of hardware and software • Multiple levels of processing is needed to wake up silicon components as needed • Stage 2 and Stage 3 come out of deep sleep based on results from previous stage • The low-power orchestration demands tight software integration • Each stage requires AI models with different KPIs on accuracy, model size, and speed • Need to have NAS-based model generation/training software to enable the complete solution • Solution enables battery-powered devices that are AI capable and can run for many months/years Summary 15 © 2024 Synaptics Inc
  • 16. Resources 16 Synaptics Astra embedded processors https://www.synaptics.com/products/embedded-processors Synaptics Astra evaluation Kit https://synacsm.atlassian.net/servicedesk/customer/portal/543/grou p/563/create/6387 Synaptics Astra software https://github.com/synaptics-astra © 2024 Synaptics Inc