(Go: >> BACK << -|- >> HOME <<)

SlideShare a Scribd company logo
Muhammad Waqar Azhar, Pedro Trancoso
Chalmers University of Technology
20. June 2022
Co-design of DL Accelerators in
VEDLIoT
2
Which one is the best?
Which one is the best for me?
DL Accelerators
3
DL Acceleration
Hardware Software
CPU GPU
TPU
NPU FPGA
Models
Quantization Pruning
4
DL Accelerator Co-Design
Design for…
Map to…
Software
Hardware
5
DL Accelerator Co-Design
Design for…
Map to…
Software
Hardware
6
DL Accelerator Co-Design
Design for…
Map to…
Software
Hardware
7
DL Accelerator Co-Design
Design for…
Map to…
Software
Hardware
8
▪ Model case-study: MobileNet
▪ Observation: generic HW not efficient
▪ Challenge: Depthwise convolution
Co-Design Example - Motivation
● Heterogeneity at different levels:
○ Model layers of different type (e.g. depthwise and pointwise convolution)
○ Within same layer type (e.g. activation and filter sizes and shapes)
○ Determines: buffer sizes, reuse, parallelism
Layer-specific hardware to capture heterogeneity!
9
Co-Design Example: Open Questions & Approaches
Approach A:
one-HW-for-all
DOG
Approach B:
one-HW-per-layer-type
Approach C:
one-HW-per-layer
+ Runs any model
- Suboptimal efficiency
+ Matches layer types
- Suboptimal utilization
+ Best efficiency
- Resource-hungry
10
Co-Design Example: Open Questions & Approaches
Approach A:
one-HW-for-all
DOG
Approach B:
one-HW-per-layer-type
Approach C:
one-HW-per-layer
TVM-VTA
PYNQ-Z2
Unique Kernels
ZCU102
Xilinx FINN
ZCU102
• MobileNet requires aggressive
quantization (4b)
• Performance: 35 GOPS and 68
GOPS using MobileNetsV1 1x
and 0.5x
• Su, Jiang, et al. "Redundancy-
reduced mobilenet acceleration on
reconfigurable logic for imagenet
classification."
• Performance: approx 90 GOPS
• Resnet34
• Performance: approx 8 GOPS
• (DPU performance > 20 GOPS)
11
Proposed Solution
Co-design:
▪ Approach B:
▪ Mapping is good but throughput is below threshold…
▪ Approach C:
▪ FINN requires large HW to support original model -> more aggressive quantization
▪ Quantized model fits in HW but accuracy is below threshold…
?
B + C
12
▪ Current situation:
▪ Zoo of DNN models
▪ Zoo of HW accelerators
▪ Heterogeneity in the model -> Heterogeneity in the hardware
The need for Co-Design!
Co-Design with both generic and layer-specific HW modules
Conclusions
13
Thank you for your attention.

More Related Content

Similar to HiPEAC2022-DL4IoT workshop_ Muhammad Waqar Azhar

Thinking in parallel ab tuladev
Thinking in parallel ab tuladevThinking in parallel ab tuladev
Thinking in parallel ab tuladev
Pavel Tsukanov
 
Exploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design spaceExploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design space
jsvetter
 
Thoughts on Cybersecurity
Thoughts on CybersecurityThoughts on Cybersecurity
Thoughts on Cybersecurity
Frank Wuerthwein
 
Parallelformers
ParallelformersParallelformers
Parallelformers
gohyunwoong
 
Feature-Oriented Software Evolution
Feature-Oriented Software EvolutionFeature-Oriented Software Evolution
Feature-Oriented Software Evolution
Leonardo Passos
 
Ceph Day SF 2015 - Keynote
Ceph Day SF 2015 - Keynote Ceph Day SF 2015 - Keynote
Ceph Day SF 2015 - Keynote
Ceph Community
 
Deep Learning on ARM Platforms - SFO17-509
Deep Learning on ARM Platforms - SFO17-509Deep Learning on ARM Platforms - SFO17-509
Deep Learning on ARM Platforms - SFO17-509
Linaro
 
Comparing Orchestration
Comparing OrchestrationComparing Orchestration
Comparing Orchestration
Knoldus Inc.
 
TRACK F: OpenCL for ALTERA FPGAs, Accelerating performance and design product...
TRACK F: OpenCL for ALTERA FPGAs, Accelerating performance and design product...TRACK F: OpenCL for ALTERA FPGAs, Accelerating performance and design product...
TRACK F: OpenCL for ALTERA FPGAs, Accelerating performance and design product...
chiportal
 
Presentation
PresentationPresentation
Presentation
butest
 
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
MLconf
 
10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems
Xavier Amatriain
 
10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConf10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConf
Xavier Amatriain
 
Are High Level Programming Languages for Multicore and Safety Critical Conver...
Are High Level Programming Languages for Multicore and Safety Critical Conver...Are High Level Programming Languages for Multicore and Safety Critical Conver...
Are High Level Programming Languages for Multicore and Safety Critical Conver...
InfinIT - Innovationsnetværket for it
 
London Ceph Day Keynote: Building Tomorrow's Ceph
London Ceph Day Keynote: Building Tomorrow's Ceph London Ceph Day Keynote: Building Tomorrow's Ceph
London Ceph Day Keynote: Building Tomorrow's Ceph
Ceph Community
 
Icpc16.ppt
Icpc16.pptIcpc16.ppt
Icpc16.ppt
Ptidej Team
 
Icpc16.ppt
Icpc16.pptIcpc16.ppt
IoT Tech Expo 2023_Pedro Trancoso presentation
IoT Tech Expo 2023_Pedro Trancoso presentationIoT Tech Expo 2023_Pedro Trancoso presentation
IoT Tech Expo 2023_Pedro Trancoso presentation
VEDLIoT Project
 
Ceph: A decade in the making and still going strong
Ceph: A decade in the making and still going strongCeph: A decade in the making and still going strong
Ceph: A decade in the making and still going strong
Patrick McGarry
 
Ceph Day Santa Clara: Keynote: Building Tomorrow's Ceph
Ceph Day Santa Clara: Keynote: Building Tomorrow's Ceph Ceph Day Santa Clara: Keynote: Building Tomorrow's Ceph
Ceph Day Santa Clara: Keynote: Building Tomorrow's Ceph
Ceph Community
 

Similar to HiPEAC2022-DL4IoT workshop_ Muhammad Waqar Azhar (20)

Thinking in parallel ab tuladev
Thinking in parallel ab tuladevThinking in parallel ab tuladev
Thinking in parallel ab tuladev
 
Exploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design spaceExploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design space
 
Thoughts on Cybersecurity
Thoughts on CybersecurityThoughts on Cybersecurity
Thoughts on Cybersecurity
 
Parallelformers
ParallelformersParallelformers
Parallelformers
 
Feature-Oriented Software Evolution
Feature-Oriented Software EvolutionFeature-Oriented Software Evolution
Feature-Oriented Software Evolution
 
Ceph Day SF 2015 - Keynote
Ceph Day SF 2015 - Keynote Ceph Day SF 2015 - Keynote
Ceph Day SF 2015 - Keynote
 
Deep Learning on ARM Platforms - SFO17-509
Deep Learning on ARM Platforms - SFO17-509Deep Learning on ARM Platforms - SFO17-509
Deep Learning on ARM Platforms - SFO17-509
 
Comparing Orchestration
Comparing OrchestrationComparing Orchestration
Comparing Orchestration
 
TRACK F: OpenCL for ALTERA FPGAs, Accelerating performance and design product...
TRACK F: OpenCL for ALTERA FPGAs, Accelerating performance and design product...TRACK F: OpenCL for ALTERA FPGAs, Accelerating performance and design product...
TRACK F: OpenCL for ALTERA FPGAs, Accelerating performance and design product...
 
Presentation
PresentationPresentation
Presentation
 
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
Xavier Amatriain, VP of Engineering, Quora at MLconf SF - 11/13/15
 
10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems
 
10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConf10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConf
 
Are High Level Programming Languages for Multicore and Safety Critical Conver...
Are High Level Programming Languages for Multicore and Safety Critical Conver...Are High Level Programming Languages for Multicore and Safety Critical Conver...
Are High Level Programming Languages for Multicore and Safety Critical Conver...
 
London Ceph Day Keynote: Building Tomorrow's Ceph
London Ceph Day Keynote: Building Tomorrow's Ceph London Ceph Day Keynote: Building Tomorrow's Ceph
London Ceph Day Keynote: Building Tomorrow's Ceph
 
Icpc16.ppt
Icpc16.pptIcpc16.ppt
Icpc16.ppt
 
Icpc16.ppt
Icpc16.pptIcpc16.ppt
Icpc16.ppt
 
IoT Tech Expo 2023_Pedro Trancoso presentation
IoT Tech Expo 2023_Pedro Trancoso presentationIoT Tech Expo 2023_Pedro Trancoso presentation
IoT Tech Expo 2023_Pedro Trancoso presentation
 
Ceph: A decade in the making and still going strong
Ceph: A decade in the making and still going strongCeph: A decade in the making and still going strong
Ceph: A decade in the making and still going strong
 
Ceph Day Santa Clara: Keynote: Building Tomorrow's Ceph
Ceph Day Santa Clara: Keynote: Building Tomorrow's Ceph Ceph Day Santa Clara: Keynote: Building Tomorrow's Ceph
Ceph Day Santa Clara: Keynote: Building Tomorrow's Ceph
 

More from VEDLIoT Project

IoT Tech Expo 2023_Micha vor dem Berge presentation
IoT Tech Expo 2023_Micha vor dem Berge presentationIoT Tech Expo 2023_Micha vor dem Berge presentation
IoT Tech Expo 2023_Micha vor dem Berge presentation
VEDLIoT Project
 
Computing Frontiers 2023_Pedro Trancoso presentation
Computing Frontiers 2023_Pedro Trancoso presentationComputing Frontiers 2023_Pedro Trancoso presentation
Computing Frontiers 2023_Pedro Trancoso presentation
VEDLIoT Project
 
HiPEAC-CSW 2022_Pedro Trancoso presentation
HiPEAC-CSW 2022_Pedro Trancoso presentationHiPEAC-CSW 2022_Pedro Trancoso presentation
HiPEAC-CSW 2022_Pedro Trancoso presentation
VEDLIoT Project
 
IoT Week 2022-NGIoT session_Micha vor dem Berge presentation
IoT Week 2022-NGIoT session_Micha vor dem Berge presentationIoT Week 2022-NGIoT session_Micha vor dem Berge presentation
IoT Week 2022-NGIoT session_Micha vor dem Berge presentation
VEDLIoT Project
 
Next Generation IoT Architectures_Hans Salomonsson
Next Generation IoT Architectures_Hans SalomonssonNext Generation IoT Architectures_Hans Salomonsson
Next Generation IoT Architectures_Hans Salomonsson
VEDLIoT Project
 
CONASENSE 2022_Jens Hagemeyer presentation
CONASENSE 2022_Jens Hagemeyer presentationCONASENSE 2022_Jens Hagemeyer presentation
CONASENSE 2022_Jens Hagemeyer presentation
VEDLIoT Project
 
NGIoT standardisation workshops_Jens Hagemeyer presentation
NGIoT standardisation workshops_Jens Hagemeyer presentationNGIoT standardisation workshops_Jens Hagemeyer presentation
NGIoT standardisation workshops_Jens Hagemeyer presentation
VEDLIoT Project
 
HiPEAC-CSW 2022_Kevin Mika presentation
HiPEAC-CSW 2022_Kevin Mika presentationHiPEAC-CSW 2022_Kevin Mika presentation
HiPEAC-CSW 2022_Kevin Mika presentation
VEDLIoT Project
 
HiPEAC 2022-DL4IoT workshop_René Griessl presentation
HiPEAC 2022-DL4IoT workshop_René Griessl presentationHiPEAC 2022-DL4IoT workshop_René Griessl presentation
HiPEAC 2022-DL4IoT workshop_René Griessl presentation
VEDLIoT Project
 
SS-CPSIoT 2023_Kevin Mika and Piotr Zierhoffer presentation
SS-CPSIoT 2023_Kevin Mika and Piotr Zierhoffer presentationSS-CPSIoT 2023_Kevin Mika and Piotr Zierhoffer presentation
SS-CPSIoT 2023_Kevin Mika and Piotr Zierhoffer presentation
VEDLIoT Project
 
HiPEAC2023-DL4IoT Workshop_Jean Hagemeyer presentation
HiPEAC2023-DL4IoT Workshop_Jean Hagemeyer presentationHiPEAC2023-DL4IoT Workshop_Jean Hagemeyer presentation
HiPEAC2023-DL4IoT Workshop_Jean Hagemeyer presentation
VEDLIoT Project
 
IoT Week 2021_Jens Hagemeyer presentation
IoT Week 2021_Jens Hagemeyer presentationIoT Week 2021_Jens Hagemeyer presentation
IoT Week 2021_Jens Hagemeyer presentation
VEDLIoT Project
 
HiPEAC 2022_Marcelo Pasin presentation
HiPEAC 2022_Marcelo Pasin presentationHiPEAC 2022_Marcelo Pasin presentation
HiPEAC 2022_Marcelo Pasin presentation
VEDLIoT Project
 
IoT Tech Expo 2023_Marcelo Pasin presentation
IoT Tech Expo 2023_Marcelo Pasin presentationIoT Tech Expo 2023_Marcelo Pasin presentation
IoT Tech Expo 2023_Marcelo Pasin presentation
VEDLIoT Project
 
IoT Tech Expo 2023_Hans-Martin Heyn presentation
IoT Tech Expo 2023_Hans-Martin Heyn presentationIoT Tech Expo 2023_Hans-Martin Heyn presentation
IoT Tech Expo 2023_Hans-Martin Heyn presentation
VEDLIoT Project
 
HiPEAC 2022_Marco Tassemeier presentation
HiPEAC 2022_Marco Tassemeier presentationHiPEAC 2022_Marco Tassemeier presentation
HiPEAC 2022_Marco Tassemeier presentation
VEDLIoT Project
 
HiPEAC Computing Systems Week 2022_Mario Porrmann presentation
HiPEAC Computing Systems Week 2022_Mario Porrmann presentationHiPEAC Computing Systems Week 2022_Mario Porrmann presentation
HiPEAC Computing Systems Week 2022_Mario Porrmann presentation
VEDLIoT Project
 
HiPEAC2022_António Casimiro presentation
HiPEAC2022_António Casimiro presentationHiPEAC2022_António Casimiro presentation
HiPEAC2022_António Casimiro presentation
VEDLIoT Project
 
NGIoT Sustainability Workshop 2023_ Hans-Martin Heyn presentation
NGIoT Sustainability Workshop 2023_ Hans-Martin Heyn presentationNGIoT Sustainability Workshop 2023_ Hans-Martin Heyn presentation
NGIoT Sustainability Workshop 2023_ Hans-Martin Heyn presentation
VEDLIoT Project
 
EU-IoT Training Workshops Series: AIoT and Edge Machine Learning 2021_Jens Ha...
EU-IoT Training Workshops Series: AIoT and Edge Machine Learning 2021_Jens Ha...EU-IoT Training Workshops Series: AIoT and Edge Machine Learning 2021_Jens Ha...
EU-IoT Training Workshops Series: AIoT and Edge Machine Learning 2021_Jens Ha...
VEDLIoT Project
 

More from VEDLIoT Project (20)

IoT Tech Expo 2023_Micha vor dem Berge presentation
IoT Tech Expo 2023_Micha vor dem Berge presentationIoT Tech Expo 2023_Micha vor dem Berge presentation
IoT Tech Expo 2023_Micha vor dem Berge presentation
 
Computing Frontiers 2023_Pedro Trancoso presentation
Computing Frontiers 2023_Pedro Trancoso presentationComputing Frontiers 2023_Pedro Trancoso presentation
Computing Frontiers 2023_Pedro Trancoso presentation
 
HiPEAC-CSW 2022_Pedro Trancoso presentation
HiPEAC-CSW 2022_Pedro Trancoso presentationHiPEAC-CSW 2022_Pedro Trancoso presentation
HiPEAC-CSW 2022_Pedro Trancoso presentation
 
IoT Week 2022-NGIoT session_Micha vor dem Berge presentation
IoT Week 2022-NGIoT session_Micha vor dem Berge presentationIoT Week 2022-NGIoT session_Micha vor dem Berge presentation
IoT Week 2022-NGIoT session_Micha vor dem Berge presentation
 
Next Generation IoT Architectures_Hans Salomonsson
Next Generation IoT Architectures_Hans SalomonssonNext Generation IoT Architectures_Hans Salomonsson
Next Generation IoT Architectures_Hans Salomonsson
 
CONASENSE 2022_Jens Hagemeyer presentation
CONASENSE 2022_Jens Hagemeyer presentationCONASENSE 2022_Jens Hagemeyer presentation
CONASENSE 2022_Jens Hagemeyer presentation
 
NGIoT standardisation workshops_Jens Hagemeyer presentation
NGIoT standardisation workshops_Jens Hagemeyer presentationNGIoT standardisation workshops_Jens Hagemeyer presentation
NGIoT standardisation workshops_Jens Hagemeyer presentation
 
HiPEAC-CSW 2022_Kevin Mika presentation
HiPEAC-CSW 2022_Kevin Mika presentationHiPEAC-CSW 2022_Kevin Mika presentation
HiPEAC-CSW 2022_Kevin Mika presentation
 
HiPEAC 2022-DL4IoT workshop_René Griessl presentation
HiPEAC 2022-DL4IoT workshop_René Griessl presentationHiPEAC 2022-DL4IoT workshop_René Griessl presentation
HiPEAC 2022-DL4IoT workshop_René Griessl presentation
 
SS-CPSIoT 2023_Kevin Mika and Piotr Zierhoffer presentation
SS-CPSIoT 2023_Kevin Mika and Piotr Zierhoffer presentationSS-CPSIoT 2023_Kevin Mika and Piotr Zierhoffer presentation
SS-CPSIoT 2023_Kevin Mika and Piotr Zierhoffer presentation
 
HiPEAC2023-DL4IoT Workshop_Jean Hagemeyer presentation
HiPEAC2023-DL4IoT Workshop_Jean Hagemeyer presentationHiPEAC2023-DL4IoT Workshop_Jean Hagemeyer presentation
HiPEAC2023-DL4IoT Workshop_Jean Hagemeyer presentation
 
IoT Week 2021_Jens Hagemeyer presentation
IoT Week 2021_Jens Hagemeyer presentationIoT Week 2021_Jens Hagemeyer presentation
IoT Week 2021_Jens Hagemeyer presentation
 
HiPEAC 2022_Marcelo Pasin presentation
HiPEAC 2022_Marcelo Pasin presentationHiPEAC 2022_Marcelo Pasin presentation
HiPEAC 2022_Marcelo Pasin presentation
 
IoT Tech Expo 2023_Marcelo Pasin presentation
IoT Tech Expo 2023_Marcelo Pasin presentationIoT Tech Expo 2023_Marcelo Pasin presentation
IoT Tech Expo 2023_Marcelo Pasin presentation
 
IoT Tech Expo 2023_Hans-Martin Heyn presentation
IoT Tech Expo 2023_Hans-Martin Heyn presentationIoT Tech Expo 2023_Hans-Martin Heyn presentation
IoT Tech Expo 2023_Hans-Martin Heyn presentation
 
HiPEAC 2022_Marco Tassemeier presentation
HiPEAC 2022_Marco Tassemeier presentationHiPEAC 2022_Marco Tassemeier presentation
HiPEAC 2022_Marco Tassemeier presentation
 
HiPEAC Computing Systems Week 2022_Mario Porrmann presentation
HiPEAC Computing Systems Week 2022_Mario Porrmann presentationHiPEAC Computing Systems Week 2022_Mario Porrmann presentation
HiPEAC Computing Systems Week 2022_Mario Porrmann presentation
 
HiPEAC2022_António Casimiro presentation
HiPEAC2022_António Casimiro presentationHiPEAC2022_António Casimiro presentation
HiPEAC2022_António Casimiro presentation
 
NGIoT Sustainability Workshop 2023_ Hans-Martin Heyn presentation
NGIoT Sustainability Workshop 2023_ Hans-Martin Heyn presentationNGIoT Sustainability Workshop 2023_ Hans-Martin Heyn presentation
NGIoT Sustainability Workshop 2023_ Hans-Martin Heyn presentation
 
EU-IoT Training Workshops Series: AIoT and Edge Machine Learning 2021_Jens Ha...
EU-IoT Training Workshops Series: AIoT and Edge Machine Learning 2021_Jens Ha...EU-IoT Training Workshops Series: AIoT and Edge Machine Learning 2021_Jens Ha...
EU-IoT Training Workshops Series: AIoT and Edge Machine Learning 2021_Jens Ha...
 

Recently uploaded

ThrombUS+ Project Presentation - June 2024
ThrombUS+ Project Presentation - June 2024ThrombUS+ Project Presentation - June 2024
ThrombUS+ Project Presentation - June 2024
elenikaldoudi1
 
Electrostatic force class 8 ncert. .pptx
Electrostatic force class 8 ncert. .pptxElectrostatic force class 8 ncert. .pptx
Electrostatic force class 8 ncert. .pptx
yokeswarikannan123
 
MACRAMÉ-ChiPs: Patchwork Project Family & Sibling Projects (24th Meeting of t...
MACRAMÉ-ChiPs: Patchwork Project Family & Sibling Projects (24th Meeting of t...MACRAMÉ-ChiPs: Patchwork Project Family & Sibling Projects (24th Meeting of t...
MACRAMÉ-ChiPs: Patchwork Project Family & Sibling Projects (24th Meeting of t...
Steffi Friedrichs
 
ALTERNATIVE ANIMAL TOXICITY STUDY .pptx
ALTERNATIVE ANIMAL TOXICITY STUDY  .pptxALTERNATIVE ANIMAL TOXICITY STUDY  .pptx
ALTERNATIVE ANIMAL TOXICITY STUDY .pptx
SAMIR PANDA
 
SPERM FUNCTION TEST IN EMBRYOLOGY .pptx
SPERM FUNCTION TEST  IN EMBRYOLOGY .pptxSPERM FUNCTION TEST  IN EMBRYOLOGY .pptx
SPERM FUNCTION TEST IN EMBRYOLOGY .pptx
SRI AUROBINDO UNIVERSITY
 
Types of Garden (Mughal and Buddhist style)
Types of Garden (Mughal and Buddhist style)Types of Garden (Mughal and Buddhist style)
Types of Garden (Mughal and Buddhist style)
saloniswain225
 
The extremotolerant desert moss Syntrichia caninervis is a promising pioneer ...
The extremotolerant desert moss Syntrichia caninervis is a promising pioneer ...The extremotolerant desert moss Syntrichia caninervis is a promising pioneer ...
The extremotolerant desert moss Syntrichia caninervis is a promising pioneer ...
Sérgio Sacani
 
PART 1 The New Natural Principles of Electromagnetism and Electromagnetic Fie...
PART 1 The New Natural Principles of Electromagnetism and Electromagnetic Fie...PART 1 The New Natural Principles of Electromagnetism and Electromagnetic Fie...
PART 1 The New Natural Principles of Electromagnetism and Electromagnetic Fie...
Thane Heins
 
ThrombUS+ Project Overview - Horizon Europe Networking Event June 2024
ThrombUS+ Project Overview - Horizon Europe Networking Event  June 2024ThrombUS+ Project Overview - Horizon Europe Networking Event  June 2024
ThrombUS+ Project Overview - Horizon Europe Networking Event June 2024
ThrombUS+ Project
 
History & overview of Bioprocess Technology.pptx
History & overview of Bioprocess Technology.pptxHistory & overview of Bioprocess Technology.pptx
History & overview of Bioprocess Technology.pptx
berciyalgolda1
 
SPERM DNA DAMAGE/SPERM DNA FRAGMENTATION.pptx
SPERM DNA DAMAGE/SPERM DNA FRAGMENTATION.pptxSPERM DNA DAMAGE/SPERM DNA FRAGMENTATION.pptx
SPERM DNA DAMAGE/SPERM DNA FRAGMENTATION.pptx
SRI AUROBINDO UNIVERSITY
 
degree Certificate of Aston University
degree Certificate of Aston Universitydegree Certificate of Aston University
degree Certificate of Aston University
ebgyz
 
SUBJECT SPECIFIC ETHICAL ISSUES IN STUDY
SUBJECT SPECIFIC ETHICAL ISSUES IN STUDYSUBJECT SPECIFIC ETHICAL ISSUES IN STUDY
SUBJECT SPECIFIC ETHICAL ISSUES IN STUDY
Dr Kirpa Ram Jangra
 
Antigen_ppt(_RANJITHA_SL)_.presentation.
Antigen_ppt(_RANJITHA_SL)_.presentation.Antigen_ppt(_RANJITHA_SL)_.presentation.
Antigen_ppt(_RANJITHA_SL)_.presentation.
RanjithaSL
 
Towards Wearable Continuous Point-of-Care Monitoring for Deep Vein Thrombosis...
Towards Wearable Continuous Point-of-Care Monitoring for Deep Vein Thrombosis...Towards Wearable Continuous Point-of-Care Monitoring for Deep Vein Thrombosis...
Towards Wearable Continuous Point-of-Care Monitoring for Deep Vein Thrombosis...
ThrombUS+ Project
 
AZoNetwork eBook Production Cover Examples
AZoNetwork eBook Production Cover ExamplesAZoNetwork eBook Production Cover Examples
AZoNetwork eBook Production Cover Examples
SaraLopez160298
 
ThrombUS+ Project Overview - January 2024
ThrombUS+ Project Overview - January 2024ThrombUS+ Project Overview - January 2024
ThrombUS+ Project Overview - January 2024
ThrombUS+ Project
 
ScieNCE grade 08 Lesson 1 and 2 NLC.pptx
ScieNCE grade 08 Lesson 1 and 2 NLC.pptxScieNCE grade 08 Lesson 1 and 2 NLC.pptx
ScieNCE grade 08 Lesson 1 and 2 NLC.pptx
JoanaBanasen1
 
Liver & Gall Bladder 23098463278654387654328765439875.pptx
Liver & Gall Bladder 23098463278654387654328765439875.pptxLiver & Gall Bladder 23098463278654387654328765439875.pptx
Liver & Gall Bladder 23098463278654387654328765439875.pptx
muralinath2
 
Dalghren, Thorne and Stebbins System of Classification of Angiosperms
Dalghren, Thorne and Stebbins System of Classification of AngiospermsDalghren, Thorne and Stebbins System of Classification of Angiosperms
Dalghren, Thorne and Stebbins System of Classification of Angiosperms
Gurjant Singh
 

Recently uploaded (20)

ThrombUS+ Project Presentation - June 2024
ThrombUS+ Project Presentation - June 2024ThrombUS+ Project Presentation - June 2024
ThrombUS+ Project Presentation - June 2024
 
Electrostatic force class 8 ncert. .pptx
Electrostatic force class 8 ncert. .pptxElectrostatic force class 8 ncert. .pptx
Electrostatic force class 8 ncert. .pptx
 
MACRAMÉ-ChiPs: Patchwork Project Family & Sibling Projects (24th Meeting of t...
MACRAMÉ-ChiPs: Patchwork Project Family & Sibling Projects (24th Meeting of t...MACRAMÉ-ChiPs: Patchwork Project Family & Sibling Projects (24th Meeting of t...
MACRAMÉ-ChiPs: Patchwork Project Family & Sibling Projects (24th Meeting of t...
 
ALTERNATIVE ANIMAL TOXICITY STUDY .pptx
ALTERNATIVE ANIMAL TOXICITY STUDY  .pptxALTERNATIVE ANIMAL TOXICITY STUDY  .pptx
ALTERNATIVE ANIMAL TOXICITY STUDY .pptx
 
SPERM FUNCTION TEST IN EMBRYOLOGY .pptx
SPERM FUNCTION TEST  IN EMBRYOLOGY .pptxSPERM FUNCTION TEST  IN EMBRYOLOGY .pptx
SPERM FUNCTION TEST IN EMBRYOLOGY .pptx
 
Types of Garden (Mughal and Buddhist style)
Types of Garden (Mughal and Buddhist style)Types of Garden (Mughal and Buddhist style)
Types of Garden (Mughal and Buddhist style)
 
The extremotolerant desert moss Syntrichia caninervis is a promising pioneer ...
The extremotolerant desert moss Syntrichia caninervis is a promising pioneer ...The extremotolerant desert moss Syntrichia caninervis is a promising pioneer ...
The extremotolerant desert moss Syntrichia caninervis is a promising pioneer ...
 
PART 1 The New Natural Principles of Electromagnetism and Electromagnetic Fie...
PART 1 The New Natural Principles of Electromagnetism and Electromagnetic Fie...PART 1 The New Natural Principles of Electromagnetism and Electromagnetic Fie...
PART 1 The New Natural Principles of Electromagnetism and Electromagnetic Fie...
 
ThrombUS+ Project Overview - Horizon Europe Networking Event June 2024
ThrombUS+ Project Overview - Horizon Europe Networking Event  June 2024ThrombUS+ Project Overview - Horizon Europe Networking Event  June 2024
ThrombUS+ Project Overview - Horizon Europe Networking Event June 2024
 
History & overview of Bioprocess Technology.pptx
History & overview of Bioprocess Technology.pptxHistory & overview of Bioprocess Technology.pptx
History & overview of Bioprocess Technology.pptx
 
SPERM DNA DAMAGE/SPERM DNA FRAGMENTATION.pptx
SPERM DNA DAMAGE/SPERM DNA FRAGMENTATION.pptxSPERM DNA DAMAGE/SPERM DNA FRAGMENTATION.pptx
SPERM DNA DAMAGE/SPERM DNA FRAGMENTATION.pptx
 
degree Certificate of Aston University
degree Certificate of Aston Universitydegree Certificate of Aston University
degree Certificate of Aston University
 
SUBJECT SPECIFIC ETHICAL ISSUES IN STUDY
SUBJECT SPECIFIC ETHICAL ISSUES IN STUDYSUBJECT SPECIFIC ETHICAL ISSUES IN STUDY
SUBJECT SPECIFIC ETHICAL ISSUES IN STUDY
 
Antigen_ppt(_RANJITHA_SL)_.presentation.
Antigen_ppt(_RANJITHA_SL)_.presentation.Antigen_ppt(_RANJITHA_SL)_.presentation.
Antigen_ppt(_RANJITHA_SL)_.presentation.
 
Towards Wearable Continuous Point-of-Care Monitoring for Deep Vein Thrombosis...
Towards Wearable Continuous Point-of-Care Monitoring for Deep Vein Thrombosis...Towards Wearable Continuous Point-of-Care Monitoring for Deep Vein Thrombosis...
Towards Wearable Continuous Point-of-Care Monitoring for Deep Vein Thrombosis...
 
AZoNetwork eBook Production Cover Examples
AZoNetwork eBook Production Cover ExamplesAZoNetwork eBook Production Cover Examples
AZoNetwork eBook Production Cover Examples
 
ThrombUS+ Project Overview - January 2024
ThrombUS+ Project Overview - January 2024ThrombUS+ Project Overview - January 2024
ThrombUS+ Project Overview - January 2024
 
ScieNCE grade 08 Lesson 1 and 2 NLC.pptx
ScieNCE grade 08 Lesson 1 and 2 NLC.pptxScieNCE grade 08 Lesson 1 and 2 NLC.pptx
ScieNCE grade 08 Lesson 1 and 2 NLC.pptx
 
Liver & Gall Bladder 23098463278654387654328765439875.pptx
Liver & Gall Bladder 23098463278654387654328765439875.pptxLiver & Gall Bladder 23098463278654387654328765439875.pptx
Liver & Gall Bladder 23098463278654387654328765439875.pptx
 
Dalghren, Thorne and Stebbins System of Classification of Angiosperms
Dalghren, Thorne and Stebbins System of Classification of AngiospermsDalghren, Thorne and Stebbins System of Classification of Angiosperms
Dalghren, Thorne and Stebbins System of Classification of Angiosperms
 

HiPEAC2022-DL4IoT workshop_ Muhammad Waqar Azhar

  • 1. Muhammad Waqar Azhar, Pedro Trancoso Chalmers University of Technology 20. June 2022 Co-design of DL Accelerators in VEDLIoT
  • 2. 2 Which one is the best? Which one is the best for me? DL Accelerators
  • 3. 3 DL Acceleration Hardware Software CPU GPU TPU NPU FPGA Models Quantization Pruning
  • 4. 4 DL Accelerator Co-Design Design for… Map to… Software Hardware
  • 5. 5 DL Accelerator Co-Design Design for… Map to… Software Hardware
  • 6. 6 DL Accelerator Co-Design Design for… Map to… Software Hardware
  • 7. 7 DL Accelerator Co-Design Design for… Map to… Software Hardware
  • 8. 8 ▪ Model case-study: MobileNet ▪ Observation: generic HW not efficient ▪ Challenge: Depthwise convolution Co-Design Example - Motivation ● Heterogeneity at different levels: ○ Model layers of different type (e.g. depthwise and pointwise convolution) ○ Within same layer type (e.g. activation and filter sizes and shapes) ○ Determines: buffer sizes, reuse, parallelism Layer-specific hardware to capture heterogeneity!
  • 9. 9 Co-Design Example: Open Questions & Approaches Approach A: one-HW-for-all DOG Approach B: one-HW-per-layer-type Approach C: one-HW-per-layer + Runs any model - Suboptimal efficiency + Matches layer types - Suboptimal utilization + Best efficiency - Resource-hungry
  • 10. 10 Co-Design Example: Open Questions & Approaches Approach A: one-HW-for-all DOG Approach B: one-HW-per-layer-type Approach C: one-HW-per-layer TVM-VTA PYNQ-Z2 Unique Kernels ZCU102 Xilinx FINN ZCU102 • MobileNet requires aggressive quantization (4b) • Performance: 35 GOPS and 68 GOPS using MobileNetsV1 1x and 0.5x • Su, Jiang, et al. "Redundancy- reduced mobilenet acceleration on reconfigurable logic for imagenet classification." • Performance: approx 90 GOPS • Resnet34 • Performance: approx 8 GOPS • (DPU performance > 20 GOPS)
  • 11. 11 Proposed Solution Co-design: ▪ Approach B: ▪ Mapping is good but throughput is below threshold… ▪ Approach C: ▪ FINN requires large HW to support original model -> more aggressive quantization ▪ Quantized model fits in HW but accuracy is below threshold… ? B + C
  • 12. 12 ▪ Current situation: ▪ Zoo of DNN models ▪ Zoo of HW accelerators ▪ Heterogeneity in the model -> Heterogeneity in the hardware The need for Co-Design! Co-Design with both generic and layer-specific HW modules Conclusions
  • 13. 13 Thank you for your attention.