(Go: >> BACK << -|- >> HOME <<)

SlideShare a Scribd company logo
K U L A
You only look once:
Unified, Real-time Object Detection
by Joseph Redmon, Santosh Divvala,
Ross Girshick, Ali Farhadi (CVPR 2016)
K U L A
from deepsystems.io
Pascal VOC2007 test sample results.
K U L A
Main Concept
* Object Detection
* Regression problem
* YOLO
* Only One Feedforward
* Global context
* Unified (Real-time detection)
* YOLO: 45 FPS
* Fast YOLO: 155 FPS
* General representation
* Robust on various background
* Other domain
K U L A
Previous Works: Repurpose classifier to perform detectio
Deformable Parts Models (DPM)
• Sliding window
R-CNN based methods
1) generate potential bounding boxes.
2) run classifiers on these proposed
boxes
3) post-processing (refinement,
elimination, rescore)
K U L A
Object detection as Regression Problem
YOLO: Single Regression Problem
Image → bounding box coordinate and class probability.
* Extremely Fast
* Global reasoning
* Generalizable representation
K U L A
Unified Detection
• All BBox, All classes
1) Image → S x S grids
2) Grid cell
→ B: BBoxes and Confidence score
x, y, w, h, confidence
→ C: class probabilities w.r.t #classes
K U L A
Unified Detection
• Predict one set of class
probabilities per grid cell,
regardless of the number of
boxes B.
• At test time,
individual box confidence
prediction
K U L A
Network Design
• Modified GoogLeNet
• 1x1 reduction layer (“Network in Network”)
K U L A
How it works?
from deepsystems.io
K U L A
from deepsystems.io
How it works?
K U L A
from deepsystems.io
How it works?
K U L A
from deepsystems.io
How it works?
K U L A
from deepsystems.io
How it works?
K U L A
from deepsystems.io
How it works?
K U L A
from deepsystems.io
How it works?
K U L A
from deepsystems.io
How it works?
K U L A
from deepsystems.io
How it works?
K U L A
from deepsystems.io
How it works?
K U L A
How it works?
from deepsystems.io
Total :
7*7*2 = 98 boxes
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Look at detection procedure
from deepsystems.io
K U L A
Limitation of YOLO
from deepsystems.io
• Group of small objects
• Unusual aspect ratios
• Coarse feature
• Localization error of bounding box
K U L A
Comparison to other Real-Time Systems
from deepsystems.io
K U L A
VOC Error
from deepsystems.io
K U L A
Combining Fast R-CNN and YOLO
from deepsystems.io
K U L A
VOC 2012 Leaderboard
from deepsystems.io
K U L A
Generalizability : Person Detection in Artwork
from deepsystems.io
K U L A
Generalizability : Person Detection in Artwork
from deepsystems.io
K U L A
Key Points
from deepsystems.io
1.Fast: YOLO - 45 fps, YOLO-tiny - 155 fps.
2.End-to-end training.
3.Makes more localization errors but is less likely to
predict false positives on background
4.Performance is lower than the current state of the art.
5.Combined Fast R-CNN + YOLO model is one of the
highest performing detection
6.methods.
7.Learns very general representations of objects: it
outperforms other detection methods,
8.including DPM and R-CNN, when generalizing from
natural images to other domains
K U L A
Appendix : Loss Function (sum-squared error)
from deepsystems.io
K U L A
from deepsystems.io
Appendix : Loss Function (sum-squared error)
K U L A
from deepsystems.io
Appendix : Loss Function (sum-squared error)
K U L A
from deepsystems.io
Appendix : Intersection over Union (IoU)
• IoU(pred, truth)=[0, 1]
K U L A
from deepsystems.io
Appendix : Sum-Squared Error (SSE)
sum of squared errors of prediction (SSE), is the sum of the squares of
residuals (deviations predicted from actual empirical values of data). It is a
measure of the discrepancy between the data and an estimation model. A
small RSS indicates a tight fit of the model to the data. It is used as an
optimality criterion in parameter selection and model selection.

More Related Content

What's hot

Yolov5
Yolov5 Yolov5
Introduction to object detection
Introduction to object detectionIntroduction to object detection
Introduction to object detection
Brodmann17
 
YOLOv4: optimal speed and accuracy of object detection review
YOLOv4: optimal speed and accuracy of object detection reviewYOLOv4: optimal speed and accuracy of object detection review
YOLOv4: optimal speed and accuracy of object detection review
LEE HOSEONG
 
You only look once
You only look onceYou only look once
You only look once
Gin Kyeng Lee
 
Yolo
YoloYolo
PR-207: YOLOv3: An Incremental Improvement
PR-207: YOLOv3: An Incremental ImprovementPR-207: YOLOv3: An Incremental Improvement
PR-207: YOLOv3: An Incremental Improvement
Jinwon Lee
 
A Brief History of Object Detection / Tommi Kerola
A Brief History of Object Detection / Tommi KerolaA Brief History of Object Detection / Tommi Kerola
A Brief History of Object Detection / Tommi Kerola
Preferred Networks
 
Deep Learning for Computer Vision: Object Detection (UPC 2016)
Deep Learning for Computer Vision: Object Detection (UPC 2016)Deep Learning for Computer Vision: Object Detection (UPC 2016)
Deep Learning for Computer Vision: Object Detection (UPC 2016)
Universitat Politècnica de Catalunya
 
Yolo
YoloYolo
YOLO
YOLOYOLO
Object detection with deep learning
Object detection with deep learningObject detection with deep learning
Object detection with deep learning
Sushant Shrivastava
 
Yolo releases gianmaria
Yolo releases gianmariaYolo releases gianmaria
Yolo releases gianmaria
Deep Learning Italia
 
Anatomy of YOLO - v1
Anatomy of YOLO - v1Anatomy of YOLO - v1
Anatomy of YOLO - v1
Jihoon Song
 
Yolov3
Yolov3Yolov3
Yolov3
VincentWu105
 
Object detection
Object detectionObject detection
Object detection
ROUSHAN RAJ KUMAR
 
Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...
Universitat Politècnica de Catalunya
 
YOLO v1
YOLO v1YOLO v1
YOLO v1
오 혜린
 
Yol ov2
Yol ov2Yol ov2
Deep learning based object detection basics
Deep learning based object detection basicsDeep learning based object detection basics
Deep learning based object detection basics
Brodmann17
 
Object detection
Object detectionObject detection
Object detection
Somesh Vyas
 

What's hot (20)

Yolov5
Yolov5 Yolov5
Yolov5
 
Introduction to object detection
Introduction to object detectionIntroduction to object detection
Introduction to object detection
 
YOLOv4: optimal speed and accuracy of object detection review
YOLOv4: optimal speed and accuracy of object detection reviewYOLOv4: optimal speed and accuracy of object detection review
YOLOv4: optimal speed and accuracy of object detection review
 
You only look once
You only look onceYou only look once
You only look once
 
Yolo
YoloYolo
Yolo
 
PR-207: YOLOv3: An Incremental Improvement
PR-207: YOLOv3: An Incremental ImprovementPR-207: YOLOv3: An Incremental Improvement
PR-207: YOLOv3: An Incremental Improvement
 
A Brief History of Object Detection / Tommi Kerola
A Brief History of Object Detection / Tommi KerolaA Brief History of Object Detection / Tommi Kerola
A Brief History of Object Detection / Tommi Kerola
 
Deep Learning for Computer Vision: Object Detection (UPC 2016)
Deep Learning for Computer Vision: Object Detection (UPC 2016)Deep Learning for Computer Vision: Object Detection (UPC 2016)
Deep Learning for Computer Vision: Object Detection (UPC 2016)
 
Yolo
YoloYolo
Yolo
 
YOLO
YOLOYOLO
YOLO
 
Object detection with deep learning
Object detection with deep learningObject detection with deep learning
Object detection with deep learning
 
Yolo releases gianmaria
Yolo releases gianmariaYolo releases gianmaria
Yolo releases gianmaria
 
Anatomy of YOLO - v1
Anatomy of YOLO - v1Anatomy of YOLO - v1
Anatomy of YOLO - v1
 
Yolov3
Yolov3Yolov3
Yolov3
 
Object detection
Object detectionObject detection
Object detection
 
Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...
 
YOLO v1
YOLO v1YOLO v1
YOLO v1
 
Yol ov2
Yol ov2Yol ov2
Yol ov2
 
Deep learning based object detection basics
Deep learning based object detection basicsDeep learning based object detection basics
Deep learning based object detection basics
 
Object detection
Object detectionObject detection
Object detection
 

Similar to You Only Look Once: Unified, Real-Time Object Detection

ppt - of a project will help you on your college projects
ppt - of a project will help you on your college projectsppt - of a project will help you on your college projects
ppt - of a project will help you on your college projects
vikaspandey0702
 
Deep learning based object detection
Deep learning based object detectionDeep learning based object detection
Deep learning based object detection
MonicaDommaraju
 
ppt - Copy for projects will help you further
ppt - Copy for projects will help you furtherppt - Copy for projects will help you further
ppt - Copy for projects will help you further
vikaspandey0702
 
A-13 Iomp-1.pptx
A-13 Iomp-1.pptxA-13 Iomp-1.pptx
A-13 Iomp-1.pptx
Jayendranath3
 
Review: You Only Look One-level Feature
Review: You Only Look One-level FeatureReview: You Only Look One-level Feature
Review: You Only Look One-level Feature
Dongmin Choi
 
Polymorphism 9
Polymorphism 9Polymorphism 9
Polymorphism 9
Fajar Baskoro
 
Polymorphism 9
Polymorphism 9Polymorphism 9
Polymorphism 9
Fajar Baskoro
 
Object Detection - Míriam Bellver - UPC Barcelona 2018
Object Detection - Míriam Bellver - UPC Barcelona 2018Object Detection - Míriam Bellver - UPC Barcelona 2018
Object Detection - Míriam Bellver - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
Lec11 object-re-id
Lec11 object-re-idLec11 object-re-id
Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...
Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...
Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...
ISSEL
 
Python metaprogramming in linear time language for automated runtime verifica...
Python metaprogramming in linear time language for automated runtime verifica...Python metaprogramming in linear time language for automated runtime verifica...
Python metaprogramming in linear time language for automated runtime verifica...
ISSEL
 
Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...
Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...
Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...
ISSEL
 
Learning do discover: machine learning in high-energy physics
Learning do discover: machine learning in high-energy physicsLearning do discover: machine learning in high-energy physics
Learning do discover: machine learning in high-energy physics
Balázs Kégl
 
20211118 AI+ Remote Sensing
20211118 AI+ Remote Sensing20211118 AI+ Remote Sensing
20211118 AI+ Remote Sensing
Jui-Hsin (Larry) Lai
 
Deep Learning Hardware: Past, Present, & Future
Deep Learning Hardware: Past, Present, & FutureDeep Learning Hardware: Past, Present, & Future
Deep Learning Hardware: Past, Present, & Future
Rouyun Pan
 
YOLOv4: A Face Mask Detection System
YOLOv4: A Face Mask Detection SystemYOLOv4: A Face Mask Detection System
YOLOv4: A Face Mask Detection System
IRJET Journal
 
Real Time Object Detection System with YOLO and CNN Models: A Review
Real Time Object Detection System with YOLO and CNN Models: A ReviewReal Time Object Detection System with YOLO and CNN Models: A Review
Real Time Object Detection System with YOLO and CNN Models: A Review
Springer
 
“Robust Object Detection Under Dataset Shifts,” a Presentation from Arm
“Robust Object Detection Under Dataset Shifts,” a Presentation from Arm“Robust Object Detection Under Dataset Shifts,” a Presentation from Arm
“Robust Object Detection Under Dataset Shifts,” a Presentation from Arm
Edge AI and Vision Alliance
 
物件偵測與辨識技術
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術
CHENHuiMei
 
Convolutional Neural Networks CNN
Convolutional Neural Networks CNNConvolutional Neural Networks CNN
Convolutional Neural Networks CNN
Abdullah al Mamun
 

Similar to You Only Look Once: Unified, Real-Time Object Detection (20)

ppt - of a project will help you on your college projects
ppt - of a project will help you on your college projectsppt - of a project will help you on your college projects
ppt - of a project will help you on your college projects
 
Deep learning based object detection
Deep learning based object detectionDeep learning based object detection
Deep learning based object detection
 
ppt - Copy for projects will help you further
ppt - Copy for projects will help you furtherppt - Copy for projects will help you further
ppt - Copy for projects will help you further
 
A-13 Iomp-1.pptx
A-13 Iomp-1.pptxA-13 Iomp-1.pptx
A-13 Iomp-1.pptx
 
Review: You Only Look One-level Feature
Review: You Only Look One-level FeatureReview: You Only Look One-level Feature
Review: You Only Look One-level Feature
 
Polymorphism 9
Polymorphism 9Polymorphism 9
Polymorphism 9
 
Polymorphism 9
Polymorphism 9Polymorphism 9
Polymorphism 9
 
Object Detection - Míriam Bellver - UPC Barcelona 2018
Object Detection - Míriam Bellver - UPC Barcelona 2018Object Detection - Míriam Bellver - UPC Barcelona 2018
Object Detection - Míriam Bellver - UPC Barcelona 2018
 
Lec11 object-re-id
Lec11 object-re-idLec11 object-re-id
Lec11 object-re-id
 
Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...
Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...
Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...
 
Python metaprogramming in linear time language for automated runtime verifica...
Python metaprogramming in linear time language for automated runtime verifica...Python metaprogramming in linear time language for automated runtime verifica...
Python metaprogramming in linear time language for automated runtime verifica...
 
Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...
Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...
Μεταπρογραµµατισµός κώδικα Python σε γλώσσα γραµµικού χρόνου για αυτόµατη επα...
 
Learning do discover: machine learning in high-energy physics
Learning do discover: machine learning in high-energy physicsLearning do discover: machine learning in high-energy physics
Learning do discover: machine learning in high-energy physics
 
20211118 AI+ Remote Sensing
20211118 AI+ Remote Sensing20211118 AI+ Remote Sensing
20211118 AI+ Remote Sensing
 
Deep Learning Hardware: Past, Present, & Future
Deep Learning Hardware: Past, Present, & FutureDeep Learning Hardware: Past, Present, & Future
Deep Learning Hardware: Past, Present, & Future
 
YOLOv4: A Face Mask Detection System
YOLOv4: A Face Mask Detection SystemYOLOv4: A Face Mask Detection System
YOLOv4: A Face Mask Detection System
 
Real Time Object Detection System with YOLO and CNN Models: A Review
Real Time Object Detection System with YOLO and CNN Models: A ReviewReal Time Object Detection System with YOLO and CNN Models: A Review
Real Time Object Detection System with YOLO and CNN Models: A Review
 
“Robust Object Detection Under Dataset Shifts,” a Presentation from Arm
“Robust Object Detection Under Dataset Shifts,” a Presentation from Arm“Robust Object Detection Under Dataset Shifts,” a Presentation from Arm
“Robust Object Detection Under Dataset Shifts,” a Presentation from Arm
 
物件偵測與辨識技術
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術
 
Convolutional Neural Networks CNN
Convolutional Neural Networks CNNConvolutional Neural Networks CNN
Convolutional Neural Networks CNN
 

Recently uploaded

Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - MydbopsScaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Mydbops
 
Verti - EMEA Insurer Innovation Award 2024
Verti - EMEA Insurer Innovation Award 2024Verti - EMEA Insurer Innovation Award 2024
Verti - EMEA Insurer Innovation Award 2024
The Digital Insurer
 
AC Atlassian Coimbatore Session Slides( 22/06/2024)
AC Atlassian Coimbatore Session Slides( 22/06/2024)AC Atlassian Coimbatore Session Slides( 22/06/2024)
AC Atlassian Coimbatore Session Slides( 22/06/2024)
apoorva2579
 
Why do You Have to Redesign?_Redesign Challenge Day 1
Why do You Have to Redesign?_Redesign Challenge Day 1Why do You Have to Redesign?_Redesign Challenge Day 1
Why do You Have to Redesign?_Redesign Challenge Day 1
FellyciaHikmahwarani
 
一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理
一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理
一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理
uuuot
 
K2G - Insurtech Innovation EMEA Award 2024
K2G - Insurtech Innovation EMEA Award 2024K2G - Insurtech Innovation EMEA Award 2024
K2G - Insurtech Innovation EMEA Award 2024
The Digital Insurer
 
STKI Israeli Market Study 2024 final v1
STKI Israeli Market Study 2024 final  v1STKI Israeli Market Study 2024 final  v1
STKI Israeli Market Study 2024 final v1
Dr. Jimmy Schwarzkopf
 
Coordinate Systems in FME 101 - Webinar Slides
Coordinate Systems in FME 101 - Webinar SlidesCoordinate Systems in FME 101 - Webinar Slides
Coordinate Systems in FME 101 - Webinar Slides
Safe Software
 
“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...
“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...
“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...
Edge AI and Vision Alliance
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
Emerging Tech
 
What Not to Document and Why_ (North Bay Python 2024)
What Not to Document and Why_ (North Bay Python 2024)What Not to Document and Why_ (North Bay Python 2024)
What Not to Document and Why_ (North Bay Python 2024)
Margaret Fero
 
Lessons Of Binary Analysis - Christien Rioux
Lessons Of Binary Analysis - Christien RiouxLessons Of Binary Analysis - Christien Rioux
Lessons Of Binary Analysis - Christien Rioux
crioux1
 
AI_dev Europe 2024 - From OpenAI to Opensource AI
AI_dev Europe 2024 - From OpenAI to Opensource AIAI_dev Europe 2024 - From OpenAI to Opensource AI
AI_dev Europe 2024 - From OpenAI to Opensource AI
Raphaël Semeteys
 
Quality Patents: Patents That Stand the Test of Time
Quality Patents: Patents That Stand the Test of TimeQuality Patents: Patents That Stand the Test of Time
Quality Patents: Patents That Stand the Test of Time
Aurora Consulting
 
20240702 QFM021 Machine Intelligence Reading List June 2024
20240702 QFM021 Machine Intelligence Reading List June 202420240702 QFM021 Machine Intelligence Reading List June 2024
20240702 QFM021 Machine Intelligence Reading List June 2024
Matthew Sinclair
 
Navigating Post-Quantum Blockchain: Resilient Cryptography in Quantum Threats
Navigating Post-Quantum Blockchain: Resilient Cryptography in Quantum ThreatsNavigating Post-Quantum Blockchain: Resilient Cryptography in Quantum Threats
Navigating Post-Quantum Blockchain: Resilient Cryptography in Quantum Threats
anupriti
 
Data Protection in a Connected World: Sovereignty and Cyber Security
Data Protection in a Connected World: Sovereignty and Cyber SecurityData Protection in a Connected World: Sovereignty and Cyber Security
Data Protection in a Connected World: Sovereignty and Cyber Security
anupriti
 
Calgary MuleSoft Meetup APM and IDP .pptx
Calgary MuleSoft Meetup APM and IDP .pptxCalgary MuleSoft Meetup APM and IDP .pptx
Calgary MuleSoft Meetup APM and IDP .pptx
ishalveerrandhawa1
 
Quantum Communications Q&A with Gemini LLM
Quantum Communications Q&A with Gemini LLMQuantum Communications Q&A with Gemini LLM
Quantum Communications Q&A with Gemini LLM
Vijayananda Mohire
 
How RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptxHow RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptx
SynapseIndia
 

Recently uploaded (20)

Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - MydbopsScaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
Scaling Connections in PostgreSQL Postgres Bangalore(PGBLR) Meetup-2 - Mydbops
 
Verti - EMEA Insurer Innovation Award 2024
Verti - EMEA Insurer Innovation Award 2024Verti - EMEA Insurer Innovation Award 2024
Verti - EMEA Insurer Innovation Award 2024
 
AC Atlassian Coimbatore Session Slides( 22/06/2024)
AC Atlassian Coimbatore Session Slides( 22/06/2024)AC Atlassian Coimbatore Session Slides( 22/06/2024)
AC Atlassian Coimbatore Session Slides( 22/06/2024)
 
Why do You Have to Redesign?_Redesign Challenge Day 1
Why do You Have to Redesign?_Redesign Challenge Day 1Why do You Have to Redesign?_Redesign Challenge Day 1
Why do You Have to Redesign?_Redesign Challenge Day 1
 
一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理
一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理
一比一原版(msvu毕业证书)圣文森山大学毕业证如何办理
 
K2G - Insurtech Innovation EMEA Award 2024
K2G - Insurtech Innovation EMEA Award 2024K2G - Insurtech Innovation EMEA Award 2024
K2G - Insurtech Innovation EMEA Award 2024
 
STKI Israeli Market Study 2024 final v1
STKI Israeli Market Study 2024 final  v1STKI Israeli Market Study 2024 final  v1
STKI Israeli Market Study 2024 final v1
 
Coordinate Systems in FME 101 - Webinar Slides
Coordinate Systems in FME 101 - Webinar SlidesCoordinate Systems in FME 101 - Webinar Slides
Coordinate Systems in FME 101 - Webinar Slides
 
“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...
“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...
“Intel’s Approach to Operationalizing AI in the Manufacturing Sector,” a Pres...
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
 
What Not to Document and Why_ (North Bay Python 2024)
What Not to Document and Why_ (North Bay Python 2024)What Not to Document and Why_ (North Bay Python 2024)
What Not to Document and Why_ (North Bay Python 2024)
 
Lessons Of Binary Analysis - Christien Rioux
Lessons Of Binary Analysis - Christien RiouxLessons Of Binary Analysis - Christien Rioux
Lessons Of Binary Analysis - Christien Rioux
 
AI_dev Europe 2024 - From OpenAI to Opensource AI
AI_dev Europe 2024 - From OpenAI to Opensource AIAI_dev Europe 2024 - From OpenAI to Opensource AI
AI_dev Europe 2024 - From OpenAI to Opensource AI
 
Quality Patents: Patents That Stand the Test of Time
Quality Patents: Patents That Stand the Test of TimeQuality Patents: Patents That Stand the Test of Time
Quality Patents: Patents That Stand the Test of Time
 
20240702 QFM021 Machine Intelligence Reading List June 2024
20240702 QFM021 Machine Intelligence Reading List June 202420240702 QFM021 Machine Intelligence Reading List June 2024
20240702 QFM021 Machine Intelligence Reading List June 2024
 
Navigating Post-Quantum Blockchain: Resilient Cryptography in Quantum Threats
Navigating Post-Quantum Blockchain: Resilient Cryptography in Quantum ThreatsNavigating Post-Quantum Blockchain: Resilient Cryptography in Quantum Threats
Navigating Post-Quantum Blockchain: Resilient Cryptography in Quantum Threats
 
Data Protection in a Connected World: Sovereignty and Cyber Security
Data Protection in a Connected World: Sovereignty and Cyber SecurityData Protection in a Connected World: Sovereignty and Cyber Security
Data Protection in a Connected World: Sovereignty and Cyber Security
 
Calgary MuleSoft Meetup APM and IDP .pptx
Calgary MuleSoft Meetup APM and IDP .pptxCalgary MuleSoft Meetup APM and IDP .pptx
Calgary MuleSoft Meetup APM and IDP .pptx
 
Quantum Communications Q&A with Gemini LLM
Quantum Communications Q&A with Gemini LLMQuantum Communications Q&A with Gemini LLM
Quantum Communications Q&A with Gemini LLM
 
How RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptxHow RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptx
 

You Only Look Once: Unified, Real-Time Object Detection

  • 1. K U L A You only look once: Unified, Real-time Object Detection by Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi (CVPR 2016)
  • 2. K U L A from deepsystems.io Pascal VOC2007 test sample results.
  • 3. K U L A Main Concept * Object Detection * Regression problem * YOLO * Only One Feedforward * Global context * Unified (Real-time detection) * YOLO: 45 FPS * Fast YOLO: 155 FPS * General representation * Robust on various background * Other domain
  • 4. K U L A Previous Works: Repurpose classifier to perform detectio Deformable Parts Models (DPM) • Sliding window R-CNN based methods 1) generate potential bounding boxes. 2) run classifiers on these proposed boxes 3) post-processing (refinement, elimination, rescore)
  • 5. K U L A Object detection as Regression Problem YOLO: Single Regression Problem Image → bounding box coordinate and class probability. * Extremely Fast * Global reasoning * Generalizable representation
  • 6. K U L A Unified Detection • All BBox, All classes 1) Image → S x S grids 2) Grid cell → B: BBoxes and Confidence score x, y, w, h, confidence → C: class probabilities w.r.t #classes
  • 7. K U L A Unified Detection • Predict one set of class probabilities per grid cell, regardless of the number of boxes B. • At test time, individual box confidence prediction
  • 8. K U L A Network Design • Modified GoogLeNet • 1x1 reduction layer (“Network in Network”)
  • 9. K U L A How it works? from deepsystems.io
  • 10. K U L A from deepsystems.io How it works?
  • 11. K U L A from deepsystems.io How it works?
  • 12. K U L A from deepsystems.io How it works?
  • 13. K U L A from deepsystems.io How it works?
  • 14. K U L A from deepsystems.io How it works?
  • 15. K U L A from deepsystems.io How it works?
  • 16. K U L A from deepsystems.io How it works?
  • 17. K U L A from deepsystems.io How it works?
  • 18. K U L A from deepsystems.io How it works?
  • 19. K U L A How it works? from deepsystems.io Total : 7*7*2 = 98 boxes
  • 20. K U L A Look at detection procedure from deepsystems.io
  • 21. K U L A Look at detection procedure from deepsystems.io
  • 22. K U L A Look at detection procedure from deepsystems.io
  • 23. K U L A Look at detection procedure from deepsystems.io
  • 24. K U L A Look at detection procedure from deepsystems.io
  • 25. K U L A Look at detection procedure from deepsystems.io
  • 26. K U L A Look at detection procedure from deepsystems.io
  • 27. K U L A Look at detection procedure from deepsystems.io
  • 28. K U L A Look at detection procedure from deepsystems.io
  • 29. K U L A Look at detection procedure from deepsystems.io
  • 30. K U L A Look at detection procedure from deepsystems.io
  • 31. K U L A Look at detection procedure from deepsystems.io
  • 32. K U L A Look at detection procedure from deepsystems.io
  • 33. K U L A Look at detection procedure from deepsystems.io
  • 34. K U L A Look at detection procedure from deepsystems.io
  • 35. K U L A Look at detection procedure from deepsystems.io
  • 36. K U L A Look at detection procedure from deepsystems.io
  • 37. K U L A Look at detection procedure from deepsystems.io
  • 38. K U L A Look at detection procedure from deepsystems.io
  • 39. K U L A Limitation of YOLO from deepsystems.io • Group of small objects • Unusual aspect ratios • Coarse feature • Localization error of bounding box
  • 40. K U L A Comparison to other Real-Time Systems from deepsystems.io
  • 41. K U L A VOC Error from deepsystems.io
  • 42. K U L A Combining Fast R-CNN and YOLO from deepsystems.io
  • 43. K U L A VOC 2012 Leaderboard from deepsystems.io
  • 44. K U L A Generalizability : Person Detection in Artwork from deepsystems.io
  • 45. K U L A Generalizability : Person Detection in Artwork from deepsystems.io
  • 46. K U L A Key Points from deepsystems.io 1.Fast: YOLO - 45 fps, YOLO-tiny - 155 fps. 2.End-to-end training. 3.Makes more localization errors but is less likely to predict false positives on background 4.Performance is lower than the current state of the art. 5.Combined Fast R-CNN + YOLO model is one of the highest performing detection 6.methods. 7.Learns very general representations of objects: it outperforms other detection methods, 8.including DPM and R-CNN, when generalizing from natural images to other domains
  • 47. K U L A Appendix : Loss Function (sum-squared error) from deepsystems.io
  • 48. K U L A from deepsystems.io Appendix : Loss Function (sum-squared error)
  • 49. K U L A from deepsystems.io Appendix : Loss Function (sum-squared error)
  • 50. K U L A from deepsystems.io Appendix : Intersection over Union (IoU) • IoU(pred, truth)=[0, 1]
  • 51. K U L A from deepsystems.io Appendix : Sum-Squared Error (SSE) sum of squared errors of prediction (SSE), is the sum of the squares of residuals (deviations predicted from actual empirical values of data). It is a measure of the discrepancy between the data and an estimation model. A small RSS indicates a tight fit of the model to the data. It is used as an optimality criterion in parameter selection and model selection.