SlideShare a Scribd company logo
DEEP LEARNING IN PRACTICE
Tess Ferrandez – Microsoft - @TessFerrandez
TIM
MITYA
YANA
VITO
CLAUS
TESS
SHOTS ON GOAL
DETECTING CANCER
SHOPLIFTING
DEEP LEARNING
What’s so magical about it
Deep learning and computer vision
Deep learning and computer vision
int EstimatePrice(...){
price = 100000 +
67000 * area_in_sqm +
200000 * has_pool +
100000 * new_kitchen +
5000 * neighborhood_quality;
return price;
}
Price = b + w1*area_in_sqm + w2*has_pool + ...
[LINEAR REGRESSION]
Deep learning and computer vision
Deep learning and computer vision
GIVEN ENOUGH SAMPLES,
A NEURAL NETWORK WILL
FIND THE PATTERN
GIVEN ENOUGH SAMPLES,
A NEURAL NETWORK WILL
FIND THE PATTERN
SEQUENCESPACE
CONVOLUTIONAL NN RECURRENT NNDENSE NN
Deep learning and computer vision
Deep learning and computer vision
[ 0.01949719, 0.09399229, -0.01618082, -0.00876935, 0.03146157, 0.06853894, 0.00096175, -0.06854118, -0.04771797, -0.05296798, 0.02119147, 0.00511259, 0.13726683
INTERMEDIATE REPRESENTATIONEMBEDDINGSECRET CODE
RECOMMEND A BOOK
-1.0 1.0
ADULTYOUTH
ADULT
FICTION
(-0.6, 0.4) *
(0.7, 0.9) *
FICTION ADULT
MATH REFERENCES
US-CENTRIC
CHICK LIT
FUNNY
SCI-FI
LAWYERS
WOULD BRAD PITT PLAY A
CHARACTER IN THE MOVIE?
EMBEDDING
[ 0.01949719, 0.09399229, -0.01618082, -0.00876935, 0.03146157, 0.06853894, 0.00096175, -0.06854118, -0.04771797, -0.05296798, 0.02119147, 0.00511259, 0.1372668
WORD
EMBEDDINGS
[ 0.01949719, 0.09399229, -0.01618082, -0.00876935, 0.03146157, 0.06853894, 0.00096175, -0.06854118, -0.04771797, -0.05296798, 0.02119147, 0.00511259, 0.1372668
[FACENET]
T-SNE
Projection of
128D to 2D
FACE RECOGNITION DEMO
SEGMENTATION
encoder decoder
encoder decoder
encoder decoder
IN PRACTICE
The secrets behind the magic
Deep learning and computer vision
Deep learning and computer vision
Deep learning and computer vision
Deep learning and computer vision
Deep learning and computer vision
Deep learning and computer vision
Time for the Epoch
Training data
Validation data
MODEL LOSS ACCURACY
BASIC 0.2507 91.05%
Deep learning and computer vision
OOPSIE DOOPSIE!
We’re overfitting
Chihuahua the movie
[DATA AUGMENTATION]
Deep learning and computer vision
[DROPOUT]
Deep learning and computer vision
MODEL LOSS ACCURACY
BASIC 0.2507 91.05%
AUGMENTATION 0.1988 93.68%
Deep learning and computer vision
Deep learning and computer vision
Deep learning and computer vision
MODEL LOSS ACCURACY
BASIC 0.2507 91.05%
AUGMENTATION 0.1988 93.68%
TRANSFER LEARN 0.01253 99.47%
Deep learning and computer vision
APPLIED MACHINE LEARNING
When the magic is gone, and we’re left with Software Engineering
UNDERSTAND THE BUSINESS NEEDS
WHAT IS THE PROBLEM?
HOW WILL THE MODEL BE
USED? / REQUIREMENTS
HOW IS IT DONE TODAY?
IS IT FEASIBLE?
ETHICAL CONCERNS
UNDERSTAND THE BUSINESS NEEDS
UNDERSTAND THE
BUSINESS NEEDS
MINE CLEAN EXPLORE
Deep learning and computer vision
UNDERSTAND THE
BUSINESS NEEDS
MINE CLEAN EXPLORE
ENGINEER MODEL DEPLOY
LOTS OF LABLED SAMPLES and
NO CONSEQUENTIAL DECISIONS
SHOTS ON GOAL
LOUD CROWD
GOAL VISIBLE
SPEED/DIRECTION PLAYER DENSITY
PLAYER POSES
SCENE CHANGES
GOAL IN VIEW
NEGATIVE SAMPLING
5S VIDEOS - AROUND ACTION
NEGATIVE SAMPLES FROM ATTACKS
VGG EMBEDDINGS
90+%
MODEL
ACCURACY
GRASS?
GOAL?
SCENE CHANGE
ShotNoShot
https://github.com/tyiannak/pyAudioAnalysis
AUDIO
PEOPLE CLUSTERS - SIZES
https://github.com/fizyr/keras-retinanet
model_path =
'c:/Tess/source/vision_samples/models/resnet50_coco_best_v2.1.0.h5'
model = models.load_model(model_path, backbone_name='resnet50’)
image_path = 'C:/Tess/source/vision_samples/data/images/basket_image.jpg'
image = read_image_bgr(image_path)
image = preprocess_image(image)
image, scale = resize_image(image)
# process image
boxes, scores, labels = model.predict_on_batch(np.expand_dims(image,
axis=0))
from keras_retinanet import models
from keras_retinanet.utils.image import read_image_bgr, preprocess_image,
resize_image
GOAL / NO GOAL
SCENE CHANGE DETECTION
FOCUSED OPTICAL FLOW ON PLAYERS
DETECTING CANCER
VERY FEW POSITIVE SAMPLES
EXTREME ACCURACY NEEDS
POTENTIAL FOR BIAS
HARD TO DIFFERENTIATE
ONLY PARTIALLY LABLED
EXTREMELY LARGE IMAGES
COLOR SEGMENTATION
CONVEX HULL
SHOPLIFTING
VERY FEW POSITIVE SAMPLES
VERY FEW SAMPLES PER ACTION TYPE
VERY SENSITIVE TO BIAS
COVERED FACES
ALONE
MEN 20-40 HOODIES
SHOPLIFTING POSES
MEN 20-40 HOODIESCOVERED FACES
ALONE
12:32:00CHRISTMAS
FISH EYE
DETECT not PREDICT HUD ARTIFACTS
NEGATIVE SAMPLES FROM SAME VIDEOS
PEOPLE SHOPPING
POSE DETECTION
BACKGROUND SUBTRACTION
CLASSIFICATION AT THE BOX LEVEL
A LITTLE
DOMAIN KNOWLEDGE
GOES A LONG WAY
KISSKeep it Simple …
Deep learning and computer vision
DEEP LEARNING IN PRACTICE
Tess Ferrandez – Microsoft - @TessFerrandez

More Related Content

PPTX
A developers guide to machine learning
PPTX
A practical guide to deep learning
PDF
Notes from Coursera Deep Learning courses by Andrew Ng
PDF
Creating awesome value proposition using Value Proposition Canvas
PDF
Conception d’une plateforme web d’e-Commerce au sein d’une entreprise commerc...
PPTX
NLP - Prédictions de tags sur les questions Stackoverflow
PDF
Data Visualization in Data Science
A developers guide to machine learning
A practical guide to deep learning
Notes from Coursera Deep Learning courses by Andrew Ng
Creating awesome value proposition using Value Proposition Canvas
Conception d’une plateforme web d’e-Commerce au sein d’une entreprise commerc...
NLP - Prédictions de tags sur les questions Stackoverflow
Data Visualization in Data Science

What's hot (20)

PDF
AI in Business: Opportunities & Challenges
PPTX
Facenet - Paper Review
PPTX
Introduction to Machine Learning
PDF
AI, Creativity and Generative Art
PPTX
Machine Learning
PPTX
ppt on machine learning to deep learning (1).pptx
PDF
History of AI
PPTX
Notes of AI for everyone - by Andrew Ng
PPTX
Generative AI Use cases for Enterprise - Second Session
PPTX
Artificial Intelligence - Machine Learning Vs Deep Learning
PDF
The age of GANs
PDF
Conversational AI and Chatbot Integrations
PPT
Ai presentation
PPTX
Generative AI, WiDS 2023.pptx
PPTX
Lesson 2 ai in industry
PDF
Introduction to LLMs
PDF
Artificial Intelligence Career In 2023
PPTX
artificial intelligence ppt.pptx
PPTX
ARTIFICIAL INTELLIGENCE
PDF
Time series deep learning
AI in Business: Opportunities & Challenges
Facenet - Paper Review
Introduction to Machine Learning
AI, Creativity and Generative Art
Machine Learning
ppt on machine learning to deep learning (1).pptx
History of AI
Notes of AI for everyone - by Andrew Ng
Generative AI Use cases for Enterprise - Second Session
Artificial Intelligence - Machine Learning Vs Deep Learning
The age of GANs
Conversational AI and Chatbot Integrations
Ai presentation
Generative AI, WiDS 2023.pptx
Lesson 2 ai in industry
Introduction to LLMs
Artificial Intelligence Career In 2023
artificial intelligence ppt.pptx
ARTIFICIAL INTELLIGENCE
Time series deep learning
Ad

Similar to Deep learning and computer vision (20)

PPTX
An Introduction to Deep Learning with Apache MXNet (November 2017)
PPTX
Alessandro Ferrari - Smart City, Mixed Reality, Self-Driving Cars & Neural Co...
PDF
The Art of CTF
PDF
The secret of successful software teams
PDF
The Secret Sauce of Successful Teams
PDF
AlphaZero and beyond: Polygames
PDF
Deep Learning for Developers (October 2017)
PPTX
An introduction to Deep Learning with Apache MXNet (November 2017)
PPTX
Deep Learning with Apache MXNet (September 2017)
PDF
Cisco Connect Ottawa 2018 sixty to zero
PPTX
Semantic Analysis to Compute Personality Traits from Social Media Posts
PDF
Ai business innovator v001
PPTX
machine learning in the age of big data: new approaches and business applicat...
PDF
Finding Bugs, Fixing Bugs, Preventing Bugs - Exploiting Automated Tests to In...
PDF
Announcing NamSorML : AI classifiers for race, ethnicity and migration studies
PPTX
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
PDF
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
PDF
Belfast JUG, SIMD (Vectorial) Operations
PDF
Cisco Connect Toronto 2018 sixty to zero
PPTX
Chaos engineering - The art of breaking stuff in production on purpose
An Introduction to Deep Learning with Apache MXNet (November 2017)
Alessandro Ferrari - Smart City, Mixed Reality, Self-Driving Cars & Neural Co...
The Art of CTF
The secret of successful software teams
The Secret Sauce of Successful Teams
AlphaZero and beyond: Polygames
Deep Learning for Developers (October 2017)
An introduction to Deep Learning with Apache MXNet (November 2017)
Deep Learning with Apache MXNet (September 2017)
Cisco Connect Ottawa 2018 sixty to zero
Semantic Analysis to Compute Personality Traits from Social Media Posts
Ai business innovator v001
machine learning in the age of big data: new approaches and business applicat...
Finding Bugs, Fixing Bugs, Preventing Bugs - Exploiting Automated Tests to In...
Announcing NamSorML : AI classifiers for race, ethnicity and migration studies
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
Belfast JUG, SIMD (Vectorial) Operations
Cisco Connect Toronto 2018 sixty to zero
Chaos engineering - The art of breaking stuff in production on purpose
Ad

More from Tess Ferrandez (11)

PPTX
funwithalgorithms.pptx
PPTX
Debugging .NET apps
PPTX
CSI .net core - debugging .net applications
PPTX
Debugging performance issues, memory issues and crashes in .net applications rev
PPTX
Common asp.net production issues rev
PPTX
Perf by design
PPT
Fun421 stephens
PPTX
C# to python
PPTX
AI and Ethics - We are the guardians of our future
PPTX
My bot has a personality disorder
PPTX
.Net debugging 2017
funwithalgorithms.pptx
Debugging .NET apps
CSI .net core - debugging .net applications
Debugging performance issues, memory issues and crashes in .net applications rev
Common asp.net production issues rev
Perf by design
Fun421 stephens
C# to python
AI and Ethics - We are the guardians of our future
My bot has a personality disorder
.Net debugging 2017

Recently uploaded (20)

PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PPTX
additive manufacturing of ss316l using mig welding
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPT
introduction to datamining and warehousing
PPTX
Construction Project Organization Group 2.pptx
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
Geodesy 1.pptx...............................................
PDF
Level 2 – IBM Data and AI Fundamentals (1)_v1.1.PDF
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPTX
Internet of Things (IOT) - A guide to understanding
PDF
PPT on Performance Review to get promotions
PDF
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPT
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
PDF
737-MAX_SRG.pdf student reference guides
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
additive manufacturing of ss316l using mig welding
Embodied AI: Ushering in the Next Era of Intelligent Systems
introduction to datamining and warehousing
Construction Project Organization Group 2.pptx
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
Foundation to blockchain - A guide to Blockchain Tech
Geodesy 1.pptx...............................................
Level 2 – IBM Data and AI Fundamentals (1)_v1.1.PDF
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Automation-in-Manufacturing-Chapter-Introduction.pdf
Internet of Things (IOT) - A guide to understanding
PPT on Performance Review to get promotions
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
737-MAX_SRG.pdf student reference guides
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf

Deep learning and computer vision