SlideShare a Scribd company logo
Sky is Not the Limit: Semantic-
Aware Sky Replacement
Yi-Hsuan Tsai Xiaohui Shen Zhe Lin Ming-Hsuan YangKalyan Sunkavalli
ACM Transactions on Graphics (SIGGRAPH), 2016
Motivation
Goal: automatically segment and replace with different styles of the sky
Example Results
Example Results
Challenges
• Manually edit sky using Photoshop
5 mins 30 mins
We need a good segmentation algorithm!
Input Image
Reference
Challenges
• Manually edit sky using Photoshop
Input Image
Reference
We need image harmonization!
v.s
Professional editingColors are not matched
System
Input Image
Sky
Segmentation
Reference Images
Sky
Search
Sky
Replacement
Results
Sky Segmentation
Input Image
Sky
Segmentation
Literatures
• Sky/non-sky classifier [Tao et al. SIGGRAPH’09]
• Scene parsing [Long et al. CVPR’15]
• Online refinement [Rother et al. SIGGRAPH’04]
Challenges
• Sky appearance varies widely
• skylines/landscapes, clouds, lighting conditions
• Need accurate sky boundaries
Sky Search
Input Image Reference Images
Sky
Search
Literatures
• GIST [Hays and Efros SIGGRAPH’07, Liu et al. CGF’14]
• Only consider global scene layout
• Need a large database
Challenges
• Search compatible images
• Account for image content
Reference Image 1 Reference Image 2 Reference Image 3
Sky Replacement
Input Image
Literatures
• Global transfer [Reinhard et al. 2001, Tao et al. SIGGRAPH’09]
• Image contents are not considered
• Less realistic results
• Local transfer [Wu et al. CGF’13, Laffont et al. SIGGRAPH’14]
• Boundary artifacts
• Rely on filters for smoothing
Challenges
• Transfer foreground appearance
• Account for image content
Sky
Replacement
Semantic-Aware System
Input Image
Sky
Segmentation
Reference Images
Sky
Search
Sky
Replacement
Results
Fully Convolutional Networks
Fully Convolutional Networks
Scene Parsing
Fg
Road
Building
Sky
Tree
Semantic Response
Sky
. . .
Building Road
Fully Convolutional Networks (FCN)
• End-to-end model
• Pixel-wise segmentation
• Finetune with 11 scene labels
• Semantic response map
[Long et al. CVPR’15]
Sky Segmentation
Input Image
Scene
Parsing
Online
Refinement
Fully Convolutional
Networks
Sky Segmentation
Input Image
Scene
Parsing
Fully Convolutional
Networks
Online
Refinement
Conditional Random Field optimization
• Online models: color, texture
• Semantic response (sky/non-sky)
• Pairwise term: magnitude of gradient
Sky Segmentation
Results
Input Image FCN Results Our Results
Semantic-Aware Sky Replacement (SIGGRAPH 2016)
Results
DeepLab [Chen et al. ICLR’15]
Sky Search
Input Image
Sky Image Database (415 Images)
Sky
Search
Sky Search
Input Image
Reference Images
Semantic Layout Descriptor
• Account for local layouts
• Utilize semantic responses
Sky
Search
Sky Image Database (415 Images)
Sky Search
Input Image
Reference Images
Semantic Layout Descriptor
• Account for local layouts
• Utilize semantic responses
Sky
Search
Check Sky Properties
• Prevent large distortions
• Aspect ratio
• Resolution
• Ensure sky diversity
• Color similarity
Sky Image Database (415 Images)
Semantic Layout Descriptor
Input Image
. . .
Sky Building Road
Semantic Responses
• Pixel-wise responses
• Range from 0 to 1
Semantic Layout Descriptor
Input Image
. . .
Sky Building Road
Semantic Responses
• Pixel-wise responses
• Range from 0 to 1
Average pooling on spatial pyramids
• Global pooling
Semantic Layout Descriptor
Input Image
. . .
Sky Building Road
Semantic Responses
• Pixel-wise responses
• Range from 0 to 1
Average pooling on spatial pyramids
• Global pooling
• Local contents (3x3 grids)
. . .
...
Semantic Layout Descriptor
Input Image
. . .
Sky Building Road
Semantic Layout Descriptor
Input Image
. . .
Sky Building Road
. . . . . . . . .
Semantic Layout Descriptor
Input Image
. . .
Sky Building Road
. . . . . . . . .
Descriptor . . .
Sky Replacement
Input Image
Sky
Alignment
Sky Alignment
• Extract complete sky regions from reference
images
• Re-scale and paste on the input image
Reference Images
Sky Replacement
Input Image
Sky
Alignment
Semantic-aware
Transfer
Sky Alignment
• Extract complete sky regions from reference
images
• Re-scale and paste on the input image
Semantic-aware Transfer
• Adjustment foreground appearance
• Account for semantic regions
Reference Images
Semantic-aware Transfer
Direct local transfer [Laffont et al. SIGGRAPH’14]
• Match corresponding semantic regions
• Boundary artifacts
Input image Scene parsing
Semantic-aware Transfer
Direct local transfer [Laffont et al. SIGGRAPH’14]
• Match corresponding semantic regions
• Boundary artifacts
T1 (x)
Input image Scene parsing
Semantic-aware Transfer
Direct local transfer [Laffont et al. SIGGRAPH’14]
• Match corresponding semantic regions
• Boundary artifacts
T2 (x)
T1 (x)
Input image Scene parsing
Semantic-aware Transfer
Direct local transfer [Laffont et al. SIGGRAPH’14]
• Match corresponding semantic regions
• Boundary artifacts
Input image Scene parsing Direct local transfer
T2 (x)
T1 (x)
Semantic-aware Transfer
Direct local transfer [Laffont et al. SIGGRAPH’14]
• Match corresponding semantic regions
• Boundary artifacts
Propose a soft mapping method
• Utilize semantic responses as weights
for each category n
Input image Scene parsing Direct local transfer
T1 (x)
T2 (x)
Semantic-aware Transfer
Direct local transfer [Laffont et al. SIGGRAPH’14]
• Match corresponding semantic regions
• Boundary artifacts
Propose a soft mapping method
• Utilize semantic responses as weights
for each category n
Input image Scene parsing Direct local transfer
T1 (x)
T2 (x)
Semantic-aware Transfer
Direct local transfer [Laffont et al. SIGGRAPH’14]
• Match corresponding semantic regions
• Boundary artifacts
Propose a soft mapping method
• Utilize semantic responses as weights
for each category n
Input image Scene parsing Direct local transfer Soft mapping
Wn (x) = 1 or 0
T1 (x)
T2 (x)
Transfer Functions
Transfer Functions Tn (x) for each category n
• Transfer luminance and color
T1 (x)
T2 (x)
Luminance
• Shift mean
Transfer Functions
Transfer Functions Tn (x) for each category n
• Transfer luminance and color
Color
• Matched regions: chrominance
• Histogram matching [Lee et al. CVPR’16]
• Non-matched regions: color temperature
• Consider entire foreground
• More conservative
Not all the semantic regions are matched!
T1 (x)
T2 (x)
?
Sky Replacement
Results
Input Image Sky Replacement Results
Input Image Sky Replacement Results
Input Image Sky Replacement Results
Input Image Sky Replacement Results
Input Image Sky Replacement Results
Sky Replacement with
User Preference
Input Image Sky Replacement Results
Input Image Sky Replacement Results
Input Image Sky Replacement Results
Input Image
Preferred Sky
Sky Replacement ResultsInput Image
Preferred Sky
Comparisons to Other
Methods
Comparisons of different search methods
Semantic-Aware Sky Replacement (SIGGRAPH 2016)
Comparisons of different transfer methods
Semantic-Aware Sky Replacement (SIGGRAPH 2016)
Limitation
Light reflections
Conclusions
• Automatic sky replacement results can be realistic
• New sky image database
• Semantics helps a lot
• Sky segmentation
• Sky image search
• Appearance transfer
• Apply semantics to other tasks
• Scene completion
• Photo and video re-coloring
Summary of my Other Projects:
Visual Object Recognition
Joint Object Classification and Segmentation [BMVC’13]
• How do segmentation and classification help each other?
Class-specific Object Segmentation Hypotheses [ICCV’13]
• How to utilize exemplars to gain more information
during learning and inference?
Image Retrieval [ICIP’14]
• Compute label similarities to bridge semantic gaps
Exemplar-based Object Detection [CVPR’15]
• Discover representative exemplars to build models
• Region-based feature extraction and model learning
Image (Object) Recognition
• Classification
• Segmentation
• Retrieval
• Detection
Video Object Recognition
• Object (Co-)segmentation
• Scene (Co-)parsing
Video Segmentation via Object Flow [CVPR’16]
• How do segmentation and optical flow help each other?
• Segmentation: multi-scale, spatio-temporal graphical model
• Optical flow: use segmentation to refine boundaries
• Iteratively solve the joint model
Semantic Co-segmentation in Videos (submitted to ECCV’16)
• Temporal-consistent object tracklets
• Relations between objects from a collection of videos
Ongoing and future work
• Scene Parsing via Deep CNNs
• Attention to small objects
• Label co-occurrence
• Video Scene Co-parsing
• Weakly-supervised: video tags
• Use image-based classifier
Object Segmentation
96.4 MCL, 74.4
93.3 PMCut, 59.1
94.4 MCL, 53.083.6 PMCut, 47.3
Object Segmentation
93.5 PMCut, 26.6
89.2 MCL, 65.373.8 PMCut, 58.0
86.9 PMCut, 68.0
Object Detection
Object Detection
Video Object Segmentation
Video Object Segmentation
Segmentation Updated Optical Flow Initial Optical Flow
Joint Object Classification
and Segmentation [BMVC’13] Object Segmentation [ICCV’13]
Image Retrieval [ICIP’14]
Object Detection [CVPR’15]
Video Object Segmentation
[CVPR’16]
Sky Replacement [SIGGRAPH’16]
Semantic Co-segmentation in Videos
(submitted to ECCV’16)
Video Scene Co-parsing (ongoing)
Image (Object) Recognition via Exemplars
• Classification
• Segmentation
• Retrieval
• Detection
Video Object Recognition: Temporal + CNN
• Object (Co-)segmentation
• Scene (Co-)parsing
Image/Video Editing
• Background/Object Replacement
• Scene Completion
• Re-coloring
Semantic Information
My homepage:
https://sites.google.com/site/yihsuantsai/
Thank you!

More Related Content

PDF
DD18 - SEED - Raytracing in Hybrid Real-Time Rendering
PPT
SIGGRAPH 2010 - Style and Gameplay in the Mirror's Edge
PPTX
Past, Present and Future Challenges of Global Illumination in Games
PPTX
Lighting the City of Glass
PPTX
Moving Frostbite to Physically Based Rendering
PDF
presentation644v4
PDF
Unreal Summit 2016 Seoul Lighting the Planetary World of Project A1
PPTX
Shiny Pixels and Beyond: Real-Time Raytracing at SEED
DD18 - SEED - Raytracing in Hybrid Real-Time Rendering
SIGGRAPH 2010 - Style and Gameplay in the Mirror's Edge
Past, Present and Future Challenges of Global Illumination in Games
Lighting the City of Glass
Moving Frostbite to Physically Based Rendering
presentation644v4
Unreal Summit 2016 Seoul Lighting the Planetary World of Project A1
Shiny Pixels and Beyond: Real-Time Raytracing at SEED

What's hot (20)

PPT
Graphical Objects and Scene Graphs
PPT
Models and architectures
PDF
Online video object segmentation via convolutional trident network
PPT
Advanced Lighting for Interactive Applications
PPTX
Stochastic Screen-Space Reflections
PPT
Interactive Stereoscopic Rendering for Non-Planar Projections (GRAPP 2009)
PPTX
Physically Based and Unified Volumetric Rendering in Frostbite
PDF
"Introduction to Feature Descriptors in Vision: From Haar to SIFT," A Present...
PPTX
A new Post-Processing Pipeline
PPTX
Lucene/Solr spatial in 2015
PPT
Shadow Techniques for Real-Time and Interactive Applications
PDF
Epic_GDC2011_Samaritan
PPT
Hands on Logo Search
PPT
Paris Master Class 2011 - 01 Deferred Lighting, MSAA
PDF
Taking Killzone Shadow Fall Image Quality Into The Next Generation
PPTX
Harpster, J. - Open data on buildings with satellite imagery processing
PDF
Edge detection-based post-processing in Warlords of Draenor
PPTX
2016-01 Lucene Solr spatial in 2015, NYC Meetup
PPT
Soft Shadow Maps for Linear Lights
PPT
The Unique Lighting of Mirror's Edge
Graphical Objects and Scene Graphs
Models and architectures
Online video object segmentation via convolutional trident network
Advanced Lighting for Interactive Applications
Stochastic Screen-Space Reflections
Interactive Stereoscopic Rendering for Non-Planar Projections (GRAPP 2009)
Physically Based and Unified Volumetric Rendering in Frostbite
"Introduction to Feature Descriptors in Vision: From Haar to SIFT," A Present...
A new Post-Processing Pipeline
Lucene/Solr spatial in 2015
Shadow Techniques for Real-Time and Interactive Applications
Epic_GDC2011_Samaritan
Hands on Logo Search
Paris Master Class 2011 - 01 Deferred Lighting, MSAA
Taking Killzone Shadow Fall Image Quality Into The Next Generation
Harpster, J. - Open data on buildings with satellite imagery processing
Edge detection-based post-processing in Warlords of Draenor
2016-01 Lucene Solr spatial in 2015, NYC Meetup
Soft Shadow Maps for Linear Lights
The Unique Lighting of Mirror's Edge
Ad

Viewers also liked (13)

PDF
(Semantic Web Technologies and Applications track) "A Quantitative Comparison...
PDF
Improving Spatial Codification in Semantic Segmentation
PDF
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
PPTX
Semantic Mapping of Road Scenes
PDF
crfasrnn_presentation
PPTX
Semantic Segmentation Methods using Deep Learning
PDF
#6 PyData Warsaw: Deep learning for image segmentation
PDF
Dataset for Semantic Urban Scene Understanding
PPTX
Deep learning intro
PDF
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
PDF
Semantic segmentation
PDF
Deep Learning for Computer Vision: Segmentation (UPC 2016)
PDF
Efficient exploration of region hierarchies for semantic segmentation
(Semantic Web Technologies and Applications track) "A Quantitative Comparison...
Improving Spatial Codification in Semantic Segmentation
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
Semantic Mapping of Road Scenes
crfasrnn_presentation
Semantic Segmentation Methods using Deep Learning
#6 PyData Warsaw: Deep learning for image segmentation
Dataset for Semantic Urban Scene Understanding
Deep learning intro
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
Semantic segmentation
Deep Learning for Computer Vision: Segmentation (UPC 2016)
Efficient exploration of region hierarchies for semantic segmentation
Ad

Similar to Semantic-Aware Sky Replacement (SIGGRAPH 2016) (20)

PDF
lecture_16_jiajun.pdf
PDF
PPT s11-machine vision-s2
PDF
視訊訊號處理與深度學習應用
PDF
NetVLAD: CNN architecture for weakly supervised place recognition
PDF
Knowing when to look
PPTX
Evolving a Medical Image Similarity Search
PPTX
Cross-Modal Scene Understanding presntation
PPTX
OpenGL basics
PPT
Multi-view Stereo and Structure from Motion
PPTX
Using Deep Learning to Derive 3D Cities from Satellite Imagery
PDF
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
PDF
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
PPT
affine transformation for computer graphics
PDF
Video Object Segmentation in Videos
PPTX
OpenGL - Bringing the 3D World into the Android
PDF
Content Based Image Retrieval
PPTX
Lec11 single view-converted
PPTX
Optical Computing for Fast Light Transport Analysis
PPTX
2D Transformation.pptx
PDF
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
lecture_16_jiajun.pdf
PPT s11-machine vision-s2
視訊訊號處理與深度學習應用
NetVLAD: CNN architecture for weakly supervised place recognition
Knowing when to look
Evolving a Medical Image Similarity Search
Cross-Modal Scene Understanding presntation
OpenGL basics
Multi-view Stereo and Structure from Motion
Using Deep Learning to Derive 3D Cities from Satellite Imagery
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
Semantic Retrieval and Automatic Annotation: Linear Transformations, Correlat...
affine transformation for computer graphics
Video Object Segmentation in Videos
OpenGL - Bringing the 3D World into the Android
Content Based Image Retrieval
Lec11 single view-converted
Optical Computing for Fast Light Transport Analysis
2D Transformation.pptx
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)

Recently uploaded (20)

PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PDF
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPTX
Construction Project Organization Group 2.pptx
DOCX
573137875-Attendance-Management-System-original
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPT
Project quality management in manufacturing
PPTX
Artificial Intelligence
PDF
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
Well-logging-methods_new................
PPT
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
PDF
PPT on Performance Review to get promotions
PPTX
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
Current and future trends in Computer Vision.pptx
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Embodied AI: Ushering in the Next Era of Intelligent Systems
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
Construction Project Organization Group 2.pptx
573137875-Attendance-Management-System-original
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
Project quality management in manufacturing
Artificial Intelligence
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Well-logging-methods_new................
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
PPT on Performance Review to get promotions
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
UNIT 4 Total Quality Management .pptx
Current and future trends in Computer Vision.pptx

Semantic-Aware Sky Replacement (SIGGRAPH 2016)