SlideShare a Scribd company logo
Many-Objective Reinforcement Learning for Online
Testing of DNN-Enabled Systems
Fitash Ul Haq1, Donghwan Shin3, Lionel Briand1,2
1) University of Luxembourg
2) University of Ottawa
3) University of Sheffield
Date: 19th May 2023
2
Introduction
DNN-Enabled System (DADS):
Composed of multiple DNNs capable of various tasks as object tracking, object
classification, traffic light detection and traffic sign detection
Self-Driving Cars Autonomous Drones
3
Cannot Find
Safety Violation Online Testing
Can Find
Testing DNN-Enabled System (DADS)
Offline Testing
Introduction
4
Challenges for Online Testing:
To address the challenges, we propose MORLOT (Many-Objective Reinforcement
Learning for Online Testing) by leveraging many-objective search and
Reinforcement Learning (RL)
Static
Environment
Many Safety
Requirements
Large Input
Space
3rd Party DNNs
Introduction
5
Key Idea
Many-Objective Search
Reinforcement Learning
(Q-learning) Environment
Action (e.g,
increase speed)
Fitness Values
(rewards)
Reward (e.g, distance)
Action (e.g,
increase speed)
6
Many-Objective Reinforcement Learning for Online Testing
MORLOT: Overview
Objectives
(i.e., reqs),
Q-tables
Observe
State
Choose Action
Randomly or using
Many-Objective Search
Reset
Environment
Many-Objective Reinforcement Learning for Online Testing
7
Choose Action using Many-Objective Search
Actions are chosen based on the objective that is closest to being
satisfied
O1
O2
O3
QO_1
QO_2
QO_3
Select Action
F
_
v
=
0
.
3
F_v = 0.7
F
_
v
=
0
.
5
Action from Q table of O2
(e.g., Decrease speed of VIF)
VIF: Vehicle-In-Front
8
MORLOT: Overview
Objectives
(i.e., reqs),
Q-tables
Many-Objective Reinforcement Learning for Online Testing
Observe
State
Choose Action
Randomly or using
Many-Objective Search
Reset
Environment
Take Action/
Receive
Rewards
Update Q-
tables/
Archive
Minimal Test Suite
9
Research Questions
How do alternative
approaches fare in terms of
test effectiveness?
How do alternative
approaches fare in terms of
test efficiency?
RQ1
RQ2
10
• Transfuser (DNN-enabled
ADS):
Highest rank publicly available
DNN-enabled ADS in the CARLA
Autonomous Driving
Leaderboard Sensors Track
Case Study Subject
Transfuser
• CARLA (Simulator):
Open-source simulator based on
the Unreal Engine designed to
support training, development,
and validation of ADS
CARLA Simulator
11
Safety and Functional Requirements
We use the following six safety and
functional requirements:
• R1: EV should not go out of lane;
• R2: EV should not collide with other
vehicles;
EV
EV: Ego Vehicle
12
Safety and Functional Requirements
We use the following six safety and
functional requirements:
• R1: EV should not go out of lane;
• R2: EV should not collide with other
vehicles;
• R3: EV should not collide with
pedestrians;
• R4: EV should not collide with
static meshes (i.e., traffic lights, traffic
signs etc.);
• R5: EV should reach its destination in
defined time budget;
EV: Ego Vehicle
EV
13
Safety and Functional Requirements
We use the following six safety and
functional requirements:
• R1: EV should not go out of lane;
• R2: EV should not collide with other
vehicles;
• R3: EV should not collide with
pedestrians;
• R4: EV should not collide with
static meshes (i.e., traffic lights, traffic
signs etc.);
• R5: EV should reach its destination in
defined time budget;
• R6: EV should not violate traffic lights.
EV: Ego Vehicle
EV
14
State and Action Definition
State Contain information
about:
• Ego Vehicle (EV)
• Vehicle in Front (VIF)
• Pedestrians
• Weather Conditions
• Fog Conditions and
• Lighting conditions
EV
VIF
15
State and Action Definition
EV: Ego Vehicle,
VIF: Vehicle-In-Front
Possible Actions
• Changing throttle of Vehicle in
Front (VIF)
• Changing steering of Vehicle in
Front (VIF)
• Changing pedestrian movement
• Changing weather
• Changing fog
• Changing lighting
EV
VIF
16
RQ1: Test Effectiveness
Online Testing of
DADS
Test Suite
Effectiveness
(TSE)
§ 4 hours
§ 10 repetitions
§ Three
environments
§ Straight
§ Left-Turn
§ Right-Turn
Algorithms
MORLOT
MOSA
FITEST
Random
Search (RS)
𝑇𝑆𝐸 =
# 𝑜𝑓 𝑆𝑎𝑓𝑒𝑡𝑦 𝑣𝑖𝑜𝑙𝑎𝑡𝑖𝑜𝑛𝑠
# 𝑜𝑓 𝑆𝑎𝑓𝑒𝑡𝑦 𝑅𝑒𝑞𝑢𝑖𝑟𝑒𝑚𝑒𝑛𝑡𝑠
17
RQ1: Results
MORLOT is significantly more effective than random search and
alternative many-objective search approaches tailored for test suite
generation.
18
RQ2: Test Efficiency
Online Testing of
DADS
Test Efficiency
§ 4 hours
§ 10 repetitions
§ Three
environments
§ Straight
§ Left-Turn
§ Right-Turn
§ Average TSE achieved by
10 runs
§ 20 minutes interval
Algorithms
MORLOT
MOSA
FITEST
Random
Search (RS)
19
RQ2: Results
MORLOT is significantly more efficient than random search and alternative approaches.
For any given budget, MORLOT achieves a significantly higher average TSE, and this
difference keeps increasing over time.
20
Conclusion
Many-Objective Reinforcement Learning for Online
Testing of DNN-Enabled Systems
Fitash Ul Haq1, Donghwan Shin3, Lionel Briand1,2
1) University of Luxembourg
2) University of Ottawa
3) University of Sheffield
Date: 19th May 2023

More Related Content

PDF
Backjin lee autonomous vehicle slideshare
PDF
2018.07 Self-driving cars meetup
PDF
Presentation on National Road Safety.pdf
PDF
yolov3-4-5.pdf
PDF
Future Research Directions in Military Ground Vehicle Mobility
PPTX
Light Tactical All-terrain Vehicle (LTATV) Crew Augmentation
PDF
Deep Learning for Autonomous Vehicle Innovation Insights from Patents
PDF
Automated Testing of Autonomous Driving Assistance Systems
Backjin lee autonomous vehicle slideshare
2018.07 Self-driving cars meetup
Presentation on National Road Safety.pdf
yolov3-4-5.pdf
Future Research Directions in Military Ground Vehicle Mobility
Light Tactical All-terrain Vehicle (LTATV) Crew Augmentation
Deep Learning for Autonomous Vehicle Innovation Insights from Patents
Automated Testing of Autonomous Driving Assistance Systems

Similar to Many-Objective Reinforcement Learning for Online Testing of DNN-Enabled Systems (20)

PPT
Rosenbloom simulator
PPT
Commercial Drone Best Practices: How to Incorporate Data and Job Specs
PDF
AUTONOMOUS VEHICLES 2.pdf
PDF
Slides SDC Controls
PDF
Final 360 stakeholder_presentation_110515
DOCX
ROAD The Road Event Awareness Dataset for Autonomous Driving.docx
PPTX
Autonomous Industry Feedback
PPTX
PDF
Verification of IVI Over-The-Air using UML/OCL
PPTX
HARA ISO 26262: What is HARA and Why is it Required?
PPTX
Smart Roadside Initiative
 
PPTX
CarSafe (MobiSys 2013)
 
PDF
VSD_paper_Publish
PDF
Review of Environment Perception for Intelligent Vehicles
PDF
Autonomy Incubator Seminar Series: Tractable Robust Planning and Model Learni...
PPTX
OWASP A&D Project
PDF
Validation Framework for Autonomous Aerial Vehicles
PDF
VEHICLE DETECTION USING YOLO V3 FOR COUNTING THE VEHICLES AND TRAFFIC ANALYSIS
PDF
2015 HPVC West - Innovation Report - Cal Poly San Luis Obispo
PDF
HERE Mobility SDK Tutorial
Rosenbloom simulator
Commercial Drone Best Practices: How to Incorporate Data and Job Specs
AUTONOMOUS VEHICLES 2.pdf
Slides SDC Controls
Final 360 stakeholder_presentation_110515
ROAD The Road Event Awareness Dataset for Autonomous Driving.docx
Autonomous Industry Feedback
Verification of IVI Over-The-Air using UML/OCL
HARA ISO 26262: What is HARA and Why is it Required?
Smart Roadside Initiative
 
CarSafe (MobiSys 2013)
 
VSD_paper_Publish
Review of Environment Perception for Intelligent Vehicles
Autonomy Incubator Seminar Series: Tractable Robust Planning and Model Learni...
OWASP A&D Project
Validation Framework for Autonomous Aerial Vehicles
VEHICLE DETECTION USING YOLO V3 FOR COUNTING THE VEHICLES AND TRAFFIC ANALYSIS
2015 HPVC West - Innovation Report - Cal Poly San Luis Obispo
HERE Mobility SDK Tutorial
Ad

More from Lionel Briand (20)

PDF
LTM: Scalable and Black-box Similarity-based Test Suite Minimization based on...
PDF
TEASMA: A Practical Methodology for Test Adequacy Assessment of Deep Neural N...
PDF
Automated Test Case Repair Using Language Models
PDF
Automated Testing and Safety Analysis of Deep Neural Networks
PDF
FlakyFix: Using Large Language Models for Predicting Flaky Test Fix Categorie...
PDF
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
PDF
Precise and Complete Requirements? An Elusive Goal
PDF
Large Language Models for Test Case Evolution and Repair
PDF
Metamorphic Testing for Web System Security
PDF
Simulator-based Explanation and Debugging of Hazard-triggering Events in DNN-...
PDF
Fuzzing for CPS Mutation Testing
PDF
Data-driven Mutation Analysis for Cyber-Physical Systems
PDF
ATM: Black-box Test Case Minimization based on Test Code Similarity and Evolu...
PDF
Black-box Safety Analysis and Retraining of DNNs based on Feature Extraction ...
PDF
PRINS: Scalable Model Inference for Component-based System Logs
PDF
Revisiting the Notion of Diversity in Software Testing
PDF
Applications of Search-based Software Testing to Trustworthy Artificial Intel...
PDF
Autonomous Systems: How to Address the Dilemma between Autonomy and Safety
PDF
Mathematicians, Social Scientists, or Engineers? The Split Minds of Software ...
PDF
Reinforcement Learning for Test Case Prioritization
LTM: Scalable and Black-box Similarity-based Test Suite Minimization based on...
TEASMA: A Practical Methodology for Test Adequacy Assessment of Deep Neural N...
Automated Test Case Repair Using Language Models
Automated Testing and Safety Analysis of Deep Neural Networks
FlakyFix: Using Large Language Models for Predicting Flaky Test Fix Categorie...
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Precise and Complete Requirements? An Elusive Goal
Large Language Models for Test Case Evolution and Repair
Metamorphic Testing for Web System Security
Simulator-based Explanation and Debugging of Hazard-triggering Events in DNN-...
Fuzzing for CPS Mutation Testing
Data-driven Mutation Analysis for Cyber-Physical Systems
ATM: Black-box Test Case Minimization based on Test Code Similarity and Evolu...
Black-box Safety Analysis and Retraining of DNNs based on Feature Extraction ...
PRINS: Scalable Model Inference for Component-based System Logs
Revisiting the Notion of Diversity in Software Testing
Applications of Search-based Software Testing to Trustworthy Artificial Intel...
Autonomous Systems: How to Address the Dilemma between Autonomy and Safety
Mathematicians, Social Scientists, or Engineers? The Split Minds of Software ...
Reinforcement Learning for Test Case Prioritization
Ad

Recently uploaded (20)

PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
CCleaner Pro 6.38.11537 Crack Final Latest Version 2025
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
Cost to Outsource Software Development in 2025
PPTX
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
PDF
Autodesk AutoCAD Crack Free Download 2025
PDF
iTop VPN 6.5.0 Crack + License Key 2025 (Premium Version)
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PDF
Salesforce Agentforce AI Implementation.pdf
PPTX
Monitoring Stack: Grafana, Loki & Promtail
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by AndrĂŠ Kraus
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
DOCX
Greta — No-Code AI for Building Full-Stack Web & Mobile Apps
PDF
medical staffing services at VALiNTRY
PPTX
Patient Appointment Booking in Odoo with online payment
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
Complete Guide to Website Development in Malaysia for SMEs
PDF
Designing Intelligence for the Shop Floor.pdf
Navsoft: AI-Powered Business Solutions & Custom Software Development
CCleaner Pro 6.38.11537 Crack Final Latest Version 2025
Wondershare Filmora 15 Crack With Activation Key [2025
wealthsignaloriginal-com-DS-text-... (1).pdf
Cost to Outsource Software Development in 2025
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
Autodesk AutoCAD Crack Free Download 2025
iTop VPN 6.5.0 Crack + License Key 2025 (Premium Version)
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
Salesforce Agentforce AI Implementation.pdf
Monitoring Stack: Grafana, Loki & Promtail
T3DD25 TYPO3 Content Blocks - Deep Dive by AndrĂŠ Kraus
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
Greta — No-Code AI for Building Full-Stack Web & Mobile Apps
medical staffing services at VALiNTRY
Patient Appointment Booking in Odoo with online payment
Design an Analysis of Algorithms II-SECS-1021-03
Complete Guide to Website Development in Malaysia for SMEs
Designing Intelligence for the Shop Floor.pdf

Many-Objective Reinforcement Learning for Online Testing of DNN-Enabled Systems

  • 1. Many-Objective Reinforcement Learning for Online Testing of DNN-Enabled Systems Fitash Ul Haq1, Donghwan Shin3, Lionel Briand1,2 1) University of Luxembourg 2) University of Ottawa 3) University of Sheffield Date: 19th May 2023
  • 2. 2 Introduction DNN-Enabled System (DADS): Composed of multiple DNNs capable of various tasks as object tracking, object classification, traffic light detection and traffic sign detection Self-Driving Cars Autonomous Drones
  • 3. 3 Cannot Find Safety Violation Online Testing Can Find Testing DNN-Enabled System (DADS) Offline Testing Introduction
  • 4. 4 Challenges for Online Testing: To address the challenges, we propose MORLOT (Many-Objective Reinforcement Learning for Online Testing) by leveraging many-objective search and Reinforcement Learning (RL) Static Environment Many Safety Requirements Large Input Space 3rd Party DNNs Introduction
  • 5. 5 Key Idea Many-Objective Search Reinforcement Learning (Q-learning) Environment Action (e.g, increase speed) Fitness Values (rewards) Reward (e.g, distance) Action (e.g, increase speed)
  • 6. 6 Many-Objective Reinforcement Learning for Online Testing MORLOT: Overview Objectives (i.e., reqs), Q-tables Observe State Choose Action Randomly or using Many-Objective Search Reset Environment Many-Objective Reinforcement Learning for Online Testing
  • 7. 7 Choose Action using Many-Objective Search Actions are chosen based on the objective that is closest to being satisfied O1 O2 O3 QO_1 QO_2 QO_3 Select Action F _ v = 0 . 3 F_v = 0.7 F _ v = 0 . 5 Action from Q table of O2 (e.g., Decrease speed of VIF) VIF: Vehicle-In-Front
  • 8. 8 MORLOT: Overview Objectives (i.e., reqs), Q-tables Many-Objective Reinforcement Learning for Online Testing Observe State Choose Action Randomly or using Many-Objective Search Reset Environment Take Action/ Receive Rewards Update Q- tables/ Archive Minimal Test Suite
  • 9. 9 Research Questions How do alternative approaches fare in terms of test effectiveness? How do alternative approaches fare in terms of test efficiency? RQ1 RQ2
  • 10. 10 • Transfuser (DNN-enabled ADS): Highest rank publicly available DNN-enabled ADS in the CARLA Autonomous Driving Leaderboard Sensors Track Case Study Subject Transfuser • CARLA (Simulator): Open-source simulator based on the Unreal Engine designed to support training, development, and validation of ADS CARLA Simulator
  • 11. 11 Safety and Functional Requirements We use the following six safety and functional requirements: • R1: EV should not go out of lane; • R2: EV should not collide with other vehicles; EV EV: Ego Vehicle
  • 12. 12 Safety and Functional Requirements We use the following six safety and functional requirements: • R1: EV should not go out of lane; • R2: EV should not collide with other vehicles; • R3: EV should not collide with pedestrians; • R4: EV should not collide with static meshes (i.e., traffic lights, traffic signs etc.); • R5: EV should reach its destination in defined time budget; EV: Ego Vehicle EV
  • 13. 13 Safety and Functional Requirements We use the following six safety and functional requirements: • R1: EV should not go out of lane; • R2: EV should not collide with other vehicles; • R3: EV should not collide with pedestrians; • R4: EV should not collide with static meshes (i.e., traffic lights, traffic signs etc.); • R5: EV should reach its destination in defined time budget; • R6: EV should not violate traffic lights. EV: Ego Vehicle EV
  • 14. 14 State and Action Definition State Contain information about: • Ego Vehicle (EV) • Vehicle in Front (VIF) • Pedestrians • Weather Conditions • Fog Conditions and • Lighting conditions EV VIF
  • 15. 15 State and Action Definition EV: Ego Vehicle, VIF: Vehicle-In-Front Possible Actions • Changing throttle of Vehicle in Front (VIF) • Changing steering of Vehicle in Front (VIF) • Changing pedestrian movement • Changing weather • Changing fog • Changing lighting EV VIF
  • 16. 16 RQ1: Test Effectiveness Online Testing of DADS Test Suite Effectiveness (TSE) § 4 hours § 10 repetitions § Three environments § Straight § Left-Turn § Right-Turn Algorithms MORLOT MOSA FITEST Random Search (RS) 𝑇𝑆𝐸 = # 𝑜𝑓 𝑆𝑎𝑓𝑒𝑡𝑦 𝑣𝑖𝑜𝑙𝑎𝑡𝑖𝑜𝑛𝑠 # 𝑜𝑓 𝑆𝑎𝑓𝑒𝑡𝑦 𝑅𝑒𝑞𝑢𝑖𝑟𝑒𝑚𝑒𝑛𝑡𝑠
  • 17. 17 RQ1: Results MORLOT is significantly more effective than random search and alternative many-objective search approaches tailored for test suite generation.
  • 18. 18 RQ2: Test Efficiency Online Testing of DADS Test Efficiency § 4 hours § 10 repetitions § Three environments § Straight § Left-Turn § Right-Turn § Average TSE achieved by 10 runs § 20 minutes interval Algorithms MORLOT MOSA FITEST Random Search (RS)
  • 19. 19 RQ2: Results MORLOT is significantly more efficient than random search and alternative approaches. For any given budget, MORLOT achieves a significantly higher average TSE, and this difference keeps increasing over time.
  • 21. Many-Objective Reinforcement Learning for Online Testing of DNN-Enabled Systems Fitash Ul Haq1, Donghwan Shin3, Lionel Briand1,2 1) University of Luxembourg 2) University of Ottawa 3) University of Sheffield Date: 19th May 2023