SlideShare a Scribd company logo
Quantifying Regional Error in Surrogates by Modeling its 
Relationship with Sample Density 
Ali Mehmani, Souma Chowdhury , Jie Zhang, Weiyang Tong, 
and Achille Messac 
Syracuse University, Department of Mechanical and Aerospace Engineering 
54th AIAA/ASME/ASCE/AHS/ASC Structures, Structural Dynamics and 
Materials Conference 
April 8-11, 2013, Boston, Massachusetts
Surrogate model 
• Surrogate models are commonly used for providing a tractable and 
inexpensive approximation of the actual system behavior in many 
routine engineering analysis and design activities: 
2
3 
Broad Research Question 
Lower Fidelity How Low? 
Structural Blade Design 
ANSYS, Inc. 
Surrogate Model 
Expensive Inexpensive
4 
Broad Research Question 
How to quantify the level of the surrogate accuracy ? 
 further improvement of the surrogate, 
 domain exploration, 
 assessing the reliability of the optimal design, 
 quantifying the uncertainty associated with the surrogate, 
 construction of a weighted surrogate model, and 
 …
Research Objective 
 Develop a reliable method to quantify the surrogate error, 
5 
 This method should have the following characteristics: 
 model independent 
 no additional system evaluations 
 local/global error measurement 
 quantify the error of the actual surrogate
Regional Error Estimation of Surrogate 
6
Regional Error Estimation of Surrogate 
7 
(REES)
Presentation Outline 
• Review of surrogate model error measurement methods 
8 
• Relation of surrogate accuracy with sample density 
• Regional Error Estimation of Surrogate 
• Numerical examples: benchmark and an engineering 
design problems
Presentation Outline 
• Review of surrogate model error measurement methods 
9 
• Relation of surrogate accuracy with sample density 
• Regional Error Estimation of Surrogate 
• Numerical examples: benchmark and an engineering 
design problems
Surrogate Model Error Measurement Methods 
 Error quantification methods can be classified, 
 based on their computational expense, into methods that require 
additional data, and methods that use existing data. 
 based on the region of interest, into: 
- Global error measure 
(e.g., split sample, cross-validation, Akaike’s information criterion, and 
bootstrapping). 
- Local or point-wise error measure 
(e.g., the mean squared errors for Kriging and the linear reference model 
(LRM).) 
10
Surrogate Model Error Measurement Methods 
 Error metrics, 
11 
• The mean squared error (MSE) (or root mean square error (RMSE) ) 
• The maximum absolute error (MAE) 
• The relative absolute error (RAE) 
actual values on ith test point 
predicted values on ith test point
Surrogate Model Error Measurement Methods 
 Error metrics, 
• The prediction sum of square (PRESS) is based on the leave-one-out 
12 
cross-validation error 
• The root mean square of PRESS (PRESSRMS) based on the k-fold cross-validation. 
• The relative absolute error of cross-validation (RAECV) based on leave-one- 
out approach.
Presentation Outline 
• Review of surrogate model error measurement methods 
13 
• Relation of surrogate accuracy with sample density 
• Regional Error Estimation of Surrogate 
• Numerical examples: benchmark and an engineering 
design problems
Methodology: Concept 
14 
Model accuracy ∝ Available resources 
In general, this concept can be applied for different methodologies 
- Surrogate modeling, 
- Finite Element Analysis, and 
- ...
Methodology: Concept 
15 
 Finite Element Analysis (numerical methods) 
The finer mesh, the stresses are more precise due to the larger 
number of elements 
coarse mesh 
(4 solid brick element) 
medium mesh 
(32 solid brick element) 
fine mesh 
(256 solid brick element) 
Estimate total shear force and flexural moment at vertical 
sections using Finite Element Analysis.
Methodology: Concept 
 Surrogate (mathematical model) 
Surrogate accuracy generally improves with increasing training points. 
3 training points 
9 training points 
7 training points 
The location of additional points has 
strong impact on surrogate accuracy. 
This impact is highly problem and model 
dependent. 
16
Presentation Outline 
• Review of surrogate model error measurement methods 
17 
• Relation of surrogate accuracy with sample density 
• Regional Error Estimation of Surrogate 
• Numerical examples: benchmark and an engineering 
design problems
Methodology: REES 
 The REES method formulates the variation of error as a 
18 
function of training points using intermediate surrogates. 
 This formulation is used to predict the level of error in a 
final surrogate.
Methodology: REES 
Step 1 : Generation of sample data 
The entire set of sample points is represented by 푿 . 
Step 2 : Identification of sample points inside/outside region of interest 
{푿풊풏} 
{푿풐풖풕} 
푿 = 푿풊풏 + 푿풐풖풕 
푿풊풏 : Inside-region data set 
푿풐풖풕 : Outside-region data set 
user-defined region of interest 
19
Methodology: REES 
Step 3 : Estimation of the variation of the error with sample density 
Outside-Training region Point 
point 
Inside-region point 
user-defined region 
of interest 
First Iteration : 
Test Point 
Second iteration : 
Third iteration : 
Final Surrogate : 
Training Point
Methodology: REES 
Step 3 : Estimation of the variation of the error with sample density 
 A position of sample points which are selected as training 
points, at each iteration, is critical to the surrogate accuracy. 
 The proposed error measure should be minimally sensitive to 
the location of the test points at each iteration. 
21 
 Intermediate surrogates are 
iteratively constructed (at each 
iteration) over a sample set 
comprising all samples outside the 
region of interest and heuristic 
subsets of samples inside the region 
of interest.
Methodology: REES 
Step 3 : Estimation of the variation of the error with sample density 
 The intermediate subset for each 
 The number of iterations (푁푖푡) is defined 
combination at specific iteration is defined 
by 
{휷풌} ⊂ 푿풊풏 
#{휷풌} = 풏풕, 풏풕−ퟏ < 풏풕 
풌 = 1,2, … , 퐾푡 
- dimension of a problem, 
- number of inside sample points, and 
- preference of the user 
 The intermediate training points and test 
points for each combination at each 
iteration is defined by 
 The number of sample combinations 
(푲풕 ) is defined, 
푿푻푹 = 푿풐풖풕 + 휷풌 
푿푻푬 = 푿 − 푿푻푹 
 The intermediate surrogates 
푓푘 , 풌 = ퟏ, ퟐ, . . , 푲풕 
are constructed for all combinations using the 
intermediate training points ( 푿푻푹 ), and are 
tested over the intermediate test points ( 푿푻푬 ).
Methodology: REES 
Step 3 : Estimation of the variation of the error with sample density 
 The median and the maximum errors are 
estimated for each combination 
풎풕: the number of test points in tth iteration 
풆: the RAE value estimated on intermediate test points 
23
Methodology: REES 
Step 3 : Estimation of the variation of the error with sample density 
 The median and the maximum errors are 
estimated for each combination 
The median is a useful measures of central 
tendency which is less vulnerable to outliers. 
24 
Median error 
Overall Fidelity Information 
Maximum error 
Minimum Fidelity Information
Methodology: REES 
Step 3 : Estimation of the variation of the error with sample density 
 Probabilistic models are developed using 
The mode of distribution is selected to 
represent the errors at each iteration. 
a lognormal distribution to represent 
median and maximum errors estimated 
over all 푲풕 combinations at each 
iterations. 
Mode of median error distribution 
Mode of maximum error distribution 
 These values are used to relate the 
variation of the surrogate error with 
number of training points (sample 
density).
The relation of the error with sample density 
 12-D Test Problem (Dixon & Price, n=12) 
Number of sample points # 푿 = ퟓퟓퟎ, Number of inside sample points # 푿풊풏 = # 푿 
Number of training points at each iteration,풏풕 = 5푡 + 50, 푡 = 1,2, … , 70 
Number of sample combination, 푲풕 = 500 
Number of Training Points 
MOmax 
# 푿푻푹 = ퟓퟓ 
# 푿푻푬 = ퟒퟒퟓ 
Number of Training Points 
Estimated mode of median errors Estimated mode of maximum errors 
MOmed 
First iteration 
Last iteration 
# 푿푻푹 = ퟒퟎퟎ 
# 푿푻푬 = ퟏퟎퟎ
The relation of the error with sample density 
 12-D Test Problem (Dixon & Price, n=12) 
Number of Training Points 
Meanmean 
Number of Training Points 
Estimated mode of median errors Estimated mean of mean errors 
MOmed 
REES Method Normalized k-fold CV
Methodology: REES 
Step 4 : Prediction of regional error in the final surrogate 
 The final surrogate model is constructed using the full set of training data. 
 Regression models are applied to relate 
- the statistical mode of the median error distribution(푴풐풎풆풅) 
- the statistical mode of the maximum error distributions(푴풐풎풂풙), and 
- the absolute maximum error (푨푩푺풎풂풙) 
at each iteration to the size of the inside-region training points (nt), 
 These regression models are called the variation of error with sample density 
(VESD). 
The regression models are used to predict the level of the 
error in the final surrogate within the region of interest. 
28
Methodology: REES 
Modeling the Variation of Regional Error with Training Point Density 
 In this study, three types of the regression functions are used to represent 
the variation of regional error with respect to the inside-region training points 
Exponential regression model 
Multiplicative regression model 
Linear regression model 
 The choice of these functions assume a smooth monotonic decrease of the 
regional error with the training point density within that region. 
 The root mean squared error metric is used to select the best-fit regression 
model 
29
Presentation Outline 
• Review of surrogate model error measurement methods 
30 
• Relation of surrogate accuracy with sample density 
• Regional Error Estimation of Surrogate 
• Numerical examples: benchmark and an engineering 
design problems
Numerical Examples 
 The effectiveness of the REES method is explored for applications with 
- Kriging, 
- Radial Basis Functions (RBF), 
- Extended Radial Basis Functions (E-RBF), and 
- Quadratic Response Surface (QRS). 
 To evaluate practical and numerical efficiencies of the REES method, 
three benchmark problems and an engineering design problem are tested. 
 The error evaluated using REES, and the relative absolute error given by 
leave-one-out cross-validation (푹푨푬풄풗) are compared with the actual 
error evaluated using relative absolute error on additional test 
points (푹푨푬풂풄풕풖풂풍). 
31
Median of RAEs 
Numerical Examples 
Results and Discussion 
VESD regression models within the region of interest of surrogate models 
constructed for the Branin-Hoo Function to predict, 
Distribution of 
median errors 
Mode of the median error 
distribution, 
Predicted mode of median error 
in the final surrogate, 
VESDmed 
Number of Inside-region Training Points 
32
Numerical Examples 
Results and Discussion 
VESD regression models 
within the region of interest of 
surrogate models constructed 
for the Branin-Hoo Function 
to predict, 
Type and coefficients of 
VESDmed 
Kriging RBF 
E-RBF QRS
Maximum of RAEs 
Numerical Examples 
Results and Discussion 
VESD regression models within the region of interest of surrogate models constructed for the 
Branin-Hoo Function to predict the mode of maximum ( ) and the absolute 
maximum ( ) error. 
Distribution of 
maximum errors 
Mode of the maximum 
error distribution, 
Absolute maximum error 
Predicted absolute 
maximum error in 
the final surrogate 
Predicted mode of 
maximum error in 
the final surrogate 
34 Number of Inside-region Training Points
Numerical Examples 
Results and Discussion 
VESD regression models within the region of interest of surrogate models constructed for the 
Branin-Hoo Function to predict the mode of maximum ( ) and the absolute 
maximum ( ) error. 
Type and coefficients of VESDmax 
Type and coefficients of VESDABS 
Kriging RBF 
E-RBF QRS
Numerical Examples 
Wind Farm Power Generation 
Surrogates are developed using Kriging, RBF, E-RBF, and QRS to 
represent the power generation of an array-like wind farm. 
36
Numerical Examples 
Results and Discussion 
VESD regression models in different surrogates for the wind farm power 
generation problem 
It. 1 It. 2 It. 3 It. 4 Predicted Error 
37
Numerical Examples 
38 
Results and Discussion 
The closer to one, the better the corresponding error measure. 
predicted mode of median errors 
median of RAEs evaluated on test 
points 
median of relative absolute 
errors of cross-validation
Concluding Remarks 
 We developed a new method to quantify surrogate error based on the 
hypothesis that: 
“The accuracy of the approximation model is related to the amount 
of available resources” 
 This relationship can be reliably quantified when the error measures is 
less sensitive to sample locations or a type of application. 
 The REES method addresses this issue. 
 The preliminary results on benchmark and wind farm power generation 
problems indicate that in majority of cases the REES method is more 
accurate than other measures. 
39 
It is not possible using any existing methods
Future Works 
 The scope for improvement the method 
 The implementation of the proposed error measurement in 
surrogate developments. 
40
Acknowledgement 
41 
 I would like to acknowledge my research adviser 
Prof. Achille Messac, and my co-adviser Prof. 
Souma Chowdhury for their immense help and 
support in this research. 
 Support from the NSF Awards is also acknowledged.
42 
Thank you 
Questions 
and 
Comments
Median of RAEs 
Numerical Examples 
Results and Discussion 
VESD regression models within the region of interest of surrogate models 
constructed for the Branin-Hoo Function to predict, 
Distribution of 
median errors 
k-fold CV 
Mode of the median error 
distribution, 
Predicted mode of median error 
in the final surrogate, 
VESDmed 
Number of Inside-region Training Points 
43 
Mean mean 
Number of Inside-region Training Points

More Related Content

PPTX
PEMF-1-MAO2012-Ali
PPTX
COSMOS1_Scitech_2014_Ali
PPTX
ModelSelection1_WCSMO_2013_Ali
PPTX
AIAA-SciTech-ModelSelection-2014-Mehmani
PPT
DSUS_MAO_2012_Jie
PPTX
AIAA-Aviation-2015-Mehmani
PPTX
ASS_SDM2012_Ali
PPTX
AMS_Aviation_2014_Ali
PEMF-1-MAO2012-Ali
COSMOS1_Scitech_2014_Ali
ModelSelection1_WCSMO_2013_Ali
AIAA-SciTech-ModelSelection-2014-Mehmani
DSUS_MAO_2012_Jie
AIAA-Aviation-2015-Mehmani
ASS_SDM2012_Ali
AMS_Aviation_2014_Ali

What's hot (19)

PPTX
Orthogonal array
PPTX
COSMOS-ASME-IDETC-2014
PPTX
Cross-validation aggregation for forecasting
PDF
Lecture7 cross validation
PDF
Classification using L1-Penalized Logistic Regression
PPT
Orthogonal array approach a case study
PDF
Performance Evaluation for Classifiers tutorial
PPT
Taguchi design of experiments nov 24 2013
PDF
Performance analysis of logic
PPTX
Sampling techniques 2
PPT
Orthogonal array testing
PDF
IDA 2015: Efficient model selection for regularized classification by exploit...
PDF
Sampling Theory Part 3
PDF
Essentials of monte carlo simulation
PPT
Quality of Analytical Procedures
PPTX
Unit 6 input modeling
PDF
Sampling Theory Part 2
PDF
Sampling theory
Orthogonal array
COSMOS-ASME-IDETC-2014
Cross-validation aggregation for forecasting
Lecture7 cross validation
Classification using L1-Penalized Logistic Regression
Orthogonal array approach a case study
Performance Evaluation for Classifiers tutorial
Taguchi design of experiments nov 24 2013
Performance analysis of logic
Sampling techniques 2
Orthogonal array testing
IDA 2015: Efficient model selection for regularized classification by exploit...
Sampling Theory Part 3
Essentials of monte carlo simulation
Quality of Analytical Procedures
Unit 6 input modeling
Sampling Theory Part 2
Sampling theory
Ad

Similar to PEMF2_SDM_2012_Ali (12)

PPTX
AIAA-SDM-PEMF-2013
PPTX
AIAA-MAO-RegionalError-2012
PPTX
WCSMO-ModelSelection-2013
PPTX
Sampling_WCSMO_2013_Jun
PPT
AIAA-MAO-DSUS-2012
PPT
RBHF_SDM_2011_Jie
PDF
Lecture 2 neural network covers the basic
PDF
7 - Model Assessment and Selection
PPTX
COSMOS_IDETC_2014_Souma
PDF
Bayesian Subset Simulation
PDF
Lecture 5 - Linear Regression Linear Regression
PPTX
Machine learning session4(linear regression)
AIAA-SDM-PEMF-2013
AIAA-MAO-RegionalError-2012
WCSMO-ModelSelection-2013
Sampling_WCSMO_2013_Jun
AIAA-MAO-DSUS-2012
RBHF_SDM_2011_Jie
Lecture 2 neural network covers the basic
7 - Model Assessment and Selection
COSMOS_IDETC_2014_Souma
Bayesian Subset Simulation
Lecture 5 - Linear Regression Linear Regression
Machine learning session4(linear regression)
Ad

More from MDO_Lab (20)

PPTX
MOWF_SchiTech_2015_Weiyang
PPTX
SA_SciTech_2014_Weiyang
PPTX
MOMDPSO_IDETC_2014_Weiyang
PPTX
MOWF_WCSMO_2013_Weiyang
PPTX
WFSA_IDETC_2013_Weiyang
PPTX
WM_MAO_2012_Weiyang
PPTX
CP3_SDM_2010_Souma
PPT
ATI_SDM_2010_Jun
PPTX
PF_MAO_2010_Souam
PPTX
WFO_MAO_2010_Souma
PPT
ATE_MAO_2010_Jun
PPT
COST_MAO_2010_Jie
PPTX
WFO_IDETC_2011_Souma
PPT
COSTMODEL_IDETC_2010_Jie
PPTX
WFO_TIERF_2011_Messac
PPTX
WFO_SDM_2011_Souma
PPTX
WFO_IDETC_2011_Souma
PPT
AHF_IDETC_2011_Jie
PPT
WPPE_ES_2011_Jie
PPTX
MMWD_ES_2011_Jie
MOWF_SchiTech_2015_Weiyang
SA_SciTech_2014_Weiyang
MOMDPSO_IDETC_2014_Weiyang
MOWF_WCSMO_2013_Weiyang
WFSA_IDETC_2013_Weiyang
WM_MAO_2012_Weiyang
CP3_SDM_2010_Souma
ATI_SDM_2010_Jun
PF_MAO_2010_Souam
WFO_MAO_2010_Souma
ATE_MAO_2010_Jun
COST_MAO_2010_Jie
WFO_IDETC_2011_Souma
COSTMODEL_IDETC_2010_Jie
WFO_TIERF_2011_Messac
WFO_SDM_2011_Souma
WFO_IDETC_2011_Souma
AHF_IDETC_2011_Jie
WPPE_ES_2011_Jie
MMWD_ES_2011_Jie

Recently uploaded (20)

PPTX
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
PPTX
Digestion and Absorption of Carbohydrates, Proteina and Fats
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PPTX
Cell Types and Its function , kingdom of life
PDF
IGGE1 Understanding the Self1234567891011
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PPTX
Unit 4 Skeletal System.ppt.pptxopresentatiom
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PPTX
Lesson notes of climatology university.
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
PDF
Hazard Identification & Risk Assessment .pdf
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
advance database management system book.pdf
PDF
Empowerment Technology for Senior High School Guide
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
Digestion and Absorption of Carbohydrates, Proteina and Fats
Orientation - ARALprogram of Deped to the Parents.pptx
Chinmaya Tiranga quiz Grand Finale.pdf
Cell Types and Its function , kingdom of life
IGGE1 Understanding the Self1234567891011
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
Unit 4 Skeletal System.ppt.pptxopresentatiom
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Lesson notes of climatology university.
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
Paper A Mock Exam 9_ Attempt review.pdf.
LDMMIA Reiki Yoga Finals Review Spring Summer
Hazard Identification & Risk Assessment .pdf
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
advance database management system book.pdf
Empowerment Technology for Senior High School Guide
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
202450812 BayCHI UCSC-SV 20250812 v17.pptx

PEMF2_SDM_2012_Ali

  • 1. Quantifying Regional Error in Surrogates by Modeling its Relationship with Sample Density Ali Mehmani, Souma Chowdhury , Jie Zhang, Weiyang Tong, and Achille Messac Syracuse University, Department of Mechanical and Aerospace Engineering 54th AIAA/ASME/ASCE/AHS/ASC Structures, Structural Dynamics and Materials Conference April 8-11, 2013, Boston, Massachusetts
  • 2. Surrogate model • Surrogate models are commonly used for providing a tractable and inexpensive approximation of the actual system behavior in many routine engineering analysis and design activities: 2
  • 3. 3 Broad Research Question Lower Fidelity How Low? Structural Blade Design ANSYS, Inc. Surrogate Model Expensive Inexpensive
  • 4. 4 Broad Research Question How to quantify the level of the surrogate accuracy ?  further improvement of the surrogate,  domain exploration,  assessing the reliability of the optimal design,  quantifying the uncertainty associated with the surrogate,  construction of a weighted surrogate model, and  …
  • 5. Research Objective  Develop a reliable method to quantify the surrogate error, 5  This method should have the following characteristics:  model independent  no additional system evaluations  local/global error measurement  quantify the error of the actual surrogate
  • 6. Regional Error Estimation of Surrogate 6
  • 7. Regional Error Estimation of Surrogate 7 (REES)
  • 8. Presentation Outline • Review of surrogate model error measurement methods 8 • Relation of surrogate accuracy with sample density • Regional Error Estimation of Surrogate • Numerical examples: benchmark and an engineering design problems
  • 9. Presentation Outline • Review of surrogate model error measurement methods 9 • Relation of surrogate accuracy with sample density • Regional Error Estimation of Surrogate • Numerical examples: benchmark and an engineering design problems
  • 10. Surrogate Model Error Measurement Methods  Error quantification methods can be classified,  based on their computational expense, into methods that require additional data, and methods that use existing data.  based on the region of interest, into: - Global error measure (e.g., split sample, cross-validation, Akaike’s information criterion, and bootstrapping). - Local or point-wise error measure (e.g., the mean squared errors for Kriging and the linear reference model (LRM).) 10
  • 11. Surrogate Model Error Measurement Methods  Error metrics, 11 • The mean squared error (MSE) (or root mean square error (RMSE) ) • The maximum absolute error (MAE) • The relative absolute error (RAE) actual values on ith test point predicted values on ith test point
  • 12. Surrogate Model Error Measurement Methods  Error metrics, • The prediction sum of square (PRESS) is based on the leave-one-out 12 cross-validation error • The root mean square of PRESS (PRESSRMS) based on the k-fold cross-validation. • The relative absolute error of cross-validation (RAECV) based on leave-one- out approach.
  • 13. Presentation Outline • Review of surrogate model error measurement methods 13 • Relation of surrogate accuracy with sample density • Regional Error Estimation of Surrogate • Numerical examples: benchmark and an engineering design problems
  • 14. Methodology: Concept 14 Model accuracy ∝ Available resources In general, this concept can be applied for different methodologies - Surrogate modeling, - Finite Element Analysis, and - ...
  • 15. Methodology: Concept 15  Finite Element Analysis (numerical methods) The finer mesh, the stresses are more precise due to the larger number of elements coarse mesh (4 solid brick element) medium mesh (32 solid brick element) fine mesh (256 solid brick element) Estimate total shear force and flexural moment at vertical sections using Finite Element Analysis.
  • 16. Methodology: Concept  Surrogate (mathematical model) Surrogate accuracy generally improves with increasing training points. 3 training points 9 training points 7 training points The location of additional points has strong impact on surrogate accuracy. This impact is highly problem and model dependent. 16
  • 17. Presentation Outline • Review of surrogate model error measurement methods 17 • Relation of surrogate accuracy with sample density • Regional Error Estimation of Surrogate • Numerical examples: benchmark and an engineering design problems
  • 18. Methodology: REES  The REES method formulates the variation of error as a 18 function of training points using intermediate surrogates.  This formulation is used to predict the level of error in a final surrogate.
  • 19. Methodology: REES Step 1 : Generation of sample data The entire set of sample points is represented by 푿 . Step 2 : Identification of sample points inside/outside region of interest {푿풊풏} {푿풐풖풕} 푿 = 푿풊풏 + 푿풐풖풕 푿풊풏 : Inside-region data set 푿풐풖풕 : Outside-region data set user-defined region of interest 19
  • 20. Methodology: REES Step 3 : Estimation of the variation of the error with sample density Outside-Training region Point point Inside-region point user-defined region of interest First Iteration : Test Point Second iteration : Third iteration : Final Surrogate : Training Point
  • 21. Methodology: REES Step 3 : Estimation of the variation of the error with sample density  A position of sample points which are selected as training points, at each iteration, is critical to the surrogate accuracy.  The proposed error measure should be minimally sensitive to the location of the test points at each iteration. 21  Intermediate surrogates are iteratively constructed (at each iteration) over a sample set comprising all samples outside the region of interest and heuristic subsets of samples inside the region of interest.
  • 22. Methodology: REES Step 3 : Estimation of the variation of the error with sample density  The intermediate subset for each  The number of iterations (푁푖푡) is defined combination at specific iteration is defined by {휷풌} ⊂ 푿풊풏 #{휷풌} = 풏풕, 풏풕−ퟏ < 풏풕 풌 = 1,2, … , 퐾푡 - dimension of a problem, - number of inside sample points, and - preference of the user  The intermediate training points and test points for each combination at each iteration is defined by  The number of sample combinations (푲풕 ) is defined, 푿푻푹 = 푿풐풖풕 + 휷풌 푿푻푬 = 푿 − 푿푻푹  The intermediate surrogates 푓푘 , 풌 = ퟏ, ퟐ, . . , 푲풕 are constructed for all combinations using the intermediate training points ( 푿푻푹 ), and are tested over the intermediate test points ( 푿푻푬 ).
  • 23. Methodology: REES Step 3 : Estimation of the variation of the error with sample density  The median and the maximum errors are estimated for each combination 풎풕: the number of test points in tth iteration 풆: the RAE value estimated on intermediate test points 23
  • 24. Methodology: REES Step 3 : Estimation of the variation of the error with sample density  The median and the maximum errors are estimated for each combination The median is a useful measures of central tendency which is less vulnerable to outliers. 24 Median error Overall Fidelity Information Maximum error Minimum Fidelity Information
  • 25. Methodology: REES Step 3 : Estimation of the variation of the error with sample density  Probabilistic models are developed using The mode of distribution is selected to represent the errors at each iteration. a lognormal distribution to represent median and maximum errors estimated over all 푲풕 combinations at each iterations. Mode of median error distribution Mode of maximum error distribution  These values are used to relate the variation of the surrogate error with number of training points (sample density).
  • 26. The relation of the error with sample density  12-D Test Problem (Dixon & Price, n=12) Number of sample points # 푿 = ퟓퟓퟎ, Number of inside sample points # 푿풊풏 = # 푿 Number of training points at each iteration,풏풕 = 5푡 + 50, 푡 = 1,2, … , 70 Number of sample combination, 푲풕 = 500 Number of Training Points MOmax # 푿푻푹 = ퟓퟓ # 푿푻푬 = ퟒퟒퟓ Number of Training Points Estimated mode of median errors Estimated mode of maximum errors MOmed First iteration Last iteration # 푿푻푹 = ퟒퟎퟎ # 푿푻푬 = ퟏퟎퟎ
  • 27. The relation of the error with sample density  12-D Test Problem (Dixon & Price, n=12) Number of Training Points Meanmean Number of Training Points Estimated mode of median errors Estimated mean of mean errors MOmed REES Method Normalized k-fold CV
  • 28. Methodology: REES Step 4 : Prediction of regional error in the final surrogate  The final surrogate model is constructed using the full set of training data.  Regression models are applied to relate - the statistical mode of the median error distribution(푴풐풎풆풅) - the statistical mode of the maximum error distributions(푴풐풎풂풙), and - the absolute maximum error (푨푩푺풎풂풙) at each iteration to the size of the inside-region training points (nt),  These regression models are called the variation of error with sample density (VESD). The regression models are used to predict the level of the error in the final surrogate within the region of interest. 28
  • 29. Methodology: REES Modeling the Variation of Regional Error with Training Point Density  In this study, three types of the regression functions are used to represent the variation of regional error with respect to the inside-region training points Exponential regression model Multiplicative regression model Linear regression model  The choice of these functions assume a smooth monotonic decrease of the regional error with the training point density within that region.  The root mean squared error metric is used to select the best-fit regression model 29
  • 30. Presentation Outline • Review of surrogate model error measurement methods 30 • Relation of surrogate accuracy with sample density • Regional Error Estimation of Surrogate • Numerical examples: benchmark and an engineering design problems
  • 31. Numerical Examples  The effectiveness of the REES method is explored for applications with - Kriging, - Radial Basis Functions (RBF), - Extended Radial Basis Functions (E-RBF), and - Quadratic Response Surface (QRS).  To evaluate practical and numerical efficiencies of the REES method, three benchmark problems and an engineering design problem are tested.  The error evaluated using REES, and the relative absolute error given by leave-one-out cross-validation (푹푨푬풄풗) are compared with the actual error evaluated using relative absolute error on additional test points (푹푨푬풂풄풕풖풂풍). 31
  • 32. Median of RAEs Numerical Examples Results and Discussion VESD regression models within the region of interest of surrogate models constructed for the Branin-Hoo Function to predict, Distribution of median errors Mode of the median error distribution, Predicted mode of median error in the final surrogate, VESDmed Number of Inside-region Training Points 32
  • 33. Numerical Examples Results and Discussion VESD regression models within the region of interest of surrogate models constructed for the Branin-Hoo Function to predict, Type and coefficients of VESDmed Kriging RBF E-RBF QRS
  • 34. Maximum of RAEs Numerical Examples Results and Discussion VESD regression models within the region of interest of surrogate models constructed for the Branin-Hoo Function to predict the mode of maximum ( ) and the absolute maximum ( ) error. Distribution of maximum errors Mode of the maximum error distribution, Absolute maximum error Predicted absolute maximum error in the final surrogate Predicted mode of maximum error in the final surrogate 34 Number of Inside-region Training Points
  • 35. Numerical Examples Results and Discussion VESD regression models within the region of interest of surrogate models constructed for the Branin-Hoo Function to predict the mode of maximum ( ) and the absolute maximum ( ) error. Type and coefficients of VESDmax Type and coefficients of VESDABS Kriging RBF E-RBF QRS
  • 36. Numerical Examples Wind Farm Power Generation Surrogates are developed using Kriging, RBF, E-RBF, and QRS to represent the power generation of an array-like wind farm. 36
  • 37. Numerical Examples Results and Discussion VESD regression models in different surrogates for the wind farm power generation problem It. 1 It. 2 It. 3 It. 4 Predicted Error 37
  • 38. Numerical Examples 38 Results and Discussion The closer to one, the better the corresponding error measure. predicted mode of median errors median of RAEs evaluated on test points median of relative absolute errors of cross-validation
  • 39. Concluding Remarks  We developed a new method to quantify surrogate error based on the hypothesis that: “The accuracy of the approximation model is related to the amount of available resources”  This relationship can be reliably quantified when the error measures is less sensitive to sample locations or a type of application.  The REES method addresses this issue.  The preliminary results on benchmark and wind farm power generation problems indicate that in majority of cases the REES method is more accurate than other measures. 39 It is not possible using any existing methods
  • 40. Future Works  The scope for improvement the method  The implementation of the proposed error measurement in surrogate developments. 40
  • 41. Acknowledgement 41  I would like to acknowledge my research adviser Prof. Achille Messac, and my co-adviser Prof. Souma Chowdhury for their immense help and support in this research.  Support from the NSF Awards is also acknowledged.
  • 42. 42 Thank you Questions and Comments
  • 43. Median of RAEs Numerical Examples Results and Discussion VESD regression models within the region of interest of surrogate models constructed for the Branin-Hoo Function to predict, Distribution of median errors k-fold CV Mode of the median error distribution, Predicted mode of median error in the final surrogate, VESDmed Number of Inside-region Training Points 43 Mean mean Number of Inside-region Training Points