SlideShare a Scribd company logo
Development of Methods for Deep Neural
Network Architectures Optimization based on
Tensor Factorization Algorithms
Supervisor: Revin Ilya Evgenyevich, research associate,
Laboratory for Composite AI, Research Center “Strong AI in Industry”
Presented by: Zakharov Denis, J4232c
Pain: Current deep networks have excessive parameters which results in
long training time and expensive storing wights production.
Hypothesis: With the use of optimization method we can reduce size of
stored model, increase throughput.
Work: Development of Methods for Deep Neural Network Architectures
Optimization based on Tensor Factorization Algorithms
Problem
2
Purpose and objectives of study
Goal
Development of Methods for Deep Neural Network Architectures Optimization based on Tensor
Factorization Algorithms
Objectives
• Make a Literature review of related field to provide a background
• Make a scientific research of tensor Algorithms (Optimization, Operations)
• Make a scientific research of LoRA approach
• Perform experiment with TS Models
• Based on these findings propose a product that can be integrated into AutoML solution
• Develop Optimization Method
• Contribute to Fedot.Industrial framework
3
BERT 340M T5 11B GPT-3 175B
Megatron-Turing 530B
GPT-4 1.76T
Gemini Pro ≈30T
Gemini Ultra ≈60T
2018 2019 2020 2021 2022 2023 2024
Model Trend
Modern Networks
4
Number
of
parameters
Tensor Decomposition
5
!
≈ =
!!
"!
#!
"
!
"#$
$
%
&
!
≈
!!
!"
!#
!!
!"
!#
!
!!
≈
""
#!
$!
!
!
"#$
! " # $!
= ∗ ∗
t-SVD (Singular Value Decomposition)
CANDECOMP/PARAFAC
Tucker Block Term
Low-rank decomposition
6
𝑀 𝑅!
"
𝐿!
≈ ×
𝑚×𝑘
𝑚×𝑛
𝑘×𝑛
Statement of Experiment
7
Check performance of TS models NBEATS, Transformer, ARIMA on M4
dataset for checking how would they predict timeseries based on data.
Models
8
Transformer — state-of-the-art deep learning model introduced in
2017. It is an encoder-decoder architecture whose core feature is
the ‘multi-head attention’ mechanism
Linear Linear Linear
Scaled Dot-Product Attention
Concat
Linear
Multi-Head Attention
MatMul
SoftMax
Mask
Scale
MatMul
Scaled Dot-Product Attention
In Out
Multi-Head
Attention
Add & Norm
Feed
Forward
Add & Norm
Masked
Multi-Head
Attention
Add & Norm
Multi-Head
Attention
Add & Norm
Feed
Forward
Add & Norm
SoftMax
Linear
!
×
!
×
Models
9
NBEATS — deep learning-based approach for time series forecasting.
ARIMA — popular statistical model used to forecast future values in a time
series based on past values.
Block
FC Stack
FC FC
!!(#!) !"(#")
Backcast
Stack
Forecast
Stack
Block 1
Block 2
Block K
…
Forecast
Global
Forecast
Stack 1
Stack 2
Stack M
…
M4 Dataset
10
The M4 dataset is a collection of 100,000 time series used for the fourth
edition of the Makridakis forecasting Competition.
Consists of time series of:
• Yearly — 63 avg training length
• Quarterly — 125
• Monthly — 302
• Weekly — 2035
• Daily — 475
• Hourly — 682
Experiment - Monthly
11
ARIMA
NBEATS Transformer
Interpretation of results
12
On short series all three models performed well and suitable for predicting
However with longer range
• NBEATS perform better than others models
• Some models has a critical difference in predictions
• Longer Range all models not struggle
Training on just 50 epochs took:
• Almost an hour for NBEATS
• 20 min for Transformer
• 1 min for Arima
LoRA + rSVD
! ∈ #!×!
A = "(0, &!
)
B = 0
r
x
h
!!
"×$
%
$×&
'
"×&
=
(
"×"
)
"×"
*!
"×&
SVD
recover
(
$×"
=
(
+
"×"
!
$×"
13
Results NBEATS
Base
Model
LoRA
Layers
Model latency throughput
No LoRA 0.00108 4364808.0
LoRA Layer 0.00166 4159606.0
MS Default 0.00106 4748564.0
MS All 0.00108 4482670.0
MS LoRA 0.00105 5365210.0
14
Results Transformer
Model latency throughput
No LoRA 0.00085 9306328.0
LoRA Layer 0.00084 9782205.0
MS Default 0.00085 9572622.0
MS All 0.00085 9463282.0
MS LoRA 0.00084 8586088.0
0
0,01
0,02
0,03
0,04
0,05
0,06
0,07
Epoch
1
Epoch
2
Epoch
3
Epoch
4
Epoch
5
Epoch
6
Epoch
7
Epoch
8
Model Train With Early Stopping
Base LoRA
15
LoRA Implementation
14
Resume
In this work:
• Provide an Experiment for
research of performance:
• NBEATS
• ARIMA
• Transformer
• Logically assumed that layers in
models could be replaced using
LoRa approach
• Implement logic as a master
thesis
15
TN diagrams of some popular decompositions
!(")
!($) !(%)
"(")
"($)
"(%)
"(&)
!! !" !# !$
!(",")
!(%,")
!(&,")
!(",%)
!(%,%)
!(&,%)
!(",&)
!(%,&)
!(&,&)
"" "% "&
"' "( ")
"* "+ ",
!(")
!"
!# !$
…
!($)
!(%)
%" %%&"
%$
!(")
!"
!# !$
…
!($)
!(%)
%" %%&"
%$
%'
!(")
"$
""
"%
…
!(%)
!($)
!!
!" !#
#
!(")
"$
""
"%
…
!(%)
!($)
#
!!
!" !#
!(")
"$
""
"% …
!(%)
!($)
#
!!
!"
!#
#(")
THANK YOU
FOR YOUR TIME!
@misterzurg
Ad

Recommended

Deep learning: the final frontier for time series analysis and signal process...
Deep learning: the final frontier for time series analysis and signal process...
Alex Honchar
 
Time Series Forecasting Using Recurrent Neural Network and Vector Autoregress...
Time Series Forecasting Using Recurrent Neural Network and Vector Autoregress...
Databricks
 
Time series deep learning
Time series deep learning
Alberto Arrigoni
 
Automated-tuned hyper-parameter deep neural network by using arithmetic optim...
Automated-tuned hyper-parameter deep neural network by using arithmetic optim...
IJECEIAES
 
Neural Networks and Deep Learning Syllabus
Neural Networks and Deep Learning Syllabus
Andres Mendez-Vazquez
 
[DSC Adria 23]Davor Horvatic Human-Centric Explainable AI In Time Series Anal...
[DSC Adria 23]Davor Horvatic Human-Centric Explainable AI In Time Series Anal...
DataScienceConferenc1
 
Using Deep Learning and TensorFlow to Classify Time Series
Using Deep Learning and TensorFlow to Classify Time Series
Altoros
 
Phx dl meetup
Phx dl meetup
James Sirota
 
deep learning.pptx
deep learning.pptx
khalid aberbach
 
ICMS_Tax_Collection_Time_Series.pdf
ICMS_Tax_Collection_Time_Series.pdf
Guttenberg Ferreira Passos
 
Easy to learn deep learning guide - elementry
Easy to learn deep learning guide - elementry
AnjaliSohoni
 
Multivariate Time Series
Multivariate Time Series
Apache MXNet
 
DARMDN: Deep autoregressive mixture density nets for dynamical system mode...
DARMDN: Deep autoregressive mixture density nets for dynamical system mode...
Balázs Kégl
 
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
IJECEIAES
 
[20240722_LabSeminar_Huy]WaveForM: Graph Enhanced Wavelet Learning for Long S...
[20240722_LabSeminar_Huy]WaveForM: Graph Enhanced Wavelet Learning for Long S...
thanhdowork
 
News recommender system from math model to production solution
News recommender system from math model to production solution
Nikolay Karpov
 
Oleksandr Honchar "Deep learning for signal processing"
Oleksandr Honchar "Deep learning for signal processing"
Lviv Startup Club
 
Employing Advanced Neural Networks for Forecasting Time Series Data
Employing Advanced Neural Networks for Forecasting Time Series Data
IRJET Journal
 
And Then There Are Algorithms
And Then There Are Algorithms
InfluxData
 
C3 w1
C3 w1
Ajay Taneja
 
Foundation Models for Time Series : A Survey
Foundation Models for Time Series : A Survey
jayanthkalyanam1
 
Build a Neural Network for ITSM with TensorFlow
Build a Neural Network for ITSM with TensorFlow
Entrepreneur / Startup
 
Paper sharing_deep learning for smart manufacturing methods and applications
Paper sharing_deep learning for smart manufacturing methods and applications
YOU SHENG CHEN
 
Chapter10.pptx
Chapter10.pptx
adnansbp
 
An Introduction to Neural Architecture Search
An Introduction to Neural Architecture Search
Bill Liu
 
Deep learning for time series pyBCN
Deep learning for time series pyBCN
Alex Honchar
 
Deep Learning Opening Workshop - Improving Generative Models - Junier Oliva, ...
Deep Learning Opening Workshop - Improving Generative Models - Junier Oliva, ...
The Statistical and Applied Mathematical Sciences Institute
 
Dixon Deep Learning
Dixon Deep Learning
SciCompIIT
 
Seniors YSC2022
Seniors YSC2022
Denis Zakharov
 
Bachelors Diploma Slides Short Version
Bachelors Diploma Slides Short Version
Denis Zakharov
 

More Related Content

Similar to Master Thesis Slides: Topic Development of Methods for Deep Neural Network Architectures Optimization based on Tensor Factorization Algorithms (20)

deep learning.pptx
deep learning.pptx
khalid aberbach
 
ICMS_Tax_Collection_Time_Series.pdf
ICMS_Tax_Collection_Time_Series.pdf
Guttenberg Ferreira Passos
 
Easy to learn deep learning guide - elementry
Easy to learn deep learning guide - elementry
AnjaliSohoni
 
Multivariate Time Series
Multivariate Time Series
Apache MXNet
 
DARMDN: Deep autoregressive mixture density nets for dynamical system mode...
DARMDN: Deep autoregressive mixture density nets for dynamical system mode...
Balázs Kégl
 
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
IJECEIAES
 
[20240722_LabSeminar_Huy]WaveForM: Graph Enhanced Wavelet Learning for Long S...
[20240722_LabSeminar_Huy]WaveForM: Graph Enhanced Wavelet Learning for Long S...
thanhdowork
 
News recommender system from math model to production solution
News recommender system from math model to production solution
Nikolay Karpov
 
Oleksandr Honchar "Deep learning for signal processing"
Oleksandr Honchar "Deep learning for signal processing"
Lviv Startup Club
 
Employing Advanced Neural Networks for Forecasting Time Series Data
Employing Advanced Neural Networks for Forecasting Time Series Data
IRJET Journal
 
And Then There Are Algorithms
And Then There Are Algorithms
InfluxData
 
C3 w1
C3 w1
Ajay Taneja
 
Foundation Models for Time Series : A Survey
Foundation Models for Time Series : A Survey
jayanthkalyanam1
 
Build a Neural Network for ITSM with TensorFlow
Build a Neural Network for ITSM with TensorFlow
Entrepreneur / Startup
 
Paper sharing_deep learning for smart manufacturing methods and applications
Paper sharing_deep learning for smart manufacturing methods and applications
YOU SHENG CHEN
 
Chapter10.pptx
Chapter10.pptx
adnansbp
 
An Introduction to Neural Architecture Search
An Introduction to Neural Architecture Search
Bill Liu
 
Deep learning for time series pyBCN
Deep learning for time series pyBCN
Alex Honchar
 
Deep Learning Opening Workshop - Improving Generative Models - Junier Oliva, ...
Deep Learning Opening Workshop - Improving Generative Models - Junier Oliva, ...
The Statistical and Applied Mathematical Sciences Institute
 
Dixon Deep Learning
Dixon Deep Learning
SciCompIIT
 
Easy to learn deep learning guide - elementry
Easy to learn deep learning guide - elementry
AnjaliSohoni
 
Multivariate Time Series
Multivariate Time Series
Apache MXNet
 
DARMDN: Deep autoregressive mixture density nets for dynamical system mode...
DARMDN: Deep autoregressive mixture density nets for dynamical system mode...
Balázs Kégl
 
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
System for Prediction of Non Stationary Time Series based on the Wavelet Radi...
IJECEIAES
 
[20240722_LabSeminar_Huy]WaveForM: Graph Enhanced Wavelet Learning for Long S...
[20240722_LabSeminar_Huy]WaveForM: Graph Enhanced Wavelet Learning for Long S...
thanhdowork
 
News recommender system from math model to production solution
News recommender system from math model to production solution
Nikolay Karpov
 
Oleksandr Honchar "Deep learning for signal processing"
Oleksandr Honchar "Deep learning for signal processing"
Lviv Startup Club
 
Employing Advanced Neural Networks for Forecasting Time Series Data
Employing Advanced Neural Networks for Forecasting Time Series Data
IRJET Journal
 
And Then There Are Algorithms
And Then There Are Algorithms
InfluxData
 
Foundation Models for Time Series : A Survey
Foundation Models for Time Series : A Survey
jayanthkalyanam1
 
Build a Neural Network for ITSM with TensorFlow
Build a Neural Network for ITSM with TensorFlow
Entrepreneur / Startup
 
Paper sharing_deep learning for smart manufacturing methods and applications
Paper sharing_deep learning for smart manufacturing methods and applications
YOU SHENG CHEN
 
Chapter10.pptx
Chapter10.pptx
adnansbp
 
An Introduction to Neural Architecture Search
An Introduction to Neural Architecture Search
Bill Liu
 
Deep learning for time series pyBCN
Deep learning for time series pyBCN
Alex Honchar
 
Dixon Deep Learning
Dixon Deep Learning
SciCompIIT
 

More from Denis Zakharov (9)

Seniors YSC2022
Seniors YSC2022
Denis Zakharov
 
Bachelors Diploma Slides Short Version
Bachelors Diploma Slides Short Version
Denis Zakharov
 
Student Spring 2022
Student Spring 2022
Denis Zakharov
 
Development of a plugin for VS Code that supports ACSL language.
Development of a plugin for VS Code that supports ACSL language.
Denis Zakharov
 
Exploring ML methods to increase the profitability of the trading strategy
Exploring ML methods to increase the profitability of the trading strategy
Denis Zakharov
 
Student Spring 2020
Student Spring 2020
Denis Zakharov
 
Student Spring 2019
Student Spring 2019
Denis Zakharov
 
Student Spring 2021
Student Spring 2021
Denis Zakharov
 
DocHolder
DocHolder
Denis Zakharov
 
Bachelors Diploma Slides Short Version
Bachelors Diploma Slides Short Version
Denis Zakharov
 
Development of a plugin for VS Code that supports ACSL language.
Development of a plugin for VS Code that supports ACSL language.
Denis Zakharov
 
Exploring ML methods to increase the profitability of the trading strategy
Exploring ML methods to increase the profitability of the trading strategy
Denis Zakharov
 
Ad

Recently uploaded (20)

Single-Cell Multi-Omics in Neurodegeneration p1.pptx
Single-Cell Multi-Omics in Neurodegeneration p1.pptx
KanakChaudhary10
 
How Psychology Can Power Product Decisions: A Human-Centered Blueprint- Shray...
How Psychology Can Power Product Decisions: A Human-Centered Blueprint- Shray...
ShrayasiRoy2
 
GBSN__Unit 2 - Control of Microorganisms
GBSN__Unit 2 - Control of Microorganisms
Areesha Ahmad
 
Instrumentation of IR and Raman Spectrophotometers.pptx
Instrumentation of IR and Raman Spectrophotometers.pptx
sngth2h2acc
 
HERBAL INGREDIENTS USED IN ORAL CARE.pptx
HERBAL INGREDIENTS USED IN ORAL CARE.pptx
Vidhi889356
 
The scientific heritage No 162 (162) (2025)
The scientific heritage No 162 (162) (2025)
The scientific heritage
 
EV REGENERATIVE ACCELERATION INNOVATION SUMMARY PITCH June 13, 2025.pdf
EV REGENERATIVE ACCELERATION INNOVATION SUMMARY PITCH June 13, 2025.pdf
Thane Heins NOBEL PRIZE WINNING ENERGY RESEARCHER
 
The Gender Binary & LGBTI People: Religious Myth and Medical Malpractice
The Gender Binary & LGBTI People: Religious Myth and Medical Malpractice
Veronica Drantz, PhD
 
Investigatory_project Topic:-effect of electrolysis in solar desalination .pdf
Investigatory_project Topic:-effect of electrolysis in solar desalination .pdf
shubham997ku
 
Herbal Excipients: Natural Colorants & Perfumery Agents
Herbal Excipients: Natural Colorants & Perfumery Agents
Seacom Skills University
 
1-SEAFLOOR-SPREADINGGGGGGGGGGGGGGGGGGGG.pptx
1-SEAFLOOR-SPREADINGGGGGGGGGGGGGGGGGGGG.pptx
JohnCristoffMendoza
 
MEDICINAL CHEMISTRY PROSPECTIVES IN DESIGN OF EGFR INHIBITORS.pptx
MEDICINAL CHEMISTRY PROSPECTIVES IN DESIGN OF EGFR INHIBITORS.pptx
40RevathiP
 
Study of Appropriate Information Combination in Image-based Obfuscated Malwar...
Study of Appropriate Information Combination in Image-based Obfuscated Malwar...
takahashi34
 
GBSN_Unit 3 - Medical and surgical Asepsis
GBSN_Unit 3 - Medical and surgical Asepsis
Areesha Ahmad
 
GBSN_ Unit 1 - Introduction to Microbiology
GBSN_ Unit 1 - Introduction to Microbiology
Areesha Ahmad
 
It's about the habitat of organisms. Where they live and what they need to su...
It's about the habitat of organisms. Where they live and what they need to su...
KwabenaAbrokwah1
 
Science grade 7 assesement Quarter I based on matatag curriculum
Science grade 7 assesement Quarter I based on matatag curriculum
BryanLebasnon1
 
Science 7 DLL Week 1 Quarter 1 Matatag Curriculum
Science 7 DLL Week 1 Quarter 1 Matatag Curriculum
RONAFAITHLOOC
 
Relazione di laboratorio Idrolisi dell'amido (in inglese)
Relazione di laboratorio Idrolisi dell'amido (in inglese)
paolofvesco
 
Overview of Stem Cells and Immune Modulation.ppsx
Overview of Stem Cells and Immune Modulation.ppsx
AhmedAtwa29
 
Single-Cell Multi-Omics in Neurodegeneration p1.pptx
Single-Cell Multi-Omics in Neurodegeneration p1.pptx
KanakChaudhary10
 
How Psychology Can Power Product Decisions: A Human-Centered Blueprint- Shray...
How Psychology Can Power Product Decisions: A Human-Centered Blueprint- Shray...
ShrayasiRoy2
 
GBSN__Unit 2 - Control of Microorganisms
GBSN__Unit 2 - Control of Microorganisms
Areesha Ahmad
 
Instrumentation of IR and Raman Spectrophotometers.pptx
Instrumentation of IR and Raman Spectrophotometers.pptx
sngth2h2acc
 
HERBAL INGREDIENTS USED IN ORAL CARE.pptx
HERBAL INGREDIENTS USED IN ORAL CARE.pptx
Vidhi889356
 
The scientific heritage No 162 (162) (2025)
The scientific heritage No 162 (162) (2025)
The scientific heritage
 
The Gender Binary & LGBTI People: Religious Myth and Medical Malpractice
The Gender Binary & LGBTI People: Religious Myth and Medical Malpractice
Veronica Drantz, PhD
 
Investigatory_project Topic:-effect of electrolysis in solar desalination .pdf
Investigatory_project Topic:-effect of electrolysis in solar desalination .pdf
shubham997ku
 
Herbal Excipients: Natural Colorants & Perfumery Agents
Herbal Excipients: Natural Colorants & Perfumery Agents
Seacom Skills University
 
1-SEAFLOOR-SPREADINGGGGGGGGGGGGGGGGGGGG.pptx
1-SEAFLOOR-SPREADINGGGGGGGGGGGGGGGGGGGG.pptx
JohnCristoffMendoza
 
MEDICINAL CHEMISTRY PROSPECTIVES IN DESIGN OF EGFR INHIBITORS.pptx
MEDICINAL CHEMISTRY PROSPECTIVES IN DESIGN OF EGFR INHIBITORS.pptx
40RevathiP
 
Study of Appropriate Information Combination in Image-based Obfuscated Malwar...
Study of Appropriate Information Combination in Image-based Obfuscated Malwar...
takahashi34
 
GBSN_Unit 3 - Medical and surgical Asepsis
GBSN_Unit 3 - Medical and surgical Asepsis
Areesha Ahmad
 
GBSN_ Unit 1 - Introduction to Microbiology
GBSN_ Unit 1 - Introduction to Microbiology
Areesha Ahmad
 
It's about the habitat of organisms. Where they live and what they need to su...
It's about the habitat of organisms. Where they live and what they need to su...
KwabenaAbrokwah1
 
Science grade 7 assesement Quarter I based on matatag curriculum
Science grade 7 assesement Quarter I based on matatag curriculum
BryanLebasnon1
 
Science 7 DLL Week 1 Quarter 1 Matatag Curriculum
Science 7 DLL Week 1 Quarter 1 Matatag Curriculum
RONAFAITHLOOC
 
Relazione di laboratorio Idrolisi dell'amido (in inglese)
Relazione di laboratorio Idrolisi dell'amido (in inglese)
paolofvesco
 
Overview of Stem Cells and Immune Modulation.ppsx
Overview of Stem Cells and Immune Modulation.ppsx
AhmedAtwa29
 
Ad

Master Thesis Slides: Topic Development of Methods for Deep Neural Network Architectures Optimization based on Tensor Factorization Algorithms

  • 1. Development of Methods for Deep Neural Network Architectures Optimization based on Tensor Factorization Algorithms Supervisor: Revin Ilya Evgenyevich, research associate, Laboratory for Composite AI, Research Center “Strong AI in Industry” Presented by: Zakharov Denis, J4232c
  • 2. Pain: Current deep networks have excessive parameters which results in long training time and expensive storing wights production. Hypothesis: With the use of optimization method we can reduce size of stored model, increase throughput. Work: Development of Methods for Deep Neural Network Architectures Optimization based on Tensor Factorization Algorithms Problem 2
  • 3. Purpose and objectives of study Goal Development of Methods for Deep Neural Network Architectures Optimization based on Tensor Factorization Algorithms Objectives • Make a Literature review of related field to provide a background • Make a scientific research of tensor Algorithms (Optimization, Operations) • Make a scientific research of LoRA approach • Perform experiment with TS Models • Based on these findings propose a product that can be integrated into AutoML solution • Develop Optimization Method • Contribute to Fedot.Industrial framework 3
  • 4. BERT 340M T5 11B GPT-3 175B Megatron-Turing 530B GPT-4 1.76T Gemini Pro ≈30T Gemini Ultra ≈60T 2018 2019 2020 2021 2022 2023 2024 Model Trend Modern Networks 4 Number of parameters
  • 5. Tensor Decomposition 5 ! ≈ = !! "! #! " ! "#$ $ % & ! ≈ !! !" !# !! !" !# ! !! ≈ "" #! $! ! ! "#$ ! " # $! = ∗ ∗ t-SVD (Singular Value Decomposition) CANDECOMP/PARAFAC Tucker Block Term
  • 6. Low-rank decomposition 6 𝑀 𝑅! " 𝐿! ≈ × 𝑚×𝑘 𝑚×𝑛 𝑘×𝑛
  • 7. Statement of Experiment 7 Check performance of TS models NBEATS, Transformer, ARIMA on M4 dataset for checking how would they predict timeseries based on data.
  • 8. Models 8 Transformer — state-of-the-art deep learning model introduced in 2017. It is an encoder-decoder architecture whose core feature is the ‘multi-head attention’ mechanism Linear Linear Linear Scaled Dot-Product Attention Concat Linear Multi-Head Attention MatMul SoftMax Mask Scale MatMul Scaled Dot-Product Attention In Out Multi-Head Attention Add & Norm Feed Forward Add & Norm Masked Multi-Head Attention Add & Norm Multi-Head Attention Add & Norm Feed Forward Add & Norm SoftMax Linear ! × ! ×
  • 9. Models 9 NBEATS — deep learning-based approach for time series forecasting. ARIMA — popular statistical model used to forecast future values in a time series based on past values. Block FC Stack FC FC !!(#!) !"(#") Backcast Stack Forecast Stack Block 1 Block 2 Block K … Forecast Global Forecast Stack 1 Stack 2 Stack M …
  • 10. M4 Dataset 10 The M4 dataset is a collection of 100,000 time series used for the fourth edition of the Makridakis forecasting Competition. Consists of time series of: • Yearly — 63 avg training length • Quarterly — 125 • Monthly — 302 • Weekly — 2035 • Daily — 475 • Hourly — 682
  • 12. Interpretation of results 12 On short series all three models performed well and suitable for predicting However with longer range • NBEATS perform better than others models • Some models has a critical difference in predictions • Longer Range all models not struggle Training on just 50 epochs took: • Almost an hour for NBEATS • 20 min for Transformer • 1 min for Arima
  • 13. LoRA + rSVD ! ∈ #!×! A = "(0, &! ) B = 0 r x h !! "×$ % $×& ' "×& = ( "×" ) "×" *! "×& SVD recover ( $×" = ( + "×" ! $×" 13
  • 14. Results NBEATS Base Model LoRA Layers Model latency throughput No LoRA 0.00108 4364808.0 LoRA Layer 0.00166 4159606.0 MS Default 0.00106 4748564.0 MS All 0.00108 4482670.0 MS LoRA 0.00105 5365210.0 14
  • 15. Results Transformer Model latency throughput No LoRA 0.00085 9306328.0 LoRA Layer 0.00084 9782205.0 MS Default 0.00085 9572622.0 MS All 0.00085 9463282.0 MS LoRA 0.00084 8586088.0 0 0,01 0,02 0,03 0,04 0,05 0,06 0,07 Epoch 1 Epoch 2 Epoch 3 Epoch 4 Epoch 5 Epoch 6 Epoch 7 Epoch 8 Model Train With Early Stopping Base LoRA 15
  • 17. Resume In this work: • Provide an Experiment for research of performance: • NBEATS • ARIMA • Transformer • Logically assumed that layers in models could be replaced using LoRa approach • Implement logic as a master thesis 15 TN diagrams of some popular decompositions !(") !($) !(%) "(") "($) "(%) "(&) !! !" !# !$ !(",") !(%,") !(&,") !(",%) !(%,%) !(&,%) !(",&) !(%,&) !(&,&) "" "% "& "' "( ") "* "+ ", !(") !" !# !$ … !($) !(%) %" %%&" %$ !(") !" !# !$ … !($) !(%) %" %%&" %$ %' !(") "$ "" "% … !(%) !($) !! !" !# # !(") "$ "" "% … !(%) !($) # !! !" !# !(") "$ "" "% … !(%) !($) # !! !" !# #(")
  • 18. THANK YOU FOR YOUR TIME! @misterzurg