SlideShare a Scribd company logo
Santosh Pandey
Ram Sharan Chaulagain
Prakash Gyawali
- A supercomputer( HYPE -2 )
Supervisor
Prof. Dr. Subarna Shakya
Super COMPUTING Journal
Super COMPUTING Journal
Super COMPUTING Journal
OUR OPTIONS:
MULTIPROCESSOR SYSTEM MULTICOMPUTER SYSTEM
Super COMPUTING Journal
 Speedup in multiprocessing
 Depends on parallelizable code
S(P)=Speedup on P processors
T(1)=Time to process in 1 processors
T(P)= Time to process in processors
f=Inherently sequential code
p= Parallelizable code
 High performance computing for research
 Achieving super computing at a cheaper rate
than mainframes
 Muni Sakhya (1980’s)
 16 nodes
 First and the only one
Super COMPUTING Journal
 Middleware
 Network Architecture
 Multicore Computers
Super COMPUTING Journal
 SIMD (Single Instruction Multiple Data)
 MIMD (Multiple Instruction Multiple Data)
 Every application don’t have same parallelism model
 Specific Applications must be programmed
 Extend Methods of our Architecture
Super COMPUTING Journal
Super COMPUTING Journal
 Dynamic Worker Addition and Reduction
 Fault Tolerant
 Scalable System
Super COMPUTING Journal
Star Topology
 Parallel Working
 Server Thread for each Worker at Server side
 New Process for each Worker at Client side
SERVER
THREAD1
• Provide Chunk 1 To Client1
THREAD 2
• Provide Chunk 2 To Client2
THREAD
N
• Provide Chunk N To Client N
Connect
to server
Take chunk
to process
Process
Provide
output to
server
Connect
to server
Take chunk
to process
Process
Provide
output to
server
Connect
to server
Take chunk
to process
Process
Provide
output to
server
 Running thousands of flops operations
 Integration for finding the value of Pi
Super COMPUTING Journal
Time (ms)
0
10000
20000
30000
40000
50000
1,4
1,3
2,1
3,1
3,2
3,3
4,3
Time (ms)
1(10000) 2(20000) 5(50000) 7(70000) 15(150000)
Speedup 1 1.954 4.8 5.9 18
1
1.954
4.8
5.9
18
-5
0
5
10
15
20
25
Speedup
No. of Nodes
Speedup for 100 Million Iterations
Fig : Exponential Speedup
1(10000) 2(20000) 5(50000) 7(70000)15(150000)
Speedup 1 1.954 4.8 5.9 18
1
1.954
4.8
5.9
18
-5
0
5
10
15
20
25
Speedup
No. of Nodes
Speedup for 100 Million Iterations
Theory vs. Practical Data
 No official data for comparing
 Probably the fastest in Nepal
 Cryptography
 Data Mining
 Weather Forecasting
 Research
 Artificial Intelligence
Super COMPUTING Journal
Super COMPUTING Journal
Super COMPUTING Journal
Super COMPUTING Journal
 Not comparable with bigger super computer due to
less nodes
 Extension of Architecture library to define new
application
 Supporting Complex Computations
 Inter-process Communication for dependent tasks
 Implementing GPU for Computation
Websites:
 Don Berker. Robert G. Brown. Greg Lindahl. Forrest Hoffman.
Putchong Uthayopas. Kragen Sitaker. Frequently Asked Questions
[Online]. Available: http://www.beowulf.org/overview.faq.html
 Technopedia. Computer Cluster [Online]. Available:
http://www.technopedia.com/definition/6581/computer-cluster
 Dr. Wu-chun. Feng. (2015). The Green500 list- November 2015 [Online].
Available: http://www.green500/list/green201511
Books:
 Shiflet, Introduction to Computational Science: Modeling and
Simulation for Sciences, Princeton University Press, 2014.
 Kumar, Lenina, MATLAB: Easy Way to Learning, PHI Learning, 2016.
 Etter, Introduction to MATLAB, Prentice Hall, 2015
 Lemay Laura, Charles L. Perkins, Teach Yourself Java in 21 Days,
Samsnet, 1996.
Super COMPUTING Journal

More Related Content

PPTX
C-SAW: A Framework for Graph Sampling and Random Walk on GPUs
PPTX
A Highly Parallel Semi-Dataflow FPGA Architecture for Large-Scale N-Body Simu...
PPTX
Real-time and long-time together
PPTX
Deep Learning on Aerial Imagery: What does it look like on a map?
PDF
Federated HPC Clouds applied to Radiation Therapy
PDF
CNIT 127 Ch 5: Introduction to heap overflows
PDF
cnsm2011_slide
PDF
HPC in the Cloud
C-SAW: A Framework for Graph Sampling and Random Walk on GPUs
A Highly Parallel Semi-Dataflow FPGA Architecture for Large-Scale N-Body Simu...
Real-time and long-time together
Deep Learning on Aerial Imagery: What does it look like on a map?
Federated HPC Clouds applied to Radiation Therapy
CNIT 127 Ch 5: Introduction to heap overflows
cnsm2011_slide
HPC in the Cloud

What's hot (20)

PDF
Parallelization Strategies for Implementing Nbody Codes on Multicore Architec...
PPTX
Architectural Optimizations for High Performance and Energy Efficient Smith-W...
PDF
Hadoop analytics provisioning based on a virtual infrastructure
PDF
HTCC poster for CERN Openlab opendays 2015
PDF
IIBMP2019 講演資料「オープンソースで始める深層学習」
PPT
Towards Utilizing GPUs in Information Visualization
PDF
Histogram Equalized Heat Maps from Log Data via Apache Spark with Arvind Rao
PDF
post119s1-file3
PDF
resume-XinyuSui
PDF
Achitecture Aware Algorithms and Software for Peta and Exascale
PPTX
Brief Overview of a Parallel Nbody Code
PDF
PERFORMANCE PREDICTION OF GPU-BASED DEEP LEARNING APPLICATIONS
PDF
BDC-presentation
PPTX
Graph 500 DISLIB powered optimized version
PDF
PFN Summer Internship 2019 / Kenshin Abe: Extension of Chainer-Chemistry for ...
PPTX
Image Segmentation Chain
PDF
Joint CSI Estimation, Beamforming and Scheduling Design for Wideband Massive ...
PPT
Harnessing OpenCL in Modern Coprocessors
PPTX
Ulrik De Bie - Newtec@ Virtual Wall
Parallelization Strategies for Implementing Nbody Codes on Multicore Architec...
Architectural Optimizations for High Performance and Energy Efficient Smith-W...
Hadoop analytics provisioning based on a virtual infrastructure
HTCC poster for CERN Openlab opendays 2015
IIBMP2019 講演資料「オープンソースで始める深層学習」
Towards Utilizing GPUs in Information Visualization
Histogram Equalized Heat Maps from Log Data via Apache Spark with Arvind Rao
post119s1-file3
resume-XinyuSui
Achitecture Aware Algorithms and Software for Peta and Exascale
Brief Overview of a Parallel Nbody Code
PERFORMANCE PREDICTION OF GPU-BASED DEEP LEARNING APPLICATIONS
BDC-presentation
Graph 500 DISLIB powered optimized version
PFN Summer Internship 2019 / Kenshin Abe: Extension of Chainer-Chemistry for ...
Image Segmentation Chain
Joint CSI Estimation, Beamforming and Scheduling Design for Wideband Massive ...
Harnessing OpenCL in Modern Coprocessors
Ulrik De Bie - Newtec@ Virtual Wall
Ad

Viewers also liked (16)

PDF
13 a westmacall..
PDF
BigAir Presentation
PDF
Botmetric iim preso may 7v2
DOC
Joseph Cyril. -Resume-
PDF
STRUCK_AND_PARTNERSdec 16
PDF
Representación10
PDF
CHT-G(XX)X-X-MOD-02-UOG-MOD-02
PPTX
De ing y sociedad
PDF
2015 annual results announcement
PDF
Lattice Energy LLC - LENR transmutation of Carbon better energy strategy than...
PPTX
Qui acd alyr
PDF
Vietnam e-commerce Pocket Guideline 2014
PPTX
Strategies in Teaching Mathematics -Principles of Teaching 2 (KMB)
PPTX
Guide to Crete
PPTX
E-commerce usage in Vietnam
PPTX
Presupuesto
13 a westmacall..
BigAir Presentation
Botmetric iim preso may 7v2
Joseph Cyril. -Resume-
STRUCK_AND_PARTNERSdec 16
Representación10
CHT-G(XX)X-X-MOD-02-UOG-MOD-02
De ing y sociedad
2015 annual results announcement
Lattice Energy LLC - LENR transmutation of Carbon better energy strategy than...
Qui acd alyr
Vietnam e-commerce Pocket Guideline 2014
Strategies in Teaching Mathematics -Principles of Teaching 2 (KMB)
Guide to Crete
E-commerce usage in Vietnam
Presupuesto
Ad

Similar to Super COMPUTING Journal (20)

PDF
Building A Linux Cluster Using Raspberry PI #2!
PDF
PPTX
Parallel computing in india
PPTX
Assignment-1 Updated Version advanced comp.pptx
PPT
Nbvtalkatjntuvizianagaram
PPT
Parallel architecture
PDF
Lecture 1 introduction to parallel and distributed computing
PDF
Designing Software Libraries and Middleware for Exascale Systems: Opportuniti...
PPTX
PA CO-1.pptx on business analysis on systems
PPTX
Role of python in hpc
PPTX
Super computer 2017
ODP
Distributed Computing
PPTX
distributed system lab materials about ad
PPTX
lec1.pptx
PDF
ADVANCED COMPUTER ARCHITECTURE PARALLELISM SCALABILITY PROGRAMMABILITY Baas ...
PPTX
network ram parallel computing
PPTX
Cloud Computing-UNIT 1 claud computing basics
PDF
MVAPICH2 and MVAPICH2-X Projects: Latest Developments and Future Plans
PPT
Cluster Tutorial
PPT
Current Trends in HPC
Building A Linux Cluster Using Raspberry PI #2!
Parallel computing in india
Assignment-1 Updated Version advanced comp.pptx
Nbvtalkatjntuvizianagaram
Parallel architecture
Lecture 1 introduction to parallel and distributed computing
Designing Software Libraries and Middleware for Exascale Systems: Opportuniti...
PA CO-1.pptx on business analysis on systems
Role of python in hpc
Super computer 2017
Distributed Computing
distributed system lab materials about ad
lec1.pptx
ADVANCED COMPUTER ARCHITECTURE PARALLELISM SCALABILITY PROGRAMMABILITY Baas ...
network ram parallel computing
Cloud Computing-UNIT 1 claud computing basics
MVAPICH2 and MVAPICH2-X Projects: Latest Developments and Future Plans
Cluster Tutorial
Current Trends in HPC

Super COMPUTING Journal

  • 1. Santosh Pandey Ram Sharan Chaulagain Prakash Gyawali - A supercomputer( HYPE -2 ) Supervisor Prof. Dr. Subarna Shakya
  • 5. OUR OPTIONS: MULTIPROCESSOR SYSTEM MULTICOMPUTER SYSTEM
  • 7.  Speedup in multiprocessing  Depends on parallelizable code S(P)=Speedup on P processors T(1)=Time to process in 1 processors T(P)= Time to process in processors f=Inherently sequential code p= Parallelizable code
  • 8.  High performance computing for research  Achieving super computing at a cheaper rate than mainframes
  • 9.  Muni Sakhya (1980’s)  16 nodes  First and the only one
  • 11.  Middleware  Network Architecture  Multicore Computers
  • 13.  SIMD (Single Instruction Multiple Data)  MIMD (Multiple Instruction Multiple Data)
  • 14.  Every application don’t have same parallelism model  Specific Applications must be programmed  Extend Methods of our Architecture
  • 17.  Dynamic Worker Addition and Reduction  Fault Tolerant  Scalable System
  • 21.  Server Thread for each Worker at Server side  New Process for each Worker at Client side SERVER THREAD1 • Provide Chunk 1 To Client1 THREAD 2 • Provide Chunk 2 To Client2 THREAD N • Provide Chunk N To Client N Connect to server Take chunk to process Process Provide output to server Connect to server Take chunk to process Process Provide output to server Connect to server Take chunk to process Process Provide output to server
  • 22.  Running thousands of flops operations  Integration for finding the value of Pi
  • 25. 1(10000) 2(20000) 5(50000) 7(70000) 15(150000) Speedup 1 1.954 4.8 5.9 18 1 1.954 4.8 5.9 18 -5 0 5 10 15 20 25 Speedup No. of Nodes Speedup for 100 Million Iterations Fig : Exponential Speedup
  • 26. 1(10000) 2(20000) 5(50000) 7(70000)15(150000) Speedup 1 1.954 4.8 5.9 18 1 1.954 4.8 5.9 18 -5 0 5 10 15 20 25 Speedup No. of Nodes Speedup for 100 Million Iterations Theory vs. Practical Data
  • 27.  No official data for comparing  Probably the fastest in Nepal
  • 28.  Cryptography  Data Mining  Weather Forecasting  Research  Artificial Intelligence
  • 33.  Not comparable with bigger super computer due to less nodes  Extension of Architecture library to define new application
  • 34.  Supporting Complex Computations  Inter-process Communication for dependent tasks  Implementing GPU for Computation
  • 35. Websites:  Don Berker. Robert G. Brown. Greg Lindahl. Forrest Hoffman. Putchong Uthayopas. Kragen Sitaker. Frequently Asked Questions [Online]. Available: http://www.beowulf.org/overview.faq.html  Technopedia. Computer Cluster [Online]. Available: http://www.technopedia.com/definition/6581/computer-cluster  Dr. Wu-chun. Feng. (2015). The Green500 list- November 2015 [Online]. Available: http://www.green500/list/green201511 Books:  Shiflet, Introduction to Computational Science: Modeling and Simulation for Sciences, Princeton University Press, 2014.  Kumar, Lenina, MATLAB: Easy Way to Learning, PHI Learning, 2016.  Etter, Introduction to MATLAB, Prentice Hall, 2015  Lemay Laura, Charles L. Perkins, Teach Yourself Java in 21 Days, Samsnet, 1996.