SlideShare a Scribd company logo
Graphics processing units - powerful, programmable, and highly parallel - are increasingly targeting general-purpose computing applications. GPU ComputingPresented By:Khan Muhammad Nafee Mostafa0507007, Dept of CSE, KUET
GPU ComputingJ. D. OwensM. HoustonD. LuebkeS. GreenJ. E. StoneJ. C. PhillipsProceedings of the IEEE | Vol 96, No. 5 | May 2008We would be concentrating on,What is GPU ComputingWhy GPU ComputingGPU Architecture and EvolutionGPU Computing ModelSoftware Environment Future
GPU for General Purpose ComputingWhat is GPU Computing ?
What is GPU Computing ?GPU computing is the use of a GPU to do general purpose scientific and engineering computingCPU and GPU together in a heterogeneous computing model.Sequential part of the application runs on the CPU and the computationally-intensive part runs on the GPU. From the user’s perspective, the application just runs faster because it is using the high-performance of the GPU to boost performance.
Over the past few years, the GPU has evolved from a fixed-function special-purpose processor into a full-fledged parallel programmable processor with additional fixed-function special-purpose functionalityWhy GPU Computing…
GPU for Non-Graphic AppsThe GPU is designed for a particular class of applications with the following characteristics,Computational requirements are largeParallelism is substantialThroughput is more important than latencya growing community has identified other applications with similar characteristics and successfully mapped these applications onto the GPU
GPU extends its hand towards CPU for performanceParallelism is the future of computingMany applications have to process huge set of data following same functionsSeveral stream processors can execute same  set of instructions on different data sets and give a higher throughput  If GPU take some share of computation load from CPU, many applications can be benefitted in speed-up
GPU is now turned into a programmable engineGPU Architecture and Evolution
GPU PipelineAvailable operations are configurable but not programmable
Evolution…
All GPU programs must be structured in this way: many parallel elements, each processed in parallel by a single programGPU Computing Model
Computing on the GPUProgramming a GPU for Graphicsprogrammer specifies geometry covering a screen region; rasterizer generates a fragment at each pixel locationEach fragment is shaded by the fragment program (FP).FP computes the fragment by a combination of math operations and global memory readsresulting image can be used as texture on future passes.
Computing on the GPUProgramming a GPU for GraphicsProgramming a GPU for General-Purpose Programs (Old)programmer specifies geometric primitive covering computation domain of interest; rasterizer generates fragmentEach fragment is shaded by an SPMD general purpose FPFP computes the fragment by a combination of math operations and ‘gather’ accesses from global memory. resulting buffer can be used as an input on future passes. programmer specifies geometry covering a screen region; rasterizer generates a fragment at each pixel locationEach fragment is shaded by the fragment program (FP).FP computes the fragment by a combination of math operations and global memory readsresulting image can be used as texture on future passes.
Computing on the GPUProgramming a GPU for General-Purpose Programs (New)programmer directly defines the computation domain of interest as a structured grid of threadsSPMD general-purpose program computes each threadeach thread is computed by a combination of math  operations and both ‘gather’ (read) accesses from and ‘scatter’ (write) accesses to global memory; (same buffer can be used for both allowing more flexible algorithms)resulting buffer in global memory can then be used as an input in future computation
Software Environments
Software EnvironmentsBrookGPUMicrosoft’s AcceleratorVendor Specific GPGPU systemsAMD ATI’s CTM (Close to the Metal)NVIDIA’s CUDA (Compute Unified Device Architecture)
Scan performance on CPU, graphics-based GPU (using OpenGL), and direct-compute GPU (using CUDA). Results obtained on a GeForce 8800 GTX GPU and Intel Core2-Duo Extreme 2.93 GHz CPU. (Figure adapted from Harris et al.)Scan performance on CPU, OpenGL and CUDA
Future…
Concluding for bright Future…support for double-precision floating-pointhigher bandwidth path between CPU and GPU (like ATI’s HyperTransport)more tightly coupled CPU and GPU (AMD’s fusion or nVidianForce)NVIDIA Quadro for Multiple GPU CollaborationFinally, let us wait for new era when GPU Computing will rule
Thank YouI would also like to thank,

More Related Content

PPTX
CPU vs GPU Comparison
PPTX
Lec04 gpu architecture
PDF
CPU vs. GPU presentation
PPT
Parallel computing with Gpu
PPTX
Graphics processing unit ppt
PDF
GPU - Basic Working
PDF
Apache Spark Introduction
PPTX
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
CPU vs GPU Comparison
Lec04 gpu architecture
CPU vs. GPU presentation
Parallel computing with Gpu
Graphics processing unit ppt
GPU - Basic Working
Apache Spark Introduction
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...

What's hot (20)

PDF
GPU Programming
PPTX
Graphics processing unit (GPU)
PPT
PPTX
Single and Multi core processor
PPTX
Introduction to Hadoop
PPTX
Graphics Processing Unit by Saurabh
PDF
HDFS Architecture
PPTX
Final draft intel core i5 processors architecture
PDF
AI Chip Trends and Forecast
PPTX
Hadoop Architecture
PPTX
Graphic Processing Unit (GPU)
PDF
Hadoop Overview & Architecture
 
PPTX
Introduction to Hadoop and Hadoop component
PPTX
GPU Architecture NVIDIA (GTX GeForce 480)
PPTX
graphics processing unit ppt
PDF
Introduction to GPU Programming
PPT
Introduction to MongoDB
PPTX
HADOOP TECHNOLOGY ppt
PPTX
Nvidia (History, GPU Architecture and New Pascal Architecture)
GPU Programming
Graphics processing unit (GPU)
Single and Multi core processor
Introduction to Hadoop
Graphics Processing Unit by Saurabh
HDFS Architecture
Final draft intel core i5 processors architecture
AI Chip Trends and Forecast
Hadoop Architecture
Graphic Processing Unit (GPU)
Hadoop Overview & Architecture
 
Introduction to Hadoop and Hadoop component
GPU Architecture NVIDIA (GTX GeForce 480)
graphics processing unit ppt
Introduction to GPU Programming
Introduction to MongoDB
HADOOP TECHNOLOGY ppt
Nvidia (History, GPU Architecture and New Pascal Architecture)
Ad

Viewers also liked (19)

PPT
Graphics Processing Unit - GPU
PPTX
GRAPHICS PROCESSING UNIT (GPU)
PPTX
Graphics processing unit (gpu)
PPT
Gpu presentation
PDF
GPU - An Introduction
PDF
Example Application of GPU
PDF
GPU Computing for Data Science
PDF
Automatically Defined Functions for Learning Classifier Systems
PPTX
The Effect of Heat on a GPU
PPTX
GPU Computing: A brief overview
PPTX
Graphics processing unit
PPTX
【セミナー資料】ソーシャル×ビッグデータ×Biで切り開くこれからの企業のあり方
PDF
FAST AND EFFICIENT IMAGE COMPRESSION BASED ON PARALLEL COMPUTING USING MATLAB
PDF
Jug gpgpu
PPTX
GPU Computing
PPTX
How Persistent Memory Will Bring an Entirely New Structure to Large Data Comp...
PPTX
IDC Report on HPC Market Trends June 2013
PDF
GPU, CUDA, OpenCL and OpenACC for Parallel Applications
PPTX
Gpu with cuda architecture
Graphics Processing Unit - GPU
GRAPHICS PROCESSING UNIT (GPU)
Graphics processing unit (gpu)
Gpu presentation
GPU - An Introduction
Example Application of GPU
GPU Computing for Data Science
Automatically Defined Functions for Learning Classifier Systems
The Effect of Heat on a GPU
GPU Computing: A brief overview
Graphics processing unit
【セミナー資料】ソーシャル×ビッグデータ×Biで切り開くこれからの企業のあり方
FAST AND EFFICIENT IMAGE COMPRESSION BASED ON PARALLEL COMPUTING USING MATLAB
Jug gpgpu
GPU Computing
How Persistent Memory Will Bring an Entirely New Structure to Large Data Comp...
IDC Report on HPC Market Trends June 2013
GPU, CUDA, OpenCL and OpenACC for Parallel Applications
Gpu with cuda architecture
Ad

Similar to GPU Computing (20)

PPTX
GPU in Computer Science advance topic .pptx
PDF
Computing using GPUs
PDF
GPU Computing: An Introduction
PDF
Report on GPGPU at FCA (Lyon, France, 11-15 October, 2010)
PDF
Graphics Processing Unit: An Introduction
PPT
Achieving Improved Performance In Multi-threaded Programming With GPU Computing
PDF
19564926 graphics-processing-unit
PDF
Newbie’s guide to_the_gpgpu_universe
PPT
Cuda intro
PDF
Image Processing Application on Graphics processors
PPTX
PDF
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
PDF
Raul sena - Apresentação Analiticsemtudo - Scientific Applications using GPU
PDF
Volume 2-issue-6-2040-2045
PDF
Volume 2-issue-6-2040-2045
PPTX
Introduction to Accelerators
PDF
A beginner’s guide to programming GPUs with CUDA
PPT
NVIDIA CUDA
PPTX
Gpu databases
PDF
Cuda Without a Phd - A practical guick start
GPU in Computer Science advance topic .pptx
Computing using GPUs
GPU Computing: An Introduction
Report on GPGPU at FCA (Lyon, France, 11-15 October, 2010)
Graphics Processing Unit: An Introduction
Achieving Improved Performance In Multi-threaded Programming With GPU Computing
19564926 graphics-processing-unit
Newbie’s guide to_the_gpgpu_universe
Cuda intro
Image Processing Application on Graphics processors
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
Raul sena - Apresentação Analiticsemtudo - Scientific Applications using GPU
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045
Introduction to Accelerators
A beginner’s guide to programming GPUs with CUDA
NVIDIA CUDA
Gpu databases
Cuda Without a Phd - A practical guick start

More from Khan Mostafa (14)

PDF
Graph-based Analysis and Opinion Mining in Social Network
PDF
Research in the Computing Industry
PDF
Semantic matchmaking Local Closed-World Reasoning
PDF
Survey on real media paint simulation in Computer Graphics
PDF
Seminal works on watercolor painting simulation
PDF
Reaction Paper Discussing Articles in Fields of Outlier Detection & Sentiment...
PDF
Project Presentation: Graph-based Analysis and Opinion Mining in Social Network
PDF
A Survey on Sentiment Mining Techniques
PPTX
The Career (CSE)
PPTX
RDF by Structured Reference to Semantics, the RS2 framework
PDF
Study Tour (KUET CSE 2k5) Poster
PDF
Traffic Jam Detection System by Ratul, Sadh, Shams
PPTX
Open Document Format
PPTX
An Approach To Emerge Web 3.0
Graph-based Analysis and Opinion Mining in Social Network
Research in the Computing Industry
Semantic matchmaking Local Closed-World Reasoning
Survey on real media paint simulation in Computer Graphics
Seminal works on watercolor painting simulation
Reaction Paper Discussing Articles in Fields of Outlier Detection & Sentiment...
Project Presentation: Graph-based Analysis and Opinion Mining in Social Network
A Survey on Sentiment Mining Techniques
The Career (CSE)
RDF by Structured Reference to Semantics, the RS2 framework
Study Tour (KUET CSE 2k5) Poster
Traffic Jam Detection System by Ratul, Sadh, Shams
Open Document Format
An Approach To Emerge Web 3.0

Recently uploaded (20)

PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Getting Started with Data Integration: FME Form 101
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Machine Learning_overview_presentation.pptx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
1. Introduction to Computer Programming.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
cuic standard and advanced reporting.pdf
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPT
Teaching material agriculture food technology
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Getting Started with Data Integration: FME Form 101
Encapsulation_ Review paper, used for researhc scholars
Network Security Unit 5.pdf for BCA BBA.
Per capita expenditure prediction using model stacking based on satellite ima...
Unlocking AI with Model Context Protocol (MCP)
Machine Learning_overview_presentation.pptx
“AI and Expert System Decision Support & Business Intelligence Systems”
1. Introduction to Computer Programming.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
cuic standard and advanced reporting.pdf
NewMind AI Weekly Chronicles - August'25-Week II
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Teaching material agriculture food technology
Spectral efficient network and resource selection model in 5G networks
SOPHOS-XG Firewall Administrator PPT.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Accuracy of neural networks in brain wave diagnosis of schizophrenia

GPU Computing

  • 1. Graphics processing units - powerful, programmable, and highly parallel - are increasingly targeting general-purpose computing applications. GPU ComputingPresented By:Khan Muhammad Nafee Mostafa0507007, Dept of CSE, KUET
  • 2. GPU ComputingJ. D. OwensM. HoustonD. LuebkeS. GreenJ. E. StoneJ. C. PhillipsProceedings of the IEEE | Vol 96, No. 5 | May 2008We would be concentrating on,What is GPU ComputingWhy GPU ComputingGPU Architecture and EvolutionGPU Computing ModelSoftware Environment Future
  • 3. GPU for General Purpose ComputingWhat is GPU Computing ?
  • 4. What is GPU Computing ?GPU computing is the use of a GPU to do general purpose scientific and engineering computingCPU and GPU together in a heterogeneous computing model.Sequential part of the application runs on the CPU and the computationally-intensive part runs on the GPU. From the user’s perspective, the application just runs faster because it is using the high-performance of the GPU to boost performance.
  • 5. Over the past few years, the GPU has evolved from a fixed-function special-purpose processor into a full-fledged parallel programmable processor with additional fixed-function special-purpose functionalityWhy GPU Computing…
  • 6. GPU for Non-Graphic AppsThe GPU is designed for a particular class of applications with the following characteristics,Computational requirements are largeParallelism is substantialThroughput is more important than latencya growing community has identified other applications with similar characteristics and successfully mapped these applications onto the GPU
  • 7. GPU extends its hand towards CPU for performanceParallelism is the future of computingMany applications have to process huge set of data following same functionsSeveral stream processors can execute same set of instructions on different data sets and give a higher throughput If GPU take some share of computation load from CPU, many applications can be benefitted in speed-up
  • 8. GPU is now turned into a programmable engineGPU Architecture and Evolution
  • 9. GPU PipelineAvailable operations are configurable but not programmable
  • 11. All GPU programs must be structured in this way: many parallel elements, each processed in parallel by a single programGPU Computing Model
  • 12. Computing on the GPUProgramming a GPU for Graphicsprogrammer specifies geometry covering a screen region; rasterizer generates a fragment at each pixel locationEach fragment is shaded by the fragment program (FP).FP computes the fragment by a combination of math operations and global memory readsresulting image can be used as texture on future passes.
  • 13. Computing on the GPUProgramming a GPU for GraphicsProgramming a GPU for General-Purpose Programs (Old)programmer specifies geometric primitive covering computation domain of interest; rasterizer generates fragmentEach fragment is shaded by an SPMD general purpose FPFP computes the fragment by a combination of math operations and ‘gather’ accesses from global memory. resulting buffer can be used as an input on future passes. programmer specifies geometry covering a screen region; rasterizer generates a fragment at each pixel locationEach fragment is shaded by the fragment program (FP).FP computes the fragment by a combination of math operations and global memory readsresulting image can be used as texture on future passes.
  • 14. Computing on the GPUProgramming a GPU for General-Purpose Programs (New)programmer directly defines the computation domain of interest as a structured grid of threadsSPMD general-purpose program computes each threadeach thread is computed by a combination of math operations and both ‘gather’ (read) accesses from and ‘scatter’ (write) accesses to global memory; (same buffer can be used for both allowing more flexible algorithms)resulting buffer in global memory can then be used as an input in future computation
  • 16. Software EnvironmentsBrookGPUMicrosoft’s AcceleratorVendor Specific GPGPU systemsAMD ATI’s CTM (Close to the Metal)NVIDIA’s CUDA (Compute Unified Device Architecture)
  • 17. Scan performance on CPU, graphics-based GPU (using OpenGL), and direct-compute GPU (using CUDA). Results obtained on a GeForce 8800 GTX GPU and Intel Core2-Duo Extreme 2.93 GHz CPU. (Figure adapted from Harris et al.)Scan performance on CPU, OpenGL and CUDA
  • 19. Concluding for bright Future…support for double-precision floating-pointhigher bandwidth path between CPU and GPU (like ATI’s HyperTransport)more tightly coupled CPU and GPU (AMD’s fusion or nVidianForce)NVIDIA Quadro for Multiple GPU CollaborationFinally, let us wait for new era when GPU Computing will rule
  • 20. Thank YouI would also like to thank,