SlideShare a Scribd company logo
3
Most read
6
Most read
16
Most read
CUDAArchitecture
Prof. Shashikant V. Athawale
Assistant Professor | Computer Engineering
Department | AISSMS College of Engineering,
Kennedy Road, Pune , MH, India - 411001
Contents
❖ CUDAArchitecture
❖ Applications of CUDA
❖ Introduction to CUDA C-Write and launch CUDA C
kernels
❖ Manage GPU memory
❖ Manage communication and synchronization
❖ Parallel programming in CUDA- C.
Communication And Synchronization in
Thread
Communication And Synchronization in
Thread
CUDAArchitecture
CUDAArchitecture
Applications of CUDA
CUDA C : The Basics
❖ Based on industry-standard C
❖ A handful of language extensions to allow heterogeneous
programs
❖ Straightforward APIs to manage devices, memory, etc.
❖ Terminology:
➢ Host – The CPU and its memory (host memory)
➢ Device – The GPU and its memory (device memory)
Device
CUDA Kernels
GPU Memory Management
Data Transfer Directions Keywords
❖ cudaMemcpyHostToHost
❖ cudaMemcpyHostToDevice
❖ cudaMemcpyDeviceToHost
❖ cudaMemcpyDeviceToDevice
Parallel Programming in CUDA C
❖ CUDA brings data-parallel computing to the masses.
❖ CUDA is a scalable parallel programming model.
❖ Program runs on any number of processors without
recompiling.
Architecture Of Parallel CUDA Programming
CUDA Uses Extensive Multithreading
❖ CUDA threads express fine-grained data parallelism.
➢ Map threads to GPU threads.
➢ Virtualize the processors.
❖ CUDA thread blocks express coarse-grained parallelism.
➢ Blocks hold arrays of GPU threads, define shared
memory boundaries.
➢ Allow scaling between smaller and larger GPUs.
CUDA Uses Extensive Multithreading
❖ GPUs execute thousands of lightweight threads.
➢ In graphics, each thread computes one pixel.
➢ One CUDA thread computes one result (or several
results).
➢ Hardware multithreading & zero-overhead
scheduling.
Applications
❖ High bandwidth
❖ Visual computing
❖ High arithmetic intensity

More Related Content

PDF
Introduction to CUDA
PPTX
Accountability And Auditing In Professional Practice
PDF
GAN in medical imaging
PDF
The Rise of Forerunners | Accenture
PDF
Best Practice on using Azure OpenAI Service
PPTX
Types Of Buses
PPTX
Online Shopping Agent in AI
Introduction to CUDA
Accountability And Auditing In Professional Practice
GAN in medical imaging
The Rise of Forerunners | Accenture
Best Practice on using Azure OpenAI Service
Types Of Buses
Online Shopping Agent in AI

What's hot (20)

PPTX
PPTX
Cuda Architecture
PPTX
Gpu with cuda architecture
PPTX
GPU Architecture NVIDIA (GTX GeForce 480)
PPTX
Graphics processing unit (GPU)
PPT
NVIDIA CUDA
PDF
GPU - Basic Working
PDF
Introduction to GPU Programming
PDF
FPGA Hardware Accelerator for Machine Learning
PPTX
Knowledge representation in AI
PPTX
graphics processing unit ppt
PDF
Parallel Algorithms
PPTX
Graphics processing unit
PPTX
Integration of Sensors & Actuators With Arduino.pptx
PDF
GPU - An Introduction
PPTX
GPU Computing
PPTX
Graphics processing unit ppt
PPTX
Deep neural networks
PPTX
Cuda Architecture
Gpu with cuda architecture
GPU Architecture NVIDIA (GTX GeForce 480)
Graphics processing unit (GPU)
NVIDIA CUDA
GPU - Basic Working
Introduction to GPU Programming
FPGA Hardware Accelerator for Machine Learning
Knowledge representation in AI
graphics processing unit ppt
Parallel Algorithms
Graphics processing unit
Integration of Sensors & Actuators With Arduino.pptx
GPU - An Introduction
GPU Computing
Graphics processing unit ppt
Deep neural networks
Ad

Similar to CUDA Architecture (20)

PPTX
Compute Unified Device Architecture (CUDA) Programmimg
PDF
IIT ropar_CUDA_Report_Ankita Dewan
PDF
IIT ropar_CUDA_Report_Ankita Dewan
PDF
NASA Advanced Supercomputing (NAS) Division - Programming and Building HPC Ap...
PDF
CUDA by Example : The Final Countdown : Notes
PPTX
PDF
PPTX
PPTX
Pycon2014 GPU computing
PDF
Cuda materials
PDF
GPU Programming with Java
PDF
Cuda lab manual
PDF
Engineer Engineering Software
PDF
The Past, Present, and Future of OpenACC
PDF
Kato Mivule: An Overview of CUDA for High Performance Computing
PPT
Guide to heterogeneous system architecture (hsa)
PPT
Cuda intro
PDF
S0333 gtc2012-gmac-programming-cuda
PPTX
GPU in Computer Science advance topic .pptx
PDF
Introduction to CUDA programming in C language
Compute Unified Device Architecture (CUDA) Programmimg
IIT ropar_CUDA_Report_Ankita Dewan
IIT ropar_CUDA_Report_Ankita Dewan
NASA Advanced Supercomputing (NAS) Division - Programming and Building HPC Ap...
CUDA by Example : The Final Countdown : Notes
Pycon2014 GPU computing
Cuda materials
GPU Programming with Java
Cuda lab manual
Engineer Engineering Software
The Past, Present, and Future of OpenACC
Kato Mivule: An Overview of CUDA for High Performance Computing
Guide to heterogeneous system architecture (hsa)
Cuda intro
S0333 gtc2012-gmac-programming-cuda
GPU in Computer Science advance topic .pptx
Introduction to CUDA programming in C language
Ad

More from Dr Shashikant Athawale (20)

PPT
multi threaded and distributed algorithms
PPT
Amortized analysis
PPT
Complexity theory
PPT
Divide and Conquer
PPT
Model and Design
PPT
Fundamental of Algorithms
PPT
Parallel Algorithms- Sorting and Graph
PPT
Analytical Models of Parallel Programs
PPT
Basic Communication
PPT
Parallel Processing Concepts
PPT
Parallel Processing Concepts
PPT
Dynamic programming
PPT
Parallel algorithms
PPT
Greedy method
PPT
Divide and conquer
PPT
Branch and bound
PPT
Asymptotic notation
PPT
String matching algorithms
PPTX
Advanced Wireless Technologies
multi threaded and distributed algorithms
Amortized analysis
Complexity theory
Divide and Conquer
Model and Design
Fundamental of Algorithms
Parallel Algorithms- Sorting and Graph
Analytical Models of Parallel Programs
Basic Communication
Parallel Processing Concepts
Parallel Processing Concepts
Dynamic programming
Parallel algorithms
Greedy method
Divide and conquer
Branch and bound
Asymptotic notation
String matching algorithms
Advanced Wireless Technologies

Recently uploaded (20)

PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PDF
Level 2 – IBM Data and AI Fundamentals (1)_v1.1.PDF
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPT
Project quality management in manufacturing
PPTX
additive manufacturing of ss316l using mig welding
PDF
III.4.1.2_The_Space_Environment.p pdffdf
PPTX
Construction Project Organization Group 2.pptx
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
Well-logging-methods_new................
PDF
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
PPT
Total quality management ppt for engineering students
PDF
Categorization of Factors Affecting Classification Algorithms Selection
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PDF
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
Fundamentals of safety and accident prevention -final (1).pptx
Embodied AI: Ushering in the Next Era of Intelligent Systems
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Level 2 – IBM Data and AI Fundamentals (1)_v1.1.PDF
R24 SURVEYING LAB MANUAL for civil enggi
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
Project quality management in manufacturing
additive manufacturing of ss316l using mig welding
III.4.1.2_The_Space_Environment.p pdffdf
Construction Project Organization Group 2.pptx
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Well-logging-methods_new................
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
Total quality management ppt for engineering students
Categorization of Factors Affecting Classification Algorithms Selection
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Fundamentals of safety and accident prevention -final (1).pptx

CUDA Architecture