FileMood

Download Heterogeneous Parallel Programming

Heterogeneous Parallel Programming

Name

Heterogeneous Parallel Programming

  DOWNLOAD Copy Link

Trouble downloading? see How To

Total Size

954.0 MB

Total Files

138

Last Seen

2025-07-06 02:45

Hash

E37CDF3092A28925C607F32BC4EFE562CDC1811E

/

4 - 7 - 4.7- Parallel Computation Patterns - More on Parallel Scan.mp4

41.8 MB

4 - 5 - 4.5- Parallel Computation Patterns - A Work-Inefficient Scan Kernel.mp4

40.4 MB

4 - 6 - 4.6- Parallel Computation Patterns - A Work-Efficient Parallel Scan Kernel.mp4

39.7 MB

2 - 6 - 2.6- Tiled Matrix Multiplication Kernel.mp4

37.9 MB

3 - 6 - 3.6- Parallel Computation Patterns - Data Reuse in Tiled Convolution.mp4

33.1 MB

4 - 1 - 4.1- Parallel Computation Patterns - Reduction.mp4

32.6 MB

5 - 3 - 5.3- Parallel Computation Patterns - Atomic Operations in CUDA.mp4

32.2 MB

3 - 1 - 3.1- Performance Considerations - DRAM Bandwidth.mp4

31.0 MB

2 - 3 - 2.3- Memory Model and Locality -- CUDA Memories.mp4

30.3 MB

5 - 4 - 5.4- Parallel Computation Patters - Atomic Operations Performance.mp4

29.1 MB

1 - 4 - 1.4- Introduction to CUDA, Data Parallelism and Threads.mp4

28.4 MB

4 - 4 - 4.4- Parallel Computation Patterns - Scan (Prefix Sum).mp4

28.3 MB

2 - 5 - 2.5- Tiled Matrix Multiplication.mp4

27.1 MB

1 - 1 - 1.1- Course Overview.mp4

27.0 MB

2 - 1 - 2.1- Kernel-based Parallel Programming - Thread Scheduling.mp4

26.8 MB

3 - 5 - 3.5- Parallel Computation Patterns - 2D Tiled Convolution Kernel.mp4

26.5 MB

4 - 2 - 4.2- Parallel Computation Patterns - A Basic Reduction Kernel.mp4

26.3 MB

2 - 4 - 2.4- Tiled Parallel Algorithms.mp4

26.2 MB

3 - 4 - 3.4- Parallel Computation Patterns - Tiled Convolution.mp4

26.1 MB

1 - 5 - 1.5- Introduction to CUDA, Memory Allocation and Data Movement API.mp4

25.7 MB

1 - 6 - 1.6- Introduction to CUDA, Kernel-Based SPMD Parallel Programming.mp4

25.4 MB

5 - 5 - 5.5- Parallel Computation Patterns - A Privatized Histogram Kernel.mp4

24.4 MB

3 - 2 - 3.2- Performance Considerations - Memory Coalescing in CUDA.mp4

23.8 MB

5 - 1 - 5.1- Parallel Computation Patterns - Histogramming.mp4

23.3 MB

5 - 2 - 5.2- Parallel Computation Patterns - Atomic Operations.mp4

23.2 MB

1 - 8 - 1.8- Kernel-based Parallel Programming, Basic Matrix-Matrix Multiplication.mp4

22.8 MB

1 - 7 - 1.7- Kernel-based Parallel Programming, Multidimensional Kernel Configuration.mp4

22.7 MB

2 - 8 - 2.8- A Tiled Kernel for Arbitrary Matrix Dimensions.mp4

22.5 MB

Рекомендуемая литература David B. Kirk, Wen-mei W. Hwu Programming Massively Parallel Processors, Second Edition.pdf

22.4 MB

3 - 3 - 3.3- Parallel Computation Patterns - Convolution.mp4

22.1 MB

4 - 3 - 4.3- Parallel Computation Patterns - A Better Reduction Kernel.mp4

21.0 MB

2 - 2 - 2.2- Control Divergence.mp4

20.8 MB

2 - 7 - 2.7- Handling Boundary Conditions in Tiling.mp4

18.8 MB

1 - 2 - 1.2- Introduction to Heterogeneous Parallel Computing.mp4

18.6 MB

1 - 3 - 1.3- Portability and Scalability in Heterogeneous Parallel Computing.mp4

10.1 MB

hetero-lecture_slides_002-Lecture 1-Lecture-1-5-cuda-API.pdf

914.3 KB

Lecture-5-3-CUDA-atomic.pdf

788.2 KB

hetero-lecture_slides_002-Lecture 1-Lecture-1-4-cuda-intro.pdf

607.7 KB

Lecture-4-7-more-on-scan.pdf

593.6 KB

Lecture-5-5-privatized-histogram.pdf

554.1 KB

Lecture-4-4-scan.pdf

539.1 KB

Lecture-3-2-memory-coalescing.pdf

524.9 KB

Lecture-3-6-convolution-reuse.pdf

518.2 KB

Lecture-5-1-histogram.pdf

513.9 KB

Lecture-3-3-convolution.pdf

511.8 KB

Lecture-3-1-dram-bandwidth.pdf

510.2 KB

Lecture-4-6-work-efficient-scan-kernel.pdf

503.6 KB

hetero-lecture_slides_002-Lecture 1-Lecture-1-6-cuda-kernel.pdf

502.8 KB

Lecture-3-5-2D-convolution-kernel.pdf

488.5 KB

Lecture-3-4-tiled-convolution.pdf

463.2 KB

Lecture-5-4-atomic-performance.pdf

454.6 KB

hetero-lecture_slides_002-Lecture 2-Lecture-2-2-control-divergence.pdf

452.8 KB

Lecture-5-2-atomic-operations.pdf

447.5 KB

hetero-lecture_slides_002-Lecture 2-Lecture-2-1-transparent-scaling.pdf

440.6 KB

Lecture-4-3-better-reduction-kernel.pdf

423.9 KB

Lecture-4-2-reduction-kernel.pdf

372.1 KB

hetero-lecture_slides_002-Lecture 1-Lecture-1-7-kernel-multidimension.pdf

351.6 KB

Lecture-4-5-naive-scan-kernel.pdf

346.9 KB

Lecture-4-1-reduction.pdf

301.3 KB

hetero-lecture_slides_002-Lecture 2-Lecture-2-3-cuda-memories.pdf

301.2 KB

hetero-lecture_slides_002-Lecture 1-Lecture-1-3-software-cost.pdf

286.4 KB

hetero-lecture_slides_002-Lecture 1-Lecture-1-2-heterogeneous.pdf

278.8 KB

hetero-lecture_slides_002-Lecture 1-Lecture-1-8-kernel-matrix-multiplication.pdf

276.1 KB

hetero-lecture_slides_002-Lecture 1-Lecture-1-1-Overview.pdf

248.4 KB

hetero-lecture_slides_002-Lecture 2-Lecture-2-8-boundary-condition-kernel.pdf

239.0 KB

hetero-lecture_slides_002-Lecture 2-Lecture-2-6-tiled-kernel.pdf

236.7 KB

hetero-lecture_slides_002-Lecture 2-Lecture-2-4-tiled-algorithms.pdf

227.5 KB

hetero-lecture_slides_002-Lecture 2-Lecture-2-7-boundary-condition.pdf

180.1 KB

hetero-lecture_slides_002-Lecture 2-Lecture-2-5-tiled-matrix-multiplication.pdf

166.3 KB

2 - 6 - 2.6- Tiled Matrix Multiplication Kernel.srt

33.6 KB

4 - 1 - 4.1- Parallel Computation Patterns - Reduction.srt

29.2 KB

3 - 1 - 3.1- Performance Considerations - DRAM Bandwidth.srt

28.9 KB

Гетерогенное параллельное программирование.docx

28.3 KB

3 - 6 - 3.6- Parallel Computation Patterns - Data Reuse in Tiled Convolution.srt

27.9 KB

1 - 4 - 1.4- Introduction to CUDA, Data Parallelism and Threads.srt

27.1 KB

2 - 3 - 2.3- Memory Model and Locality -- CUDA Memories.srt

27.0 KB

4 - 5 - 4.5- Parallel Computation Patterns - A Work-Inefficient Scan Kernel.srt

26.9 KB

1 - 1 - 1.1- Course Overview.srt

26.4 KB

4 - 7 - 4.7- Parallel Computation Patterns - More on Parallel Scan.srt

25.6 KB

2 - 5 - 2.5- Tiled Matrix Multiplication.srt

25.6 KB

4 - 6 - 4.6- Parallel Computation Patterns - A Work-Efficient Parallel Scan Kernel.srt

25.1 KB

4 - 4 - 4.4- Parallel Computation Patterns - Scan (Prefix Sum).srt

24.7 KB

1 - 6 - 1.6- Introduction to CUDA, Kernel-Based SPMD Parallel Programming.srt

24.1 KB

2 - 4 - 2.4- Tiled Parallel Algorithms.srt

23.6 KB

1 - 5 - 1.5- Introduction to CUDA, Memory Allocation and Data Movement API.srt

23.3 KB

3 - 5 - 3.5- Parallel Computation Patterns - 2D Tiled Convolution Kernel.srt

23.3 KB

2 - 1 - 2.1- Kernel-based Parallel Programming - Thread Scheduling.srt

23.2 KB

4 - 2 - 4.2- Parallel Computation Patterns - A Basic Reduction Kernel.srt

22.4 KB

5 - 3 - 5.3- Parallel Computation Patterns - Atomic Operations in CUDA.srt

22.1 KB

3 - 4 - 3.4- Parallel Computation Patterns - Tiled Convolution.srt

21.6 KB

2 - 8 - 2.8- A Tiled Kernel for Arbitrary Matrix Dimensions.srt

20.9 KB

1 - 8 - 1.8- Kernel-based Parallel Programming, Basic Matrix-Matrix Multiplication.srt

20.7 KB

2 - 6 - 2.6- Tiled Matrix Multiplication Kernel.txt

20.7 KB

5 - 4 - 5.4- Parallel Computation Patters - Atomic Operations Performance.srt

20.0 KB

1 - 2 - 1.2- Introduction to Heterogeneous Parallel Computing.srt

19.9 KB

1 - 7 - 1.7- Kernel-based Parallel Programming, Multidimensional Kernel Configuration.srt

19.5 KB

4 - 3 - 4.3- Parallel Computation Patterns - A Better Reduction Kernel.srt

19.1 KB

3 - 3 - 3.3- Parallel Computation Patterns - Convolution.srt

19.1 KB

3 - 2 - 3.2- Performance Considerations - Memory Coalescing in CUDA.srt

18.9 KB

2 - 2 - 2.2- Control Divergence.srt

18.5 KB

4 - 1 - 4.1- Parallel Computation Patterns - Reduction.txt

17.9 KB

3 - 1 - 3.1- Performance Considerations - DRAM Bandwidth.txt

17.5 KB

5 - 5 - 5.5- Parallel Computation Patterns - A Privatized Histogram Kernel.srt

17.2 KB

3 - 6 - 3.6- Parallel Computation Patterns - Data Reuse in Tiled Convolution.txt

17.1 KB

2 - 7 - 2.7- Handling Boundary Conditions in Tiling.srt

16.9 KB

2 - 3 - 2.3- Memory Model and Locality -- CUDA Memories.txt

16.6 KB

4 - 5 - 4.5- Parallel Computation Patterns - A Work-Inefficient Scan Kernel.txt

16.5 KB

1 - 4 - 1.4- Introduction to CUDA, Data Parallelism and Threads.txt

16.4 KB

1 - 1 - 1.1- Course Overview.txt

16.2 KB

2 - 5 - 2.5- Tiled Matrix Multiplication.txt

15.7 KB

5 - 1 - 5.1- Parallel Computation Patterns - Histogramming.srt

15.7 KB

4 - 7 - 4.7- Parallel Computation Patterns - More on Parallel Scan.txt

15.7 KB

5 - 2 - 5.2- Parallel Computation Patterns - Atomic Operations.srt

15.7 KB

4 - 6 - 4.6- Parallel Computation Patterns - A Work-Efficient Parallel Scan Kernel.txt

15.6 KB

4 - 4 - 4.4- Parallel Computation Patterns - Scan (Prefix Sum).txt

15.0 KB

1 - 6 - 1.6- Introduction to CUDA, Kernel-Based SPMD Parallel Programming.txt

14.8 KB

2 - 4 - 2.4- Tiled Parallel Algorithms.txt

14.6 KB

1 - 5 - 1.5- Introduction to CUDA, Memory Allocation and Data Movement API.txt

14.5 KB

3 - 5 - 3.5- Parallel Computation Patterns - 2D Tiled Convolution Kernel.txt

14.3 KB

2 - 1 - 2.1- Kernel-based Parallel Programming - Thread Scheduling.txt

14.2 KB

4 - 2 - 4.2- Parallel Computation Patterns - A Basic Reduction Kernel.txt

13.6 KB

3 - 4 - 3.4- Parallel Computation Patterns - Tiled Convolution.txt

13.5 KB

5 - 3 - 5.3- Parallel Computation Patterns - Atomic Operations in CUDA.txt

13.4 KB

1 - 8 - 1.8- Kernel-based Parallel Programming, Basic Matrix-Matrix Multiplication.txt

12.8 KB

2 - 8 - 2.8- A Tiled Kernel for Arbitrary Matrix Dimensions.txt

12.8 KB

1 - 2 - 1.2- Introduction to Heterogeneous Parallel Computing.txt

12.5 KB

5 - 4 - 5.4- Parallel Computation Patters - Atomic Operations Performance.txt

12.4 KB

1 - 7 - 1.7- Kernel-based Parallel Programming, Multidimensional Kernel Configuration.txt

12.0 KB

4 - 3 - 4.3- Parallel Computation Patterns - A Better Reduction Kernel.txt

11.8 KB

3 - 3 - 3.3- Parallel Computation Patterns - Convolution.txt

11.7 KB

3 - 2 - 3.2- Performance Considerations - Memory Coalescing in CUDA.txt

11.7 KB

2 - 2 - 2.2- Control Divergence.txt

11.5 KB

1 - 3 - 1.3- Portability and Scalability in Heterogeneous Parallel Computing.srt

11.1 KB

5 - 5 - 5.5- Parallel Computation Patterns - A Privatized Histogram Kernel.txt

10.6 KB

2 - 7 - 2.7- Handling Boundary Conditions in Tiling.txt

10.4 KB

5 - 1 - 5.1- Parallel Computation Patterns - Histogramming.txt

9.6 KB

5 - 2 - 5.2- Parallel Computation Patterns - Atomic Operations.txt

9.6 KB

1 - 3 - 1.3- Portability and Scalability in Heterogeneous Parallel Computing.txt

6.8 KB

 

Total files 138


Copyright © 2025 FileMood.com