EECS-368-Programming-Massively-Parallel-Processors-with-CUDA

Winter 2016
Course: Programming-Massively-Parallel-Processors-with-CUDA, Northwestern University, Evanston, IL

Lab1:Matrix multiplication

The lab’s focus is on producing correct code. This project reinforces the acquisition of basic GPU/CUDA programming skills, the software interface, and the basic architecture of the device.

Lab2:Tiled matrix multiplication

This lab focuses on data layout and decomposition, and full utilization of shared memory resources and global bandwidth through bank conflict avoidance and memory coalescing.

Lab3:Histograms

In this lab you are called to define optimization goals and strategy, implement them, and keep a research lab journal on which you report statistics and analyze every optimization you tried, even ones that did not work or degraded performance. For this assignment you will need to read recent research papers that outline some of the best-known ways to solve this problem.

Lab4:Parallel prefix sum

This lab focuses on the application of efficient parallel algorithms that utilize shared memory and synchronization and minimize path divergence.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
lab1		lab1
lab2		lab2
lab3		lab3
lab4		lab4
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EECS-368-Programming-Massively-Parallel-Processors-with-CUDA

Lab1:Matrix multiplication

Lab2:Tiled matrix multiplication

Lab3:Histograms

Lab4:Parallel prefix sum

About

Releases

Packages

Languages

dawn-chu/EECS-368-Programming-Massively-Parallel-Processors-with-CUDA

Folders and files

Latest commit

History

Repository files navigation

EECS-368-Programming-Massively-Parallel-Processors-with-CUDA

Lab1:Matrix multiplication

Lab2:Tiled matrix multiplication

Lab3:Histograms

Lab4:Parallel prefix sum

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages