Relevant Papers for CS 267, Spring 2011
Lecture 1: Introduction
The Landscape of Parallel Computing Research: A View from Berkeley
The Parallel Computing Laboratory at U.C. Berkeley: A Research Agenda Based on the Berkeley View
Lecture 2: Single processor machines: Memory hierarchies and processor features
Computer Architecture: A Quantitative Approach
(book)
Cache Memories
PHiPAC - Optimizing Dense Matrix Multiply
Lecture 3: Introduction to Parallel Machines and Programming Models
The Sourcebook of Parallel Computing
(Section 10.1)
Lecture 4: Sources of Parallelism and Locality in Simulation
Lecture 5: Tricks with Trees
Scans as Primitive Parallel Operations
Parallel Prefix Sum (Scan) with CUDA
Lecture 6: Shared Memory Programming: Threads and OpenMP
Pthreads Tutorial
OpenMP Tutorial
The Sourcebook of Parallel Computing
Section 10.2-10.3 (Multithreading)
Sequential Consistency
Parallel Computer Architecture: A Hardware/Software Approach
Section 5.1 (Cache Coherence)
Parallel Computer Architecture: A Hardware/Software Approach
Sections 6.1-6.2 (Synchronization)
Lecture 7: Distributed Memory Programming
The Sourcebook of Parallel Computing
Section 10.1 (Message Passing)
Parallel Computer Architecture: A Hardware/Software Approach
Sections 10.4-10.5 (Interconnection Networks)
PRAM
PRAM (Berkeley notes)
LogP
LogGP
BSP
Roofline Model
Lecture 8: Performance Tools and Debugging
Twelve Ways to Fool the Masses
[
Back to the Resources Page
] [
Back to the Course Page
]