Optimized Matrix Multiplication and Validation Logic by BehrozKarim · Pull Request #2 · AA-parallel-computing/Assignment-4-Optional

BehrozKarim · 2026-05-30T13:14:36Z

Implemented naive, blocked and parallel matrix multiplication
Updated the readme with performance results
Added a writeup.pdf explaining the implementation and the optimization journey

naive_matmul: This function is the naive textbook i -> j -> k triple loop with a local sum accumulator.
blocked_matmul: six-loop tile structure with block_size = 32. Inside each tile, inner loops use i -> k -> j instead of the pseudocode's i -> j -> k. This makes the innermost loop access B and C with stride 1, allowing the hardware prefetcher and the compiler's auto-vectorizer to work. std::min(...) handles non-multiple-of-block-size dimensions; C is zero-initialized because each tile accumulates into it.
parallel_matmul: #pragma omp parallel for over the outer i loop. Each thread zeros and then fills a disjoint set of rows of C (so no atomics or reductions are needed), using the same i -> k -> j inner ordering.

We came up with the i->k->j inner ordering trick because we were not achieving any speedups with the pseudocode version of the implementation.

BehrozKarim added 2 commits May 30, 2026 15:40

behroz-karim: Implemented optimized matrix multiplication

e19e374

Update writeup.pdf to fix some minor issues.

ff482dd

BehrozKarim changed the title ~~Optimized Matrix Multiplicagtion and Validation Logic~~ Optimized Matrix Multiplication and Validation Logic May 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimized Matrix Multiplication and Validation Logic#2

Optimized Matrix Multiplication and Validation Logic#2
BehrozKarim wants to merge 2 commits into
AA-parallel-computing:mainfrom
BehrozKarim:behroz-karim

BehrozKarim commented May 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

BehrozKarim commented May 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant