meteharun: Implemented optimized matrix multiplication by meteharun · Pull Request #3 · parallelcomputingabo/Homework-2

meteharun · 2025-05-02T08:39:20Z

Implemented and benchmarked three matrix multiplication methods:

Used tiling in blocked version to improve cache locality
Used #pragma omp parallel for to parallelize row-wise computation in the parallel version
Rounded float values to 2 decimals to ensure correct validation

Included detailed benchmark table in README.md.

For smaller matrix sizes, blocked matmul had overhead and performed worse
Had to adjust file paths due to CLion running from the cmake-build-debug directory
Minor issues with OpenMP scoping resolved with default(none) and shared(...)

All tests pass. Speedup for large matrices especially in parallel option is significant.

meteharun: Implemented optimized matrix multiplication,

14e80b6

Provide feedback