Igli Balla: Implemented optimized matrix multiplication by Igli333 · Pull Request #16 · parallelcomputingabo/Homework-2

Igli333 · 2025-05-07T20:00:35Z

In this assignment, I optimized the naive matrix multiplication in two ways, using cache blocks and using parallel working threads.
The process was rather simple, using tiled memory in the first case to multiply smaller matrices, and in the end, achieve the full multiplication. While for parallelization, OpenMP was used, where the operation is split into threads that do different parts of the multiplication at the same time, allowing for higher gain in performance.

The challenge was to find the small things that could lead to reduced performance, such as unnecessary castings, unoptimized memory access etc.

This reverts commit 95e3460.

Igli333 and others added 5 commits May 6, 2025 13:33

First part of the work

95e3460

Revert "First part of the work"

988862c

This reverts commit 95e3460.

First part of the work

e24eabb

Code almost finished -- results to be recorded

1853559

Code optimized and results added

138b08a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Igli Balla: Implemented optimized matrix multiplication#16

Igli Balla: Implemented optimized matrix multiplication#16
Igli333 wants to merge 5 commits into
parallelcomputingabo:mainfrom
Igli333:igli-balla

Igli333 commented May 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Igli333 commented May 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant