igli-balla: Implemented CUDA matrix multiplication by Igli333 · Pull Request #9 · parallelcomputingabo/Homework-3

Igli333 · 2025-05-28T16:56:02Z

The solution for this assignment contains the naive and tiled implementation of matrix multiplication on GPUs. It tries to optimise the whole process by allowing more work do be done at the same time by the GPU.

The Dione cluster offered by Abo Akademi and University of Turku was used to run the code.
The main challenge was fixing a few issues where the tiled multiplication would calculate only a certain part of the matrix and not all the values.

igli-balla: Implemented CUDA matrix multiplication

937300e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

igli-balla: Implemented CUDA matrix multiplication#9

igli-balla: Implemented CUDA matrix multiplication#9
Igli333 wants to merge 1 commit into
parallelcomputingabo:mainfrom
Igli333:igli-balla

Igli333 commented May 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Igli333 commented May 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant