Skip to content

igli-balla: Implemented CUDA matrix multiplication#9

Open
Igli333 wants to merge 1 commit into
parallelcomputingabo:mainfrom
Igli333:igli-balla
Open

igli-balla: Implemented CUDA matrix multiplication#9
Igli333 wants to merge 1 commit into
parallelcomputingabo:mainfrom
Igli333:igli-balla

Conversation

@Igli333
Copy link
Copy Markdown

@Igli333 Igli333 commented May 28, 2025

The solution for this assignment contains the naive and tiled implementation of matrix multiplication on GPUs. It tries to optimise the whole process by allowing more work do be done at the same time by the GPU.

  • The Dione cluster offered by Abo Akademi and University of Turku was used to run the code.
  • The main challenge was fixing a few issues where the tiled multiplication would calculate only a certain part of the matrix and not all the values.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant