niklas-pihl: Implemented CUDA matrix multiplication by pihlnikl · Pull Request #11 · parallelcomputingabo/Homework-3

pihlnikl · 2025-05-30T16:35:13Z

Implemented both naive and tiled multiplication.

The measured results are not the best. Had some problems due to the lack of an NVIDIA GPU, so I had to run the code in Google Colab which skewed the measurements somewhat and I couldn't even get measurable results on many runs. I suspect the tiled multiplication was too fast, because most of the runs just showed 0 seconds even with precision set to 20. This led to some funny results, comparing colabs speed to the speed of my own CPU (Parallel CPU).

niklas-pihl: Implemented CUDA matrix multiplication

940d0df

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

niklas-pihl: Implemented CUDA matrix multiplication#11

niklas-pihl: Implemented CUDA matrix multiplication#11
pihlnikl wants to merge 1 commit into
parallelcomputingabo:mainfrom
pihlnikl:niklas-pihl

pihlnikl commented May 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

pihlnikl commented May 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant