Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 8 additions & 0 deletions .idea/.gitignore

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

1 change: 1 addition & 0 deletions .idea/.name

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

2 changes: 2 additions & 0 deletions .idea/Homework-3.iml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

344 changes: 344 additions & 0 deletions .idea/editor.xml

Large diffs are not rendered by default.

7 changes: 7 additions & 0 deletions .idea/misc.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

8 changes: 8 additions & 0 deletions .idea/modules.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

7 changes: 7 additions & 0 deletions .idea/vcs.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

12 changes: 11 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -103,9 +103,19 @@ For each test case (0 through 9, using the same `data` folder from Assignment 2)

| Test Case | Dimensions (\( m \times n \times p \)) | Naive CPU (s) | Blocked CPU (s) | Parallel CPU (s) | Naive CUDA (s) | Tiled CUDA (s) | Tiled CUDA Speedup (vs. Naive CUDA) | Tiled CUDA Speedup (vs. Parallel CPU) |
|-----------|----------------------------------------|---------------|-----------------|------------------|----------------|----------------|-------------------------------------|---------------------------------------|
| | | | | | | | | |
| 0 | 64 × 64 × 64 | 0.00100017 | 0.000999928 | 0.000999928 | 0.000149568 | 6.7648e-05 | 2.211* | 14.78* |
| 1 | 128 * 64 * 128 | 0.00300002 | 0.00300002 | 0.00100017 | 0.000125472 | 7.3888e-05 | 1.698* | 13.53* |
| 2 | 100 * 128 * 56 | 0.00200009 | 0.00199986 | 0.00100017 | 0.000140832 | 7.3568e-05 | 1.914* | 13.59* |
| 3 | 128 * 64 * 128 | 0.00300002 | 0.00300002 | 0.00199986 | 0.000151168 | 7.936e-05 | 1.905* | 25.19* |
| 4 | 32 * 128 * 32 | 0.00100017 | 0.00100017 | 0.000999928 | 0.000135456 | 6.5984e-05 | 2.052* | 15.15* |
| 5 | 200 * 100 * 256 | 0.0150001 | 0.0149999 | 0.00499988 | 0.000186176 | 9.9872e-05 | 1.864* | 50.07* |
| 6 | 256 * 256 * 256 | 0.043 | 0.046 | 0.0110002 | 0.000225056 | 0.000133568 | 1.684* | 82.36* |
| 7 | 256 * 300 * 256 | 0.0500002 | 0.0539999 | 0.013 | 0.00034144 | 0.000144352 | 2.365* | 90.08* |
| 8 | 64 * 128 * 64 | 0.00100017 | 0.000999928 | 0.000999928 | 0.000126912 | 7.2224e-05 | 1.758* | 13.84* |
| 9 | 256 * 256 * 257 | 0.043 | 0.0450001 | 0.0110002 | 0.000227968 | 0.000134144 | 1.699* | 82.00* |

---
I used my own machine for this and it contains a NVIDIA GeForce RTX 3070

### Matrix Storage and Memory Management

Expand Down
Empty file.
Empty file.
Empty file.
Empty file.
Loading