chore: adjust comment in Dockerfile regarding RTX5090 support by kyteinsky · Pull Request #316 · nextcloud/context_chat_backend

kyteinsky · 2026-06-17T05:32:05Z

Assisted-by: Github Copilot:claude-sonnet-4-6

fixes: #305

🤖 AI (if applicable)

The content of this PR was partly or fully generated using AI

Signed-off-by: kyteinsky <kyteinsky@gmail.com> Assisted-by: Github Copilot:claude-sonnet-4-6

marcelklehr · 2026-06-17T09:06:59Z

-# Real cubins for all shipping GPU generations through Blackwell (sm_100),
-# plus one forward-compatible PTX target to keep wheel size manageable.
+# CMAKE_CUDA_ARCHITECTURES is intentionally not set here. llama.cpp's CMake
+# selects sensible defaults based on the detected CUDA toolkit version:


mmh, but this is at build time, right? so it does not depend on the CUDA version that the admin has but on the CUDA version of the CUDA_DEVEL_IMAGE, correct?

yep, and we depend on llama.cpp to make the correct choice of cubins to bake in, so we don't have to check and adjust it when the cuda version changes.
we could but it already exists so I didn't bother.

@fcharlaix-opendsi

## 5.4.0 - 2026-06-24 ### Highlights - The indexing direction has been reversed now. Instead of the context_chat PHP app sending documents to the context_chat_backend ExApp, the ExApp downloads the documents from the server according to a list obtained from the PHP app. This also means that the `occ context_chat:scan` command serves no purpose and has been removed. Indexing should be smoother and run continuously now. - Kubernetes support to scale the CPU computation - Separate docker images for CPU, CUDA and ROCM (uses Vulkan) instead of one heavy CUDA/CPU image - CUDA 12.8 is shipped in the CUDA image so the host drivers should be updated to this at the minimum. ### Added - add network embedding batching (#276) @fcharlaix-opendsi - add kubernetes support and reverse content/indexing flow (#284) @kyteinsky @marcelklehr - add gh workflows for docker builds and do separate cpu, cuda and rocm (vulkan) images (#295) @kyteinsky ### Changed - update readme according to the latest changes (#300) @kyteinsky - bump llama_cpp_python to 0.3.23 (#301) @kyteinsky - move task types to the backend (#321) @kyteinsky - adjust comment in Dockerfile regarding RTX5090 support (#316) @kyteinsky ### Fixed - improve loadSources error handling (#288) @kyteinsky - fix(pgvector): add chunking to prevent long list of args in queries (#290) @kyteinsky - fix(pgvector): make doc deletion query faster (#289) @kyteinsky - drop latin-1 decode in source title and userIds (#306) @kyteinsky - handle validation errors of files and content providers individually (#308) @kyteinsky - prevent race condition in vectordb tables creation (#308) @kyteinsky - pass actual error in the error object (#308) @kyteinsky - add container hostname to /etc/hosts to silence sudo warning (#311) @sanzakicesarr ## 🤖 AI (if applicable) - [ ] The content of this PR was partly or fully generated using AI Signed-off-by: kyteinsky <kyteinsky@gmail.com>

chore: adjust comment in Dockerfile regarding RTX5090 support

2a9a2ec

Signed-off-by: kyteinsky <kyteinsky@gmail.com> Assisted-by: Github Copilot:claude-sonnet-4-6

kyteinsky requested a review from marcelklehr as a code owner June 17, 2026 05:32

marcelklehr reviewed Jun 17, 2026

View reviewed changes

marcelklehr approved these changes Jun 24, 2026

View reviewed changes

kyteinsky merged commit 13b33c0 into master Jun 24, 2026
12 checks passed

kyteinsky deleted the chore/explicit-support-for-cuda-gpus branch June 24, 2026 12:12

kyteinsky mentioned this pull request Jun 24, 2026

5.4.0 #322

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: adjust comment in Dockerfile regarding RTX5090 support#316

chore: adjust comment in Dockerfile regarding RTX5090 support#316
kyteinsky merged 1 commit into
masterfrom
chore/explicit-support-for-cuda-gpus

kyteinsky commented Jun 17, 2026

Uh oh!

marcelklehr Jun 17, 2026

Uh oh!

kyteinsky Jun 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

kyteinsky commented Jun 17, 2026

🤖 AI (if applicable)

Uh oh!

marcelklehr Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

kyteinsky Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants