Skip to content

Run Apache Spark SQL workloads on Azure Cobalt 100 Arm64 using Gluten and Velox for accelerated analytics#3199

Merged
pareenaverma merged 2 commits intoArmDeveloperEcosystem:mainfrom
odidev:spark-velox
Apr 27, 2026
Merged

Run Apache Spark SQL workloads on Azure Cobalt 100 Arm64 using Gluten and Velox for accelerated analytics#3199
pareenaverma merged 2 commits intoArmDeveloperEcosystem:mainfrom
odidev:spark-velox

Conversation

@odidev
Copy link
Copy Markdown
Contributor

@odidev odidev commented Apr 23, 2026

  1. Install and configure Hadoop, Spark, and Hive on Azure Cobalt 100 Arm64 virtual machines
  2. Build and integrate Gluten with the Velox backend for native query execution
  3. Configure Spark SQL for columnar and vectorized execution
  4. Generate and load TPC-DS datasets for benchmarking
  5. Run Spark SQL workloads and compare performance between vanilla Spark and Gluten + Velox
  • [ * ] I have reviewed Create a Learning Path
  • [ * ] I have checked my contribution for confidential information
    By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of the Creative Commons Attribution 4.0 International License.

Signed-off-by: odidev odidev@puresoftware.com.

… and Velox for accelerated analytics

Signed-off-by: odidev <odidev@puresoftware.com>
Set the draft status for the Spark Velox Cobalt index page.
@pareenaverma pareenaverma merged commit 30eef4f into ArmDeveloperEcosystem:main Apr 27, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ACM Arm Cloud Migration tech_review

Projects

Status: In Progress

Development

Successfully merging this pull request may close these issues.

2 participants