Skip to content

Comments

Add Apache Spark within CMEM BUILD documentation#212

Merged
rpietzsch merged 16 commits intomainfrom
feature/cmem-build-spark-doc
Feb 23, 2026
Merged

Add Apache Spark within CMEM BUILD documentation#212
rpietzsch merged 16 commits intomainfrom
feature/cmem-build-spark-doc

Conversation

@edufuga
Copy link

@edufuga edufuga commented Jan 30, 2026

Description

Adds a new documentation page that explains Apache Spark in general terms and documents how Spark is used within Corporate Memory’s BUILD component. It covers Spark’s main abstractions (RDDs, DataFrames), the rationale for using Spark in BUILD, what “Spark-aware datasets” are (and which types exist), and how Spark-backed workflow execution relates to the Knowledge Graph (Spark computes in BUILD; EXPLORE persists; the workflow execution engine mediates).

Context

This documentation complements the plugin documentations in and from BUILD itself.

@edufuga edufuga requested a review from rpietzsch January 30, 2026 15:34
Copy link
Contributor

@rpietzsch rpietzsch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the corrections!
I updated the spark section and okay with merging it now.

@rpietzsch rpietzsch merged commit 53fc63d into main Feb 23, 2026
2 checks passed
@rpietzsch rpietzsch deleted the feature/cmem-build-spark-doc branch February 23, 2026 11:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

2 participants