Skip to content

[SPARK-56020][SQL] Improve GroupPartitions Spark UI#54842

Open
szehon-ho wants to merge 4 commits intoapache:masterfrom
szehon-ho:group_partitions_string_follow
Open

[SPARK-56020][SQL] Improve GroupPartitions Spark UI#54842
szehon-ho wants to merge 4 commits intoapache:masterfrom
szehon-ho:group_partitions_string_follow

Conversation

@szehon-ho
Copy link
Member

@szehon-ho szehon-ho commented Mar 17, 2026

What changes were proposed in this pull request?

Follow up for #54459

  1. It handled explain but not UI
  2. Reducers could be improved a bit
  3. Fix a test typo

Why are the changes needed?

Improve UX for the new SPJ group operator

Does this PR introduce any user-facing change?

Adds to Explain Extended, and changes unreleased Spark UI for the GroupPartitions node a bit

How was this patch tested?

Add unit test

Was this patch authored or co-authored using generative AI tooling?

Yes, cursor

@szehon-ho szehon-ho changed the title [SPARK-56020][SQL] Improve GroupPartitions Explain Extended [SPARK-56020][SQL] Improve GroupPartitions Spark UI Mar 17, 2026
}

test("SPARK-55992: GroupPartitions textual representatin in plans") {
test("SPARK-55992: GroupPartitions textual representation in simple and extended explain") {
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fix a small typo here

}

/** Summary parts for plan string representation. */
private def planSummaryParts(joinKeyMaxFields: Int): Seq[String] = {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It is a bit weird that planSummaryParts() is used at 2 places but its result is immediately filtered by a .filter(_.nonEmpty) at both places. So how about returning only the non-empty parts and probably as Iterator[String].

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed

@szehon-ho
Copy link
Member Author

@peter-toth thanks for taking a look, cleaned up the pr a bit

@szehon-ho szehon-ho force-pushed the group_partitions_string_follow branch from 0a7db13 to 34f989c Compare March 17, 2026 20:21
Copy link
Contributor

@peter-toth peter-toth left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, only minor suggestion.

…rces/v2/GroupPartitionsExec.scala

Co-authored-by: Peter Toth <peter.toth@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants