[SPARK-55450][SS][PYTHON][DOCS] Document admission control in PySpark streaming data sources#54807
[SPARK-55450][SS][PYTHON][DOCS] Document admission control in PySpark streaming data sources#54807jiteshsoni wants to merge 3 commits intoapache:masterfrom
Conversation
|
Thanks for your contribution! Could you make sure you have run the examples on your own? If you already did that, you can update the PR description (specifically Also we are asked to put the "model name" if you use LLM to generate the code, not the tool name. That is described in the PR template, specifically the form of string as well. Please update it. |
a2988f9 to
ec89c64
Compare
✅ Testing Verification - Examples Manually Tested on DatabricksI've successfully tested both examples on Databricks Dogfood Staging. Screenshots attached below showing the streaming query statistics. Example 1: Continuous ProcessingQuery Name:
Key Observations:
Example 2: Trigger.AvailableNow - Finite ProcessingQuery Name:
Key Observations:
Test Environment
Verified Functionality✅ Admission Control: Screenshots below show the Streaming Query Statistics from both runs: This comment was generated with GitHub MCP. |
Thanks for the feedback! I've addressed both items: #54807 (comment) (screenshot attached) ✅ Testing verification: Updated the PR description to confirm examples were manually tested on Databricks. I've also added a detailed comment below with screenshots showing |
…r.AvailableNow in PySpark streaming data sources This patch adds comprehensive documentation and examples for the new admission control and Trigger.AvailableNow features in Python streaming data sources (added in SPARK-55304). Changes: - New tutorial: streaming_admission_control.rst with step-by-step guide - Example: structured_blockchain_admission_control.py demonstrating: - Admission control via getDefaultReadLimit() and latestOffset() - Parallel partitioning (50 batches × 4 partitions = 200 tasks) - Trigger.AvailableNow for finite processing - SupportsTriggerAvailableNow mixin implementation Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
67a457f to
dd0f52f
Compare


What changes were proposed in this pull request?
This PR adds comprehensive documentation and examples for admission control and Trigger.AvailableNow support in PySpark custom streaming data sources (SPARK-55304).
Changes include:
New tutorial documentation (
python/docs/source/tutorial/sql/streaming_admission_control.rst):Example file (
structured_blockchain_admission_control.py):Why are the changes needed?
Users need documentation and practical examples to implement admission control in custom streaming sources (introduced in SPARK-55304).
Does this PR introduce any user-facing change?
No. Documentation and examples only.
How was this patch tested?
Testing approach:
What I checked:
Was this patch authored or co-authored using generative AI tooling?
Yes (Claude Sonnet 4.5)
🤖 Generated with Claude Code