fix: propagate SSE stream errors to waiting requests by heyhayes · Pull Request #2122 · modelcontextprotocol/python-sdk

heyhayes · 2026-02-21T13:20:07Z

Summary

Fixes #1401. Also fixes #1789 (closed as duplicate).

When an SSE read timeout occurs during a StreamableHTTP POST request, the pending send_request call hangs indefinitely. The transport catches the exception but never sends an error back through the read stream, leaving the caller blocked on response_stream_reader.receive() with nothing to receive.

This PR fixes error propagation at the transport level so that SSE stream failures produce a JSONRPCError keyed to the original request ID. BaseSession._handle_response routes it to the correct per-request response stream, and send_request surfaces it as MCPError to the caller. This approach keeps failures isolated to the affected request rather than tearing down the entire session.

What changed

_handle_sse_response now sends a JSONRPCError(INTERNAL_ERROR, "SSE stream ended without a response") when the SSE stream ends without delivering a complete response, whether due to a read timeout, network error, or unexpected server close. If a last_event_id was received, reconnection is attempted first; the error is only sent after reconnection is exhausted.

_handle_reconnection returns bool instead of None so callers can distinguish success (response delivered) from failure (attempts exhausted). The method also fixes an infinite recursion bug: the attempt counter was reset to 0 on every stream end (even when no complete response was delivered), which combined with httpx read timeouts causing graceful stream termination meant the reconnection loop could run forever.

handle_get_stream applies the same fix to the GET stream's reconnection loop: the attempt counter only resets when events were actually received during the connection. Empty connections that close immediately count toward MAX_RECONNECTION_ATTEMPTS.

_default_message_handler now logs a warning for exceptions instead of silently discarding them, providing observability for transport errors not tied to a specific request.

Test plan

New E2E test: client with 0.5s SSE read timeout reads a slow resource (2s server delay), asserts MCPError is raised instead of hanging
New unit test: _handle_reconnection returns False when called at max attempts
All 51 existing streamable HTTP tests pass (including reconnection, polling, and multi-reconnection tests)
All 233 client tests pass
Linting (ruff check .) and type checking (pyright) clean

…rsion _handle_reconnection previously returned None, making it impossible for callers to distinguish between a successful response delivery and exhausted retries. This changes the return type to bool (True on success, False when max attempts exceeded) and fixes two issues: - The attempt counter at line 426 was reset to 0 on each reconnection, causing infinite recursion when streams kept ending without delivering a response. Now increments attempt on each recursion. - All recursive calls now use `return await` so the result propagates back to the original caller. MAX_RECONNECTION_ATTEMPTS increased from 2 to 5 to accommodate legitimate multi-reconnection patterns where the server intentionally closes streams between checkpoints. Github-Issue: modelcontextprotocol#1401

When an SSE stream ends prematurely (e.g. due to a read timeout), the client would hang forever waiting for a response that will never arrive. Now _handle_sse_response checks the return value of _handle_reconnection and, if reconnection did not deliver a response, sends a JSONRPCError with INTERNAL_ERROR to the read stream. This unblocks the waiting request and surfaces the failure as an MCPError to the caller. Github-Issue: modelcontextprotocol#1401

Only reset the attempt counter when events were actually received during the connection. Connections that close immediately without delivering events now count toward MAX_RECONNECTION_ATTEMPTS. Github-Issue:modelcontextprotocol#1401

Transport errors that are not tied to a specific pending request (e.g., GET stream failures) were silently swallowed by the default message handler. Add a warning log so these exceptions are at least visible in logs as an observability safety net. Github-Issue: modelcontextprotocol#1401

Add test_sse_error_when_reconnection_exhausted to exercise the _handle_sse_response path where SSE events are received (setting last_event_id) but reconnection fails, ensuring the JSONRPCError is sent to unblock the waiting request.

fengjikui · 2026-06-23T03:24:11Z

I took a pass at refreshing this because the current PR is marked conflicting, and this issue is still referenced from #1401 by downstream users.

I did not want to open a duplicate PR without checking first, so I rebuilt the same narrow direction on current origin/main here:

branch: https://github.com/fengjikui/python-sdk/tree/codex/rebuild-pr2122-sse-error-propagation
commit: 684e38b (fix: propagate streamable HTTP SSE errors)
touched files: src/mcp/client/session.py, src/mcp/client/streamable_http.py, tests/shared/test_streamable_http.py

Validation run locally on the refreshed branch:

uv run pytest tests/shared/test_streamable_http.py::test_server_close_sse_stream_via_context 
  tests/shared/test_streamable_http.py::test_streamable_http_client_auto_reconnects 
  tests/shared/test_streamable_http.py::test_streamable_http_client_respects_retry_interval 
  tests/shared/test_streamable_http.py::test_handle_reconnection_returns_false_on_max_attempts 
  tests/shared/test_streamable_http.py::test_sse_stream_close_raises_when_reconnection_fails -q
# 5 passed

uv run ruff check src/mcp/client/streamable_http.py src/mcp/client/session.py tests/shared/test_streamable_http.py
# All checks passed

uv run ruff format --check src/mcp/client/streamable_http.py src/mcp/client/session.py tests/shared/test_streamable_http.py
# 3 files already formatted

git diff --check origin/main..HEAD
# clean

The refreshed patch keeps the original behavior intent: _handle_reconnection() returns whether a response was delivered, _handle_sse_response() sends a JSONRPC error back to unblock the pending request when reconnection cannot deliver the response, empty GET SSE streams count toward the reconnect budget, and the default message handler logs otherwise-unhandled exceptions.

Happy to either open this as a refresh PR against main, or leave the branch here for @heyhayes / maintainers to cherry-pick if that is preferred.

heyhayes added 4 commits February 21, 2026 13:19

heyhayes marked this pull request as draft February 21, 2026 13:22

heyhayes force-pushed the fix/sse-error-propagation-1401 branch 2 times, most recently from a20a405 to c1fffe8 Compare February 21, 2026 13:51

heyhayes force-pushed the fix/sse-error-propagation-1401 branch from 90c334d to f0af07e Compare February 21, 2026 14:03

heyhayes marked this pull request as ready for review February 21, 2026 14:07

This was referenced Feb 21, 2026

ClientSession Error Handling #1401

Open

fix: handle_sse_response causing dangling client's read_stream #1812

Open

maxisbey added bug Something isn't working P1 Significant bug affecting many users, highly requested feature labels Mar 5, 2026

maxisbey mentioned this pull request Mar 24, 2026

fix: Raise exceptions in default ClientSession message handler (#1401) #1595

Closed

9 tasks

Aboudjem mentioned this pull request Mar 29, 2026

fix(session): log exceptions in default message_handler instead of silently swallowing #2374

Open

6 tasks

This was referenced May 18, 2026

fix: send error to client when SSE stream disconnects without response #1949

Closed

Fix SSE timeout hang by propagating transport exceptions to pending r… #2430

Closed

fix: propagate SSE stream errors to waiting requests #2628

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: propagate SSE stream errors to waiting requests#2122

fix: propagate SSE stream errors to waiting requests#2122
heyhayes wants to merge 5 commits into
modelcontextprotocol:mainfrom
heyhayes:fix/sse-error-propagation-1401

heyhayes commented Feb 21, 2026 •

edited

Loading

Uh oh!

fengjikui commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

heyhayes commented Feb 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changed

Test plan

Uh oh!

fengjikui commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

heyhayes commented Feb 21, 2026 •

edited

Loading