deer-flow

mirror of https://github.com/bytedance/deer-flow.git synced 2026-06-10 01:22:09 +00:00

History

fix(summarization): tag summary LLM calls nostream to stop phantom stream messages (#2503 ) (#3378 )

* fix(summarization): tag summary LLM calls nostream to stop phantom stream messages (#2503)

The SummarizationMiddleware runs its summary LLM call inside a before_model
hook. Without a nostream tag the summary tokens were captured by LangGraph's
messages-tuple stream callback and broadcast to the frontend as a phantom AI
message.

Generate a dedicated summary model copy tagged with "nostream" (merged on top
of any existing tags such as "middleware:summarize" so RunJournal attribution
is preserved) and override _create_summary / _acreate_summary to invoke it
directly. This avoids temporarily swapping the shared self.model, which would
otherwise leak the RunnableBinding across concurrent runs and break parent
logic that inspects the raw model (profile / _get_ls_params).

Add regression tests covering nostream tagging, concurrent-run isolation, raw
model preservation, and existing-tag merge.

* fix(summarization): address nostream review feedback

2026-06-07 17:55:04 +08:00

harness

fix(summarization): tag summary LLM calls nostream to stop phantom stream messages (#2503 ) (#3378 )

2026-06-07 17:55:04 +08:00