fix: strip <think> tokens from reasoning model output

Models with thinking/reasoning capabilities (DeepSeek-R1, MiniMax-M2.7, QwQ, etc.) include <think>...</think> blocks in their response content. These internal reasoning tokens leak into agent output and downstream node inputs, corrupting the workflow. Add _strip_thinking_tokens() classmethod to OpenAIProvider that filters <think>...</think> blocks via regex. Applied in both: - _deserialize_chat_response() (Message content) - _append_chat_response_output() (timeline content) The fix is zero-cost for models without thinking tokens (fast path checks for '<think>' substring before regex). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-07-26 07:58:00 +00:00 · 2026-05-09 10:13:10 +08:00 · 2026-05-09 10:13:10 +08:00 · 5789d78c34
commit 5789d78c34
parent c85e1de2a7
1 changed files with 16 additions and 2 deletions
--- a/runtime/node/agent/providers/openai_provider.py
+++ b/runtime/node/agent/providers/openai_provider.py
@ -2,6 +2,7 @@

 import base64
 import hashlib
+import re

 import binascii
 import os
@ -383,18 +384,31 @@ class OpenAIProvider(ModelProvider):
                    type="function"
                ))
        
+        content = self._get_attr(msg, "content") or ""
+        content = self._strip_thinking_tokens(content)
+
        return Message(
            role=MessageRole.ASSISTANT,
-            content=self._get_attr(msg, "content") or "",
+            content=content,
            tool_calls=tool_calls
        )

+    _THINK_PATTERN = re.compile(r"<think>.*?</think>\s*", re.DOTALL)
+
+    @classmethod
+    def _strip_thinking_tokens(cls, text: str) -> str:
+        """Strip <think>...</think> blocks from model output (e.g. DeepSeek-R1, MiniMax-M2.7)."""
+        if "<think>" not in text:
+            return text
+        return cls._THINK_PATTERN.sub("", text).strip()
+
    def _append_chat_response_output(self, timeline: List[Any], response: Any) -> None:
        """Add chat response to timeline, preserving tool_calls (Chat API compatible)."""
        msg = response.choices[0].message
+        content = self._strip_thinking_tokens(msg.content or "")
        assistant_msg = {
            "role": "assistant",
-            "content": msg.content or ""
+            "content": content
        }

        if getattr(msg, "tool_calls", None):