deer-flow

mirror of https://github.com/bytedance/deer-flow.git synced 2026-04-25 11:18:22 +00:00

Author	SHA1	Message	Date
DanielWalnut	b970993425	fix: read lead agent options from context (#2515 ) * fix: read lead agent options from context * fix: validate runtime context config	2026-04-24 22:46:51 +08:00
DanielWalnut	ec8a8cae38	fix: gate deferred MCP tool execution (#2513 ) * fix: gate deferred MCP tool execution * style: format deferred tool middleware * fix: address deferred tool review feedback	2026-04-24 22:45:41 +08:00
DanielWalnut	d78ed5c8f2	fix: inherit subagent skill allowlists (#2514 )	2026-04-24 21:24:42 +08:00
Nan Gao	f9ff3a698d	fix(middleware): avoid rescuing non-skill tool outputs during summarization (#2458 ) * fix(middelware): narrow skill rescue to skill-related tool outputs * fix(summarization): address skill rescue review feedback * fix: wire summarization skill rescue config * fix: remove dead skill tool helper * fix(lint): fix format --------- Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2026-04-24 21:19:46 +08:00
Admire	c2332bb790	fix memory settings layout overflow (#2420 ) Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2026-04-24 20:29:55 +08:00
He Wang	3a61126824	fix: keep debug.py interactive terminal free from background log noise (#2466 ) * fix(debug): keep terminal clean by redirecting all logs to file - Redirect all logs to debug.log file to prevent background task logs from interfering with interactive terminal prompts - Honor AppConfig.log_level setting instead of hard-coding to INFO - Make logging setup idempotent by clearing pre-existing handlers - Defer deerflow imports until after logging is configured to ensure import-time side effects are captured in debug.log - Display active log level in startup banner - Add prompt_toolkit installation tip for enhanced readline support Made-with: Cursor * attaching the file handler before importing/calling get_app_config() Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-04-24 17:09:41 +08:00
Airene Fang	11f557a2c6	feat(trace):Add run_name to the trace info for system agents. (#2492 ) * feat(trace): Add `run_name` to the trace info for suggestions and memory. before(in langsmith): CodexChatModel CodexChatModel lead_agent after: suggest_agent memory_agent lead_agent feat(trace): Add `run_name` to the trace info for suggestions and memory. before(in langsmith): CodexChatModel CodexChatModel lead_agent after: suggest_agent memory_agent lead_agent * feat(trace): Add `run_name` to the trace info for system agents. before(in langsmith): CodexChatModel CodexChatModel CodexChatModel CodexChatModel lead_agent after: suggest_agent title_agent security_agent memory_agent lead_agent * chore(code format):code format --------- Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2026-04-24 17:06:55 +08:00
d 🔹	e8572b9d0c	fix(jina): log transient failures at WARNING without traceback (#2484 ) (#2485 ) The exception handler in JinaClient.crawl used logger.exception, which emits an ERROR-level record with the full httpx/httpcore/anyio traceback for every transient network failure (timeout, connection refused). Other search/crawl providers in the project log the same class of recoverable failures as a single line. One offline/slow-network session could produce dozens of multi-frame ERROR stack traces, drowning out real problems. Switch to logger.warning with a concise message that includes the exception type and its str, matching the style used elsewhere for recoverable transient failures (aio_sandbox, ddg, etc.). The exception type now also surfaces into the returned "Error: ..." string so callers retain diagnostic signal. Adds a regression test that asserts the log record is WARNING, carries no exc_info, and includes the exception class name. Co-authored-by: voidborne-d <voidborne-d@users.noreply.github.com> Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2026-04-24 16:00:14 +08:00
Willem Jiang	80a7446fd6	fix(backend): fix the unit test error in backend	2026-04-24 14:56:03 +08:00
Willem Jiang	cd12821134	fix(backend): Updated the uv.lock with new added dependency	2026-04-24 14:55:13 +08:00
Xinmin Zeng	30d619de08	feat(subagents): support per-subagent skill loading and custom subagent types (#2253 ) * feat(subagents): support per-subagent skill loading and custom subagent types (#2230) Add per-subagent skill configuration and custom subagent type registration, aligned with Codex's role-based config layering and per-session skill injection. Backend: - SubagentConfig gains `skills` field (None=all, []=none, list=whitelist) - New CustomSubagentConfig for user-defined subagent types in config.yaml - SubagentsAppConfig gains `custom_agents` section and `get_skills_for()` - Registry resolves custom agents with three-layer config precedence - SubagentExecutor loads skills per-session as conversation items (Codex pattern) - task_tool no longer appends skills to system_prompt - Lead agent system prompt dynamically lists all registered subagent types - setup_agent tool accepts optional skills parameter - Gateway agents API transparently passes skills in CRUD operations Frontend: - Agent/CreateAgentRequest/UpdateAgentRequest types include skills field - Agent card displays skills as badges alongside tool_groups Config: - config.example.yaml documents custom_agents and per-agent skills override Tests: - 40 new tests covering all skill config, custom agents, and registry logic - Existing tests updated for new get_skills_prompt_section signature Closes #2230 * fix: address review feedback on skills PR - Remove stale get_skills_prompt_section monkeypatches from test_task_tool_core_logic.py (task_tool no longer imports this function after skill injection moved to executor) - Add key prefixes (tg:/sk:) to agent-card badges to prevent React key collisions between tool_groups and skills * fix(ci): resolve lint and test failures - Format agent-card.tsx with prettier (lint-frontend) - Remove stale "Skills Appendix" system_prompt assertion — skills are now loaded per-session by SubagentExecutor, not appended to system_prompt * fix(ci): sort imports in test_subagent_skills_config.py (ruff I001) * fix(ci): use nullish coalescing in agent-card badge condition (eslint) * fix: address review feedback on skills PR - Use model_fields_set in AgentUpdateRequest to distinguish "field omitted" from "explicitly set to null" — fixes skills=None ambiguity where None means "inherit all" but was treated as "don't change" - Move lazy import of get_subagent_config outside loop in _build_available_subagents_description to avoid repeated import overhead --------- Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2026-04-23 23:59:47 +08:00
JerryChaox	4e72410154	fix(gateway): bound lifespan shutdown hooks to prevent worker hang under uvicorn reload (#2331 ) * fix(gateway): bound lifespan shutdown hooks to prevent worker hang Gateway worker can hang indefinitely in `uvicorn --reload` mode with the listening socket still bound — all /api/* requests return 504, and SIGKILL is the only recovery. Root cause (py-spy dump from a reproduction showed 16+ stacked frames of signal_handler -> Event.set -> threading.Lock.__enter__ on the main thread): CPython's `threading.Event` uses `Condition(Lock())` where the inner Lock is non-reentrant. uvicorn's BaseReload signal handler calls `should_exit.set()` directly from signal context; if a second signal (SIGTERM/SIGHUP from the reload supervisor, or watchfiles-triggered reload) arrives while the first handler holds the Lock, the reentrant call deadlocks on itself. The reload supervisor keeps sending those signals only when the worker fails to exit promptly. DeerFlow's lifespan currently awaits `stop_channel_service()` with no timeout; if a channel's `stop()` stalls (e.g. Feishu/Slack WebSocket waiting for an ack), the worker can't exit, the supervisor keeps signaling, and the deadlock becomes reachable. This is a defense-in-depth fix — it does not repair the upstream uvicorn/CPython issue, but it ensures DeerFlow's lifespan exits within a bounded window so the supervisor has no reason to keep firing signals. No behavior change on the happy path. Wraps the shutdown hook in `asyncio.wait_for(timeout=5.0)` and logs a warning on timeout before proceeding to worker exit. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * Update backend/app/gateway/app.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * style: apply make format (ruff) to test assertions Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Co-authored-by: Willem Jiang <willem.jiang@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-04-23 19:41:26 +08:00
He Wang	c42ae3af79	feat: add optional prompt-toolkit support to debug.py (#2461 ) * feat: add optional prompt-toolkit support to debug.py Use PromptSession.prompt_async() for arrow-key navigation and input history when prompt-toolkit is available, falling back to plain input() with a helpful install tip otherwise. Made-with: Cursor * fix: handle EOFError gracefully in debug.py Catch EOFError alongside KeyboardInterrupt so that Ctrl-D exits cleanly instead of printing a traceback. Made-with: Cursor	2026-04-23 17:49:18 +08:00
dependabot[bot]	bd35cd39aa	chore(deps): bump uuid from 13.0.0 to 14.0.0 in /frontend (#2467 ) Bumps [uuid](https://github.com/uuidjs/uuid) from 13.0.0 to 14.0.0. - [Release notes](https://github.com/uuidjs/uuid/releases) - [Changelog](https://github.com/uuidjs/uuid/blob/main/CHANGELOG.md) - [Commits](https://github.com/uuidjs/uuid/compare/v13.0.0...v14.0.0) --- updated-dependencies: - dependency-name: uuid dependency-version: 14.0.0 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-23 14:47:15 +08:00
d 🔹	b90f219bd1	fix(skills): validate bundled SKILL.md front-matter in CI (fixes #2443 ) (#2457 ) * fix(skills): validate bundled SKILL.md front-matter in CI (fixes #2443) Adds a parametrized backend test that runs `_validate_skill_frontmatter` against every bundled SKILL.md under `skills/public/`, so a broken front-matter fails CI with a per-skill error message instead of surfacing as a runtime gateway-load warning. The new test caught two pre-existing breakages on `main` and fixes them: * `bootstrap/SKILL.md`: the unquoted description had a second `:` mid-line ("Also trigger for updates: ..."), which YAML parses as a nested mapping ("mapping values are not allowed here"). Rewrites the description as a folded scalar (`>-`), which preserves the original wording (including the embedded colon, double quotes, and apostrophes) without further escaping. This complements PR #2436 (single-file colon→hyphen patch) with a more general convention that survives future edits. * `chart-visualization/SKILL.md`: used `dependency:` which is not in `ALLOWED_FRONTMATTER_PROPERTIES`. Renamed to `compatibility:`, the documented field for "Required tools, dependencies" per skill-creator. No code reads `dependency` (verified by grep across backend/). * Apply suggestions from code review Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * Fix the lint error --------- Co-authored-by: Willem Jiang <willem.jiang@gmail.com> Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>	2026-04-23 14:06:14 +08:00
dependabot[bot]	96d00f6073	chore(deps): bump dompurify from 3.3.1 to 3.4.1 in /frontend (#2462 ) Bumps [dompurify](https://github.com/cure53/DOMPurify) from 3.3.1 to 3.4.1. - [Release notes](https://github.com/cure53/DOMPurify/releases) - [Commits](https://github.com/cure53/DOMPurify/compare/3.3.1...3.4.1) --- updated-dependencies: - dependency-name: dompurify dependency-version: 3.4.1 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-23 12:18:59 +08:00
He Wang	c43c803f66	fix: remove mismatched context param in debug.py to suppress Pydantic warning (#2446 ) * fix: remove mismatched context param in debug.py to suppress Pydantic warning The ainvoke call passed context={"thread_id": ...} but the agent graph has no context_schema (ContextT defaults to None), causing a PydanticSerializationUnexpectedValue warning on every invocation. Align with the production run_agent path by injecting context via Runtime into configurable["__pregel_runtime"] instead. Closes #2445 Made-with: Cursor * refactor: derive runtime thread_id from config to avoid duplication Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Made-with: Cursor --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-04-23 09:56:57 +08:00
dependabot[bot]	dbd777fe62	chore(deps): bump python-dotenv from 1.2.1 to 1.2.2 in /backend (#2440 ) Bumps [python-dotenv](https://github.com/theskumar/python-dotenv) from 1.2.1 to 1.2.2. - [Release notes](https://github.com/theskumar/python-dotenv/releases) - [Changelog](https://github.com/theskumar/python-dotenv/blob/main/CHANGELOG.md) - [Commits](https://github.com/theskumar/python-dotenv/compare/v1.2.1...v1.2.2) --- updated-dependencies: - dependency-name: python-dotenv dependency-version: 1.2.2 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-22 16:48:09 +08:00
dependabot[bot]	1ca2621285	chore(deps): bump lxml from 6.0.2 to 6.1.0 in /backend (#2427 ) Bumps [lxml](https://github.com/lxml/lxml) from 6.0.2 to 6.1.0. - [Release notes](https://github.com/lxml/lxml/releases) - [Changelog](https://github.com/lxml/lxml/blob/master/CHANGES.txt) - [Commits](https://github.com/lxml/lxml/compare/lxml-6.0.2...lxml-6.1.0) --- updated-dependencies: - dependency-name: lxml dependency-version: 6.1.0 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-22 16:14:11 +08:00
Shawn Jasper	5ba1dacf25	fix: rename present_file to present_files in docs and prompts (#2393 ) The tool is registered as `present_files` (plural) in present_file_tool.py, but four references in documentation and prompt strings incorrectly used the singular form `present_file`. This could cause confusion and potentially lead to incorrect tool invocations. Changed files: - backend/docs/GUARDRAILS.md - backend/docs/ARCHITECTURE.md - backend/packages/harness/deerflow/agents/lead_agent/prompt.py (2 occurrences)	2026-04-21 16:10:14 +08:00
Reuben Bowlby	085c13edc7	fix: remove unnecessary f-string prefixes and unused import (#2352 ) - Remove f-string prefix on 7 strings with no placeholders (F541) in analyze.py, aggregate_benchmark.py, run_loop.py, generate_review.py - Remove unused `os` import in quick_validate.py (F401) Found by ruff via HUMMBL Arbiter (https://hummbl.io/audit).	2026-04-21 09:53:18 +08:00
Copilot	ef04174194	Fix invalid HTML nesting in reasoning trigger during complex task rendering (#2382 ) * Initial plan * fix(frontend): avoid invalid paragraph nesting in reasoning trigger Agent-Logs-Url: https://github.com/bytedance/deer-flow/sessions/4c9eb0c2-ff29-4629-a61c-4e33d736d918 Co-authored-by: WillemJiang <219644+WillemJiang@users.noreply.github.com> * test(frontend): strengthen reasoning trigger DOM nesting assertion Agent-Logs-Url: https://github.com/bytedance/deer-flow/sessions/4c9eb0c2-ff29-4629-a61c-4e33d736d918 Co-authored-by: WillemJiang <219644+WillemJiang@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: WillemJiang <219644+WillemJiang@users.noreply.github.com>	2026-04-21 09:41:28 +08:00
Ansel	6dce26a52e	fix: resolve tool duplication and skill parser YAML inconsistencies (#1803 ) (#2107 ) * Refactor tests for SKILL.md parser Updated tests for SKILL.md parser to handle quoted names and descriptions correctly. Added new tests for parsing plain and single-quoted names, and ensured multi-line descriptions are processed properly. * Implement tool name validation and deduplication Add tool name mismatch warning and deduplication logic * Refactor skill file parsing and error handling * Add tests for tool name deduplication Added tests for tool name deduplication in get_available_tools(). Ensured that duplicates are not returned, the first occurrence is kept, and warnings are logged for skipped duplicates. * Apply suggestions from code review Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * Update minimal config to include tools list * Update test for nonexistent skill file Ensure the test for nonexistent files checks for None. * Refactor tool loading and add skill management support Refactor tool loading logic to include skill management tools based on configuration and clean up comments. * Enhance code comments for tool loading logic Added comments to clarify the purpose of various code sections related to tool loading and configuration. * Fix assertion for duplicate tool name warning * Fix indentation issues in tools.py * Fix the lint error of test_tool_deduplication * Fix the lint error of tools.py * Fix the lint error * Fix the lint error * make format --------- Co-authored-by: Willem Jiang <willem.jiang@gmail.com> Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>	2026-04-20 20:25:03 +08:00
imhaoran	fc94e90f6c	fix(setup-agent): prevent data loss when setup fails on existing agen… (#2254 ) * fix(setup-agent): prevent data loss when setup fails on existing agent directory Record whether the agent directory pre-existed before mkdir, and only run shutil.rmtree cleanup when the directory was newly created during this call. Previously, any failure would delete the entire directory including pre-existing SOUL.md and config.yaml. * fix: address PR review — init variables before try, remove unused result * style: fix ruff I001 import block formatting in test file * style: add missing blank lines between top-level definitions in test file	2026-04-20 20:17:30 +08:00
Eilen Shin	f2013f47aa	fix command palette hydration mismatch (#2301 ) * fix command palette hydration mismatch * style: format command dialog description	2026-04-20 11:36:16 +08:00
KiteEater	4be857f64b	fix: use Apple Container image pull syntax (#2366 )	2026-04-20 08:00:05 +08:00
Admire	c99865f53d	fix(token-usage): enable stream usage for openai-compatible models (#2217 ) * fix(token-usage): enable stream usage for openai-compatible models * fix(token-usage): narrow stream_usage default to ChatOpenAI	2026-04-19 22:42:55 +08:00
YYMa	05f1da03e5	fix(script): use portable locale for langgraph log pipeline on macOS (#2361 )	2026-04-19 22:41:00 +08:00
Xun	a62ca5dd47	fix: Catch httpx.ReadError in the error handling (#2309 ) * fix: Catch httpx.ReadError in the error handling * fix	2026-04-19 22:30:22 +08:00
Nan Gao	f514e35a36	fix(backend): make clarification messages idempotent (#2350 ) (#2351 )	2026-04-19 22:00:58 +08:00
Xun	7c87dc5bca	fix(reasoning): prevent LLM-hallucinated HTML tags from rendering as DOM elements (#2321 ) * fix * add test * fix	2026-04-19 19:27:34 +08:00
Hinotobi	80e210f5bb	[security] fix(uploads): require explicit opt-in for host-side document conversion (#2332 ) * fix: disable host-side upload conversion by default * fix: address PR review comments on upload conversion gate	2026-04-18 22:47:42 +08:00
dependabot[bot]	5656f90792	chore(deps-dev): bump pytest from 9.0.2 to 9.0.3 in /backend (#2349 ) Bumps [pytest](https://github.com/pytest-dev/pytest) from 9.0.2 to 9.0.3. - [Release notes](https://github.com/pytest-dev/pytest/releases) - [Changelog](https://github.com/pytest-dev/pytest/blob/main/CHANGELOG.rst) - [Commits](https://github.com/pytest-dev/pytest/compare/9.0.2...9.0.3) --- updated-dependencies: - dependency-name: pytest dependency-version: 9.0.3 dependency-type: direct:development ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-18 22:22:40 +08:00
Shawn Jasper	55474011c9	fix(subagent): inherit parent agent's tool_groups in task_tool (#2305 ) * fix(subagent): inherit parent agent's tool_groups in task_tool When a custom agent defines tool_groups (e.g. [file:read, file:write, bash]), the restriction is correctly applied to the lead agent. However, when the lead agent delegates work to a subagent via the task tool, get_available_tools() is called without the groups parameter, causing the subagent to receive ALL tools (including web_search, web_fetch, image_search, etc.) regardless of the parent agent's configuration. This fix propagates tool_groups through run metadata so that task_tool passes the same group filter when building the subagent's tool set. Changes: - agent.py: include tool_groups in run metadata - task_tool.py: read tool_groups from metadata and pass to get_available_tools() * fix: initialize metadata before conditional block and update tests for tool_groups propagation - Initialize metadata = {} before the 'if runtime is not None' block to avoid Ruff F821 (possibly-undefined variable) and simplify the parent_tool_groups expression. - Update existing test assertion to expect groups=None in get_available_tools call signature. - Add 3 new test cases: - test_task_tool_propagates_tool_groups_to_subagent - test_task_tool_no_tool_groups_passes_none - test_task_tool_runtime_none_passes_groups_none	2026-04-18 22:17:37 +08:00
imhaoran	24fe5fbd8c	fix(mcp): prevent RuntimeError from escaping except block in get_cach… (#2252 ) * fix(mcp): prevent RuntimeError from escaping except block in get_cached_mcp_tools When `asyncio.get_event_loop()` raises RuntimeError and the fallback `asyncio.run()` also fails, the exception escapes unhandled because Python does not route exceptions raised inside an `except` block to sibling `except` clauses. Wrap the fallback call in its own try/except so failures are logged and the function returns [] as intended. * fix: use logger.exception to preserve stack traces on MCP init failure	2026-04-18 21:07:30 +08:00
Willem Jiang	be4663505a	chroe(script): disable the color log of langgraph	2026-04-18 20:03:05 +08:00
dependabot[bot]	aa6098e6a4	chore(deps): bump langsmith from 0.6.4 to 0.7.31 in /backend (#2291 ) Bumps [langsmith](https://github.com/langchain-ai/langsmith-sdk) from 0.6.4 to 0.7.31. - [Release notes](https://github.com/langchain-ai/langsmith-sdk/releases) - [Commits](https://github.com/langchain-ai/langsmith-sdk/compare/v0.6.4...v0.7.31) --- updated-dependencies: - dependency-name: langsmith dependency-version: 0.7.31 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-18 19:54:21 +08:00
Airene Fang	1221448029	fix(scripts): Cloud Provider Reports Security Issue（aliyun could） (#2323 ) ATT&CK矩阵ID：T1059.004 数据来源：进程启动触发检测告警原因：该进程的命令行显示出反弹shelI的特征命令行：timeout 1 bash -c exec 3<>/dev/tcp/127.0.0.1/2024 进程路径：/usr/bin/timeout 进程链：-［337650］ /usr/sbin/sshd -D -［397971］ /usr/sbin/sshd -D -R -［397977］-bash -［398903］ make dev -［398920］ bash ./scripts/serve.sh --dev -［399037］bash ./scripts/wait-for-port.sh 2024 60 LangGraph	2026-04-18 19:33:32 +08:00
Jason	3b91df2b18	fix(frontend): add catch-all API rewrite for gateway routes (#2335 ) When NEXT_PUBLIC_BACKEND_BASE_URL is unset, the frontend proxies API requests to the gateway. Only /api/agents and /api/skills had rewrite rules, causing 404s for /api/models, /api/threads, /api/memory, /api/mcp, /api/suggestions, /api/runs, etc. Add a catch-all /api/:path* rewrite that proxies all remaining gateway API routes. The existing /api/langgraph rewrite takes priority because it is pushed to the array first (Next.js checks rewrites in order). Fixes #2327 Co-authored-by: JasonOA888 <JasonOA888@users.noreply.github.com>	2026-04-18 11:35:19 +08:00
Shawn Jasper	ca1b7d5f48	fix(sandbox): add missing path masking in ls_tool output (#2317 ) ls_tool was the only file-system tool that did not call mask_local_paths_in_output() before returning its result, causing host absolute paths (e.g. /Users/.../backend/.deer-flow/knowledge-base/...) to leak to the LLM instead of the expected virtual paths (/mnt/knowledge-base/...). This patch: - Adds the mask_local_paths_in_output() call to ls_tool, consistent with bash_tool, glob_tool and grep_tool. - Initialises thread_data = None before the is_local_sandbox branch (same pattern as glob_tool) so the variable is always in scope. - Adds three new tests covering user-data path masking, skills path masking and the empty-directory edge case.	2026-04-18 08:46:59 +08:00
yangzheli	c6b0423558	feat(frontend): add Playwright E2E tests with CI workflow (#2279 ) * feat(frontend): add Playwright E2E tests with CI workflow Add end-to-end testing infrastructure using Playwright (Chromium only). 14 tests across 5 spec files cover landing page, chat workspace, thread history, sidebar navigation, and agent chat — all with mocked LangGraph/Backend APIs via network interception (zero backend dependency). New files: - playwright.config.ts — Chromium, 30s timeout, auto-start Next.js - tests/e2e/utils/mock-api.ts — shared API mocks & SSE stream helpers - tests/e2e/{landing,chat,thread-history,sidebar,agent-chat}.spec.ts - .github/workflows/e2e-tests.yml — push main + PR trigger, paths filter Updated: package.json, Makefile, .gitignore, CONTRIBUTING.md, frontend/CLAUDE.md, frontend/AGENTS.md, frontend/README.md Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: apply Copilot suggestions --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2026-04-18 08:21:08 +08:00
DanielWalnut	898f4e8ac2	fix: Memory update system has cache corruption, data loss, and thread-safety bugs (#2251 ) * fix(memory): cache corruption, thread-safety, and caller mutation bugs Bug 1 (updater.py): deep-copy current_memory before passing to _apply_updates() so a subsequent save() failure cannot leave a partially-mutated object in the storage cache. Bug 3 (storage.py): add _cache_lock (threading.Lock) to FileMemoryStorage and acquire it around every read/write of _memory_cache, fixing concurrent-access races between the background timer thread and HTTP reload calls. Bug 4 (storage.py): replace in-place mutation memory_data["lastUpdated"] = ... with a shallow copy memory_data = {*memory_data, "lastUpdated": ...} so save() no longer silently modifies the caller's dict. Regression tests added for all three bugs in test_memory_storage.py and test_memory_updater.py. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> style: format test_memory_updater.py with ruff Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * style: remove stale bug-number labels from code comments and docstrings Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 12:00:31 +08:00
dependabot[bot]	259a6844bf	chore(deps): bump python-multipart from 0.0.22 to 0.0.26 in /backend (#2282 ) Bumps [python-multipart](https://github.com/Kludex/python-multipart) from 0.0.22 to 0.0.26. - [Release notes](https://github.com/Kludex/python-multipart/releases) - [Changelog](https://github.com/Kludex/python-multipart/blob/master/CHANGELOG.md) - [Commits](https://github.com/Kludex/python-multipart/compare/0.0.22...0.0.26) --- updated-dependencies: - dependency-name: python-multipart dependency-version: 0.0.26 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-04-16 09:07:28 +08:00
d 🔹	a664d2f5c4	fix(checkpointer): create parent directory before opening SQLite in sync provider (#2272 ) * fix(checkpointer): create parent directory before opening SQLite in sync provider The sync checkpointer factory (_sync_checkpointer_cm) opens a SQLite connection without first ensuring the parent directory exists. The async provider and both store providers already call ensure_sqlite_parent_dir(), but this call was missing from the sync path. When the deer-flow harness package is used from an external virtualenv (where the .deer-flow directory is not pre-created), the missing parent directory causes: sqlite3.OperationalError: unable to open database file Add the missing ensure_sqlite_parent_dir() call in the sync SQLite branch, consistent with the async provider, and add a regression test. Closes #2259 * style: fix ruff format + add call-order assertion for ensure_parent_dir - Fix formatting in test_checkpointer.py (ruff format) - Add test_sqlite_ensure_parent_dir_before_connect to verify ensure_sqlite_parent_dir is called before from_conn_string (addresses Copilot review suggestion) --------- Co-authored-by: voidborne-d <voidborne-d@users.noreply.github.com>	2026-04-16 09:06:38 +08:00
YuJitang	105db00987	feat: show token usage per assistant response (#2270 ) * feat: show token usage per assistant response * fix: align client models response with token usage * fix: address token usage review feedback * docs: clarify token usage config example --------- Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2026-04-16 08:56:49 +08:00
Nan Gao	0e16a7fe55	fix(frontend): make Suggestion button opaque in dark mode (#2276 ) * fix(frontend): make Suggestion button opaque in dark mode The outline Button variant applies dark:bg-input/30, leaving Suggestion pills ~70% transparent in dark mode. Scrolled chat content bled through the buttons, making suggestion text unreadable. Override with dark:bg-background so it matches the opaque light-mode appearance. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix the lint error of commit --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Willem Jiang <willem.jiang@gmail.com>	2026-04-16 08:55:16 +08:00
Nan Gao	4d3038a7b6	fix(frontend): stop artifact panel from auto-opening on rehydrated write_file (#2278 ) After a page refresh, the artifact panel's autoOpen/autoSelect state is reset to true. Submitting a new question flips thread.isLoading to true, which message-list passes to every MessageGroup — including historical ones. The previous response's last write_file step then satisfies the auto-open condition and re-pops the stale artifact. Gate the auto-open on the tool call having no result yet, so only a write_file that is still streaming in the current response can trigger it; rehydrated tool calls always carry a result and are now skipped. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-16 08:46:47 +08:00
Hinotobi	2176b2bbfc	fix: validate bootstrap agent names before filesystem writes (#2274 ) * fix: validate bootstrap agent names before filesystem writes * fix: tighten bootstrap agent-name validation	2026-04-16 08:36:42 +08:00
Wen	8e3591312a	test: add unit tests for ViewImageMiddleware (#2256 ) * test: add unit tests for ViewImageMiddleware - Add 33 test cases covering all 7 internal methods plus sync/async before_model hooks - Cover normal path, edge cases (missing keys, empty base64, stale ToolMessages before assistant turn), and deduplication logic - Related to Q2 Roadmap #1669 * test: add unit tests for ViewImageMiddleware Add 35 test cases covering all internal methods, before_model hooks, and edge cases (missing attrs, list-content dedup, stale ToolMessages). Related to #1669	2026-04-15 23:54:30 +08:00
Willem Jiang	242c654075	fix(frontend):lint error of message-list-item.tsx	2026-04-15 23:35:50 +08:00

1 2 3 4 5 ...

1958 Commits