deer-flow

mirror of https://github.com/bytedance/deer-flow.git synced 2026-05-14 12:43:45 +00:00

History

fix(agents): make update_agent honor runtime.context user_id like setup_agent (#2867 )

* fix(agents): make update_agent honor runtime.context user_id like setup_agent

PR #2784 hardened setup_agent to prefer runtime.context["user_id"] (set by
inject_authenticated_user_context from the auth-validated request) over the
contextvar, so an agent created during the bootstrap flow always lands under
users/<auth_uid>/agents/<name>. update_agent was left calling
get_effective_user_id() unconditionally — the same class of bug that produced
issues #2782 / #2862 still applies whenever the contextvar is not available
on the executing task (background work, future cross-process drivers,
checkpoint resume on a different task). In that regime update_agent silently
routes writes to users/default/agents/<name>, corrupting the shared default
bucket and losing the user's edit.

Extract the resolution policy into a shared resolve_runtime_user_id helper
on deerflow.runtime.user_context and route both setup_agent and update_agent
through it so the two halves of the lifecycle stay in lockstep.

Add load-bearing end-to-end tests that drive a real langchain.agents
create_agent graph with a fake LLM, exercising the full pipeline:

  HTTP wire format
    -> app.gateway.services.start_run config-assembly
    -> deerflow.runtime.runs.worker._build_runtime_context
    -> langchain.agents create_agent graph
    -> ToolNode dispatch (sync + async + sub-graph + ContextThreadPoolExecutor)
    -> setup_agent / update_agent

The negative-control tests intentionally land in users/default/ to prove the
positive tests are actually load-bearing rather than vacuously passing.

The new test_update_agent_e2e_user_isolation suite included a test that
failed against main and now passes after this fix.

* style: ruff format on new e2e tests

* test(e2e): real-server HTTP test driving setup_agent through the full ASGI stack

Adds tests/test_setup_agent_http_e2e_real_server.py — a single load-bearing
test that drives the entire FastAPI gateway through starlette.testclient.
TestClient with no mocks above the LLM:

  - lifespan boots (config, sqlite engine, LangGraph runtime, channels)
  - POST /api/v1/auth/register (real password hash, real sqlite write,
    issues access_token + csrf_token cookies)
  - POST /api/threads (real thread_meta + checkpoint creation)
  - POST /api/threads/{id}/runs/stream with the exact wire shape the React
    frontend sends (assistant_id + input + config + context with
    agent_name/is_bootstrap)
  - AuthMiddleware -> CSRFMiddleware -> require_permission ->
    start_run -> inject_authenticated_user_context ->
    asyncio.create_task(run_agent) -> worker._build_runtime_context ->
    Runtime injection -> ToolNode dispatch -> real setup_agent
  - Asserts SOUL.md is under users/<authenticated_uid>/agents/<name>/
    and NOT under users/default/agents/<name>/.

DEER_FLOW_HOME and the sqlite path are redirected into tmp_path so the test
never touches the real .deer-flow directory or developer database. The only
patch above the LLM boundary is replacing create_chat_model with a fake that
emits a single setup_agent tool_call.

This is the "真实验证" answer: it reproduces what curl-against-uvicorn would
do, minus the network socket layer.

* test: address Copilot review on user-isolation e2e tests

- Drop "currently expected to FAIL" wording from update_agent e2e docstring
  and header (Copilot review): the fix is in this PR, the test pins the
  corrected behaviour rather than driving a future change.
- Rephrase the assertion failure messages from "BUG:" to "REGRESSION:" to
  match the test's role on the fixed branch.
- Bound _drain_stream with a wall-clock timeout, a max-bytes cap, and an
  early break on the "event: end" SSE frame (Copilot review). Stops the
  test from hanging on a stuck run or runaway heartbeat loop.
- Replace the misleading "patch both module aliases" comment with an
  explanation of why patching lead_agent.agent.create_chat_model is the
  only correct target (Copilot review): lead_agent rebinds the symbol
  into its own namespace at import time, so patching deerflow.models is
  too late.

* test(refactor): address WillemJiang review on user-isolation e2e tests

- Extract the duplicated FakeToolCallingModel (and a
  build_single_tool_call_model helper) into tests/_agent_e2e_helpers.py.
  All three e2e files now import from the shared module instead of
  redefining the shim locally.
- Convert the manual p.start() / p.stop() try/finally blocks in
  test_update_agent_e2e_user_isolation.py to contextlib.ExitStack so
  patch lifecycle is Pythonic and exception-safe.
- Lift the isolated_app fixture's private-attribute resets into a
  named _reset_process_singletons helper with a comment block
  explaining why each singleton has to be invalidated for true e2e
  isolation, and why raising=False is intentional. Makes the
  fragility visible and the intent self-documenting rather than
  leaving the resets inline as opaque monkeypatch calls.

Net change: -59 lines (143 -> 84) across the three test files, with
every assertion intact. Full suite remains 69 passed / lint clean.

* test(e2e): make real-server test self-supply its config

CI's actions/checkout only ships config.example.yaml (the real config.yaml
is gitignored), so the production config-discovery search
(./config.yaml -> ../config.yaml -> $DEER_FLOW_CONFIG_PATH) finds nothing
and the test fails at lifespan boot with FileNotFoundError. The dev-machine
run passed only because a local config.yaml happened to exist.

Write a minimal AppConfig-valid yaml into tmp_path and pin
DEER_FLOW_CONFIG_PATH to it. The yaml carries just what the schema requires
(a single fake-test-model entry, LocalSandboxProvider, sqlite database).
The LLM never gets instantiated because the test patches create_chat_model
on the lead agent module, so the api_key/base_url stay placeholders.

Verified by hiding the local config.yaml to mirror the CI checkout — the
test now passes in both environments.

2026-05-12 23:18:54 +08:00

_agent_e2e_helpers.py

fix(agents): make update_agent honor runtime.context user_id like setup_agent (#2867 )

2026-05-12 23:18:54 +08:00

_router_auth_helpers.py

fix the lint error in backend

2026-04-26 15:09:25 +08:00

conftest.py

refactor(skills): Unified skill storage capability (#2613 )

2026-05-01 13:23:26 +08:00

test_acp_config.py

feat(acp): add env field to ACPAgentConfig for subprocess env injection (#1447 )

2026-03-27 20:03:30 +08:00

test_aio_sandbox_local_backend.py

[security] fix(sandbox): bind local Docker ports to loopback (#2633 )

2026-04-30 11:40:28 +08:00

test_aio_sandbox_provider.py

feat(persistence): per-user filesystem isolation, run-scoped APIs, and state/history simplification (#2153 )

2026-04-26 11:13:01 +08:00

test_aio_sandbox.py

fix(sandbox): pass no_change_timeout to exec_command to prevent 120s premature termination (#2685 )

2026-05-01 22:27:02 +08:00

test_app_config_reload.py

fix(config): reset config-backed singletons on hot reload (#2588 )

2026-05-06 10:17:55 +08:00

test_artifacts_router.py

feat(auth): release-validation pass for 2.0-rc — 12 blockers + simplify follow-ups (#2008 )

2026-04-26 11:08:11 +08:00

test_auth_config.py

feat(auth): release-validation pass for 2.0-rc — 12 blockers + simplify follow-ups (#2008 )

2026-04-26 11:08:11 +08:00

test_auth_errors.py

feat(auth): release-validation pass for 2.0-rc — 12 blockers + simplify follow-ups (#2008 )

2026-04-26 11:08:11 +08:00

test_auth_middleware.py

feat: implement process-local internal authentication for Gateway and enhance CSRF handling

2026-04-26 22:20:57 +08:00

test_auth_type_system.py

feat(auth): release-validation pass for 2.0-rc — 12 blockers + simplify follow-ups (#2008 )

2026-04-26 11:08:11 +08:00

test_auth.py

fix(security): harden auth system and fix run journal logic bug (#2593 )

2026-04-28 11:34:07 +08:00

test_channel_file_attachments.py

[security] fix(upload): reject symlinked upload destinations (#2623 )

2026-05-02 15:19:28 +08:00

test_channels.py

fix(channels): authenticate gateway command requests (#2742 )

2026-05-06 15:27:34 +08:00

test_check_script.py

fix(check): windows pnpm version detection in check script (#2189 )

2026-04-14 10:29:44 +08:00

test_checkpointer_none_fix.py

feat(persistence):Unified persistence layer with event store, feedback, and rebase cleanup (#2134 )

2026-04-26 11:09:55 +08:00

test_checkpointer.py

fix(packaging): add postgres extra for store/checkpointer supportFix postgres extra install guidance (#2584 )

2026-05-09 09:49:08 +08:00

test_clarification_middleware.py

fix(backend): make clarification messages idempotent (#2350 ) (#2351 )

2026-04-19 22:00:58 +08:00

test_claude_provider_oauth_billing.py

fix(oauth): Harden Claude OAuth cache-control handling (#1583 )

2026-03-30 07:41:18 +08:00

test_claude_provider_prompt_caching.py

fix: cap prompt caching breakpoints at 4 to prevent API 400 errors (#2449 )

2026-04-25 19:40:06 +08:00

test_cli_auth_providers.py

fix(provider): preserve streamed Codex output when response.completed.output is empty (#1928 )

2026-04-07 18:21:22 +08:00

test_client_e2e.py

fix(harness): resolve runtime paths from project root (#2642 )

2026-05-01 22:19:50 +08:00

test_client_live.py

[Security] Address critical host-shell escape in LocalSandboxProvider (#1547 )

2026-03-29 21:03:58 +08:00

test_client_message_serialization.py

feat: refine token usage display modes (#2329 )

2026-05-04 09:56:16 +08:00

test_client.py

feat: refine token usage display modes (#2329 )

2026-05-04 09:56:16 +08:00

test_codex_provider.py

chroe(2585): keep polishing the code of codex token usage (#2689 )

2026-05-02 15:04:11 +08:00

test_config_version.py

refactor: split backend into harness (deerflow.*) and app (app.*) (#1131 )

2026-03-14 22:55:52 +08:00

test_converters.py

feat(persistence): add unified persistence layer with event store, token tracking, and feedback (#1930 )

2026-04-26 11:05:47 +08:00

test_create_deerflow_agent_live.py

feat: add create_deerflow_agent SDK entry point (Phase 1) (#1203 )

2026-03-29 15:31:18 +08:00

test_create_deerflow_agent.py

feat(loop-detection): make loop detection configurable with per-tool frequency overrides (#2711 )

2026-05-07 16:15:15 +08:00

test_credential_loader.py

feat(loop-detection): make loop detection configurable with per-tool frequency overrides (#2711 )

2026-05-07 16:15:15 +08:00

test_csrf_middleware.py

feat: static system prompt with DynamicContextMiddleware for prefix-cache optimization (#2801 )

2026-05-09 09:27:02 +08:00

test_custom_agent.py

feat(agent): add custom-agent self-updates with user isolation (#2713 )

2026-05-05 23:17:42 +08:00

test_dangling_tool_call_middleware.py

fix(middleware): Handle invalid tool calls in dangling pairing middleware (#2890 ) (#2891 )

2026-05-12 10:55:13 +08:00

test_detect_uv_extras.py

fix(scripts): preserve uv extras across make dev restarts (#2754 ) (#2767 )

2026-05-10 22:28:29 +08:00

test_dev_entrypoint.py

fix(scripts): preserve uv extras across make dev restarts (#2754 ) (#2767 )

2026-05-10 22:28:29 +08:00

test_dingtalk_channel.py

feat(channels): add DingTalk channel integration (#2628 )

2026-04-30 11:25:33 +08:00

test_discord_channel.py

feat(channels): add Discord channel integration (#1806 )

2026-04-11 17:48:04 +08:00

test_docker_sandbox_mode_detection.py

fix Windows Docker sandbox path mounting (#1634 )

2026-03-31 22:19:27 +08:00

test_doctor.py

feat(dx): Setup Wizard + doctor command — closes #2030 (#2034 )

2026-04-10 17:43:39 +08:00

test_dynamic_context_middleware.py

fix(harness): preserve dynamic context across summarization (#2823 )

2026-05-09 19:39:36 +08:00

test_ensure_admin.py

refactor: Remove init_token handling from admin initialization logic and related tests

2026-04-26 11:09:56 +08:00

test_exa_tools.py

feat(community): add Exa search as community tool provider (#1357 )

2026-04-08 17:13:39 +08:00

test_feedback.py

feat(persistence):Unified persistence layer with event store, feedback, and rebase cleanup (#2134 )

2026-04-26 11:09:55 +08:00

test_feishu_parser.py

Feature/feishu receive file (#1608 )

2026-04-06 22:14:12 +08:00

test_file_conversion.py

[security] fix(uploads): require explicit opt-in for host-side document conversion (#2332 )

2026-04-18 22:47:42 +08:00

test_firecrawl_tools.py

feat(dx): Setup Wizard + doctor command — closes #2030 (#2034 )

2026-04-10 17:43:39 +08:00

test_gateway_deps_config.py

refactor: root release config in gateway runtime (#2611 )

2026-04-28 00:13:04 +08:00

test_gateway_docs_toggle.py

fix(nginx): defer CORS to gateway allowlist (#2861 )

2026-05-11 17:38:37 +08:00

test_gateway_lifespan_shutdown.py

fix(gateway): bound lifespan shutdown hooks to prevent worker hang under uvicorn reload (#2331 )

2026-04-23 19:41:26 +08:00

test_gateway_runtime_cleanup.py

fix(nginx): defer CORS to gateway allowlist (#2861 )

2026-05-11 17:38:37 +08:00

test_gateway_services.py

fix: keep new agent bootstrap in user scope (#2784 )

2026-05-09 19:43:50 +08:00

test_guardrail_middleware.py

feat(guardrails): add pre-tool-call authorization middleware with pluggable providers (#1240 )

2026-03-23 18:07:33 +08:00

test_harness_boundary.py

refactor: split backend into harness (deerflow.*) and app (app.*) (#1131 )

2026-03-14 22:55:52 +08:00

test_infoquest_client.py

feat(harness): integration ACP agent tool (#1344 )

2026-03-26 14:20:18 +08:00

test_initialize_admin.py

fix(security): harden auth system and fix run journal logic bug (#2593 )

2026-04-28 11:34:07 +08:00

test_invoke_acp_agent_tool.py

refactor: thread app_config through lead and subagent task path (#2666 )

2026-05-02 06:37:49 +08:00

test_jina_client.py

fix(jina): log transient failures at WARNING without traceback (#2484 ) (#2485 )

2026-04-24 16:00:14 +08:00

test_langgraph_auth.py

fix(security): harden auth system and fix run journal logic bug (#2593 )

2026-04-28 11:34:07 +08:00

test_lead_agent_model_resolution.py

feat(loop-detection): make loop detection configurable with per-tool frequency overrides (#2711 )

2026-05-07 16:15:15 +08:00

test_lead_agent_prompt.py

fix(skills): enforce allowed-tools metadata (#2626 )

2026-05-07 08:34:43 +08:00

test_lead_agent_skills.py

fix(skills): enforce allowed-tools metadata (#2626 )

2026-05-07 08:34:43 +08:00

test_llm_error_handling_middleware.py

refactor: thread app_config through middleware factories (#2652 )

2026-04-30 12:41:09 +08:00

test_local_bash_tool_loading.py

fix(sandbox): improve sandbox security and preserve multimodal content (#2114 )

2026-04-11 16:52:10 +08:00

test_local_sandbox_encoding.py

fix(sandbox): disable msys path conversion (#2766 )

2026-05-08 10:13:11 +08:00

test_local_sandbox_provider_mounts.py

fix(harness): reset local sandbox singleton with provider lifecycle (#2834 )

2026-05-11 07:42:15 +08:00

test_local_skill_storage_write.py

refactor(skills): Unified skill storage capability (#2613 )

2026-05-01 13:23:26 +08:00

test_logging_level_from_config.py

fix(config): unify log_level from config.yaml across Gateway and debug entry points (#2601 )

2026-04-30 22:27:14 +08:00

test_loop_detection_config.py

feat(loop-detection): make loop detection configurable with per-tool frequency overrides (#2711 )

2026-05-07 16:15:15 +08:00

test_loop_detection_middleware.py

feat(loop-detection): make loop detection configurable with per-tool frequency overrides (#2711 )

2026-05-07 16:15:15 +08:00

test_mcp_client_config.py

refactor: split backend into harness (deerflow.*) and app (app.*) (#1131 )

2026-03-14 22:55:52 +08:00

test_mcp_custom_interceptors.py

feat(mcp): support custom tool interceptors via extensions_config.json (#2451 )

2026-04-25 09:18:13 +08:00

test_mcp_oauth.py

refactor: split backend into harness (deerflow.*) and app (app.*) (#1131 )

2026-03-14 22:55:52 +08:00

test_mcp_sync_wrapper.py

fix(harness): wrap async-only config tools for sync client execution (#2878 )

2026-05-11 22:14:13 +08:00

test_memory_prompt_injection.py

fix: inject longTermBackground into memory prompt (#1734 )

2026-04-03 11:21:58 +08:00

test_memory_queue_user_isolation.py

fix the lint error in backend

2026-04-26 15:09:25 +08:00

test_memory_queue.py

feat(persistence): per-user filesystem isolation, run-scoped APIs, and state/history simplification (#2153 )

2026-04-26 11:13:01 +08:00

test_memory_router.py

feat(persistence): per-user filesystem isolation, run-scoped APIs, and state/history simplification (#2153 )

2026-04-26 11:13:01 +08:00

test_memory_storage_user_isolation.py

fix the lint error in backend

2026-04-26 15:09:25 +08:00

test_memory_storage.py

fix: Memory update system has cache corruption, data loss, and thread-safety bugs (#2251 )

2026-04-17 12:00:31 +08:00

test_memory_thread_meta_isolation.py

feat(persistence):Unified persistence layer with event store, feedback, and rebase cleanup (#2134 )

2026-04-26 11:09:55 +08:00

test_memory_updater_user_isolation.py

fix the lint error in backend

2026-04-26 15:09:25 +08:00

test_memory_updater.py

fix(memory): replace short-lived asyncio.run() with persistent event loop (#2627 )

2026-04-30 17:59:57 +08:00

test_memory_upload_filtering.py

feat: flush memory before summarization (#2176 )

2026-04-14 15:01:06 +08:00

test_migration_user_isolation.py

feat(agent): add custom-agent self-updates with user isolation (#2713 )

2026-05-05 23:17:42 +08:00

test_mindie_provider.py

chore(adpator):Adapt MindIE engine model and improve testing and fixes (#2523 )

2026-04-28 15:09:31 +08:00

test_model_config.py

feat(codex): support explicit OpenAI Responses API config (#1235 )

2026-03-22 20:39:26 +08:00

test_model_factory.py

feat(persistence): add unified persistence layer with event store, token tracking, and feedback (#1930 )

2026-04-26 11:05:47 +08:00

test_owner_isolation.py

feat(persistence):Unified persistence layer with event store, feedback, and rebase cleanup (#2134 )

2026-04-26 11:09:55 +08:00

test_patched_deepseek.py

fix: resolve missing serialized kwargs in PatchedChatDeepSeek (#2025 )

2026-04-09 16:07:16 +08:00

test_patched_minimax.py

fix: improve MiniMax code plan integration (#1169 )

2026-03-20 17:18:59 +08:00

test_patched_openai.py

fix(LLM): fixing Gemini thinking + tool calls via OpenAI gateway (#1180 ) (#1205 )

2026-03-26 15:07:05 +08:00

test_paths_user_isolation.py

feat(agent): add custom-agent self-updates with user isolation (#2713 )

2026-05-05 23:17:42 +08:00

test_persistence_scaffold.py

fix(packaging): add postgres extra for store/checkpointer supportFix postgres extra install guidance (#2584 )

2026-05-09 09:49:08 +08:00

test_present_file_tool_core_logic.py

feat(persistence): per-user filesystem isolation, run-scoped APIs, and state/history simplification (#2153 )

2026-04-26 11:13:01 +08:00

test_provisioner_kubeconfig.py

feat(provisioner): add optional PVC support for sandbox volumes (#2020 )

2026-04-10 20:40:30 +08:00

test_provisioner_pvc_volumes.py

feat(provisioner): add optional PVC support for sandbox volumes (#2020 )

2026-04-10 20:40:30 +08:00

test_readability.py

refactor: split backend into harness (deerflow.*) and app (app.*) (#1131 )

2026-03-14 22:55:52 +08:00

test_reflection_resolvers.py

refactor: split backend into harness (deerflow.*) and app (app.*) (#1131 )

2026-03-14 22:55:52 +08:00

test_remote_sandbox_backend.py

fix: Supplement list_running in RemoteSandboxBackend (#2716 )

2026-05-05 18:53:10 +08:00

test_run_event_store_pagination.py

fix the lint error in backend

2026-04-26 15:09:25 +08:00

test_run_event_store.py

fix(events): serialize structured db event content (#2762 )

2026-05-08 10:17:17 +08:00

test_run_journal.py

fix(runtime): persist run message summaries (#2850 )

2026-05-11 19:54:00 +08:00

test_run_manager.py

feat(run): Propagates model_name from the gateway request through the runtime and persistence stack to the SQLite database. (#2775 )

2026-05-11 21:45:18 +08:00

test_run_repository.py

feat(run): Propagates model_name from the gateway request through the runtime and persistence stack to the SQLite database. (#2775 )

2026-05-11 21:45:18 +08:00

test_run_worker_rollback.py

fix(runtime): make rollback restore checkpoint supersede newer checkpoints (#2582 )

2026-05-02 11:25:45 +08:00

test_runs_api_endpoints.py

fix the lint error in backend

2026-04-26 15:09:25 +08:00

test_runtime_paths.py

fix(harness): restore legacy skills path fallback (#2694 ) (#2696 )

2026-05-03 23:40:59 +08:00

test_sandbox_audit_middleware.py

feat(sandbox): strengthen bash command auditing with compound splitting and expanded patterns (#1881 )

2026-04-07 17:15:24 +08:00

test_sandbox_orphan_reconciliation_e2e.py

fix(sandbox): add startup reconciliation to prevent orphaned container leaks (#1976 )

2026-04-09 17:21:23 +08:00

test_sandbox_orphan_reconciliation.py

fix(sandbox): add startup reconciliation to prevent orphaned container leaks (#1976 )

2026-04-09 17:21:23 +08:00

test_sandbox_search_tools.py

fix(sandbox): add missing path masking in ls_tool output (#2317 )

2026-04-18 08:46:59 +08:00

test_sandbox_tools_security.py

fix(sandbox): block host bash traversal escapes (#2560 )

2026-04-28 12:18:41 +08:00

test_security_scanner.py

feat(trace):Add run_name to the trace info for system agents. (#2492 )

2026-04-24 17:06:55 +08:00

test_serialization.py

feat(gateway): implement LangGraph Platform API in Gateway, replace langgraph-cli (#1403 )

2026-03-30 16:02:23 +08:00

test_serialize_message_content.py

feat(harness): integration ACP agent tool (#1344 )

2026-03-26 14:20:18 +08:00

test_serper_tools.py

feat(community): add Serper web search provider (#2630 )

2026-05-02 16:22:35 +08:00

test_setup_agent_e2e_user_isolation.py

fix(agents): make update_agent honor runtime.context user_id like setup_agent (#2867 )

2026-05-12 23:18:54 +08:00

test_setup_agent_http_e2e_real_server.py

fix(agents): make update_agent honor runtime.context user_id like setup_agent (#2867 )

2026-05-12 23:18:54 +08:00

test_setup_agent_tool.py

fix: keep new agent bootstrap in user scope (#2784 )

2026-05-09 19:43:50 +08:00

test_setup_wizard.py

enable token usage by default (#2841 )

2026-05-10 22:00:57 +08:00

test_skill_manage_tool.py

refactor(skills): Unified skill storage capability (#2613 )

2026-05-01 13:23:26 +08:00

test_skills_archive_root.py

refactor: extract shared skill installer and upload manager to harness (#1202 )

2026-03-25 16:28:33 +08:00

test_skills_bundled.py

fix(skills): validate bundled SKILL.md front-matter in CI (fixes #2443 ) (#2457 )

2026-04-23 14:06:14 +08:00

test_skills_custom_router.py

refactor(skills): Unified skill storage capability (#2613 )

2026-05-01 13:23:26 +08:00

test_skills_installer.py

refactor(skills): Unified skill storage capability (#2613 )

2026-05-01 13:23:26 +08:00

test_skills_loader.py

fix(harness): restore legacy skills path fallback (#2694 ) (#2696 )

2026-05-03 23:40:59 +08:00

test_skills_parser.py

fix(skills): enforce allowed-tools metadata (#2626 )

2026-05-07 08:34:43 +08:00

test_skills_validation.py

fix(skills): enforce allowed-tools metadata (#2626 )

2026-05-07 08:34:43 +08:00

test_sse_format.py

feat(gateway): implement LangGraph Platform API in Gateway, replace langgraph-cli (#1403 )

2026-03-30 16:02:23 +08:00

test_stream_bridge.py

Fix(#1702 ): stream resume run (#1858 )

2026-04-06 14:51:10 +08:00

test_subagent_executor.py

fix(subagents): consolidate system_prompt and skills into single SystemMessage (#2701 )

2026-05-11 09:59:06 +08:00

test_subagent_limit_middleware.py

fix(middleware): sync raw tool call metadata (#2757 )

2026-05-08 10:08:53 +08:00

test_subagent_prompt_security.py

feat(subagents): support per-subagent skill loading and custom subagent types (#2253 )

2026-04-23 23:59:47 +08:00

test_subagent_skills_config.py

refactor: thread app_config through lead and subagent task path (#2666 )

2026-05-02 06:37:49 +08:00

test_subagent_timeout_config.py

feat(subagents): allow model override per subagent in config.yaml (#2064 )

2026-04-12 16:40:21 +08:00

test_subagent_token_collector.py

fix: bucket subagent token usage into parent run totals (#2838 )

2026-05-10 22:47:30 +08:00

test_suggestions_router.py

refactor: thread release config through lead path (#2612 )

2026-04-28 14:53:18 +08:00

test_summarization_middleware.py

fix(harness): preserve dynamic context across summarization (#2823 )

2026-05-09 19:39:36 +08:00

test_task_tool_core_logic.py

fix: bucket subagent token usage into parent run totals (#2838 )

2026-05-10 22:47:30 +08:00

test_thread_data_middleware.py

Fix Windows backend test compatibility (#1384 )

2026-03-26 17:39:16 +08:00

test_thread_meta_repo.py

feat(persistence):Unified persistence layer with event store, feedback, and rebase cleanup (#2134 )

2026-04-26 11:09:55 +08:00

test_thread_run_messages_pagination.py

fix the lint error in backend

2026-04-26 15:09:25 +08:00

test_thread_token_usage.py

fix: use backend thread token usage for header total (#2800 )

2026-05-09 19:40:32 +08:00

test_threads_router.py

fix(gateway): return ISO 8601 timestamps from threads endpoints (#2599 )

2026-05-02 15:16:16 +08:00

test_title_generation.py

refactor: split backend into harness (deerflow.*) and app (app.*) (#1131 )

2026-03-14 22:55:52 +08:00

test_title_middleware_core_logic.py

fix title generation with dynamic context reminder (#2830 )

2026-05-09 18:22:58 +08:00

test_todo_middleware.py

fix(todo-middleware): prevent premature agent exit with incomplete todos (#2135 )

2026-04-14 11:11:26 +08:00

test_token_usage_config.py

enable token usage by default (#2841 )

2026-05-10 22:00:57 +08:00

test_token_usage_middleware.py

feat: static system prompt with DynamicContextMiddleware for prefix-cache optimization (#2801 )

2026-05-09 09:27:02 +08:00

test_token_usage.py

feat(harness): integration ACP agent tool (#1344 )

2026-03-26 14:20:18 +08:00

test_tool_args_schema_no_pydantic_warning.py

fix(tools): make write_file append discoverable in model-facing schema (#2843 )

2026-05-10 23:09:03 +08:00

test_tool_deduplication.py

fix(harness): wrap async-only config tools for sync client execution (#2878 )

2026-05-11 22:14:13 +08:00

test_tool_error_handling_middleware.py

fix(subagents): use model override for tools and middleware (#2641 )

2026-05-01 22:21:10 +08:00

test_tool_output_truncation.py

fix: add output truncation to ls_tool to prevent context window overflow (#1896 )

2026-04-06 15:09:57 +08:00

test_tool_search.py

fix: gate deferred MCP tool execution (#2513 )

2026-04-24 22:45:41 +08:00

test_tracing_config.py

feat(tracing): add optional Langfuse support (#1717 )

2026-04-02 13:06:10 +08:00

test_tracing_factory.py

feat(tracing): add optional Langfuse support (#1717 )

2026-04-02 13:06:10 +08:00

test_update_agent_e2e_user_isolation.py

fix(agents): make update_agent honor runtime.context user_id like setup_agent (#2867 )

2026-05-12 23:18:54 +08:00

test_update_agent_tool.py

feat(agent): add custom-agent self-updates with user isolation (#2713 )

2026-05-05 23:17:42 +08:00

test_uploads_manager.py

fix(uploads): add Windows support for safe symlink-protected uploads (#2794 )

2026-05-09 18:21:54 +08:00

test_uploads_middleware_core_logic.py

feat(persistence): per-user filesystem isolation, run-scoped APIs, and state/history simplification (#2153 )

2026-04-26 11:13:01 +08:00

test_uploads_router.py

Fix duplicate gateway upload filenames (#2789 )

2026-05-09 18:02:40 +08:00

test_user_context.py

fix the lint error in backend

2026-04-26 15:09:25 +08:00

test_utils_time.py

fix(gateway): return ISO 8601 timestamps from threads endpoints (#2599 )

2026-05-02 15:16:16 +08:00

test_view_image_middleware.py

test: add unit tests for ViewImageMiddleware (#2256 )

2026-04-15 23:54:30 +08:00

test_view_image_tool.py

fix(harness): constrain view_image to thread data paths (#2557 )

2026-04-28 11:13:17 +08:00

test_vllm_provider.py

feat(models): add vLLM provider support (#1860 )

2026-04-06 15:18:34 +08:00

test_wechat_channel.py

feat: add WeChat channel integration (#1869 )

2026-04-10 20:49:28 +08:00