mirror of https://github.com/bytedance/deer-flow.git synced 2026-06-09 17:12:01 +00:00

History

test(e2e): deterministic record/replay front-back contract verification (#3365 )

* test(e2e): record/replay front-back contract verification

Guards the front-back contract with a deterministic, key-free record/replay
harness (mirrors open-design's golden-trace approach):

- ReplayChatModel (tests/replay_provider.py): replays recorded LLM turns by a
  normalized hash of the model input. Strips <system-reminder>/date/uuid/tmp-path
  so one fixture replays across days and from both the browser and direct-POST
  paths; a miss raises loudly (no silent divergence).
- Recording is record-through-browser (scripts/record_gateway.py +
  build_fixture_from_jsonl.py + frontend/tests/e2e-record): a real run is driven
  through the real frontend so captured inputs match exactly what the browser
  sends; fixtures contain no API key.
- Layer 1 — backend golden (tests/test_replay_golden.py): replay through the real
  gateway, assert the SSE event sequence == committed golden.
- Layer 2 — full-stack render (frontend/tests/e2e-real-backend): real Next.js +
  real gateway (replay model) + Chromium; assert the replayed auto-title and
  follow-up suggestions render. DOM assertions are the gate; visual regression is
  a local dev gate (CI uploads the render as an artifact).
- CI (.github/workflows/replay-e2e.yml): both layers, triggered on EITHER side of
  the contract (frontend/** or backend gateway/harness/fixtures).

* test(e2e): multi-run render-order cross-stack scenario (#3352)

Guards the dangerous front-back class where a backend ordering change
silently breaks a frontend assumption while both sides' unit tests stay
green. Reproduces issue #3352: backend list_by_thread returns runs
newest-first (#2932) and the frontend prepended per-run pages, inverting
chronological order once the checkpoint no longer held the older messages.

- tests/seed_runs_router.py: test-only seeder, mounted on the replay
  gateway only when DEERFLOW_ENABLE_TEST_SEED=1 (never in the production
  app). Seeds a thread with >=2 runs + per-run message events and no
  checkpoint -- the #3352 precondition -- so the frontend per-run reload
  path is the sole source of truth and the prepend inversion is observable.
- frontend/tests/e2e-real-backend/multi-run-order.spec.ts: drives the real
  frontend against the real gateway, asserts the first run renders above
  the second. Reverting the #3354 fix turns it red.
- replay-e2e.yml: trigger on the new replay test-infra paths.
- docs: REPLAY_E2E.md cross-stack scenario section.

* test(e2e): address Copilot review on the replay harness

- Fix stale recorder references (scripts/record_traces.py ->
  scripts/record_gateway.py + scripts/build_fixture_from_jsonl.py) in
  replay_provider.py, test_replay_golden.py, _replay_fixture.py.
- MODE_CONTEXT['ultra']: thinking_enabled False -> True, mirroring the
  frontend's `context.mode !== 'flash'` (hooks.ts). It did not affect the
  hashed input (Layer 1 golden still green), but the table now matches the
  real frontend context it claims to mirror.
- replay_provider.py docstring: stop claiming memory is recorded-enabled;
  the replay config disables memory/summarization for determinism (title
  stays, as an in-graph deterministic call).
- record_gateway.py / run_replay_gateway.py: override DEER_FLOW_HOME instead
  of setdefault, so an outer value can't leak into the hermetic harness.
- record_gateway.py: clear error when DEERFLOW_RECORD_OUT is unset (was a
  bare KeyError).
- playwright.record.config.ts: forward OPENAI_*/DEERFLOW_RECORD_OUT only when
  set, so the gateway raises a clear 'missing env' error instead of getting ''.

* test(e2e): address Copilot review round 2

- seed_runs_router.py: constrain SeedMessage.role to Literal['human','ai']
  so a bad value is a clean 422 at the boundary instead of a 500
  (KeyError on _EVENT_TYPE).
- record-write-read-file.spec.ts: waitForCaptureStable now throws on
  timeout instead of returning the last count, so a truncated/partial
  recording can't pass silently.
- real-backend-render.spec.ts: guard the suggestions JSON.parse; a
  bracket-prefixed non-JSON turn falls back to '' so the existing
  not.toBe('') assertion fails clearly instead of a generic parse throw.

2026-06-08 12:35:03 +08:00

.vscode

chore: specify project name

2026-01-14 09:58:53 +08:00

public

feat(frontend): support static website demo mode (#3170 )

2026-05-23 00:10:56 +08:00

scripts

feat: add uploads

2026-01-24 19:38:08 +08:00

src

fix(frontend): truncate overflowing text in agent cards (#3391 )

2026-06-07 23:29:59 +08:00

tests

test(e2e): deterministic record/replay front-back contract verification (#3365 )

2026-06-08 12:35:03 +08:00

.env.example

docs: clean standalone LangGraph server remnants (#3301 )

2026-05-29 11:36:45 +08:00

.gitignore

chore: create frontend project from boilerplate

2026-01-14 09:50:26 +08:00

.npmrc

chore: add .npmrc back

2026-02-10 22:07:25 +08:00

.prettierignore

Stabilize write artifact previews (#3172 )

2026-05-23 16:56:14 +08:00

AGENTS.md

feat(frontend): add Playwright E2E tests with CI workflow (#2279 )

2026-04-18 08:21:08 +08:00

CLAUDE.md

docs: clean standalone LangGraph server remnants (#3301 )

2026-05-29 11:36:45 +08:00

components.json

feat: implement the first section of landing page

2026-01-23 00:15:21 +08:00

Dockerfile

chore(uv): speed up Docker builds with mirrors (#1600 )

2026-03-30 20:16:44 +08:00

eslint.config.js

fix: fix eslint errors and warnings

2026-01-31 21:46:31 +08:00

Makefile

feat(frontend): support static website demo mode (#3170 )

2026-05-23 00:10:56 +08:00

next.config.js

feat(frontend): support static website demo mode (#3170 )

2026-05-23 00:10:56 +08:00

package.json

chore(deps): bump next from 16.1.7 to 16.2.6 in /frontend (#2899 )

2026-05-12 10:45:40 +08:00

playwright.config.ts

fix: resolve make dev and test-e2e errors (#2570 )

2026-04-26 17:27:32 +08:00

playwright.real-backend.config.ts

test(e2e): deterministic record/replay front-back contract verification (#3365 )

2026-06-08 12:35:03 +08:00

playwright.record.config.ts

test(e2e): deterministic record/replay front-back contract verification (#3365 )

2026-06-08 12:35:03 +08:00

pnpm-lock.yaml

chore(deps): bump uuid from 10.0.0 to 14.0.0 in /frontend (#3281 )

2026-05-28 07:14:44 +08:00

pnpm-workspace.yaml

Add packages section to pnpm-workspace.yaml (#1382 )

2026-03-26 16:09:35 +08:00

postcss.config.js

chore: create frontend project from boilerplate

2026-01-14 09:50:26 +08:00

prettier.config.js

chore: create frontend project from boilerplate

2026-01-14 09:50:26 +08:00

README.md

docs: align runtime docs with gateway mode (#2868 )

2026-05-12 16:19:21 +08:00

tsconfig.json

feat: implement the first version of landing page

2026-01-23 13:24:03 +08:00

vitest.config.ts

feat(frontend): set up Vitest frontend testing infrastructure with CI workflow (#2147 )

2026-04-12 18:00:43 +08:00

README.md

DeerFlow Frontend

Like the original DeerFlow 1.0, we would love to give the community a minimalistic and easy-to-use web interface with a more modern and flexible architecture.

Tech Stack

Framework: Next.js 16 with App Router
UI: React 19, Tailwind CSS 4, Shadcn UI, MagicUI and React Bits
AI Integration: LangGraph SDK and Vercel AI Elements

Quick Start

Prerequisites

Node.js 22+
pnpm 10.26.2+

Installation

# Install dependencies
pnpm install

# Copy environment variables
cp .env.example .env
# Edit .env with your configuration

Development

# Start development server
pnpm dev

# The app will be available at http://localhost:3000

Build & Test

# Type check
pnpm typecheck

# Check formatting
pnpm format

# Apply formatting
pnpm format:write

# Lint
pnpm lint

# Run unit tests
pnpm test

# One-time setup: install Playwright Chromium browser
pnpm exec playwright install chromium

# Run E2E tests (builds and starts production server automatically)
pnpm test:e2e

# Build for production
pnpm build

# Start production server
pnpm start

Site Map

├── /                    # Landing page
├── /chats               # Chat list
├── /chats/new           # New chat page
└── /chats/[thread_id]   # A specific chat page

Configuration

Environment Variables

Key environment variables (see .env.example for full list):

# Backend API URL (optional, uses local Next.js/nginx proxy by default)
NEXT_PUBLIC_BACKEND_BASE_URL="http://localhost:8001"
# LangGraph-compatible API URL (optional, uses local Next.js/nginx proxy by default)
NEXT_PUBLIC_LANGGRAPH_BASE_URL="http://localhost:8001/api"

Project Structure

tests/
├── e2e/                    # E2E tests (Playwright, Chromium, mocked backend)
└── unit/                   # Unit tests (mirrors src/ layout)
src/
├── app/                    # Next.js App Router pages
│   ├── api/                # API routes
│   ├── workspace/          # Main workspace pages
│   └── mock/               # Mock/demo pages
├── components/             # React components
│   ├── ui/                 # Reusable UI components
│   ├── workspace/          # Workspace-specific components
│   ├── landing/            # Landing page components
│   └── ai-elements/        # AI-related UI elements
├── core/                   # Core business logic
│   ├── api/                # API client & data fetching
│   ├── artifacts/          # Artifact management
│   ├── config/              # App configuration
│   ├── i18n/               # Internationalization
│   ├── mcp/                # MCP integration
│   ├── messages/           # Message handling
│   ├── models/             # Data models & types
│   ├── settings/           # User settings
│   ├── skills/             # Skills system
│   ├── threads/            # Thread management
│   ├── todos/              # Todo system
│   └── utils/              # Utility functions
├── hooks/                  # Custom React hooks
├── lib/                    # Shared libraries & utilities
├── server/                 # Server-side code
│   └── better-auth/        # Authentication setup and session helpers
└── styles/                 # Global styles

Scripts

Command	Description
`pnpm dev`	Start development server with Turbopack
`pnpm build`	Build for production
`pnpm start`	Start production server
`pnpm test`	Run unit tests with Vitest
`pnpm test:e2e`	Run E2E tests with Playwright
`pnpm format`	Check formatting with Prettier
`pnpm format:write`	Apply formatting with Prettier
`pnpm lint`	Run ESLint
`pnpm lint:fix`	Fix ESLint issues
`pnpm typecheck`	Run TypeScript type checking
`pnpm check`	Run both lint and typecheck

Development Notes

Uses pnpm workspaces (see packageManager in package.json)
Turbopack enabled by default in development for faster builds
Environment validation can be skipped with SKIP_ENV_VALIDATION=1 (useful for Docker)
Backend API URLs are optional; nginx proxy is used by default in development

License

MIT License. See LICENSE for details.