mirror of https://github.com/bytedance/deer-flow.git synced 2026-06-09 17:12:01 +00:00

History

feat: MiniMax provider for image/video/podcast skills + new music-generation skill (#3437 )

* docs(spec): MiniMax integration for generation skills + new music skill

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

* docs(plan): MiniMax generation providers implementation plan

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

* test(skills): add importlib loader + FakeResp for skill tests

* test(skills): register loaded module in sys.modules; raise requests.HTTPError in FakeResp

* feat(image-generation): add MiniMax provider with env auto-detect

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* refactor(image-generation): guard unknown provider, derive ref MIME, strengthen tests

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

* feat(video-generation): add MiniMax provider with async poll/download

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* refactor(video-generation): surface base_resp errors while polling; add timeout test

* feat(podcast-generation): add MiniMax t2a_v2 provider with env auto-detect

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* refactor(podcast-generation): restore TTS credential guard; add volcengine + voice tests

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* feat(music-generation): new MiniMax music skill via skill-creator

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

* refactor(music-generation): treat empty lyrics as absent; test no-audio-data path

* refactor(skills): add request timeouts to MiniMax network calls

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* Potential fix for pull request finding 'Explicit returns mixed with implicit (fall through) returns'

Co-authored-by: Copilot Autofix powered by AI <223894421+github-code-quality[bot]@users.noreply.github.com>

* fix(models): strip inconsistent user-message names for MiniMax chat

DeerFlow middlewares tag user messages with provenance names (user-input, summary, loop_warning); langchain serializes them into the OpenAI-compatible payload and MiniMax rejects mismatched user-message names with "user name must be consistent (2013)". PatchedChatMiniMax now drops the per-message name from user-role messages. Point the config.example MiniMax models at PatchedChatMiniMax so they also get reasoning_content mapping.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

* feat(image-generation): MiniMax sends JSON prompt field, guard 1500-char limit

MiniMax image-01 takes one text string capped at 1500 chars, but the skill was sending the whole structured JSON. The MiniMax provider now extracts the JSON `prompt` field (relying on prompt_optimizer to expand it) and fails fast with a clear error before calling the API when that field exceeds 1500 chars. Authoring stays provider-agnostic; Gemini still receives the full JSON.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

* feat(podcast-generation): per-provider TTS concurrency and retry/backoff

Each TTS provider owns its concurrency internally — MiniMax runs single-threaded to reduce rate-limit failures, Volcengine keeps 4 workers — with automatic retry and backoff on transient HTTP and base_resp errors. No caller-facing concurrency knob.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

* fix(skills): address Copilot review comments on generation skills

- video: add raise_for_status + timeout to the Gemini download/POST/poll calls so non-2xx responses surface as clear HTTP errors instead of JSON/KeyError or hangs
- video: check the task Fail status before the generic base_resp check so the failure keeps its task_id context
- video/image: create the output file parent directory before writing (matching music-generation) so nested output paths do not raise FileNotFoundError
- music: require a non-empty prompt and fail fast with ValueError instead of sending an empty prompt to the API

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

* fix(scripts): reclaim dev ports across worktrees in make stop/dev

All deer-flow worktrees (main checkout + linked worktrees) hardcode the same dev ports (8001/3000/2026), so a service started from any worktree must be reclaimable from another. stop_all now resolves the set of worktree roots (DEERFLOW_ROOTS) and treats a process as deer-flow-owned when its open files live under any of them. It also force-kills survivors on 2026 alongside 8001/3000, fixing `make dev` aborting on the nginx port preflight when a prior nginx lingered on 2026.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

* fix(view-image): hide the injected image-context message from the UI

ViewImageMiddleware injects a HumanMessage (text + base64 images) so the vision model can see viewed images, but it was the only internal injector that set neither hide_from_ui nor a hidden name, so it leaked into the chat UI (and IM channels) as a user bubble reading "Here are the images you've viewed:". Mark it with additional_kwargs={"hide_from_ui": True}, matching todo/dynamic_context injections, which the frontend isHiddenFromUIMessage and the channel sender already honor. The model still receives the full content.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

* fix(minimax): mark M2.7 models as text-only (no vision)

MiniMax M2.7 / M2.7-highspeed do not support vision; only M3 does. The
provider config asserted vision support for M2.7 in four places.

- config.example.yaml: 4 M2.7 entries -> supports_vision: false
- backend/docs/CONFIGURATION.md: M2.7 + highspeed -> supports_vision: false
- wizard: add LLMProvider.model_vision_overrides + extra_config_for() so
  selecting an M2.7 model writes supports_vision: false while M3 (default)
  keeps vision; wire it through setup_wizard.py
- tests: M2.7-highspeed fixture -> supports_vision=False; add
  test_minimax_vision_is_per_model

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Co-authored-by: Willem Jiang <willem.jiang@gmail.com>
Co-authored-by: Copilot Autofix powered by AI <223894421+github-code-quality[bot]@users.noreply.github.com>

2026-06-08 22:04:38 +08:00

.vscode

chore: specify project name

2026-01-14 09:58:53 +08:00

public

feat(frontend): support static website demo mode (#3170 )

2026-05-23 00:10:56 +08:00

scripts

feat: add uploads

2026-01-24 19:38:08 +08:00

src

feat: MiniMax provider for image/video/podcast skills + new music-generation skill (#3437 )

2026-06-08 22:04:38 +08:00

tests

fix(replay-e2e): match by conversation, not the living system prompt (#3436 )

2026-06-08 17:32:41 +08:00

.env.example

docs: clean standalone LangGraph server remnants (#3301 )

2026-05-29 11:36:45 +08:00

.gitignore

chore: create frontend project from boilerplate

2026-01-14 09:50:26 +08:00

.npmrc

chore: add .npmrc back

2026-02-10 22:07:25 +08:00

.prettierignore

Stabilize write artifact previews (#3172 )

2026-05-23 16:56:14 +08:00

AGENTS.md

feat(frontend): add Playwright E2E tests with CI workflow (#2279 )

2026-04-18 08:21:08 +08:00

CLAUDE.md

docs: clean standalone LangGraph server remnants (#3301 )

2026-05-29 11:36:45 +08:00

components.json

feat: implement the first section of landing page

2026-01-23 00:15:21 +08:00

Dockerfile

chore(uv): speed up Docker builds with mirrors (#1600 )

2026-03-30 20:16:44 +08:00

eslint.config.js

fix: fix eslint errors and warnings

2026-01-31 21:46:31 +08:00

Makefile

feat(frontend): support static website demo mode (#3170 )

2026-05-23 00:10:56 +08:00

next.config.js

feat(frontend): support static website demo mode (#3170 )

2026-05-23 00:10:56 +08:00

package.json

chore(deps): bump next from 16.1.7 to 16.2.6 in /frontend (#2899 )

2026-05-12 10:45:40 +08:00

playwright.config.ts

fix: resolve make dev and test-e2e errors (#2570 )

2026-04-26 17:27:32 +08:00

playwright.real-backend.config.ts

test(e2e): deterministic record/replay front-back contract verification (#3365 )

2026-06-08 12:35:03 +08:00

playwright.record.config.ts

test(e2e): deterministic record/replay front-back contract verification (#3365 )

2026-06-08 12:35:03 +08:00

pnpm-lock.yaml

chore(deps): bump uuid from 10.0.0 to 14.0.0 in /frontend (#3281 )

2026-05-28 07:14:44 +08:00

pnpm-workspace.yaml

Add packages section to pnpm-workspace.yaml (#1382 )

2026-03-26 16:09:35 +08:00

postcss.config.js

chore: create frontend project from boilerplate

2026-01-14 09:50:26 +08:00

prettier.config.js

chore: create frontend project from boilerplate

2026-01-14 09:50:26 +08:00

README.md

docs: align runtime docs with gateway mode (#2868 )

2026-05-12 16:19:21 +08:00

tsconfig.json

feat: implement the first version of landing page

2026-01-23 13:24:03 +08:00

vitest.config.ts

feat(frontend): set up Vitest frontend testing infrastructure with CI workflow (#2147 )

2026-04-12 18:00:43 +08:00

README.md

DeerFlow Frontend

Like the original DeerFlow 1.0, we would love to give the community a minimalistic and easy-to-use web interface with a more modern and flexible architecture.

Tech Stack

Framework: Next.js 16 with App Router
UI: React 19, Tailwind CSS 4, Shadcn UI, MagicUI and React Bits
AI Integration: LangGraph SDK and Vercel AI Elements

Quick Start

Prerequisites

Node.js 22+
pnpm 10.26.2+

Installation

# Install dependencies
pnpm install

# Copy environment variables
cp .env.example .env
# Edit .env with your configuration

Development

# Start development server
pnpm dev

# The app will be available at http://localhost:3000

Build & Test

# Type check
pnpm typecheck

# Check formatting
pnpm format

# Apply formatting
pnpm format:write

# Lint
pnpm lint

# Run unit tests
pnpm test

# One-time setup: install Playwright Chromium browser
pnpm exec playwright install chromium

# Run E2E tests (builds and starts production server automatically)
pnpm test:e2e

# Build for production
pnpm build

# Start production server
pnpm start

Site Map

├── /                    # Landing page
├── /chats               # Chat list
├── /chats/new           # New chat page
└── /chats/[thread_id]   # A specific chat page

Configuration

Environment Variables

Key environment variables (see .env.example for full list):

# Backend API URL (optional, uses local Next.js/nginx proxy by default)
NEXT_PUBLIC_BACKEND_BASE_URL="http://localhost:8001"
# LangGraph-compatible API URL (optional, uses local Next.js/nginx proxy by default)
NEXT_PUBLIC_LANGGRAPH_BASE_URL="http://localhost:8001/api"

Project Structure

tests/
├── e2e/                    # E2E tests (Playwright, Chromium, mocked backend)
└── unit/                   # Unit tests (mirrors src/ layout)
src/
├── app/                    # Next.js App Router pages
│   ├── api/                # API routes
│   ├── workspace/          # Main workspace pages
│   └── mock/               # Mock/demo pages
├── components/             # React components
│   ├── ui/                 # Reusable UI components
│   ├── workspace/          # Workspace-specific components
│   ├── landing/            # Landing page components
│   └── ai-elements/        # AI-related UI elements
├── core/                   # Core business logic
│   ├── api/                # API client & data fetching
│   ├── artifacts/          # Artifact management
│   ├── config/              # App configuration
│   ├── i18n/               # Internationalization
│   ├── mcp/                # MCP integration
│   ├── messages/           # Message handling
│   ├── models/             # Data models & types
│   ├── settings/           # User settings
│   ├── skills/             # Skills system
│   ├── threads/            # Thread management
│   ├── todos/              # Todo system
│   └── utils/              # Utility functions
├── hooks/                  # Custom React hooks
├── lib/                    # Shared libraries & utilities
├── server/                 # Server-side code
│   └── better-auth/        # Authentication setup and session helpers
└── styles/                 # Global styles

Scripts

Command	Description
`pnpm dev`	Start development server with Turbopack
`pnpm build`	Build for production
`pnpm start`	Start production server
`pnpm test`	Run unit tests with Vitest
`pnpm test:e2e`	Run E2E tests with Playwright
`pnpm format`	Check formatting with Prettier
`pnpm format:write`	Apply formatting with Prettier
`pnpm lint`	Run ESLint
`pnpm lint:fix`	Fix ESLint issues
`pnpm typecheck`	Run TypeScript type checking
`pnpm check`	Run both lint and typecheck

Development Notes

Uses pnpm workspaces (see packageManager in package.json)
Turbopack enabled by default in development for faster builds
Environment validation can be skipped with SKIP_ENV_VALIDATION=1 (useful for Docker)
Backend API URLs are optional; nginx proxy is used by default in development

License

MIT License. See LICENSE for details.