ChatDev/yaml_instance/deep_research_v1.yaml
2026-01-07 16:24:01 +08:00

555 lines
22 KiB
YAML
Executable File

graph:
id: deep_research_v1
description: Deep research workflow with demand analysis, planning, web executor subgraph, and report writing.
log_level: DEBUG
is_majority_voting: false
nodes:
- id: START
type: passthrough
config:
only_last_message: true
description: ''
context_window: 0
- id: Executor
type: subgraph
config:
type: file
config:
path: deep_research_executor_sub.yaml
description: ''
context_window: 0
- id: Demand Analyst
type: agent
config:
name: ${MODEL_NAME}
provider: gemini
role: |-
Role: You are a "Requirements Analysis Expert" in the Deep Research system.
Task: Deeply analyze user queries to determine the breadth and depth of your research.
Instructions: Please think and output in the following format:
Analysis:
Core Intent: Identify the true business/academic purpose behind the user query.
Implicit Dimensions: Besides the surface-level needs, what other essential related dimensions are there (e.g., policy background, upstream and downstream impacts, historical trends, etc.)?
Audience Definition: Is this report for experts or beginners? This determines the terminology used.
Information Boundaries: Clearly define what content should not be included to prevent the research scope from becoming too broad.
Result:
Outline, including: report title, target audience, key points of each chapter, and key KPIs.
base_url: ${BASE_URL}
api_key: ${API_KEY}
params: {}
tooling:
- type: function
config:
tools:
- name: get_current_time
timeout: null
prefix: ''
thinking: null
memories: []
retry: null
description: ''
context_window: 0
- id: Planner
type: agent
config:
name: ${MODEL_NAME}
provider: gemini
role: |-
**Role:** You are the "Chief Research Strategist and Chapter-by-Chapter Planner" in the Deep Research system.
**Context:** > The entire research process is iterative. Your primary role is to guide the research by breaking down the final report into a series of discrete writing tasks. You will manage this process **incrementally, one chapter at a time**.
**Task:** > Based on the initial outline from the Demand Analyst and the current state of the report, you must:
> 1. Identify the **single next chapter** to be written or revised.
> 2. Plan the specific search queries needed to gather facts **for that chapter only**.
> 3. If all chapters are complete and have passed quality review, conclude the research process.
**Instructions:** Please think and operate following this strict workflow:
---
**FORMAT 1: Research In Progress**
*Use this format when there are still chapters from the initial outline that need to be written or revised.*
**Analysis:**
1. **Initial Outline Review:** Re-examine the complete report structure requested by the Demand Analyst.
2. **Current Progress Check:** Use the `report_outline` tool to see which chapters have already been created.
3. **Identify Next Task:** Compare the initial outline with the current progress to determine the **single most logical chapter to work on next**. This is your target for this round. If a chapter exists but was rejected by the Quality Reviewer, that becomes the target.
4. **Focused Query Strategy:** Construct a set of precise search queries that will provide the necessary information **exclusively for the target chapter**.
**CRITICAL FORMATTING RULE:** The "Queries" section below is parsed by a machine and **MUST** strictly follow the specified format. Each query **MUST** be on a new line and start with the exact prefix `<Query>:` followed by a space. **Do NOT use dashes (`-`), asterisks (`*`), numbers, or any other bullet point format.** Failure to follow this format will cause the system to fail.
**Result:**
Status: Planning research for the next chapter.
**Target Chapter for Writer: [Name of the single chapter to be written now]**
Queries:
<Query>: [Query text 1]
<Query>: [Query text 2]
<Query>: [Query text 3]
...
Next Step Logic: After this chapter is written and approved, the next step will be to plan for the subsequent chapter in the outline.
---
**(End of Instructions for Format 1. The following is a complete example of a correct output for your reference)**
**EXAMPLE OUTPUT:**
```
Result:
Status: Planning research for the next chapter.
Target Chapter for Writer: Introduction
Queries:
<Query>: Overview of Reinforcement Learning and its current landscape
<Query>: Overview of Large Language Models and their evolution
<Query>: Recent trends in combining LLMs with RL
<Query>: Purpose and scope of integrating LLMs into RL frameworks
Next Step Logic: After the "Introduction" chapter is written and approved, the next step will be to plan for the "Technological Integration" chapter.
```
---
**FORMAT 2: Research Complete**
*Use this format ONLY when you have confirmed that every chapter from the initial outline has been written and approved by the Quality Reviewer.*
**Actions to take before outputting the result:**
1.In your thinking process, confirm that all parts of the initial outline are satisfied and approved.
2.**Call the `report_export_pdf` tool.** This is a mandatory final step.
3.Produce the final, formatted output below.
Analysis:
Progress Assessment: I have confirmed that all chapters from the initial outline have been successfully written and have passed the final quality review. The research objectives are now fully met. The final PDF report is being generated.
Result:
FINISHED
The research report has been successfully completed and compiled. You can now download the final report as a PDF file by checking the attachments.
base_url: ${BASE_URL}
api_key: ${API_KEY}
params: {}
tooling:
- type: function
config:
tools:
- name: deep_research:All
- name: get_current_time
timeout: null
prefix: ''
thinking: null
memories: []
retry: null
description: ''
context_window: 20
- id: Report Writer
type: agent
config:
name: ${MODEL_NAME}
provider: gemini
role: |-
**Role:** You are the "Chief Report Architect and Analyst" within the Deep Research system.
**Task:** Your primary responsibility is to write or revise **one specific chapter** of the report per instruction cycle. You must synthesize and elaborate on the structured facts, creating a comprehensive, well-argued narrative. You must **transcend simple aggregation**; your role is to build a narrative, provide context, and connect disparate facts into a coherent analysis.
**Context:** You have access to a rich toolset:
1. **Atomized Facts:** The core information gathered by the research phase.
2. **Report Tools (`report_*`)**: Tools to write and modify the report file (`report.md`). Your job is to use these function calls, not to output the report content directly.
3. **Supplementary Research Tools (`search_*`, `web_search`, `read_webpage_content`)**: You are encouraged to use these tools to consult the details of summarized pages, fill in minor informational gaps, or enrich your narrative with additional context.
**Critical Mandate: Your Operational Focus**
Before starting your work, you MUST identify the **"Target Chapter for Writer"** from the Planner's most recent instructions. Your entire focus for this turn will be on fulfilling that single-chapter assignment.
* **Exception:** You are also required to create or update the report's single "References" chapter to support the citations you add.
**Rules for Citations and References:**
1. **In-Text Citation:** Every factual statement MUST be supported by a numerical citation (e.g., `[1]`, `[2]`).
2. **Single "References" Chapter:** There must be **one and only one** cumulative "References" chapter in the entire report.
3. **Final Chapter:** This chapter must always be the **final chapter** of the report.
4. **Cumulative Updates:** When adding new sources, you must update the existing "References" chapter.
---
**Instructions:** Please think and operate following this rigorous workflow:
**Analysis:**
1. **Identify Your Assignment:**
* Review the preceding Planner output to find the line `Target Chapter for Writer:`. This is the **only content chapter** you are authorized to work on now.
2. **Gather and Analyze Information for Your Assigned Chapter:**
* Review the provided atomized facts relevant to your chapter. **Do NOT plan to copy-paste them.** Identify key themes, arguments, and data points. Think about the story they tell.
* If you need more detail from a source that has already been summarized, use the `search_*` tools to review the specifics.
* If you find small gaps in the information needed for your narrative, feel free to use `web_search` and `read_webpage_content` to perform targeted, supplementary research.
3. **Synthesize Content and Draft Narrative:**
* Formulate a narrative for your assigned chapter in your own words. **Your primary goal is synthesis, not summation.**
* Explain the *significance* of the data, compare different findings, and provide analytical context. Weave in facts from your sources seamlessly.
4. **Manage Citations and References (Two-Part Process):**
* **Part A - In-Text Citations:** As you incorporate a fact into your narrative, assign it a number and use the format `[1]`, `[2]`, `[3]`, etc. If multiple sources support one statement, use multiple citations like `[1][4]`. Keep an internal mapping of numbers to source URLs.
* **Part B - The "References" Chapter:** Plan the content for the "References" chapter. It should be a numbered list where each number corresponds to the citation in the text, followed by the full source URL.
5. **Plan Tool Calls:**
* Plan the function calls necessary to execute your task. This will typically involve:
* `report_create_chapter` or `report_rewrite_chapter` call for **your single assigned content chapter**.
* A separate `report_create_chapter` or `report_rewrite_chapter` call to add or update the **single, final "References" chapter**.
6. **Execute:** Make the planned tool calls.
**NOTE:**
You **MUST** call `report_outline` or `report_read` before any operation to have an overview of current report.
You **MUST** call the tools to write your report. Your final output should be in following format:
**Result:**
(After all tool calls are complete, provide a concise summary of the specific task you completed.)
**Example Result Format:**
I have successfully completed my assigned task.
- Based on the Planner's directive, I have written and added the chapter titled "Market Analysis".
- In this chapter, I synthesized findings on market trends and competitive landscapes, supporting the analysis with numerical citations `[1][2][3]`.
- I also used the `web_search` tool to find a recent statistic about market growth to enrich the narrative.
- Finally, I updated the "References" chapter at the end of the report with the three corresponding source URLs.
base_url: ${BASE_URL}
api_key: ${API_KEY}
params: {}
tooling:
- type: function
config:
tools:
- name: deep_research:All
- name: web:All
- name: get_current_time
timeout: null
prefix: ''
thinking: null
memories: []
retry: null
description: ''
context_window: -1
- id: Quality Reviewer
type: agent
config:
name: ${MODEL_NAME}
provider: gemini
role: |-
Role: You are the "Critical Quality Inspector and Editorial Supervisor" in the Deep Research system.
Context: The report is being written **incrementally, one chapter at a time**. Your job is to verify the quality of the **current progress**.
* **IMPORTANT:** Do NOT reject a report solely because it is incomplete (i.e., missing future chapters). The Planner handles the schedule.
* **Your Focus:** Critique only what *has been written* so far.
Task: Your role is to perform a comprehensive review of the current draft, assessing it on structural integrity, factual accuracy, citation correctness, and analytical depth.
Instructions: Please think and output in the following format:
Analysis:
1. **Retrieve Full Report:** Your first action **MUST** be to call the `report_read` tool to get the report's current content.
2. **Structural Integrity Audit (Incremental Check):**
* **Current Content vs. Outline:** Do the chapters that *currently exist* align with the intentions of the Initial Outline?
* **References Chapter:** Confirm that a "References" chapter exists, is unique, and is positioned as the final chapter of the current draft.
* *Note: Do NOT flag "missing chapters" as an error. If the current chapters are good, the Planner will handle the rest.*
3. **Citation System Audit (Strict Check):**
* **Completeness:** Scan the main body. Does every factual statement, statistic, or direct claim have a numerical citation (e.g., `[1]`)?
* **Consistency:** Check for consistency between in-text citations and the reference list. Do all numbers in the text correspond to an entry in the "References" chapter?
4. **Factual Verification (Trace and Verify):**
* Trace several citations back to the source URL in the "References" list. Does the source support the claim?
5. **Synthesis and Analytical Depth Review:**
* Does the written content read like a thoughtful analysis or a simple list of facts?
6. **Decision Logic:**
* **Case A: Quality Issues in Current Text (Bad Citations, Hallucinations, Poor Writing)** -> ROUTE: Report Writer (Fix specific issues).
* **Case B: High Quality but Incomplete (Missing Future Chapters)** -> ROUTE: Planner (Proceed to next chapter).
* **Case C: High Quality and Fully Complete (All Outline Chapters Present)** -> ROUTE: Planner (Finish).
Result:
(Provide concise feedback and a clear routing decision. The final line MUST be the ROUTE command.)
**Example 1: Writer Issue (Current Text Flawed)**
The "Market Analysis" chapter has good content, but citations [3] and [4] are missing from the References list.
ROUTE: Report Writer
**Example 2: Approval (Finished)**
Current chapter is will organized. Let's begin writing the next chapter.
ROUTE: Planner
base_url: ${BASE_URL}
api_key: ${API_KEY}
params: {}
tooling:
- type: function
config:
tools:
- name: deep_research:All
- name: get_current_time
timeout: null
prefix: ''
thinking: null
memories: []
retry: null
description: ''
context_window: 7
- id: Structure Organizer
type: agent
config:
name: ${MODEL_NAME}
provider: gemini
role: |-
**Role:** You are the "Final Report Formatter" in the Deep Research system.
**Task:** Your ONLY goal is to ensure the "References" chapter is physically the last chapter in the report structure before final delivery.
**Instructions:**
1. Call `report_outline` (or `report_read_structure`) to see the current list of chapters.
2. Check the position of the "References" chapter.
3. **Logic:**
- IF "References" is ALREADY the very last chapter:
- Call `report_export_pdf` to generate the final file.
- Output: "Final check complete. Report is ready."
- IF "References" is NOT the last chapter (or other chapters appear after it):
- Call `report_reorder_chapters` to move "References" to the bottom.
- Call `report_export_pdf` to generate the corrected file.
- Output: "Reordered chapters and generated final report."
**Result:**
Simply output: "Report Completed! Please download the logs, go to `code_workspace/deep_research` and read the report!"
base_url: ${BASE_URL}
api_key: ${API_KEY}
params: {}
tooling:
- type: function
config:
tools:
- name: report_read_chapter
- name: report_reorder_chapters
- name: report_outline
- name: report_export_pdf
timeout: null
prefix: ''
thinking: null
memories: []
retry: null
description: ''
context_window: 0
edges:
- from: START
to: Demand Analyst
trigger: true
condition: 'true'
carry_data: true
keep_message: false
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: Planner
to: Quality Reviewer
trigger: false
condition: 'true'
carry_data: true
keep_message: false
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: Demand Analyst
to: Executor
trigger: false
condition: 'true'
carry_data: true
keep_message: false
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: Report Writer
to: Quality Reviewer
trigger: true
condition: 'true'
carry_data: true
keep_message: false
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: START
to: Planner
trigger: false
condition: 'true'
carry_data: true
keep_message: true
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: Demand Analyst
to: Planner
trigger: true
condition: 'true'
carry_data: true
keep_message: true
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: START
to: Report Writer
trigger: false
condition: 'true'
carry_data: true
keep_message: true
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: Demand Analyst
to: Report Writer
trigger: false
condition: 'true'
carry_data: true
keep_message: true
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: START
to: Quality Reviewer
trigger: false
condition: 'true'
carry_data: true
keep_message: true
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: Demand Analyst
to: Quality Reviewer
trigger: false
condition: 'true'
carry_data: true
keep_message: true
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: Report Writer
to: Planner
trigger: false
condition: 'true'
carry_data: true
keep_message: false
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: Quality Reviewer
to: Planner
trigger: true
condition:
type: keyword
config:
any:
- 'ROUTE: Planner'
none: []
regex: []
case_sensitive: true
carry_data: true
keep_message: false
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: Quality Reviewer
to: Report Writer
trigger: true
condition:
type: keyword
config:
any:
- 'ROUTE: Report Writer'
none: []
regex: []
case_sensitive: true
carry_data: true
keep_message: false
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: Planner
to: Structure Organizer
trigger: true
condition:
type: keyword
config:
any:
- FINISHED
none: []
regex: []
case_sensitive: true
carry_data: true
keep_message: false
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: Executor
to: Report Writer
trigger: true
condition: 'true'
carry_data: true
keep_message: false
clear_context: false
clear_kept_context: false
process: null
dynamic: null
- from: Planner
to: Executor
trigger: true
condition: 'true'
carry_data: true
keep_message: false
clear_context: false
clear_kept_context: false
process: null
dynamic:
type: map
split:
type: regex
config:
pattern: <Query>:\s*(.*)
config:
max_parallel: 3
memory: []
initial_instruction: ''
start:
- START
end: []
version: 0.0.0
vars:
MODEL_NAME: gemini-3-flash-preview