🧮 Math Solver Agent

A Python CLI agent that solves math problems step-by-step using DeepSeek and the OpenAI-compatible Tool Use API.

基于 DeepSeek + Tool Use 构建的数学解题 Agent，展示 AI Agent 架构核心能力。

独立设计并实现，无框架依赖，手动实现完整 Agentic Loop。

Architecture / 架构

main.py               ← CLI 入口，对话循环
agent.py              ← Agentic Loop（核心）：调 API → 执行工具 → 循环
tools.py              ← 工具定义（JSON Schema）+ 工具实现（SymPy）
mcp_server.py         ← 把三个工具包装成标准 MCP Server（见下方 MCP 一节）
test_mcp_client.py    ← 用 MCP 官方 client SDK 验证 mcp_server.py
rag_formula_lookup.py ← formula_lookup 的 RAG 版本（见下方 RAG 一节）

Three tools / 三个工具：

Tool	Description
`step_decomposer`	分析题型，生成解题路线图（先规划再计算）
`formula_lookup`	从内置公式库检索相关公式（代数/几何/微积分…）
`calculator`	基于 SymPy 的符号计算引擎（求导/积分/解方程…）

Agent loop 流程：

用户输入
  → DeepSeek Chat (Tool Use)
  → finish_reason == "tool_calls"  → 执行工具 → 把结果塞回对话 → 继续
  → finish_reason != "tool_calls"  → 输出最终解答

Setup / 安装

# 1. 克隆项目
git clone https://github.com/Heliotrope-dev/math-agent.git
cd math-agent

# 2. 安装依赖
pip install openai sympy

# 3. 设置 API Key（从 https://platform.deepseek.com 获取）
export DEEPSEEK_API_KEY="sk-..."   # macOS/Linux
# 或 Windows: set DEEPSEEK_API_KEY=sk-...

# 4. 运行
python main.py

Demo / 演示

Agentic Loop 运行过程（三个工具依次调用）：

$demo-start$

最终输出（牛顿-莱布尼兹公式完整推导）：

$demo-output$

Example / 示例

📌 输入数学题：解方程 2x² + 5x - 3 = 0

🤔 Agent 思考中...

🔧 调用工具：step_decomposer
🔧 调用工具：formula_lookup   (topic: algebra)
🔧 调用工具：calculator        (solve: 2*x**2 + 5*x - 3 = 0)

📊 解题结果：

**解题思路**
这是一个标准二次方程，使用求根公式求解。

**分步解答**
1. 识别系数：a=2, b=5, c=-3
2. 代入求根公式：x = (-5 ± √(25+24)) / 4 = (-5 ± 7) / 4
3. 两个解：x₁ = 1/2,  x₂ = -3

**最终答案**
x = 1/2  或  x = -3

MCP Server / 标准协议化改造

tools.py 里的工具是手写的 OpenAI 格式 JSON Schema，只能被这一个 agent 用。 mcp_server.py 用 FastMCP 把同样的三个工具包装成标准 MCP Server —— schema 直接从 Python 类型注解 + docstring 自动生成，任何 MCP host（Claude Code / Claude Desktop / Cursor）都能直接发现并调用，不需要为每个 host 各写一套接入代码。

pip install "mcp[cli]"
mcp dev mcp_server.py              # 用 MCP Inspector 调试
# 或注册进 Claude Code：
claude mcp add math-agent -- python mcp_server.py

test_mcp_client.py 用官方 client SDK 连接 server、list tools 并实际调用一次，用来验证 server 本身没问题。

RAG / 语义检索版 formula_lookup

tools.py 里的 formula_lookup 必须先知道 topic（固定 enum：algebra / calculus / ...）才能查到公式，题目原文用不上。rag_formula_lookup.py 用本地 Ollama （nomic-embed-text 做 embedding，qwen3.5 做生成）重做了一版：

公式库 → 按条切块 + 中文语义描述 → embedding → 向量索引
用户题目（纯自然语言，不需要 topic）→ embedding → 余弦相似度检索 top-k → 拼进 prompt → 生成解题思路

调试时发现一个有意思的点：直接把公式 notation 和中文描述混在一起做 embedding，中文 query 检索效果很差（跨语言匹配被符号稀释）；把"用于检索的文本"（纯中文语义描述）和"返回内容"（完整公式）分开存之后，检索准确率明显提升。

pip install requests
ollama pull nomic-embed-text
python rag_formula_lookup.py

Key Design Decisions / 设计要点

deepseek-chat — 速度快、支持 Tool Use，兼容 OpenAI SDK，适合 agentic 场景
手动维护 messages 列表，每轮把 assistant 消息和 tool result 追加进去，循环直到 finish_reason != tool_calls
SymPy 做符号计算 — 精确，无浮点误差，支持代数化简
公式库 + 步骤分解 — 让模型输出有结构的教学式解答，而不只是答案

Built with Claude Code · 蒋天奇 · 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧮 Math Solver Agent

Architecture / 架构

Setup / 安装

Demo / 演示

Example / 示例

MCP Server / 标准协议化改造

RAG / 语义检索版 formula_lookup

Key Design Decisions / 设计要点

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets		assets
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
main.py		main.py
mcp_server.py		mcp_server.py
rag_formula_lookup.py		rag_formula_lookup.py
test_mcp_client.py		test_mcp_client.py
tools.py		tools.py

Folders and files

Latest commit

History

Repository files navigation

🧮 Math Solver Agent

Architecture / 架构

Setup / 安装

Demo / 演示

Example / 示例

MCP Server / 标准协议化改造

RAG / 语义检索版 formula_lookup

Key Design Decisions / 设计要点

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages