快来看，n8n更新了！构建基于专业知识库的多领域RAG系统

qimuai 发布于 2026-3-10 22:01 阅读：1 一手编译

内容来源：https://blog.n8n.io/build-multi-domain-rag-systems-with-specialized-knowledge-bases/

内容总结：

智能客服新突破：Pinecone推出多知识库精准查询方案，破解行业信息混杂难题

在度假租赁、连锁门店或客户服务等场景中，传统AI客服常因知识库信息混杂而“答非所问”，例如误将A物业的暖气操作指南发送给B物业的客人，严重影响服务专业性与用户体验。这一普遍痛点背后，实则是知识管理架构的缺陷——将不同领域的信息不加区分地存储于单一知识库，迫使AI每次都在海量无关信息中艰难检索。

为此，Pinecone开发者推广工程师Jenna Pederson近日提出并演示了一套基于检索增强生成（RAG） 的创新工作流解决方案。该方案核心在于“分而治之”：为不同业务领域（如不同物业、不同客户）建立独立的专用知识库，并通过智能路由机制，将用户查询精准导向对应的知识库，从而确保回答的高度相关性与准确性。

方案核心架构与优势：

精准检索：采用语义搜索技术，能理解用户查询的深层含义（如“这里很冷”指向暖气调控），而非简单关键词匹配，避免返回无关信息（如冰箱说明书）。
模块化知识库：每个独立领域（如三个度假物业）配备专属的Pinecone Assistant，分别存储和管理该领域的文档（如设备指南、Wi-Fi密码）。上传流程通过n8n平台与Google Drive联动实现自动化。
智能路由与应答：通过AI智能体（Agent）分析用户查询，自动识别所指领域并调用对应知识库。若信息不足，会主动询问澄清。最终由大语言模型（如GPT-4）将检索到的信息片段整合成自然流畅的回复。

该设计带来三大显著效益：

准确性提升：杜绝了不同领域信息间的“污染”，确保答案精准对应具体场景。
易于维护：可独立更新、调试任一知识库，无需担心波及全局，降低了管理复杂性与风险。
强扩展性：新增领域（如第四个物业）仅需创建新知识库并更新路由逻辑，系统复杂度呈线性而非指数增长，便于业务规模扩张。

实施指引：
用户可依据自身业务划分独立领域（如分公司、不同客户群），为每个领域创建独立的Pinecone Assistant，并构建相应的查询路由逻辑。Pinecone Assistant已封装数据处理、向量化、语义搜索及结果重排等复杂技术环节，使开发者能更专注于业务逻辑构建。

此方案现已提供完整工作流模板，适用于n8n平台。通过将AI架构与真实业务结构对齐，该模式为构建高效、可靠的专业化智能客服系统提供了清晰路径。

中文翻译：

本《认证节点聚焦》由 Pinecone 开发者布道师 Jenna Pederson 撰写。

假设您管理着多处度假租赁房产。一位客人发短信询问如何开启暖气，但您却误将另一处房产的、操作完全不同的温控器说明发了过去。这会让您显得不专业，客人感到困惑，而且他们现在还很冷。

这不仅是一场客户服务噩梦，更是一个知识管理问题。当您把所有房产文档都塞进一个知识库时，您就是在要求您的人工智能每次都要搜索所有内容来找出相关信息。这就像创建了一个包含 1 万行、30 列的电子表格，却从不将数据分到不同标签页中。我们的大脑不是这样工作的，我们的业务和人工智能也不应该是。

促使我们分离电子表格标签页的同一原则，也应指导我们的人工智能架构。不同的领域需要不同的上下文。

在本文中，我们将通过构建一个工作流来解决此问题，该工作流会根据上下文（即客人入住的是哪处房产）将查询路由到多个专门的知识库。您可以将此模式应用于特许经营地点、代理客户或客户支持层级——任何需要为不同用户或工作流中的不同步骤提供不同上下文的场景。

让我们先分解所需的主要组件，然后将它们连接起来。

我们的解决方案

Google Drive
我们将把源文件存储在 Google Drive 文件夹（或任何其他文档存储提供商）中。这里将存放关于我们各处房产的原始文档，在本例中是 Markdown 格式。

聊天界面
一个聊天界面将允许客人询问其房产相关问题、预约服务或要求房产经理回电。在本教程中，我们将使用 n8n 内置的聊天触发器，但您也可以响应 Webhook、Slack 消息、WhatsApp 消息或 Telegram 消息。

搜索
接下来，我们需要一种在数据中搜索答案的方法。请记住，我们的数据是专门化的，包含特定信息（如 Wi-Fi 密码），因此我们不能简单地将所有请求直接路由到 Claude 或 OpenAI 这样的模型。

我们需要比简单关键字搜索更智能一些的方法。我们将使用自然语言提问，因此需要按含义搜索。例如，如果客人说“这里冷死了”，请注意消息中并未提及温控器、暖气、暖通空调或温度控制。语义搜索将根据含义进行搜索，并找到控制供暖系统的信息。如果我们只能进行简单的关键字搜索，那么可能会得到关于存放食物的冰柜的结果。

当我们使用语义搜索按含义搜索时，我们将利用查询和知识库中数据的上下文，得到与“冷死了”含义相似的结果。关于冰柜的结果可能仍会被返回，但它们在语义上的排名可能会低于关于调节温控器的结果。

这种搜索方式更复杂，但实现起来也更复杂。在我们的示例中，我们将使用 Pinecone Assistant 来为我们管理这种复杂性。该助手负责使用正确的分块策略处理数据分块、将分块数据转换为向量嵌入以编码含义、进行查询规划、执行语义搜索以及对结果进行重新排序。

注意：如果您以前使用过 Pinecone Vector Store 节点，其中一些步骤可能很熟悉，甚至可能曾让您遇到困难。

我们从搜索中得到的结果是我们的数据块，也称为上下文片段。

输出生成
最后，一旦我们以数据块的形式获得了搜索结果，就需要将它们转换回我们可以阅读和理解的内容。我们将使用一个大语言模型来完成这项工作。我们将上下文片段传递给模型，并指示其将我们的数据转换回用户可以理解的自然文本响应。

什么是 RAG？
这种从知识库中检索相关上下文并用该上下文增强模型响应的模式，称为检索增强生成。

开始构建
现在我们已经了解了这些主要组件，让我们来构建这个工作流！

只想要工作流模板？将其导入您的 n8n 实例即可快速开始。或者，按照本文中的说明，逐步学习如何构建它。

先决条件
您需要准备以下内容才能开始：

一个 Pinecone 账户和 API 密钥
一个已启用并配置了 Google Drive API 的 GCP 项目
一个 OpenAI 账户和 API 密钥
三个名为 hillcrest、lakeside、birchwood 的空 Google Drive 文件夹

1. 创建知识库
首先，我们将在 Pinecone 控制台中创建三个 Pinecone Assistant。

在浏览器中访问：https://app.pinecone.io/organizations/-/projects/-/assistant
创建一个名为 n8n-vacation-rental-property-lakeside 的新助手。
对名为 n8n-vacation-rental-property-birchwood 和 n8n-vacation-rental-property-hillcrest 的助手重复相同操作。

无需配置聊天模型、助手指令或上传文件，我们将在 n8n 中处理这些。

2. 安装 Pinecone Assistant 社区节点
我们需要创建一个工作流并安装 Pinecone Assistant 节点。如果您已经安装了该节点，请创建工作流并转到下一步设置凭据。

在您的 n8n 工作区中，首先创建一个新的空工作流。
选择 + 图标以查看节点面板，并搜索 pinecone assistant。
选择安装按钮以安装社区节点。

在 5 月 1 日前注册 Pinecone Standard 免费试用（3 周，300 美元额度），可获得以下特别优惠：

6 个月内每月 40 美元的额度（试用结束后开始）
免除助手每小时费用

优惠详情：开始使用 n8n Pinecone Assistant 节点，并在 2026 年 5 月 1 日前领取此促销。在 2026 年 7 月 1 日前升级到付费 Standard 计划以激活您的 6 个月福利。

3. 设置您的凭据
接下来，我们需要为我们使用的每项服务设置凭据。

Google Drive OAuth 2 凭据

在您的 n8n 工作区中，为 Google Drive OAuth 2 API 创建一个新凭据。
复制 OAuth 重定向 URL。我们稍后会用到客户端 ID 和客户端密钥。
在 GCP 控制台中，选择您的项目。
转到 API 和服务菜单，选择凭据。
选择 + 创建凭据，然后选择 OAuth 客户端 ID。
注意：如果您尚未在此项目中使用 OAuth，则还需要设置 OAuth 同意屏幕。
为应用程序类型选择 Web 应用程序。
在授权重定向 URI 部分选择添加 URI。
粘贴步骤 2 中的重定向 URL。
选择创建。
记下您的 OAuth 客户端的客户端 ID 和客户端密钥。
回到您的 n8n 工作区，粘贴步骤 10 中的客户端 ID 和客户端密钥。
选择“使用 Google 登录”按钮进行身份验证。
保存您的凭据。

您可以在此处找到设置 GCP 项目、API、OAuth 同意屏幕和 OAuth 客户端的完整说明。

OpenAI 凭据

在您的 n8n 工作区中，为 OpenAI 创建一个新凭据。
粘贴您的 OpenAI API 密钥。
保存您的凭据。

Pinecone 凭据

在您的 n8n 工作区中，为 Pinecone Assistant API 创建一个新凭据。
粘贴您的 Pinecone API 密钥。
保存您的凭据。

现在您的凭据已在 n8n 中设置完毕，让我们构建我们的工作流。

4. 构建工作流

设置文件上传路径
接下来，我们将设置文件上传路径，以将您的数据从 Google Drive 获取到代表三处房产的三个 Pinecone Assistant 中。

对于第一处房产 Hillcrest，设置一个 Google Drive 触发器，用于“特定文件夹发生变更时”。
在文件夹列表中选择 hillcrest 文件夹。
在“监视”中，选择“文件已创建”。
连接到此节点，添加另一个用于“下载文件”操作的 Google Drive 节点。
在“文件名”字段中，将其设置为 {{ $json.name }}。
然后连接用于“上传文件”操作的 Pinecone Assistant 节点。
在“助手名称”字段中选择 hillcrest 助手。
将“外部文件 ID”字段设置为 {{ $json.id }}。

现在，选择您刚刚创建的所有三个节点，复制它们，并为 Lakeside 和 Birchwood 助手上传路径粘贴它们。您需要调整触发器中的文件夹和 Pinecone Assistant 节点中的助手名称。

测试上传路径
在构建工作流的其余部分之前，让我们确保这能正常工作。我们将为此演示使用一些虚构数据并将其导入助手。

从此处下载虚构数据文件。
将文件添加到三个独立的 Google Drive 文件夹，分别命名为 lakeside、birchwood 和 hillcrest。
激活工作流，以便将文档上传到您的助手。
转到 Pinecone 控制台，验证每个助手的文件是否已上传。

设置聊天路径
现在我们需要为客人提供一种询问问题和请求其房产信息的方式。我们将使用 n8n 内置的聊天触发器。

添加聊天触发器节点。
将 AI 代理节点连接到聊天触发器。
为系统消息添加一个选项，并将其设置为：
“您是一位乐于助人的度假租赁房产经理及其客人的助手。根据用户的消息，判断他们是在请求关于‘hillcrest’、‘birchwood’、‘lakeside’房产的信息。您根据房产名称将请求路由到相应的 pinecone 助手工具，以获取有关该房产的答案。
如果您无法从用户的消息中推断出房产，请不要调用任何工具，而是在聊天中询问更多信息。
如果对方要求联系，请不要调用任何工具，而是返回一个表示会有人联系他们的响应。
使用友好、乐于助人的语气。”
这告诉模型（我们接下来会添加）应如何表现。它包含了当不清楚客人指的是哪处房产时该怎么做，以及当知道房产时该输出什么的指令。
1. 添加 OpenAI 聊天模型节点，并将模型设置为 gpt-4.1-mini。
2. 将简单记忆节点添加到 AI 代理，并将键设置为 {{ $('When chat message received').item.json.sessionId }}。
3. 向 AI 代理工具添加三个 Pinecone 助手工具节点，每个房产一个。
4. 在每个工具中，将助手名称设置为相应房产的助手。
5. 更新三个 Pinecone 助手工具节点的名称，以反映其所指的房产。这将帮助 AI 代理根据从客人那里收集到的信息知道调用哪个工具。

现在您应该拥有一个完整的工作流，看起来像这样，包含两条路径：一条上传路径和一条聊天路径。

让我们来测试一下！

测试聊天路径
要测试聊天路径：

打开聊天界面。
首先询问关于房产的问题：我需要咖啡机的帮助。
注意响应是如何请求更多关于客人所指房产的信息的。这是因为在我们的系统消息中，我们已指示模型如果无法从原始消息推断出房产，则向客人询问更多信息。
如果您换了房产并想询问另一个问题，请在查询中包含房产信息：Hillcrest 房产的空气炸锅不工作了。

为何有效
通过按领域分离我们的知识库，我们从根本上改变了系统检索信息的方式。以下是这种方法效果显著的三个原因：

准确性
就像您需要销售标签页的数据时不会搜索所有电子表格标签页一样，每个助手对应一个领域可以消除上下文污染。您将获得精确的答案，而不是混杂的困惑。当您询问 Lakeside 的热水浴缸时，您得到的是 Lakeside 的答案，而不是关于其他房产信息的混合体。

可维护性
您现在可以独立地更改、调试或添加领域，而不会产生连锁反应。更改一处房产的 Wi-Fi 密码。独立于其他房产测试和调试一处房产的助手。让一名团队成员负责单个助手及该房产的信息。这将影响范围精确限制在正在更改的内容上，使更改更安全，也更容易推理。

可扩展性
适用于三个领域的模式也适用于三十个。复杂性呈线性增长，而非指数级增长，因此您可以添加第四个房产而无需触及其他三个。创建新文档，启动新助手，在路由逻辑中添加一个条件，就完成了。

总结
RAG 不是一个单一的系统——它是一个构建模块。我们在此构建的多助手模式，通过让人工智能架构与您的业务实际运作方式相匹配，为您提供了精确、可维护的检索。

因为 Pinecone Assistant 处理了分块、嵌入和重新排序的复杂性，您可以专注于重要的事情：您的业务。

准备好为您的用例实施此模式了吗？从以下步骤开始：

确定您不同的领域（例如，特许经营地点、客户账户、客户支持工作流）。
为每个领域创建一个 Pinecone Assistant。
构建将上下文映射到正确助手的路由逻辑。

工作流模板可在此处获取。根据您的领域调整路由逻辑，您将拥有协同工作的专门知识库。

英文来源：

This Verified Node Spotlight was written by Jenna Pederson, Staff Developer Advocate for Pinecone.
Imagine you manage multiple vacation rental properties. A guest at one of your properties texts asking how to turn on the heat, but you accidentally send them instructions for your other property's completely different thermostat. You look unprofessional, your guest is confused, and now they are cold.
This isn't just a customer service nightmare, but a knowledge management problem. When you shove all your property documentation into one knowledge base, you're asking your AI to search through everything every time to figure out what's relevant. It's like creating a spreadsheet with 10,000 rows and 30 columns and never separating your data into tabs. Our brains don't work that way, and neither does our business or AI.
The same principle that pushes us to separate spreadsheet tabs should inform our AI architecture. Different domains need different contexts.
In this article, we'll solve this problem by building a workflow that routes queries to multiple specialized knowledge bases based on context (i.e. which property the guest is staying at). You can adapt this pattern to franchise locations, agency clients, or customer support tiers—any scenario where different users or steps in a workflow need different context.
Let's break down the main components we'll need and then connect it all together.
Our solution
Google Drive
We'll store our source files in a Google Drive folder (or any other document storage provider). This will hold the raw documents about each of our properties, in our case, markdown format.
Chat interface
A chat interface will allow guests to ask questions about their property, request a service appointment, or get a call back from the property manager. For the purposes of this tutorial, we'll use n8n's built-in Chat trigger, but you could respond to a webhook, a Slack message, a What's App message, or Telegram message.
Search
Next, we'll need a way to search for answers in our data. Remember, our data is specialized and contains info (like the Wi-Fi password), so we can't just route all requests directly to a model like Claude or OpenAI.
We'll need something a little smarter than a simple keyword search. We'll be using natural language to ask questions, so we'll need to search by meaning. For instance, if a guest says "It's freezing in here," notice that there's no mention of thermostat, heat, HVAC, or temperature control in the message. A semantic search will search by meaning and find information for controlling the heating system. If we only had access to a simple keyword search, then we might get results about a freezer for storing food.
When we search by meaning using semantic search, we'll get results similar to "freezing" using the surrounding context of both the query and the data in our knowledge base. The results about a chest freezer might still be returned, but they would likely be ranked lower in semantic meaning than results about adjusting the thermostat.
This type of search is more sophisticated, but it's also more complex to implement. In our example, we'll use Pinecone Assistant to manage this complexity for us. The Assistant handles chunking our data with the right chunking strategy, converting our chunk data into vector embeddings to encode the meaning, query planning, executing the semantic search, and re-ranking the results.
Note: If you've used the Pinecone Vector Store node before, some of these steps may be familiar or have even tripped you up.
The results we get back from this search are chunks of our data, also known as context snippets.
Output generation
Finally, once we have our search results in chunks, we need to turn them back into something we can read and understand. We'll use a large language model to do that. We pass the context snippets to the model with instructions to transform our data back into a natural text response that the user can understand.
What is RAG?
This pattern of retrieving relevant context from your knowledge base and augmenting the model's response with that context is called Retrieval-Augmented Generation (RAG).
Let's build
Now that we understand these main components, let's build this workflow!
Just want the workflow template? Import it into your n8n instance and get started quickly. Or, follow the instructions in this post to learn how to build it step-by-step.
Prerequisites
You'll need the following to get started:

A Pinecone account and API key
A GCP project with Google Drive API enabled and configured
An Open AI account and API key
Three empty Google Drive folders named hillcrest, lakeside, birchwood
1. Create knowledge bases
  First, we'll create three Pinecone Assistants in the Pinecone console.
Point your browser at: https://app.pinecone.io/organizations/-/projects/-/assistant
Create a new Assistant named: n8n-vacation-rental-property-lakeside
Repeat the same for Assistants named n8n-vacation-rental-property-birchwood and n8n-vacation-rental-property-hillcrest
There's no need to configure a Chat model, Assistant instructions, or upload files as we'll handle this in n8n.
1. Install the Pinecone Assistant community node
  We'll need to create a workflow and install the Pinecone Assistant node. If you've already installed the node, create the workflow and move to the next step to setup your credentials.
In your n8n workspace, start by creating a new, empty workflow
Select the + icon to view the nodes panel and search for pinecone assistant
Select the install button to install the community node
Get these special perks when you sign up for a Pinecone Standard free trial (3 weeks, $300 credits) before May 1st:
$40/month in credits for 6 months (starting after your trial ends)
Waived Assistant hourly fees
Offer details: Start using the n8n Pinecone Assistant node and claim this promotion before May 1, 2026. Upgrade to a paid Standard plan before July 1st, 2026 to activate your 6-month benefits.
1. Setup your credentials
  Next, we'll need to set up credentials for each of the services we use.
  Google Drive OAuth 2 credential
In your n8n workspace, create a new credential for Google Drive OAuth 2 API
Copy the OAuth Redirect URL. We'll come back to the Client ID and Client Secret shortly.
In the GCP Console, select your project
Go to the APIs & Services menu and select Credentials
Select + Create credentials and select OAuth client ID
Note: If you haven't used OAuth in this project yet, you'll also need to set up the OAuth consent screen.
Select Web application for Application type
Select Add URI in the Authorized redirect URIs section
Paste in the redirect URL from step 2
Select Create
Make a note of the Client ID and Client secret for your OAuth client
Back in your n8n workspace, paste in the Client ID and Client secret from step 10
Select the Sign in with Google button to authenticate
Save your credential
You can find the full instructions for setting up your GCP project, APIs, OAuth consent screen, and OAuth client here.
OpenAI credential
In your n8n workspace, create a new credential for OpenAI
Paste in your OpenAI API key
Save your credential
Pinecone credential
In your n8n workspace, create a new credential for Pinecone Assistant API
Paste in your Pinecone API key
Save your credential
Now that your credentials are set up in n8n, let's build our workflow.
1. Build the workflow
  Setup the file upload path
  Next, we'll set up the file upload path to get your data from Google Drive into your three Pinecone Assistants representing the three properties.
For the first property, Hillcrest, setup a Google Drive Trigger for On changes involving a specific folder
Select the hillcrest folder in the Folder list
In Watch For, select File Created
Connected to this node, add another Google Drive node for Download file action
In the File name field, set it to {{ $json.name }}
Then connect the Pinecone Assistant node for the Upload file action
Select the hillcrest assistant in the Assistant Name field
Set the External File ID field to {{ $json.id }}
Now, select all three of the nodes you just created, copy them, and paste them for the Lakeside and Birchwood assistant upload paths. You'll need to adjust both the Folder in the trigger and the Assistant Name in the Pinecone Assistant node.
Test the upload path
Before building the rest of the workflow, let's make sure this works. We'll work with some fictional data for this demo and get it into Assistant.
Download the fictional data files from here
Add the files to three separate Google Drive folders named lakeside, birchwood, and hillcrest
Activate the workflow so the documents are uploaded to your assistants
Go to the Pinecone Console and verify each assistant's files have been uploaded.
Setup the chat path
Now we need to give guests a way to ask their questions and request information about their property. We'll use n8n's built-in Chat trigger.
Add the Chat Trigger node
Connect an AI Agent node to the Chat Trigger
Add an option for System Message and set it to:
You are a helpful assistant for a vacation rental property manager and their guests. Based on the user's message, decide if they are requesting information about the "hillcrest", "birchwood", "lakeside" property. You route requests based on property name to the appropriate pinecone assistant tool to fetch answers about the property.
If you cannot infer the property from the user's message, do not call any tools and instead ask for more information in the chat.
If the person requests to be contacted, do not call any tools and instead return a response indicating that someone will reach out to them.
Use a friendly, helpful tone.
This tells the model (which we'll add next) how to behave. It has instructions for what to do when it's not clear which property the guest is referring to and what to output when the property is known. 3. Add the OpenAI Chat Model node and set the Model to gpt-4.1-mini 4. Add the Simple Memory node to the AI Agent and set the Key to {{ $('When chat message received').item.json.sessionId }} 5. Add three Pinecone Assistant Tool nodes to the AI Agent tools, one for each property. 6. In each tool, set the Assistant Name to the corresponding property's assistant. 7. Update the name of the three Pinecone Assistant Tool nodes to reflect which property it's referring to. This will help the AI Agent know which tool to invoke based on the info collected from the guest.
Now you should have a full workflow that looks something like this, with two paths: an upload path and chat path.
Let's test it out!
Test the chat path
To test the chat path:
Open the chat interface
Start by asking this questions about a property: I need help with the coffee maker
Notice how the response requests more information about which property the guest is referring to. This happens because in our System Message, we've instructed the model to ask for more information from the guest if it cannot infer the property from the original message.
If you changed properties and wanted to ask another question, include the property in the query: The air fryer isn't working at the Hillcrest property.
Why this works
By separating our knowledge bases by domain, we've fundamentally changed how our system retrieves information. Here are three reasons this works so well:
AccuracyJust like you wouldn't search across all spreadsheet tabs when you need data from the Sales tab, one domain per assistant eliminates context pollution. You'll get precise answers, not blended confusion. When you ask about Lakeside's hot tub, you get Lakeside's answer, not a blend of information about other properties.
Maintainability
You can now change, debug, or add domains independently without cascading effects. Change the wifi password for one property. Test and debug one property's assistant independently of others. Let one team member own a single assistant and the information about that property. This limits the blast radius to exactly what is being changed, making it safer to make changes and easier to reason about.
Scalability
The pattern that works for three domains works for thirty. The complexity grows linearly, not exponentially so you can add a fourth property without having to touch the other three. Create new documentation, spin up a new assistant, add one condition to your routing logic and you're done.
Wrap up
RAG isn't a monolithic system—it's a building block. The multi-assistant pattern we've built here gives you precise, maintainable retrieval by matching your AI architecture to how your business actually works.
Because Pinecone Assistant handles the complexity of chunking, embeddings, and re-ranking, you can focus on what matters: your business.
Ready to implement this pattern for your use case? Start with:
Identify your distinct domains (e.g. franchise locations, client accounts, customer support workflows)
Create one Pinecone Assistant per domain
Build routing logic that maps context to the right assistant
The workflow template is available here. Adapt the routing logic to your domains, and you'll have specialized knowledge bases working in harmony.

n8n

文章目录

📚 推荐阅读

扫描二维码，在手机上阅读