Brand Knowledge MCP Server
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Brand Knowledge MCP ServerWhat inventory does dealer 42 have?"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Brand Knowledge Agent — Architecture Document
Phase 1: Knowledge Feed Layer Author: Julianna Lacaden Date: June 22, 2026 Status: Planning — Pending supervisor confirmation of scope
⚠️ Working Assumption
This document was drafted proactively during an unassigned period. The scope assumes the company's directive — "strengthen brand to feed to LLMs" — maps to Phase 1: a structured, LLM-queryable knowledge layer for dealer brand data. This must be confirmed with the supervisor before development begins.
Related MCP server: carvector-mcp
1. Problem Statement
Car dealer clients are underrepresented or inaccurately represented in LLM-generated answers. When a user asks an LLM about a specific dealer — their inventory, promotions, or reputation — the model either hallucinates, falls back to generic information, or omits the dealer entirely.
The goal of this agent is to build a structured, MCP-accessible knowledge layer that feeds accurate, up-to-date dealer brand data to an LLM at query time, so responses about that dealer are grounded in real information.
This is Phase 1. Phase 2 (not scoped here) will build a monitoring agent on top of this layer to measure how accurately LLMs represent the brand without the knowledge feed — establishing a before/after baseline.
2. Architecture Overview
Dealer Brand Data (mock)
│
▼
┌─────────────────────────┐
│ MCP Server │ ← FastMCP (Python), stdio transport
│ Tools: │
│ - get_inventory │
│ - get_promotions │
│ - get_reviews │
│ - get_dealer_profile │
│ │
│ Resource: │
│ - dealer://{id}/profile│ ← requires raw MCP SDK (see §4.1)
└───────────┬─────────────┘
│ stdio
▼
┌─────────────────────────┐
│ Orchestration Agent │ ← LangGraph (Python)
│ Model: qwen2.5 (Ollama)│
│ Routing: conditional │
└───────────┬─────────────┘
│
▼
┌─────────────────────────┐
│ Test Interface │ ← CLI or lightweight HTML
│ "What inventory does │
│ Dealer X have?" │
└─────────────────────────┘
- - - - - - - - - - - - - - - - (Phase 2, not built here)
┌─────────────────────────┐
│ Monitoring Agent │ ← Queries multiple LLMs, scores brand
│ (future) │ accuracy against MCP server as ground
└─────────────────────────┘ truth3. Scope
In scope (Phase 1)
MCP server with four tools and one resource exposing mock dealer data
LangGraph orchestration agent that queries MCP tools to answer brand questions
Type-coercion helpers on the MCP server side for model compatibility
Mock dealer data schema (no real client data required in Phase 1)
CLI test interface or simple HTML page
Documentation of all architectural decisions and known limitations
Out of scope (Phase 1)
Real dealer data integration (pending client onboarding)
Phase 2 monitoring/scoring agent
Multi-dealer support (single mock dealer is sufficient for Phase 1 validation)
Frontend UI beyond a minimal test interface
Deployment or hosting
4. Technical Design
4.1 MCP Server
Framework: FastMCP (Python) Transport: stdio
stdio is selected over HTTP for two reasons grounded in prior debugging:
HTTP (Streamable HTTP) introduced "Connection closed" errors requiring hours of debugging in an earlier build (June 11, 2026).
genkitx-mcpSSE transport produced 400 errors against FastMCP's HTTP endpoint (June 19, 2026). stdio has been validated as stable across both CrewAI and Genkit integrations.
Tools (four):
Tool | Input | Output | Notes |
|
| List of vehicle listings | Core tool, validated first |
|
| List of active promos | |
|
| List of customer reviews | |
|
| Dealer name, location, USPs | Duplicates resource content for Tools-only adapters |
Resource (one):
dealer://{id}/profileReturns structured JSON with dealer name, location, contact info, and USPs.
Limitation note: If the orchestration layer uses CrewAI's
MCPServerAdapter, this resource will not be exposed — the adapter only surfaces Tools. Theget_dealer_profiletool is included as a fallback so the resource content remains accessible regardless of the adapter used. If Resources are required, use raw MCP SDK instead of the adapter.
Type coercion helpers (mandatory):
qwen2.5 and other local models occasionally pass integer arguments as strings (e.g. "3" instead of 3) and list arguments as comma-separated strings or JSON-encoded strings (e.g. "[1,2]" instead of [1, 2]). These helpers must be included in the MCP server from the start:
def _to_int(val) -> int:
if isinstance(val, int):
return val
return int(str(val).strip())
def _to_int_list(val) -> list[int]:
if isinstance(val, list):
return [int(v) for v in val]
val = str(val).strip().strip("[]")
return [int(v.strip()) for v in val.split(",") if v.strip()]This pattern was validated on June 19, 2026. Do not defer this to debugging — apply it at build time.
4.2 Orchestration Layer
Framework: LangGraph (Python)
LangGraph is selected over CrewAI for the following reasons:
Consideration | LangGraph | CrewAI |
MCP Resources/Prompts | Accessible via raw MCP SDK | Blocked — |
Conditional routing | First-class (conditional edges) | Workaround via task chaining |
Loop-back support | Built-in (validated June 18) | Not native |
Phase 2 fit | Credibility scoring = conditional edges (already prototyped) | Would require rebuilding orchestration layer |
LangGraph's conditional edges and loop-back patterns were validated in the June 18 multi-node research pipeline. Phase 2's monitoring agent (score → loop if accuracy below threshold) reuses the same pattern. Choosing CrewAI now means rebuilding the orchestration layer when Phase 2 is scoped.
Model: qwen2.5 (via Ollama, local)
qwen2.5 is selected over llama3.2. This is a closed finding from two independent validation sessions (June 11 CrewAI integration, June 19 Genkit integration): llama3.2 sends tool arguments as strings instead of typed values, breaking schema validation. qwen2.5 handles function calling correctly.
Graph structure (Phase 1):
[START]
│
▼
[router_node] ─── decides which MCP tool to call based on query
│
▼
[tool_call_node] ─── calls MCP server via stdio, gets result
│
▼
[response_node] ─── formats LLM answer grounded in tool output
│
▼
[END]Phase 2 will insert a [scoring_node] with a conditional edge between [response_node] and [END] — the same pattern used in the June 18 credibility-scoring pipeline.
4.3 Mock Data Schema
Phase 1 does not require real client data. The following mock schema is sufficient to validate all four tools and the resource:
{
"dealer_id": "toyota-metro-01",
"name": "Toyota Metro",
"location": "Quezon City, Metro Manila",
"contact": "+63 2 8123 4567",
"usps": [
"10-year powertrain warranty",
"Same-day financing approval",
"Free first year PMS"
],
"inventory": [
{ "model": "Toyota Vios", "year": 2025, "price_php": 798000, "stock": 4 },
{ "model": "Toyota Fortuner", "year": 2025, "price_php": 1950000, "stock": 2 }
],
"promotions": [
{ "title": "Zero interest June promo", "valid_until": "2026-06-30", "details": "0% interest for 24 months on Vios" }
],
"reviews": [
{ "author": "Juan D.", "rating": 5, "text": "Fast processing and friendly staff." },
{ "author": "Maria S.", "rating": 4, "text": "Good experience but waiting area could be improved." }
]
}Flag in any handoff: this is mock data. Real dealer data integration is a Phase 2 dependency, not a Phase 1 blocker.
4.4 Test Interface
A minimal CLI interface is sufficient for Phase 1 validation:
python test_agent.py --query "What promotions does Toyota Metro have this month?"Optional: a simple HTML page (similar to the Genkit test page built June 16–17) that sends a query and displays the agent's response. This is not a deliverable — it's a test harness.
5. Known Limitations and Risks
Limitation | Impact | Mitigation |
Mock data only | Cannot validate against real dealer content | Flag clearly; Phase 2 requires real data |
| Resource |
|
qwen2.5 inconsistency on ambiguous inputs | Observed in June 18 credibility scoring: same input produced scores of 67 and 20 on separate runs | Document as known limitation; do not build scoring logic on top of qwen2.5 without a deterministic fallback |
Single-dealer scope | Phase 1 validates architecture, not scale | Multi-dealer support deferred to post-Phase-1 |
Supervisor scope unconfirmed | This doc may be planning against the wrong target | Send scope confirmation question before development begins |
6. Build Sequence
This is the order in which to build, not a daily schedule. Each step must be validated before the next begins.
Define and validate mock data — JSON file, no code yet
Build
get_inventorytool only — smallest working MCP tool, test with MCP InspectorWire one LangGraph node to one MCP tool — thin vertical slice proving the pipeline
Add remaining three tools —
get_promotions,get_reviews,get_dealer_profileAdd MCP Resource (
dealer://{id}/profile) — only if raw MCP SDK is used; skip if adapter is final choiceAdd router node — conditional routing based on query type
Add type coercion helpers — do not skip; apply before any model testing
End-to-end test — three sample queries, log results
Document findings — bugs, model behavior, transport observations
7. Phase 2 Sketch (not scoped, for reference only)
Phase 2 builds a monitoring agent on top of this architecture. Its job: take the same dealer query, send it to multiple LLMs without the MCP knowledge feed, then compare their answers against the MCP server's ground truth and score accuracy.
This reuses:
The same MCP server (Phase 1 output becomes the ground truth source)
LangGraph conditional edges (scoring node → loop-back if score below threshold)
The credibility-scoring pattern already validated June 18
Phase 2 is not blocked on Phase 1 being complete — it can be planned in parallel once Phase 1's architecture is confirmed. It is blocked on having a real answer to "what does accurate brand representation actually look like?" — which requires real dealer data.
8. Open Questions (requires supervisor input)
Scope confirmation: Is Phase 1 correct — build a knowledge feed first, monitoring second?
Real data timeline: When will actual dealer data be available, and in what format?
Model preference: Is there a company preference for cloud models (Gemini, OpenAI) over local Ollama for the orchestration layer?
Deployment target: Is this a local tool, a server-side service, or an API endpoint?
Multi-dealer requirement: Does Phase 1 need to handle more than one dealer, or is single-dealer sufficient for validation?
This document is a living planning artifact. It should be updated after each supervisor sync and after each major build milestone.
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/LacadenJulianna/AI-Agents-with-MCP-and-LangGraph'
If you have feedback or need assistance with the MCP directory API, please join our Discord server