What can you do with this server?

This server enables AI-powered analysis of Indian contracts and lookup of Indian contract law with three tools: * analyze_contract: Submit the full text of an Indian contract to identify unfair or unenforceable clauses, each rated with a severity score (1–5), a plain-language rationale, and verified citations to the Indian Contract Act, 1872. * search_indian_law: Query the Indian Contract Act, 1872 by legal topic or question (e.g. "agreement in restraint of trade") and retrieve the most relevant statutory sections, with a configurable number of results (default 5). * verify_citation: Check whether a specific cited section (e.g. ICA_1872:27 or Section 27) exists in the corpus and genuinely supports a given legal claim — serving as an anti-hallucination safeguard.

How do I use ClauseIQ?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@ClauseIQ Analyze this lease agreement for unfair clauses under Indian law." That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

ClauseIQ

by Pulkitgupta17

Overview Schema Related Servers Score Discussions

Python

Hybrid

ClauseIQ

A multi-agent AI system that reads an Indian contract, flags the clauses that are unfair to you with a severity score, and cites the exact section of Indian law behind each one.

Runs as a streaming web app and as an MCP server inside Claude Desktop.

▶ Live app · Install into Claude Desktop (MCP)

The problem

People in India sign rental, employment, and freelance contracts every day with clauses that are quietly unfair — or unenforceable under Indian law, and have no way to tell which. A 12-month lock-in that forfeits your entire deposit. A two-year, all-India non-compete. "Raise any dispute within 7 days or lose it forever." Lawyers are expensive; generic AI chatbots hallucinate sections of law that don't exist.

ClauseIQ is the tool I wanted to exist: it reads the contract, flags the risky clauses with a 1–5 severity score, explains why in plain language, and — the part that matters — backs every flag with a real, verified citation to the Indian Contract Act, 1872.

Related MCP server: Document OCR MCP Server

Live demo

Add a GIF/screenshots here: docs/assets/demo.gif, docs/assets/analysis.png.

Web app: https://clauseiq-app-fawn.vercel.app — paste a contract or upload a PDF and watch the agents work live.
API: https://clauseiq-api-124621416027.asia-south1.run.app (Cloud Run; GET /health for liveness, /docs for the OpenAPI UI).
In Claude Desktop (MCP): ask "analyse this contract for unfair clauses" — see docs/MCP_INSTALL.md.

System architecture

One application core, exposed through two front doors (a REST/streaming API and an MCP server). The analysis itself is a LangGraph state machine of four agents:

flowchart LR
    subgraph Clients
      Web[React app<br/>SSE streaming]
      Claude[Claude Desktop<br/>MCP]
    end
    Web -->|POST /analyze/stream| API[FastAPI]
    Claude -->|stdio| MCP[MCP server]
    API --> Core
    MCP --> Core
    subgraph Core[Application core - LangGraph]
      direction LR
      S[Supervisor<br/>segment + screen] --> R[Retriever<br/>hybrid search] --> RA[Risk Analyzer<br/>score 1-5] --> CV[Citation Verifier<br/>drop hallucinations]
    end
    R -.-> VS[(ChromaDB + BM25<br/>Indian Contract Act)]
    S & RA -.-> G[Gemini 2.5<br/>flash + pro]
    Core --> OBS[Langfuse + cost tracking]

Hybrid retrieval fuses dense vectors (ChromaDB) and lexical BM25 via Reciprocal Rank Fusion. The Citation Verifier is the anti-hallucination guarantee: any cited section that isn't in the corpus is dropped before you ever see it.

Key design decisions

One core, two front doors (dual-mode MCP). The agents live in the application layer with zero knowledge of transport; FastAPI and the MCP server are thin adapters over the same ContractAnalyzer. Tradeoff: a strict hexagonal boundary is more upfront structure, but it's why the exact same analysis runs in a browser and inside Claude Desktop with no duplicated logic.
Gemini via Vertex AI, with automatic fallback to AI Studio. Orchestration on gemini-2.5-flash, analysis on gemini-2.5-pro. In production it runs on Vertex AI (billed to GCP credits, authenticated by the Cloud Run service account — no key on the server); if Vertex is unavailable or the credits run out, the client automatically falls back to an AI Studio key, so the app keeps working. Tradeoff: a single-vendor (Google) dependency, in exchange for ~zero cost, strong structured output, and resilience — and Claude is still free via the user's own Claude Desktop over MCP.
A deterministic citation metric, not just LLM judges. Citation accuracy is checked against the actual statute (existence + text overlap), so the anti-hallucination guarantee never depends on another model's opinion. Tradeoff: it only catches citation faults, so it's paired with LLM-judged faithfulness for the rest.
Eval-gated CI. A 20-case golden dataset runs in CI; the build fails if faithfulness or citation accuracy regress. Tradeoff: slower, token-spending CI, but prompt regressions can't silently ship.
Result types + guardrails over exceptions for expected failures. Expected failures (not-a-contract, prompt injection, missing law) are typed Result values and explicit guardrails, not stack traces. Tradeoff: more verbose call sites, far more predictable behaviour at the API/MCP boundary.

Evaluation results

Measured with DeepEval over the full 20-case golden dataset (5 each: rental, employment, NDA, vendor). LLM-judged metrics use Gemini as judge; Citation Accuracy is deterministic (checked against the statute).

Metric	Score	Gate	Type
Citation Accuracy	1.00	≥ 0.90 ✅	deterministic
Faithfulness	0.90	≥ 0.85 ✅	LLM judge
Answer Relevancy	0.90	informational	LLM judge
Legal Soundness (G-Eval)	0.82	informational	LLM judge
Contextual Precision	0.57	informational	LLM judge
Contextual Recall	0.26	informational †	LLM judge

CI gates on Citation Accuracy + Faithfulness — the deterministic anti-hallucination guarantee plus grounding. The rest are reported for insight.

† Contextual Recall compares the gold summary against the raw statute snippets cited, which structurally understates it (the summary states legal conclusions not verbatim in the statute) — it's not a retrieval failure, as the perfect Citation Accuracy and 0.90 Faithfulness show.
Scores are produced by tests/evaluation/ and gated in CI.

Cost per query

Per analysis (segmentation on gemini-2.5-flash + per-clause analysis on gemini-2.5-pro), tracked live via a per-model price table and surfaced in logs, the SSE done event, and Langfuse.

Measured across the 20-case eval run (avg; range $0.0034–$0.0190 by contract size).

Component	Model	~Cost / contract
Segmentation / orchestration	gemini-2.5-flash	~$0.0008
Clause analysis	gemini-2.5-pro	~$0.0063
Retrieval + citation verify	local / deterministic	$0
Embeddings	all-MiniLM-L6-v2 (local)	$0
Total per contract		~$0.007

What I'd do differently

Hosted vector DB + per-tenant isolation. Embedded ChromaDB is perfect for a single-instance demo; a real product needs a managed store and isolation.
Expand the law corpus. Today it's the Indian Contract Act, 1872; rent-control and labour statutes are stubbed. State-specific coverage is the obvious next step.
Severity-calibration metric. The eval checks faithfulness and citations; I'd add a per-clause severity-MAE metric against expert labels.
Adversarial golden cases. Add benign/near-miss contracts to measure the false-positive rate, not just recall on unfair clauses.

Tech stack

Backend	Frontend
Python 3.11, FastAPI	React 18, Vite, TypeScript (strict)
LangGraph (multi-agent)	Tailwind v4, shadcn/ui
ChromaDB + rank-bm25 (RRF)	TanStack Query, Zustand
sentence-transformers (MiniLM)	Zod (runtime validation)
Google Gemini 2.5 (flash + pro)	Framer Motion (streaming UI)
MCP (FastMCP, stdio)	Biome, Vitest
DeepEval, Langfuse, structlog	pnpm
uv, ruff, mypy --strict, pytest	Lighthouse 100/98/100/91

Quickstart

# Backend (terminal 1)
uv sync
echo "CLAUSEIQ_GEMINI_API_KEY=your_key" > .env
uv run python scripts/ingest_laws.py          # build the vector index (once)
uv run uvicorn clauseiq.interfaces.api.main:app --app-dir src --reload --port 8000

# Frontend (terminal 2)
cd frontend && pnpm install && pnpm dev        # http://localhost:5173

API docs at http://localhost:8000/docs. Run the test suite with uv run pytest.

Deployed on Google Cloud Run (backend) and Vercel (frontend), with Gemini on Vertex AI.

Install into Claude Desktop (MCP)

ClauseIQ runs as an MCP server, so you can analyse contracts and look up Indian law directly inside Claude Desktop. Setup takes under a minute — see docs/MCP_INSTALL.md.

Disclaimer: ClauseIQ is automated decision-support, not legal advice. Amendment history is not tracked; verify current law for time-sensitive matters.

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Tools

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Pulkitgupta17/ClauseIQ'

If you have feedback or need assistance with the MCP directory API, please join our Discord server