Skip to main content
Glama

MindPulse Academic Suite

Scholar Agent is part of the MindPulse Academic Suite, forming a powerful synergy between local open-source tools and fully-managed cloud services:

  • ๐ŸŒŒ Scholar Agent (Open Source & Local): A local-first knowledge flywheel that integrates with your IDE via MCP (Model Context Protocol). It saves research answers as local Markdown knowledge cards, building your custom personal LLM-Wiki.

  • โšก PaperPulse (Cloud SaaS): A fully-managed daily academic digest SaaS that monitors arXiv/Semantic Scholar, scores papers based on your personalized research preferences, and delivers distilled summaries straight to your WeChat or Email.

Feature

Scholar Agent (Local)

PaperPulse (Cloud SaaS)

Hosting & Mode

Local MCP Server (Open Source)

Fully-Managed SaaS (Closed Source)

Core Workflow

On-demand research query & knowledge synthesis

Automated daily crawling, scoring & email/WeChat push

Storage

Local Markdown Files / Vector DB

Cloud Postgres / Managed Index

IDE Integration

Deeply integrated with Claude Code, VS Code, Cursor

Web-based Dashboard & Chatbot

Pricing

Free & Open Source

Free Tier / Premium Subscriptions

๐Ÿ’ก Synergy (One-Click Local Sync):

  • One-Click Sync: When your Scholar Agent MCP server is running locally, simply click the "Import to Local Scholar Agent" button on the PaperPulse web interface. The note will be instantly synced and written to your local knowledge/ directory via a secure local loopback interface, bypass browser sandbox constraints and rebuild your search index automatically!

  • Manual Export: You can also click "Export Markdown" to download the standard Markdown note and place it into your knowledge base directory manually.


Why

Every AI conversation generates knowledge โ€” research findings, technical explanations, citations. But LLMs are stateless: each new session starts from zero. The research your AI completed yesterday is not available today.

Scholar Agent makes AI knowledge persistent. It saves research and answers as local knowledge cards โ€” structured, citable, and interconnected. Before answering, the AI checks existing local knowledge first, building on what it has already learned rather than starting from scratch each time.

The result is a personal LLM-Wiki: structured, traceable, continuously growing โ€” making your AI increasingly accurate in the domains you care about.


Demo


What It Does

Architecture & Data Flow

When you ask a question, the agent routes the query through a local-first retrieval loop before falling back to external sources:

sequenceDiagram
    actor User
    participant Host as Claude Code / VS Code
    participant MCP as Scholar Agent (MCP Server)
    participant Local as Local Index (BM25)
    participant Web as arXiv / Semantic Scholar

    User->>Host: Prompt: "Explain MoE"
    Host->>MCP: query_knowledge("MoE")
    MCP->>Local: BM25 Query
    alt Local Hit (BM25 Score >= Threshold)
        Local-->>MCP: Match (e.g. mixture-of-experts.md)
        MCP-->>Host: Local Note Context
    else Local Miss
        MCP->>Web: API Fallback (arxiv + web search)
        Web-->>MCP: Raw Papers & Metadata
        MCP->>MCP: Synthesize & Distill
        MCP->>Local: Save Card (Staging -> Validate -> Promote)
        MCP-->>Host: Synthesized Answer + Citations
    end
    Host->>User: Natural Language Response

Knowledge Persistence

Each conversation can produce a knowledge card โ€” a structured record with:

  • The question asked

  • Evidence-backed answer with citations

  • Confidence scores and uncertainty flags

  • Source references you can trace back

These cards accumulate into a searchable local knowledge base. Next time a similar question comes up, the AI draws from what's already been researched.

Knowledge Network

Cards aren't isolated files. Scholar Agent:

  • Maintains a quality lifecycle for each card: draft โ†’ reviewed โ†’ trusted โ†’ stale โ†’ deprecated

  • Auto-generates [[wiki-links]] between related cards

  • Tracks provenance โ€” every claim links back to its source evidence

  • Outputs Obsidian-compatible Markdown (YAML frontmatter + wiki-links)

  • Obsidian Graph Ready โ€” Open your data directory (e.g. ~/scholar/) directly as an Obsidian Vault to navigate your visual knowledge graph.

Evidence-Based Answers

When researching a question, Scholar Agent:

  1. Searches local knowledge (BM25 keyword index)

  2. Falls back to web and academic APIs when local knowledge is insufficient

  3. Synthesizes answers where every claim cites its source

  4. Flags claims that lack supporting evidence

  5. Returns structured results with confidence levels and suggested next steps

Academic Research Pipeline

For paper research, Scholar Agent provides:

  • Paper Search โ€” arXiv, DBLP, Semantic Scholar with 10+ top-conference filters

  • Smart Scoring โ€” 4-dimensional ranking: relevance, recency, popularity, quality

  • Deep Analysis โ€” 20+ section structured notes with AI-assisted completion

  • Figure Extraction โ€” From arXiv source archives and PDFs

  • Daily Recommendations โ€” Dual-track: 2 top-conference papers + 2 arXiv innovation papers

  • Paper โ†’ Knowledge Card โ€” Feed analyses back into the knowledge base


Quick Start

Install

pip install py-scholar-agent

Or with pipx (isolated environment):

pipx install py-scholar-agent

Or from source:

git clone https://github.com/zfy465914233/scholar-agent.git
cd scholar-agent
pip install -e .

Setup

scholar-agent init

One command creates data directories, writes config, and registers MCP with Claude Code. Done.

Modes

Mode

Command

Data Location

Scope

Global (recommended)

scholar-agent init

~/scholar/

Every project

Project-Local

SCHOLAR_HOME=./scholar scholar-agent init

my-project/scholar/

Current project only

Docker

docker run -v ~/scholar:/data scholar-agent serve-mcp

Container volume

Isolated


MCP Integration

Scholar Agent runs as an MCP server, integrating directly into your tools:

  • Claude Code โ€” scholar-agent install claude --write

  • VS Code Copilot โ€” scholar-agent install vscode --write

  • OpenCode โ€” scholar-agent install opencode --write

Core tools (always available): query_knowledge ยท save_research ยท list_knowledge ยท capture_answer ยท ingest_source ยท build_graph

Academic tools (set SCHOLAR_ACADEMIC=1): search_papers ยท search_conf_papers ยท download_paper ยท analyze_paper ยท extract_paper_images ยท paper_to_card ยท daily_recommend ยท link_paper_keywords

Add this to your claude_desktop_config.json:

{
  "mcpServers": {
    "scholar-agent": {
      "command": "scholar-agent",
      "args": ["serve-mcp"],
      "env": {
        "SCHOLAR_ACADEMIC": "1"
      }
    }
  }
}

Local Retrieval

Knowledge is indexed with BM25 for fast keyword search โ€” no external dependencies required. An optional embedding layer can be enabled for semantic similarity with scholar-agent index --build-embedding-index.


CLI Reference

Command

Description

scholar-agent init

One-command setup: data dirs + config + MCP registration

scholar-agent serve-mcp

Start the MCP server

scholar-agent doctor

Show environment and config diagnostics

scholar-agent config show

Show resolved configuration

scholar-agent install claude --write

Register MCP with Claude Code

scholar-agent install vscode --write

Register MCP with VS Code Copilot

scholar-agent install opencode --write

Register MCP with OpenCode


Configuration

Environment Variables

Variable

Required

Description

SCHOLAR_ACADEMIC

No

Set to 1 to enable academic tools

SCHOLAR_HOME

No

Override data directory (default: ~/scholar/)

S2_API_KEY

No

Semantic Scholar API key (get one free)

LLM_API_KEY

No

LLM API key for advanced synthesis pipeline

Config File

See .scholar.example.json for a full example. Key sections:

  • knowledge_dir โ€” Knowledge cards directory

  • index_path โ€” BM25 search index

  • academic.research_interests โ€” Your domains, keywords, arXiv categories

  • academic.scoring โ€” Paper scoring weights

Data Directory

scholar/
โ”œโ”€โ”€ config/         # Configuration files
โ”œโ”€โ”€ knowledge/      # Knowledge cards
โ”œโ”€โ”€ paper-notes/    # Paper analysis notes
โ”œโ”€โ”€ daily-notes/    # Daily paper recommendations
โ”œโ”€โ”€ indexes/        # BM25 search index
โ”œโ”€โ”€ cache/          # Cached data
โ””โ”€โ”€ outputs/        # Generated outputs

Daily research flow

Ask a question (via MCP)
  โ†’ Scholar Agent searches local knowledge first
  โ†’ Falls back to web/academic APIs when needed
  โ†’ Synthesizes answer with citations
  โ†’ Saves as a knowledge card
  โ†’ Next similar question draws from local knowledge

Paper analysis flow

For best paper analysis quality:

  1. Download: download_paper("2510.24701", title="Paper Title", domain="LLM")

  2. Extract images: extract_paper_images("2510.24701")

  3. Deep analysis: analyze_paper(paper_json)

  4. Feed into knowledge base: paper_to_card(paper_json)

Downloading the PDF first enables full-text extraction, producing notes with specific data, formulas, and experimental results.


Development

make dev       # Install with dev dependencies + pre-commit hooks
make lint      # Run ruff + mypy
make test      # Run test suite (1121 tests, ~20s, fully offline)
make coverage  # Run tests with coverage report
make build     # Build distribution package
make docker    # Build Docker image

See CONTRIBUTING.md for detailed guidelines.

Highlights

  • Knowledge persistence โ€” Every conversation can produce a reusable knowledge card; the local knowledge base grows over time

  • Evidence-based โ€” Every claim cites its source, with confidence scores and uncertainty flags

  • Quality lifecycle โ€” Cards are validated, scored, promoted, and deprecated. Full provenance tracking

  • Knowledge network โ€” Wiki-links connect related cards into a navigable knowledge graph

  • Obsidian compatible โ€” Markdown + YAML frontmatter + [[wiki-links]]. Your data, no lock-in

  • Academic pipeline โ€” Search โ†’ Score โ†’ Analyze โ†’ Extract โ†’ Recommend, fully automated

  • MCP integration โ€” Works with Claude Code, VS Code Copilot, and OpenCode out of the box

  • Offline-first โ€” Local BM25 index, graceful degradation when external APIs are unavailable

Comparison

Wondering how Scholar Agent compares to mem0, MemGPT, or Zep? See docs/comparison.md for a detailed breakdown.

License

MIT โ€” see LICENSE.

A
license - permissive license
-
quality - not tested
A
maintenance

Maintenance

โ€“Maintainers
โ€“Response time
โ€“Release cycle
1Releases (12mo)
Commit activity

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/zfy465914233/scholar-agent'

If you have feedback or need assistance with the MCP directory API, please join our Discord server