Extracts video and live streaming content for summarization and knowledge graph generation
Supports outputting processed video content in Markdown format
Uses PaddleOCR for accurate Chinese character recognition from images in content
Extracts content from Douyin (TikTok China) short videos for summarization and knowledge graph generation
Extracts content from social media posts with OCR support for text recognition from images
Extracts content from Q&A platform for summarization and knowledge graph generation
Video Content Summarization MCP Server
A Model Context Protocol (MCP) server that extracts content from multiple video platforms and generates intelligent knowledge graphs.
Features
🌐 Multi-Platform Support
- Douyin (TikTok China) - Short video content extraction
- Bilibili - Video and live streaming content
- Xiaohongshu (Little Red Book) - Social media posts with OCR support
- Zhihu - Q&A platform content
✨ Advanced Capabilities
- OCR Text Recognition - Extract text from images using PaddleOCR
- Knowledge Graph Generation - Intelligent content structuring
- Chinese Content Optimization - Specialized processing for Chinese text
- Context-Aware Extraction - Smart content understanding and quality control
Installation
Prerequisites
- Python 3.8 or higher
- Anaconda (recommended for dependency management)
Setup
- Clone the repository:
- Create and activate conda environment:
- Install dependencies:
Configuration
For Claude Desktop
Add this configuration to your Claude Desktop config file:
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%/Claude/claude_desktop_config.json
For Other MCP Clients
The server can be started directly:
Usage
Basic Video Processing
Supported URL Formats
- Douyin:
https://v.douyin.com/...
or full URLs - Bilibili:
https://www.bilibili.com/video/...
- Xiaohongshu:
https://www.xiaohongshu.com/discovery/item/...
- Zhihu:
https://www.zhihu.com/question/...
Context-Enhanced Processing
For platforms with anti-crawling measures, you can provide context:
Features in Detail
OCR Integration
- Automatic image text extraction from Xiaohongshu posts
- PaddleOCR for accurate Chinese character recognition
- Batch processing for multiple images
Knowledge Graph Generation
- Structured content analysis
- Intelligent relationship mapping
- Quality control and validation
Anti-Crawling Strategies
- Smart fallback mechanisms
- Context-based extraction
- User guidance for optimal results
Development
Project Structure
Running Tests
Dependencies
Key dependencies include:
bilibili-api-python
- Bilibili API integrationyt-dlp
- Video downloading capabilitiesPaddleOCR
- OCR text recognitionbeautifulsoup4
- Web scrapingrequests
- HTTP requests
See requirements.txt
for complete list.
Contributing
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature
) - Commit your changes (
git commit -m 'Add some amazing feature'
) - Push to the branch (
git push origin feature/amazing-feature
) - Open a Pull Request
License
This project is licensed under the MIT License - see the LICENSE file for details.
Acknowledgments
- Built using the Model Context Protocol
- OCR powered by PaddleOCR
- Platform integrations using various open-source APIs
This server cannot be installed
Extracts content from multiple video platforms (Douyin, Bilibili, Xiaohongshu, Zhihu) and generates intelligent knowledge graphs with OCR text recognition capabilities.
Related MCP Servers
- -securityFlicense-qualityEnables extraction of transcript text from YouTube videos by providing the video URL, supporting standard, shortened, and embed URL formats.Last updated -1JavaScript
- -securityFlicense-qualityA Model Context Protocol server that enables AI assistants to extract transcripts from YouTube videos, allowing AI to analyze and work with video content directly.Last updated -61TypeScript
- -security-license-qualityEnables interaction with YouTube videos by extracting metadata, captions in multiple languages, and converting content to markdown with various templates.Last updated -TypeScript
- -securityFlicense-qualityA Model Context Protocol server that provides AI models with real-time trending content from 18 major Chinese internet platforms, including Weibo, Zhihu, and Bilibili.Last updated -TypeScript