Why this server?
This server provides direct integration with the Kaggle API, allowing you to search for ML/AI datasets, competitions, kernels, and pre-trained models specifically requested in your query.
-securityAlicense-qualityConnects Claude AI to the Kaggle API through the Model Context Protocol, enabling users to browse competitions, search and download datasets, analyze kernels, and access pre-trained models through natural language interactions.Last updated 7 months agoMITWhy this server?
This server is designed to search the extensive arXiv research repository, providing direct access to academic papers, which is highly relevant for ML/AI models training research.
AsecurityAlicense-qualityThe ArXiv MCP Server bridges the gap between AI models and academic research by providing a sophisticated interface to arXiv's extensive research repository. This server enables AI assistants to perform precise paper searches and access full paper content, enhancing their ability to engage with scientific literature.Last updated 5 days ago342,482Apache 2.0Why this server?
A specialized server for academic papers, offering advanced features like semantic search and analysis of research from arXiv, directly fitting your need for research paper data.
-security-license-qualityEnables AI-powered academic paper discovery, search, and analysis from arXiv with advanced features like semantic search, citation network analysis, and multi-format exports (BibTeX, RIS, JSON, CSV). Provides intelligent research assistance through specialized AI prompts for summarization, trend tracking, and literature review automation.Last updated 2 hours ago14MITWhy this server?
This provides access to search and retrieve ML models, datasets, and their metadata directly from the Hugging Face Hub, a crucial resource for ML/AI training.
AsecurityFlicense-qualityEnables access to the Hugging Face Hub API to search and retrieve information about machine learning models, datasets, and their metadata. Provides comprehensive tools for exploring the Hugging Face ecosystem including model details, dataset information, and parquet file access.Last updated 8 months ago8Why this server?
Excellent for finding data on arbitrary 'websites' as it enables scraping and extraction from virtually any website globally, bypassing anti-bot systems to gather training data.
-security-license-qualityEnables AI models to scrape and extract data from any website globally using Thordata's 195+ country proxy network. Bypasses anti-bot systems and renders JavaScript content, outputting structured data in Markdown, HTML, or Links format.Last updated 7 months agoMITWhy this server?
Enables comprehensive web and local document crawling and data extraction, perfect for gathering large volumes of varied data for ML/AI model training context.
-securityFlicense-qualityEnables web scraping and crawling capabilities for LLM clients, supporting single-page scraping, multi-page website crawling, and web search with multiple engines (Playwright, Cheerio, Puppeteer) and flexible output formats including markdown, HTML, text, and screenshots.Last updated 23 days ago55Why this server?
This server combines web search, content extraction, web crawling, and scraping capabilities using the Firecrawl API, making it a robust tool for general data collection from websites.
AsecurityFlicense-qualityBuilt as a Model Context Protocol (MCP) server that provides advanced web search, content extraction, web crawling, and scraping capabilities using the Firecrawl API.Last updated 12 days ago41Why this server?
Designed to ingest, index, and retrieve structured knowledge from diverse sources (including web, documents, GitHub), making it useful for building a knowledge base for ML/AI context.

Graphlit MCP Serverofficial
AsecurityAlicense-qualityThe Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.Last updated 3 months ago469373MITWhy this server?
If your research papers are stored locally as PDFs, this tool allows for semantic search and retrieval within that document collection using vector embeddings.
-securityAlicense-qualityA Model Context Protocol server that enables intelligent document search and retrieval from PDF collections, providing semantic search capabilities powered by OpenAI embeddings and ChromaDB vector storage.Last updated 7 months ago11MIT