Javelin MCP Server
AI security guardrails for the Model Context Protocol (MCP). This server integrates with Javelin's AI security platform to provide comprehensive guardrails for AI applications.
Features
🛡️ Trust & Safety: Detect harmful content across multiple categories including violence, weapons, hate speech, crime, sexual content, and profanity
🔒 Prompt Injection Detection: Identify prompt injection attempts and jailbreak techniques to prevent model manipulation
🌍 Language Detection: Detect language with confidence scores and enforce language policies
Usage
This server is hosted in the cloud and accessible via the MCP registry. Connect your MCP client to the hosted endpoint.
Available Tools
promptInjectionDetection
- Detect prompt injection and jailbreak attemptstrustSafetyDetection
- Analyze content for harmful categorieslanguageDetection
- Detect language with confidence scoring
Example Usage
Local Development
Setup
Environment Variables
Run Locally
set MCP_TRANSPORT environment variable to sse or http based on application layer protocol used.
Test
API Documentation
All tools return structured assessments with:
- Categories: Boolean flags for each threat type
- Category Scores: Confidence scores (0.0-1.0)
- Request Reject: Boolean indicating policy decision
See Javelin Documentation for detailed API reference.
This server cannot be installed
hybrid server
The server is able to function both locally and remotely, depending on the configuration or use case.
Provides AI security guardrails through Javelin's platform to detect harmful content, prompt injection attempts, and language policies. Enables comprehensive content safety analysis with trust & safety detection, prompt injection protection, and language identification.