ocr_paddle
Extract text from images using PaddleOCR with support for over 100 languages. Ideal for multi-language documents and CPU-only servers.
Instructions
Extract text from an image using PaddleOCR (100+ languages, production-grade).
PaddleOCR is purpose-built for text extraction with superior accuracy and speed compared to general vision models. Best for:
Multi-language documents (100+ languages supported)
CPU-only servers (PP-OCRv6 Tiny is only 1.5M parameters)
High-volume batch OCR (5.2× faster than previous versions)
Args: image_path: Absolute or relative path to the image file (supports PNG, JPEG, etc.). language: Language code - 'en' (English), 'ch' (Chinese), 'japan' (Japanese), 'korean', 'french', 'german', 'spanish', 'arabic', 'multilingual', etc. See PaddleOCR docs for full list. detail_level: 'normal' for plain text, 'high' for text with bounding boxes and confidence. use_angle_cls: If True, use angle classification to correct rotated text (default True).
Returns: Dict with extracted text, and optionally regions with bounding boxes and confidence.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| language | No | en | |
| image_path | Yes | ||
| detail_level | No | normal | |
| use_angle_cls | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||