alefba_read_document
Read a document from a URL and extract its text using OCR. Returns structured page data with text and bounding boxes for Persian, Arabic, and English.
Instructions
Read (OCR) a document or image from a URL. استخراج متن از سند یا تصویر با آدرس اینترنتی. By default the call blocks (wait=true) and returns the full structured result {document_url, pages:[{page_url,width,height,angle,text,parts:[...]}]} where each box is 'left top width height' in pixels. Set wait=false to queue the job and get {state, task_ids} back, then poll with alefba_get_result. type is one of general, ID-card, excel; priority is 1 (highest) to 4 (lowest). Pass an optional task_id to fetch a queued result. Optional name of the configured Alefba instance to target. Omit to use the default instance.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| document_url | Yes | ||
| type | No | general | |
| fix_orientation | No | ||
| word_positions | No | ||
| wait | No | ||
| priority | No | ||
| task_id | No | ||
| instance | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||