ocr-service

从图像中提取文字内容，支持多种图像格式和语言。

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Install skill "ocr-service" with this command: npx skills add lin-a1/skills-agent/lin-a1-skills-agent-ocr-service

功能

从图像中提取文字内容，支持多种图像格式和语言。

调用方式

from services.ocr_service.client import OCRServiceClient

client = OCRServiceClient()

健康检查

status = client.health_check()

image_base64 = client.image_to_base64("/path/to/image.jpg") result = client.ocr(image_base64)

texts = result["rec_texts"] # ["识别的文字1", "识别的文字2", ...] scores = result["rec_scores"] # [0.98, 0.95, ...]

返回格式

{ "doc_preprocessor_res": {"angle": 0}, "dt_polys": [[x1,y1], [x2,y2], ...], "rec_texts": ["识别的文字1", "识别的文字2"], "rec_scores": [0.98, 0.95] }

字段说明

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related by shared tags or category signals.

Automation

No summary provided by upstream source.

Repository SourceNeeds Review

Automation

No summary provided by upstream source.

Repository SourceNeeds Review

Automation

No summary provided by upstream source.

Repository SourceNeeds Review

Automation

No summary provided by upstream source.

Repository SourceNeeds Review