ocr-service

从图像中提取文字内容,支持多种图像格式和语言。

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "ocr-service" with this command: npx skills add lin-a1/skills-agent/lin-a1-skills-agent-ocr-service

功能

从图像中提取文字内容,支持多种图像格式和语言。

调用方式

from services.ocr_service.client import OCRServiceClient

client = OCRServiceClient()

健康检查

status = client.health_check()

OCR识别

image_base64 = client.image_to_base64("/path/to/image.jpg") result = client.ocr(image_base64)

获取识别结果

texts = result["rec_texts"] # ["识别的文字1", "识别的文字2", ...] scores = result["rec_scores"] # [0.98, 0.95, ...]

返回格式

{ "doc_preprocessor_res": {"angle": 0}, "dt_polys": [[x1,y1], [x2,y2], ...], "rec_texts": ["识别的文字1", "识别的文字2"], "rec_scores": [0.98, 0.95] }

字段说明

  • rec_texts : 识别出的文字列表

  • rec_scores : 每个文字块的置信度

  • dt_polys : 检测到的文本区域坐标

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Automation

websearch-service

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

deepsearch-service

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

embedding-service

No summary provided by upstream source.

Repository SourceNeeds Review
Automation

rag-service

No summary provided by upstream source.

Repository SourceNeeds Review