diepre-vision-cognition

DiePre 视觉认知 Skill —— 将包装/模切机器视觉感知与 SOUL 推理融合的认知框架

Safety Notice

This listing is from the official public ClawHub registry. Review SKILL.md and referenced scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "diepre-vision-cognition" with this command: npx skills add kingofzhao/diepre-vision-cognition

DiePre Vision Cognition Skill

元数据

字段
名称diepre-vision-cognition
版本1.0.0
作者KingOfZhao
发布日期2026-03-31
置信度96%

学术参考文献

本视觉框架的技术路线受以下前沿研究启发:

  1. Generating CAD Code with Vision-Language Models — VLM生成CAD代码+迭代验证(CADCodeVerify),直接升级照片→DXF管道
  2. From 2D CAD to 3D Parametric via VLM — 2D图纸→参数化3D,解决透视矫正和参数化问题
  3. Tool-Augmented VLLMs as Generic CAD Task Solvers (ICCV 2025) — VLLM+工具调用做通用CAD,封装OpenCV管道为可调用Skill
  4. Efficient Vision-Language-Action Models — VLA高效优化(低延迟+内存优化),适合本地部署
  5. Vlaser: Synergistic Embodied Reasoning — 具身推理VLA,未来"照片→动作决策"的理论基础

核心能力

将 DiePre(模切压痕)机器视觉感知与 SOUL 认知框架融合:

  1. 视觉已知/未知分离:从图像中提取确定特征(已知)与模糊区域(未知)
  2. 文件记忆:每次检测结果写入 vision_log/YYYY-MM-DD.jsonl
  3. 四向视觉碰撞:正视角、反转、侧光、整体布局四个维度同时分析
  4. 人机闭环质检:AI 初判 → 人类复核 → 标注反馈 → 模型持续进化
  5. 置信度质检输出:低于 90% 置信度的缺陷自动升级为人工复核

安装命令

clawhub install diepre-vision-cognition
# 或手动安装
cp -r skills/diepre-vision-cognition ~/.openclaw/skills/

调用方式

from skills.diepre_vision_cognition import DiePrevisionCognition

vision = DiePrevisionCognition(workspace=".")
result = vision.analyze(
    image_path="path/to/dieline.png",
    context={"material": "corrugated", "thickness_mm": 3.0}
)

print(result.confidence)     # 置信度
print(result.defects)        # 检测到的缺陷列表
print(result.collision_log)  # 四向分析详情

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

Zhua Metacognition

爪爪元认知系统 —— 思考自己的思考、监控认知过程、优化决策质量。Use when 爪爪需要反思自己的思维过程、优化认知策略、或提升决策质量。

Registry SourceRecently Updated
2000Profile unavailable
General

MeatLoop

A real human reviews your question, screenshot, or pair of images and returns a structured verdict by email. Use for sanity checks, content verification, cre...

Registry SourceRecently Updated
2980Profile unavailable
General

MiniMax Vision Captcha

使用MiniMax视觉模型识别图片中的验证码、滑块位置、文字内容等。适用于需要AI视觉分析的场景,如微信验证码识别、网页截图分析、图片文字提取。当需要识别图片内容、分析验证码、提取截图信息时使用此技能。

Registry SourceRecently Updated
1.3K0Profile unavailable
General

投资者认知基石书单

整合10本投资经典,系统讲解不确定性与价值根源,助力构建全面且科学的投资认知体系。

Registry SourceRecently Updated
560Profile unavailable