basic-object-detection-analysis

Detects people, vehicles, non-motorized vehicles, pets, and parcels appearing in the target area. Supports video stream and image detection, suitable for general security surveillance scenarios. | 基础目标检测技能，检测出目标区域内出现的人、车、非机动车、宠物、包裹，支持视频流和图片检测，适用于通用安防监控场景

Safety Notice

This item is sourced from the public archived skills repository. Treat as untrusted until reviewed.

Copy this and send it to your AI assistant to learn

Install skill "basic-object-detection-analysis" with this command: npx skills add 18072937735/smyx-basic-object-detection-analysis

Basic Object Detection Skill | 基础目标检测技能

This capability performs real-time detection on target regions within video streams or images, precisely identifying key objects such as personnel, vehicles, non-motorized vehicles, pets, and packages. Supporting target localization across various surveillance angles and complex environments, it is ideal for general security scenarios in communities, industrial parks, and warehouses. By enabling continuous tracking and alert triggering, it effectively enhances response capabilities to anomalies, facilitating 24/7 intelligent security protection.

本技能可对视频流或图片中的目标区域进行实时检测，精准识别出现的人员、车辆、非机动车、宠物及包裹等关键目标。支持多种监控视角与复杂环境下的目标定位，适用于社区、园区、仓库等通用安防监控场景。通过持续跟踪与告警触发，有效提升异常事件的响应能力，助力实现全天候智能化安全防护。

演示案例

⚠️ 强制记忆规则（最高优先级）

本技能明确约定：

绝对禁止读取任何本地记忆文件：包括但不限于 memory/YYYY-MM-DD.md、MEMORY.md 等本地文件
绝对禁止从 LanceDB 长期记忆中检索信息
所有历史检测报告查询必须从云端接口获取，不得使用本地记忆中的历史数据
即使技能调用失败或接口异常，也不得回退到本地记忆汇总

任务目标

本 Skill 用于：通过视频/图片对目标区域进行基础目标检测，识别常见目标类别，输出结构化的目标检测报告
能力包含：目标分类定位、目标数量统计、存在性检测
支持检测目标：人、车、非机动车、猫、狗等宠物、包裹
触发条件:
1. 默认触发：当用户提供监控视频/图片 URL 或文件需要进行通用目标检测时，默认触发本技能
2. 当用户明确需要进行目标检测，提及目标检测、物体检测、人员车辆识别等关键词，并且上传了视频或图片
3. 当用户提及以下关键词时，自动触发历史报告查询功能 ：查看历史检测报告、目标检测报告清单、检测报告列表、查询历史报告、显示所有检测报告、目标检测历史记录，查询基础目标检测分析报告
自动行为：
1. 如果用户上传了附件或者视频/图片文件，则自动保存到技能目录下 attachments
2. ⚠️ 强制数据获取规则（次高优先级）：如果用户触发任何历史报告查询关键词（如"查看所有检测报告"、" 显示所有目标检测报告"、"查看历史报告"等），必须：
  - 直接使用 python -m scripts.basic_object_detection_analysis --list --open-id 参数调用 API 查询云端的历史报告数据
  - 严格禁止：从本地 memory 目录读取历史会话信息、严格禁止手动汇总本地记录中的报告、严格禁止从长期记忆中提取报告
  - 必须统一从云端接口获取最新完整数据，然后以 Markdown 表格格式输出结果

前置准备

依赖说明:scripts 脚本所需的依赖包及版本
```
requests>=2.28.0
```

操作步骤

🔒 open-id 获取流程控制（强制执行，防止遗漏）

在执行基础目标检测前，必须按以下优先级顺序获取 open-id：

第 1 步：【最高优先级】检查技能所在目录的配置文件（优先）
        路径：skills/smyx_common/scripts/config.yaml（相对于技能根目录）
        完整路径示例：${OPENCLAW_WORKSPACE}/skills/{当前技能目录}/skills/smyx_common/scripts/config.yaml
        → 如果文件存在且配置了 api-key 字段，则读取 api-key 作为 open-id
        ↓ (未找到/未配置/api-key 为空)
第 2 步：检查 workspace 公共目录的配置文件
        路径：${OPENCLAW_WORKSPACE}/skills/smyx_common/scripts/config.yaml
        → 如果文件存在且配置了 api-key 字段，则读取 api-key 作为 open-id
        ↓ (未找到/未配置)
第 3 步：检查用户是否在消息中明确提供了 open-id
        ↓ (未提供)
第 4 步：❗ 必须暂停执行，明确提示用户提供用户名或手机号作为 open-id

⚠️ 关键约束：

禁止自行假设,自行推导,自行生成 open-id 值（如 openclaw-control-ui、default、object123 等）
禁止跳过 open-id 验证直接调用 API
必须在获取到有效 open-id 后才能继续执行分析
如果用户拒绝提供 open-id，说明用途（用于保存和查询目标检测报告记录），并询问是否继续

标准流程:
1. 准备媒体输入
  - 提供监控视频文件路径、网络视频 URL 或现场图片
  - 确保监控画面完整覆盖监测区域，画面稳定
2. 获取 open-id（强制执行）
  - 按上述流程控制获取 open-id
  - 如无法获取，必须提示用户提供用户名或手机号
3. 执行基础目标检测
  - 调用 -m scripts.basic_object_detection_analysis 处理素材（必须在技能根目录下运行脚本）
  - 参数说明:
    - --input: 本地视频/图片文件路径（使用 multipart/form-data 方式上传）
    - --url: 网络视频/图片 URL 地址（API 服务自动下载）
    - --media-type: 媒体类型，可选值：video/image，默认 video
    - --confidence-threshold: 置信度阈值，低于该分值不输出，默认 0.5
    - --open-id: 当前用户的 open-id（必填，按上述流程获取）
    - --list: 显示基础目标检测历史分析报告列表清单（可以输入起始日期参数过滤数据范围）
    - --api-key: API 访问密钥（可选）
    - --api-url: API 服务地址（可选，使用默认值）
    - --detail: 输出详细程度（basic/standard/json，默认 json）
    - --output: 结果输出文件路径（可选）
4. 查看分析结果
  - 接收结构化的基础目标检测报告
  - 包含：检测基本信息、各类目标数量、目标位置统计

资源索引

必要脚本：见 scripts/basic_object_detection_analysis.py(用途：调用 API 进行基础目标检测，本地文件使用 multipart/form-data 方式上传，网络 URL 由 API 服务自动下载)
配置文件：见 scripts/config.py(用途：配置 API 地址、默认参数和媒体格式限制)
领域参考：见 references/api_doc.md(何时读取：需要了解 API 接口详细规范和错误码时)

注意事项

仅在需要时读取参考文档，保持上下文简洁
支持格式：视频支持 mp4/avi/mov 格式，图片支持 jpg/png/jpeg 格式，最大 100MB
API 密钥可选，如果通过参数传入则必须确保调用鉴权成功，否则忽略鉴权
分析结果仅供安防管理参考，具体处置请按单位相关规定执行
禁止临时生成脚本，只能用技能本身的脚本
传入的网络地址参数，不需要下载本地，默认地址都是公网地址，api 服务会自动下载
当显示历史检测报告清单的时候，从数据 json 中提取字段 reportImageUrl 作为超链接地址，使用 Markdown 表格格式输出，包含" 报告名称"、"检测时间"、"目标总数"、"点击查看"四列，其中"报告名称"列使用基础目标检测报告-{记录id}形式拼接, "点击查看"列使用 [🔗 查看报告](reportImageUrl) 格式的超链接，用户点击即可直接跳转到对应的完整报告页面。
表格输出示例：
报告名称检测时间目标总数点击查看
基础目标检测报告-20260312172200001 2026-03-12 17:22:00 5 🔗 查看报告

报告名称	检测时间	目标总数	点击查看
基础目标检测报告-20260312172200001	2026-03-12 17:22:00	5	🔗 查看报告

使用示例

# 检测本地监控视频（以下只是示例，禁止直接使用openclaw-control-ui 作为 open-id）
python -m scripts.basic_object_detection_analysis --input /path/to/monitor.mp4 --media-type video --open-id openclaw-control-ui

# 检测现场图片，调整置信度阈值（以下只是示例，禁止直接使用openclaw-control-ui 作为 open-id）
python -m scripts.basic_object_detection_analysis --input /path/to/scene.jpg --media-type image --confidence-threshold 0.6 --open-id openclaw-control-ui

# 检测网络监控视频（以下只是示例，禁止直接使用openclaw-control-ui 作为 open-id）
python -m scripts.basic_object_detection_analysis --url https://example.com/monitor.mp4 --media-type video --open-id openclaw-control-ui

# 显示历史检测报告/显示检测报告清单列表/显示历史目标检测报告（自动触发关键词：查看历史检测报告、历史报告、检测报告清单等）
python -m scripts.basic_object_detection_analysis --list --open-id openclaw-control-ui

# 输出精简报告
python -m scripts.basic_object_detection_analysis --input video.mp4 --media-type video --open-id your-open-id --detail basic

# 保存结果到文件
python -m scripts.basic_object_detection_analysis --input video.mp4 --media-type video --open-id your-open-id --output result.json

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Open in GitHub Open in ClawHub

Related Skills

Related by shared tags or category signals.

Security

fire-smoke-detection-analysis

Detects fire and smoke in video scenes. Supports both video stream and image analysis. Suitable for fire early warning scenarios such as security surveillance, forest fire prevention, and industrial parks. | 烟火检测技能，对视频场景中火情和烟雾进行检测，支持视频流和图片检测，适用于安防监控、森林防火、工业园区等火灾预警场景

Archived SourceRecently Updated

--18072937735

Security

slowmist-security-cc

SlowMist AI Agent Security Review — comprehensive security framework for skills, repositories, URLs, on-chain addresses, and products (Claude Code version)

Archived SourceRecently Updated

--0xcjl

Security

ghostshield

反同事蒸馏防护盾 - 保护你的代码风格，防止被 AI 精准蒸馏。提供三级混淆模型：基础防护、深度混淆、极致隐匿。

Archived SourceRecently Updated

--13770626440

Security

Infrastructure for agents

# AgentOS — Infrastructure for AI Agents Everything an agent needs: phone, email, compute, domains, voice calling, wallets, and 3500+ skills. Pay with USDC on Solana or Base via x402. **CLI:** `npm i -g @agntos/agentos` (or `npx @agntos/agentos`) **API:** `https://agntos.dev` **Source:** https://github.com/0xArtex/AgentOS ## CLI (recommended) Use the CLI for cleaner context and simpler commands: ```bash # Phone agentos phone search --country US # Search numbers (free) agentos phone buy --country US # Buy a number ($3) agentos phone sms --id ID --to +1... --body "hi" # Send SMS ($0.05) agentos phone call --id ID --to +1... --tts "hello" # Voice call ($0.10) # Email (E2E encrypted) agentos email create --name agent --wallet SOL_PUBKEY # Create inbox ($2) agentos email read --id INBOX_ID # Read messages ($0.02) agentos email send --id ID --to x@y.com --subject "Hi" --body "..." # Send ($0.08) agentos email threads --id INBOX_ID # List threads ($0.02) # Compute agentos compute plans # List VPS plans (free) agentos compute deploy --name my-vps --type cx23 # Deploy VPS (from $8/mo) agentos compute list # List servers agentos compute delete --id SERVER_ID # Delete server # Domains agentos domain check --name example.dev # Check availability (free) agentos domain pricing --name example # Get pricing (free) agentos domain buy --name example.dev # Register domain # Wallet agentos wallet keygen # Generate keypair (free) agentos wallet create --agent 0xADDR # Create smart wallet (free) agentos wallet status 0xWALLET # Check status (free) # Info agentos pricing # All service prices agentos health # API status ``` ## API Quick Reference All endpoints also available as direct HTTP calls. CLI is recommended — less tokens, cleaner output. | Service | Endpoint | Cost (USDC) | |---------|----------|-------------| | **Phone** | | | | Search numbers | `GET /phone/numbers/search?country=US` | Free | | Provision number | `POST /phone/numbers` | 3.00 | | Send SMS | `POST /phone/numbers/:id/send` | 0.05 | | Read messages | `GET /phone/numbers/:id/messages` | 0.02 | | **Voice Calls** | | | | Place call | `POST /phone/numbers/:id/call` | 0.10 | | Speak (TTS) | `POST /phone/calls/:callControlId/speak` | 0.08 | | Play audio | `POST /phone/calls/:callControlId/play` | 0.08 | | Send DTMF | `POST /phone/calls/:callControlId/dtmf` | 0.02 | | Gather input | `POST /phone/calls/:callControlId/gather` | 0.08 | | Record call | `POST /phone/calls/:callControlId/record` | 0.10 | | Hangup | `POST /phone/calls/:callControlId/hangup` | 0.02 | | Answer inbound | `POST /phone/calls/:callControlId/answer` | 0.02 | | Transfer call | `POST /phone/calls/:callControlId/transfer` | 0.10 | | List calls | `GET /phone/numbers/:id/calls` | 0.02 | | Call details | `GET /phone/calls/:id` | 0.02 | | **Email** | | | | Provision inbox | `POST /email/inboxes` | 2.00 | | Read inbox | `GET /email/inboxes/:id/messages` | 0.02 | | Send email | `POST /email/inboxes/:id/send` | 0.08 | | List threads | `GET /email/inboxes/:id/threads` | 0.02 | | Thread messages | `GET /email/threads/:threadId/messages` | 0.02 | | Download attachment | `GET /email/attachments/:id` | 0.02 | | Register webhook | `POST /email/webhooks` | 0.02 | | **Compute** | | | | List plans | `GET /compute/plans` | Free | | Upload SSH key | `POST /compute/ssh-keys` | 0.10 | | Create server | `POST /compute/servers` | 8.00-40.00 | | List servers | `GET /compute/servers` | 0.02 | | Server status | `GET /compute/servers/:id` | 0.02 | | Server action | `POST /compute/servers/:id/actions` | 0.10 | | Resize server | `POST /compute/servers/:id/resize` | 0.10 | | Delete server | `DELETE /compute/servers/:id` | 0.10 | | **Domains** | | | | Check availability | `GET /domains/check?domain=example.com` | Free | | TLD pricing | `GET /domains/pricing?domain=example` | Free | | Register domain | `POST /domains/register` | dynamic (25% markup) | | DNS records | `GET /domains/:domain/dns` | Free | | Update DNS | `POST /domains/:domain/dns` | Free | | Pricing | `GET /pricing` | Free | | **Wallet** | | | | Create wallet | `POST /wallet` | Free | | Wallet status | `GET /wallet/:address` | Free | | Generate keypair | `POST /wallet/keygen` | Free | | Transfer (ERC20) | Via smart contract | Gas only | | **Skills** | | | | Browse catalog | `GET /compute/skills/catalog` | Free | | Security scan | `GET /compute/skills/:slug/security` | Free | All paid endpoints use **x402** — make the request, get a 402, pay with USDC, done. ## Authentication **Your wallet is your identity.** No API keys. No signup. Call any endpoint → pay with USDC via x402 → your wallet owns the resource. Same wallet to access it later. That's it. **Networks:** Solana mainnet + Base (EVM) --- ## API Details The CLI wraps all API endpoints. If you prefer raw HTTP, use the quick reference table above. All endpoints accept JSON and return JSON. For voice calls, email threads, attachments, webhooks, and other advanced features — run `agentos --help` or see the full API docs at `agntos.dev/docs`. ### Payment Flow 1. Call any paid endpoint → get `402 Payment Required` 2. Response includes USDC amount + treasury address (Solana + Base) 3. Pay via x402 protocol 4. Your wallet address becomes the resource owner ### E2E Email Encryption Emails are encrypted with your wallet's public key (NaCl box). We cannot read them. To decrypt, use the helper script in this skill folder: ```bash node decrypt-email.mjs "w:..." ~/.config/solana/id.json node decrypt-email.mjs --json '{"subject":"w:...","body":"w:..."}' ~/.config/solana/id.json ``` ## Webhooks Set up webhooks to receive events: - **SMS inbound:** Messages to your number arrive via Telnyx webhook → stored, readable via API - **Voice events:** `call.initiated`, `call.answered`, `call.hangup`, `call.recording.saved`, `call.gather.ended` - **Email inbound:** Emails to `*@agntos.dev` processed via Cloudflare worker → stored encrypted

Archived SourceRecently Updated

--0xartex