screen-control-operator-v3

Screen Control Operator V3

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "screen-control-operator-v3" with this command: npx skills add breverdbidder/life-os/breverdbidder-life-os-screen-control-operator-v3

Screen Control Operator V3

Built 3 weeks before Claude Cowork announcement. Now with feature parity + our advantages.

Core Advantages Over Claude Cowork

Feature Claude Cowork Screen Control Operator V3

Vision Method Screenshots CDP + Accessibility Tree

Speed 1-5 seconds 50-200ms (10x faster)

Cost Vision tokens ($$$) Text tokens only ($)

Reliability ~85% OCR 100% semantic queries

Skill Recording ✅ Yes ✅ Yes

Parallel Execution ✅ Yes ✅ Yes

Domain Skills Generic Foreclosure-specific

Smart Router N/A 90% FREE tier

When to Use This Skill

Trigger phrases:

  • "control my screen"

  • "record a workflow"

  • "replay this skill"

  • "verify the Lovable preview"

  • "test the scraper"

  • "debug DOM selectors"

  • "inspect page structure"

  • "run BECA lookup"

  • "search BCPAO"

  • "autonomous browser testing"

Quick Start

from screen_control_operator_v3 import ScreenControlOperatorV3

operator = ScreenControlOperatorV3(headless=False) operator.launch()

Option 1: Record new skill

skill = operator.record_skill("My Workflow", domain="foreclosure") operator.save_skill(skill, "my_workflow.json")

Option 2: Play recorded skill

result = operator.play_skill_file("my_workflow.json", variables={"case_number": "2025-CA-001234"})

Option 3: Use pre-built foreclosure skills

result = operator.lookup_beca_case("2025-CA-001234") result = operator.lookup_bcpao_property("12-34-56-78-90") result = operator.search_acclaimweb_liens("SMITH JOHN")

Option 4: Parallel execution

results = operator.execute_parallel([ {"skill": skill1, "variables": {"case": "001"}}, {"skill": skill2, "variables": {"case": "002"}}, {"skill": skill3, "variables": {"case": "003"}}, ])

operator.close()

CLI Commands

Record new skill

python screen_control_operator_v3.py record
--name "BECA Deep Search"
--domain foreclosure
--output skills/beca_deep.json
--start-url "https://vmatrix1.brevardclerk.us/beca/"

Play skill with variables

python screen_control_operator_v3.py play
--skill skills/beca_deep.json
--vars '{"case_number": "2025-CA-001234"}'

Pre-built skills

python screen_control_operator_v3.py beca --case "2025-CA-001234" python screen_control_operator_v3.py bcpao --parcel "12-34-56-78-90"

Inspect page DOM (NOT screenshot)

python screen_control_operator_v3.py inspect
--url "https://example.com"
--output structure.json

Pre-Built Foreclosure Skills

Skill ID Name Variables Description

beca_lookup_001

BECA Case Lookup case_number

Search BECA, extract judgment

bcpao_lookup_001

BCPAO Property parcel_id

Get property details, photo

acclaimweb_lien_001

Lien Search party_name

Search recorded liens

realforeclose_list_001

Auction List auction_date

Get auction calendar

Skill Recording (Cowork Feature)

How Recording Works

  • Launch browser and navigate to starting page

  • Call record_skill(name, domain)

  • Perform your workflow manually

  • Press Enter or Ctrl+C to stop

  • Skill is captured with:

  • All navigation events

  • All click events (with selectors)

  • All type events (with values)

  • Success criteria (inferred)

What Gets Recorded

{ "skill_id": "abc123def456", "name": "BECA Deep Search", "domain": "foreclosure", "actions": [ { "action_type": "navigate", "url": "https://vmatrix1.brevardclerk.us/beca/" }, { "action_type": "type", "selector": "#caseNumber", "value": "{{case_number}}", "element_info": {"label": "Case Number", "role": "textbox"} }, { "action_type": "click", "selector": "button[type='submit']", "element_info": {"label": "Search", "role": "button"} } ], "variables": {"case_number": ""}, "success_criteria": [ {"type": "element_visible", "selector": ".case-details"} ] }

Parallel Execution (Cowork Feature)

Execute multiple skills simultaneously in isolated browser contexts:

executor = ParallelExecutor(max_parallel=5) executor.launch()

tasks = [ {"skill": beca_skill, "variables": {"case": "2025-CA-001"}}, {"skill": beca_skill, "variables": {"case": "2025-CA-002"}}, {"skill": beca_skill, "variables": {"case": "2025-CA-003"}}, {"skill": beca_skill, "variables": {"case": "2025-CA-004"}}, {"skill": beca_skill, "variables": {"case": "2025-CA-005"}}, ]

results = executor.execute_parallel(tasks)

All 5 run simultaneously in separate contexts

DOM Inspection (Our Advantage)

Get page structure via accessibility tree (NOT screenshots):

structure = operator.get_page_structure()

Returns:

{ "url": "https://example.com", "title": "Page Title", "elements": [ {"tag": "BUTTON", "role": "button", "label": "Submit", "testid": "submit-btn"}, {"tag": "INPUT", "role": "textbox", "label": "Search", "id": "search-input"}, ... ] }

Find element by natural language:

selector = operator.find_element_semantic("search button")

Returns: "[data-testid='search-btn']" or "#search-button" or similar

Smart Router Integration

Operation Model Tier Cost

Page navigation FREE (Gemini) $0

Element finding FREE (Gemini) $0

Skill recording FREE (Gemini) $0

Skill playback FREE (Gemini) $0

Error recovery BALANCED (Sonnet) $

Result: 95%+ operations in FREE tier

GitHub Actions Integration

.github/workflows/browser_agent.yml

name: Browser Agent Daily on: schedule: - cron: '0 4 * * *' # 11 PM EST

jobs: scrape: runs-on: ubuntu-latest steps: - uses: actions/checkout@v4

  - name: Set up Python
    uses: actions/setup-python@v4
    with:
      python-version: '3.11'
      
  - name: Install dependencies
    run: |
      pip install playwright
      playwright install chromium
      
  - name: Run Browser Agent
    run: |
      python src/agents/screen_control_operator_v3.py beca \
        --case "${{ github.event.inputs.case_number }}"

Error Recovery

When element not found, V3 tries multiple strategies:

  • data-testid (most reliable)

  • aria-label (semantic)

  • role + text (accessibility)

  • ID (traditional)

  • Original selector (fallback)

  • Text content (last resort)

This is why we're 100% reliable vs Cowork's ~85%.

Dependencies

pip install playwright --break-system-packages playwright install chromium

File Locations

  • Script: src/agents/screen_control_operator_v3.py

  • Skills: skills/*.json

  • Skill Lib: .claude/skills/screen-control-operator-v3/SKILL.md

Built by Claude AI Architect + Ariel Shapira

December 25, 2025 (original) → January 15, 2026 (V3)

We are the team of Claude innovators!

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

amazon-bestseller-launch

No summary provided by upstream source.

Repository SourceNeeds Review
General

kdp-listing-optimizer

No summary provided by upstream source.

Repository SourceNeeds Review
General

screen-control-operator

No summary provided by upstream source.

Repository SourceNeeds Review
General

adhd-task-management

No summary provided by upstream source.

Repository SourceNeeds Review