Browser Automation
Help the user automate browser-based tasks using Yutori's Navigator agent.
Process
Understand the task
-
What website needs to be automated?
-
What actions are required? (clicking, typing, extracting data)
-
Does it require login or authentication?
Define the task clearly
-
Break complex workflows into clear steps
-
Specify what data to extract if applicable
-
Note any buttons or elements to interact with
Start the browsing task Use run_browsing_task with:
-
task : Clear natural language instructions
-
start_url : The URL to begin browsing
-
max_steps : 25 (default) to 100 for complex flows
-
output_fields : For structured data extraction (e.g., ["name", "price", "url"])
Poll for results
-
Browsing typically takes 30-120 seconds depending on complexity
-
Use get_browsing_task_result to check status
-
Poll every 10-15 seconds until complete
Review and validate
-
Check the extracted data or confirmation
-
Verify the task completed as expected
Task Writing Tips
-
Be specific about UI elements: "Click the blue 'Submit' button"
-
Reference visible text when possible
-
For forms, specify which fields get which values
$ARGUMENTS