ethpandaops Query Guide

Query Ethereum network data through the ethpandaops tools. Execute Python code in sandboxed containers with access to ClickHouse blockchain data, Prometheus metrics, Loki logs, and Dora explorer APIs.

Workflow

Discover - Find available datasources and schemas
Find patterns - Search for query examples and runbooks
Execute - Run Python using the ethpandaops library

Access Methods

This skill works with either the CLI (panda ) or the MCP server. Use whichever is available.

CLI (panda binary)

Discovery

panda datasources # List all datasources panda datasources --type clickhouse # Filter by type panda schema # List ClickHouse tables panda schema beacon_api_eth_v1_events_block # Show table schema panda docs # List Python API modules panda docs clickhouse # Show module docs

Search

panda search examples "block arrival time" panda search examples "attestation" --category attestations --limit 5 panda search runbooks "finality delay" panda search runbooks "validator" --tag performance

Execute

panda execute --code 'from ethpandaops import clickhouse; print(clickhouse.list_datasources())' panda execute --file script.py panda execute --code '...' --session <id> # Reuse session echo 'print("hello")' | panda execute

Sessions

panda session list panda session create panda session destroy <session-id>

All commands support --json for structured output.

MCP Server (when available as plugin)

Resource Description

datasources://list

All configured datasources

datasources://clickhouse

ClickHouse clusters

datasources://prometheus

Prometheus instances

datasources://loki

Loki instances

networks://active

Active Ethereum networks

clickhouse://tables

Available tables

clickhouse://tables/{table}

Table schema details

python://ethpandaops

Python library API docs

search_examples(query="block arrival time") search_runbooks(query="network not finalizing") execute_python(code="...") manage_session(operation="list")

The ethpandaops Python Library

ClickHouse - Blockchain Data

from ethpandaops import clickhouse

List available clusters

clusters = clickhouse.list_datasources()

Returns: [{"name": "xatu", "database": "default"}, {"name": "xatu-cbt", ...}]

Query data (returns pandas DataFrame)

df = clickhouse.query("xatu-cbt", """ SELECT slot, avg(seen_slot_start_diff) as avg_arrival_ms FROM mainnet.fct_block_first_seen_by_node WHERE slot_start_date_time >= now() - INTERVAL 1 HOUR GROUP BY slot ORDER BY slot DESC """)

Parameterized queries

df = clickhouse.query("xatu", "SELECT * FROM blocks WHERE slot > {slot}", {"slot": 1000})

Cluster selection:

xatu-cbt
Pre-aggregated tables (faster, use for metrics)
xatu
Raw event data (use for detailed analysis)

Required filters:

ALWAYS filter on partition key: slot_start_date_time >= now() - INTERVAL X HOUR
Filter by network: meta_network_name = 'mainnet' or use schema like mainnet.table_name

Prometheus - Infrastructure Metrics

from ethpandaops import prometheus

List instances

instances = prometheus.list_datasources()

Instant query

result = prometheus.query("ethpandaops", "up")

Range query

result = prometheus.query_range( "ethpandaops", "rate(http_requests_total[5m])", start="now-1h", end="now", step="1m" )

Time formats: RFC3339 or relative (now , now-1h , now-30m )

Loki - Log Data

Always discover labels first. Before querying logs, fetch the available labels and their values so you can add the right filters. Unfiltered Loki queries are slow and may time out — label filters narrow the search at the storage level and are essential for efficient log retrieval.

from ethpandaops import loki

Step 1: List instances

instances = loki.list_datasources()

Step 2: Fetch all available labels

labels = loki.get_labels("ethpandaops") print(labels)

Example: ['app', 'cluster', 'ethereum_cl', 'ethereum_el', 'ethereum_network',

'instance', 'namespace', 'node', 'testnet', 'validator_client', ...]

Step 3: Get values for a specific label to build your filter

networks = loki.get_label_values("ethpandaops", "testnet") print(networks) # e.g. ['fusaka-devnet-3', 'hoodi', 'sepolia', ...]

cl_clients = loki.get_label_values("ethpandaops", "ethereum_cl") print(cl_clients) # e.g. ['lighthouse', 'prysm', 'teku', 'nimbus', 'lodestar', 'grandine']

Step 4: Query logs with label filters

logs = loki.query( "ethpandaops", '{testnet="hoodi", ethereum_cl="lighthouse"} |= "error"', start="now-1h", limit=100 )

Key labels for Ethereum log queries:

testnet — network/devnet name (e.g. hoodi , fusaka-devnet-3 )
ethereum_cl — consensus layer client (e.g. lighthouse , prysm , teku )
ethereum_el — execution layer client (e.g. geth , nethermind , besu )
ethereum_network — Ethereum network name
instance — specific node instance
validator_client — validator client name

Log level formats vary by client. When filtering logs by severity, be aware that Ethereum clients format log levels differently:

Keywords: CRIT , ERR , ERROR , WARN , INFO , DEBUG
Structured fields: level=error , "level":"error" , "severity":"ERROR"
Shorthand: E , W , C

Start with |~ "(?i)(CRIT|ERR)" as a default filter. If it returns no results, fetch a few unfiltered log lines to identify the client's format, then adapt the regex (e.g. |~ "level=(error|fatal)" ).

Dora - Beacon Chain Explorer

Discovering all Dora API endpoints:

Before using Dora, discover the full set of available API endpoints by fetching the Swagger documentation. The swagger page is always at <dora-url>/api/swagger/index.html .

First, get the Dora base URL for the network:

from ethpandaops import dora base_url = dora.get_base_url("mainnet") print(f"Swagger docs: {base_url}/api/swagger/index.html")

Then use WebFetch to read the swagger page at {base_url}/api/swagger/index.html to discover all supported API endpoints for that Dora instance. This is important because different Dora deployments may support different endpoints.

Use the discovered endpoints to make targeted API calls via the Python dora module or direct HTTP requests.

Common API usage:

from ethpandaops import dora

Get network health

overview = dora.get_network_overview("mainnet") print(f"Current epoch: {overview['current_epoch']}") print(f"Active validators: {overview['active_validator_count']}")

Check finality

epochs_behind = overview['current_epoch'] - overview.get('finalized_epoch', 0) if epochs_behind > 2: print(f"Warning: {epochs_behind} epochs behind finality")

Generate explorer links

link = dora.link_validator("mainnet", "12345") link = dora.link_slot("mainnet", "9000000") link = dora.link_epoch("mainnet", 280000)

Direct HTTP calls for endpoints not in the Python module:

from ethpandaops import dora import httpx

base_url = dora.get_base_url("mainnet")

Call any endpoint discovered from swagger

with httpx.Client(timeout=30) as client: resp = client.get(f"{base_url}/api/v1/<endpoint>") data = resp.json()

Storage - Upload Outputs

from ethpandaops import storage

Save visualization

import matplotlib.pyplot as plt plt.savefig("/workspace/chart.png")

Upload for public URL

url = storage.upload("/workspace/chart.png") print(f"Chart URL: {url}")

List uploaded files

files = storage.list_files()

Session Management

Critical: Each execution runs in a fresh Python process. Variables do NOT persist.

Files persist: Save to /workspace/ to share data between calls.

Reuse sessions: Pass --session <id> (CLI) or session_id (MCP) for faster startup and workspace persistence.

Multi-Step Analysis Pattern

Call 1: Query and save

from ethpandaops import clickhouse df = clickhouse.query("xatu-cbt", "SELECT ...") df.to_parquet("/workspace/data.parquet")

Call 2: Load and visualize (reuse session from Call 1)

import pandas as pd import matplotlib.pyplot as plt from ethpandaops import storage

df = pd.read_parquet("/workspace/data.parquet") plt.figure(figsize=(12, 6)) plt.plot(df["slot"], df["value"]) plt.savefig("/workspace/chart.png") url = storage.upload("/workspace/chart.png") print(f"Chart: {url}")

Error Handling

ClickHouse errors include actionable suggestions:

Missing date filter → "Add slot_start_date_time >= now() - INTERVAL X HOUR "
Wrong cluster → "Use xatu-cbt for aggregated metrics"
Query timeout → Break into smaller time windows

Default execution timeout is 60s, max 600s. For large analyses:

Search for optimized patterns first (panda search examples "..." )
Break work into smaller time windows
Save intermediate results to /workspace/

Notes

Always filter ClickHouse queries on partition keys (slot_start_date_time )
Use xatu-cbt for pre-aggregated metrics, xatu for raw event data
Use panda docs or python://ethpandaops resource for complete API documentation
Search for examples before writing complex queries from scratch
Search for runbooks to find common investigation workflows
Upload visualizations with storage.upload() for shareable URLs
NEVER just copy/paste/recite base64 of images. You MUST save the image to the workspace and upload it to give it back to the user.

query

Safety Notice

Copy this and send it to your AI assistant to learn