Exploring Data

Workflow

Check if installed (instant)

bash /mnt/skills/user/exploring-data/scripts/check_install.sh

Returns: installed or not_installed

Install if needed (one-time, ~19s)

if [ "$(bash check_install.sh)" = "not_installed" ]; then bash /mnt/skills/user/exploring-data/scripts/install_ydata.sh fi

Run analysis (always generates JSON + HTML by default)

bash /mnt/skills/user/exploring-data/scripts/analyze.sh <filepath> [minimal|full] [html|json]

Defaults: minimal + html (also generates JSON)

Output:

eda_report.html
Interactive report for user
eda_report.json
Machine-readable for Claude analysis

If Claude needs to analyze (user asks "what do you think?" etc.)

python /mnt/skills/user/exploring-data/scripts/summarize_insights.py /mnt/user-data/outputs/eda_report.json

Reads: eda_report.json (comprehensive ydata output)

Writes: eda_insights_summary.md (condensed for Claude). When this repo's artifact conventions are in use (per /docs/layout ), write instead to {REPORTS_DIR}/report-<endeavor>-eda-insights-<timeframe>.md (e.g. report-repo-eda-insights-2026-02-06.md ).

Outputs to stdout: Formatted markdown summary

Claude should read the stdout markdown summary, NOT the full JSON report.

Invocation Examples

Standard workflow (user views HTML)

bash analyze.sh /mnt/user-data/uploads/data.csv

Produces: eda_report.html + eda_report.json

Link user to: computer:///mnt/user-data/outputs/eda_report.html

User asks Claude to analyze

bash analyze.sh /mnt/user-data/uploads/data.csv python summarize_insights.py /mnt/user-data/outputs/eda_report.json

Claude reads the stdout markdown summary

Claude can then provide analysis based on patterns/insights

Full mode for comprehensive analysis

bash analyze.sh /mnt/user-data/uploads/data.csv full

JSON-only output (skip HTML generation)

bash analyze.sh /mnt/user-data/uploads/data.csv minimal json

Modes

Minimal (default, 5-10s): Dataset overview, variable analysis, correlations, missing values, alerts

Full (10-20s): Everything in minimal + scatter matrices, sample data, character analysis, more visualizations

User Triggers for Full Mode

"comprehensive analysis", "detailed EDA", "full profiling", "deep analysis"

Otherwise use minimal.

exploring-data

Safety Notice

Copy this and send it to your AI assistant to learn

Standard workflow (user views HTML)

Produces: eda_report.html + eda_report.json

Link user to: computer:///mnt/user-data/outputs/eda_report.html

User asks Claude to analyze

Claude reads the stdout markdown summary

Claude can then provide analysis based on patterns/insights

Full mode for comprehensive analysis

JSON-only output (skip HTML generation)

Source Transparency

Related Skills

algorithmic-art

markdown-syntax-fundamentals

problem-solving

mermaid-diagrams