Figshare Data Download
Use this skill to fetch Figshare dataset files behind anti-bot checks.
Workflow
- Normalize inputs:
item_or_file_url: DOI URL, item page URL, orndownloaderURLoutput_path: final local pathexpected_name: expected browser filename (optional)
- Resolve the file download link in browser:
- Open the DOI/item page with browser tooling.
- Find and click the dataset "Download" / "Download file" element.
- If already given an
ndownloaderURL, open it directly in browser.
-
Complete any browser challenge/verification and trigger the real file download.
-
Copy the downloaded file from
~/Downloadstooutput_path. -
Verify output:
- Ensure file size is non-zero.
- Ensure extension/type matches expectation (
.xlsx,.csv, etc.).
Decision Rules
- Do not use
web_fetchfor this skill. - Do not run curl probing as part of this skill workflow.
- Always use browser path first and keep it end-to-end.
Output Contract
- Always produce one final file at
output_path. - Keep the downloaded source file in
~/Downloadsunchanged. - Report:
- final path
- file size
- that the transfer used browser-only workflow
Script
scripts/figshare_data_download.py(browser-only helper)- supports semi-automatic flow:
--open-browser+ wait/poll in~/Downloads+ copy/verify - does not perform curl/web_fetch probing
- supports semi-automatic flow: