data-collector

A utility skill to fetch data from remote sources (URLs) and store it locally with rich metadata. It is designed to be the "Extract" part of an ETL pipeline, focusing on reliable data retrieval and change tracking.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "data-collector" with this command: npx skills add famaoai-creator/gemini-skills/famaoai-creator-gemini-skills-data-collector

Data Collector

A utility skill to fetch data from remote sources (URLs) and store it locally with rich metadata. It is designed to be the "Extract" part of an ETL pipeline, focusing on reliable data retrieval and change tracking.

Features

  • Fetch: Downloads content from HTTP/HTTPS URLs.

  • Traceability: Saves a manifest.json tracking source URL, fetch time, and content headers.

  • Incremental: Skips download if the remote content hasn't changed (based on Content-Length or ETag/Hash comparison if implemented).

Usage

node data-collector/scripts/collect.cjs --url <URL> --out <output_dir> [options]

Options

  • --url : The source URL to fetch.

  • --out : Target directory to save the file.

  • --name : (Optional) Specific filename to save as. If omitted, derived from URL or Content-Disposition.

  • --force : Force download even if content hasn't changed.

Example

Fetch a user list and save to ./raw_data

node data-collector/scripts/collect.cjs --url "https://jsonplaceholder.typicode.com/users" --out "./raw_data" --name "users.json"

Knowledge Protocol

  • This skill adheres to the knowledge/orchestration/knowledge-protocol.md . It automatically integrates Public, Confidential (Company/Client), and Personal knowledge tiers, prioritizing the most specific secrets while ensuring no leaks to public outputs.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

data-transformer

No summary provided by upstream source.

Repository SourceNeeds Review
General

api-fetcher

No summary provided by upstream source.

Repository SourceNeeds Review
General

local-reviewer

No summary provided by upstream source.

Repository SourceNeeds Review
General

pmo-governance-lead

No summary provided by upstream source.

Repository SourceNeeds Review