runpod-serverless

RunPod Serverless Creator

This skill helps you create and configure serverless endpoints on RunPod.io. It handles both the Template creation (software config) and the Endpoint creation (hardware config).

How to Use

The primary tool is the scripts/create_serverless.py script. It is idempotent: if a template or endpoint with the same name exists, it will reuse the template and update the endpoint.

Prerequisites

API Key: You need a RunPod API Key.
Ask the user to provide it or check if RUNPOD_API_KEY is in the environment.
Security: Do not hardcode the key in the script. Pass it via environment variable RUNPOD_API_KEY .

Command

python runpod-serverless/scripts/create_serverless.py
--name <NAME>
--runtime <python|node>
--gpu <GPU_TYPE>
[--disk <GB>]
[--min-workers <INT>]
[--max-workers <INT>]

Arguments

--name (Required): Unique name for the template and endpoint.
--runtime (Required): python or node . Maps to standard RunPod serverless base images.
--gpu (Required): GPU type alias (e.g., 3090 , 4090 , a100 , a6000 ) or specific ID (e.g., AMPERE_24 ).
--disk : Container disk size in GB (default: 10).
--min-workers : Minimum active workers (default: 0 for cold-start serverless).
--max-workers : Maximum active workers (default: 1).

GPU Options (Aliases)

3090 -> AMPERE_24 (24GB VRAM)
4090 -> ADA_24 (24GB VRAM)
a4000 -> AMPERE_16 (16GB VRAM)
a6000 -> AMPERE_48 (48GB VRAM)
a100 -> AMPERE_80 (80GB VRAM)

See references/runpod_api.md for full API details and ID mappings.

Example Workflow

Ask User for:

Application Name
Runtime (Python/Node)
Preferred GPU
API Key (if not known)

Run Script:

$env:RUNPOD_API_KEY="<USER_KEY>"; python runpod-serverless/scripts/create_serverless.py --name my-ai-api --runtime python --gpu 3090

Verify: The script outputs the Template ID and Endpoint ID upon success.

runpod-serverless

Safety Notice

Copy this and send it to your AI assistant to learn

Source Transparency

Related Skills

xlsx

pdf

frontend-design

skill-creator