infrastructure

Infrastructure Management

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "infrastructure" with this command: npx skills add fred-drake/nix/fred-drake-nix-infrastructure

Infrastructure Management

Quick Reference

Deploy with Colmena

Single host

colmena apply --on <hostname> --impure

Multiple hosts

colmena apply --on host1,host2,host3 --impure

Build only (no deploy)

colmena build --on <hostname> --impure

Proxmox Container Management

SSH to Proxmox host first, then use pct :

List containers on a host

ssh <proxmox-host> "pct list"

Container status

ssh <proxmox-host> "pct status <vmid>" ssh <proxmox-host> "pct status <vmid> --verbose"

Start/stop/reboot

ssh <proxmox-host> "pct start <vmid>" ssh <proxmox-host> "pct stop <vmid>" ssh <proxmox-host> "pct reboot <vmid>"

Execute command in container

ssh <proxmox-host> "pct exec <vmid> -- /run/current-system/sw/bin/<command>"

Common commands via pct exec

ssh <proxmox-host> "pct exec <vmid> -- /run/current-system/sw/bin/systemctl status <service>" ssh <proxmox-host> "pct exec <vmid> -- /run/current-system/sw/bin/journalctl -u <service> -n 50"

Server Inventory

Proxmox Hosts

Host Description

thrall Proxmox cluster node

sylvanas Proxmox cluster node

voljin Proxmox cluster node

Proxmox LXC Containers

All other hosts are LXC containers. Use pct list on Proxmox hosts to see VMIDs.

Common hosts: gitea-runner-1/2/3, prometheus, grafana, uptime-kuma, sonarqube, jellyseerr, prowlarr, n8n, minio, scanner, external-metrics, ironforge (gitea, woodpecker, paperless, calibre, nixarr, resume)

NixOS Workstation Services

  • fredpc : glance dashboard (native NixOS module, port 8084)

Troubleshooting Workflows

Container Won't Respond

  • Check status: ssh <proxmox-host> "pct status <vmid> --verbose"

  • If running but commands fail: ssh <proxmox-host> "pct reboot <vmid>"

  • Wait 15-30 seconds, verify: ssh <proxmox-host> "pct status <vmid>"

  • Re-deploy if needed: colmena apply --on <hostname> --impure

Service Not Working

  • Check service status: ssh <proxmox-host> "pct exec <vmid> -- /run/current-system/sw/bin/systemctl status <service>"

  • Check logs: ssh <proxmox-host> "pct exec <vmid> -- /run/current-system/sw/bin/journalctl -u <service> -n 100"

  • Restart service: ssh <proxmox-host> "pct exec <vmid> -- /run/current-system/sw/bin/systemctl restart <service>"

Podman/Container Issues

Check socket status:

ssh <proxmox-host> "pct exec <vmid> -- /run/current-system/sw/bin/systemctl status podman.socket"

List running containers:

ssh <proxmox-host> "pct exec <vmid> -- /run/current-system/sw/bin/podman ps -a"

SSH Connection Issues

If colmena fails with SSH errors:

  • Verify container is running on Proxmox

  • Check if SSH is listening: pct exec <vmid> -- /run/current-system/sw/bin/ss -tlnp | grep 22

  • Reboot container if necessary

Common Colmena Patterns

Deploy All Gitea Runners

colmena apply --on gitea-runner-1,gitea-runner-2,gitea-runner-3 --impure

Deploy Monitoring Stack

colmena apply --on prometheus,grafana --impure

Update Secrets Before Deploy

just update-secrets colmena apply --on <hostname> --impure

File Locations

Purpose Path

Colmena host configs colmena/hosts/<hostname>.nix

NixOS host configs modules/nixos/host/<hostname>/configuration.nix

Application configs apps/<appname>.nix

Secrets configs modules/secrets/<hostname>.nix

Container image SHAs apps/fetcher/containers-sha.nix

Container definitions apps/fetcher/containers.toml

Related Skills

  • provision-nixos-server: Create new servers from scratch

  • For creating new hosts, use /provision-nixos-server skill instead

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

container-update-report

No summary provided by upstream source.

Repository SourceNeeds Review
General

provision-nixos-server

No summary provided by upstream source.

Repository SourceNeeds Review
General

Layered Memory

基于 L0/L1/L2 三层结构的分层记忆管理系统,大幅减少 Token 消耗。L0 节省 99% Token(摘要层),L1 节省 88%(概览层),L2 为完整内容。支持自动生成分层文件、智能按需加载、防重复写入。适用于需要高效管理大型记忆文件的场景。

Registry SourceRecently Updated
900Profile unavailable
General

OpenClaw AWS Deploy

Deploy OpenClaw securely on AWS with a single command. Creates VPC, EC2 (ARM64), Telegram channel, and configurable AI model (Bedrock, Gemini, or any provide...

Registry SourceRecently Updated
4100Profile unavailable