ai-inference-service-mesh

AI Inference Service Mesh

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "ai-inference-service-mesh" with this command: npx skills add bagelhole/devops-security-agent-skills/bagelhole-devops-security-agent-skills-ai-inference-service-mesh

AI Inference Service Mesh

Apply Istio/Linkerd mesh controls to secure and optimize east-west AI traffic across inference microservices.

Why Mesh for AI

  • Enforce mTLS between gateway, retriever, reranker, and model services

  • Apply fine-grained traffic policies without app code changes

  • Run progressive delivery for model-serving backends

  • Observe latency hops for retrieval + generation chains

Core Patterns

Security

  • mTLS strict mode cluster-wide

  • AuthorizationPolicy per service account

  • Egress policies for approved model endpoints only

Traffic Management

  • Canary by header or percentage for new model versions

  • Retry budgets tuned for long-running streaming requests

  • Circuit breakers to protect overloaded inference backends

Resilience

  • Outlier detection on failing pods

  • Locality-aware routing in multi-zone clusters

  • Failover to secondary cluster/provider

Observability

  • Capture distributed traces across the full AI request path

  • Emit service-level and route-level p95/p99 latency

  • Segment metrics by model and tenant labels

Pitfalls to Avoid

  • Aggressive timeouts that break streaming responses

  • Blanket retries that amplify expensive generation calls

  • Missing identity boundaries between tenant-facing and internal services

Related Skills

  • service-mesh - Foundational mesh concepts

  • llm-gateway - North-south API gateway controls

  • opentelemetry - End-to-end tracing and metrics

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Security

linux-administration

No summary provided by upstream source.

Repository SourceNeeds Review
Security

linux-hardening

No summary provided by upstream source.

Repository SourceNeeds Review
Security

sops-encryption

No summary provided by upstream source.

Repository SourceNeeds Review
Security

vpn-setup

No summary provided by upstream source.

Repository SourceNeeds Review