wiki-onboarding

Wiki Onboarding Guide Generator

Generate four audience-tailored onboarding documents in an onboarding/ folder, each giving a different stakeholder exactly the understanding they need.

Source Repository Resolution (MUST DO FIRST)

Before generating any guides, you MUST determine the source repository context:

Check for git remote: Run git remote get-url origin to detect if a remote exists
Ask the user: "Is this a local-only repository, or do you have a source repository URL (e.g., GitHub, Azure DevOps)?"
Remote URL provided → store as REPO_URL , use linked citations: file:line
Local-only → use local citations: (file_path:line_number)
Determine default branch: Run git rev-parse --abbrev-ref HEAD
Do NOT proceed until source repo context is resolved

When to Activate

User asks for onboarding docs or getting-started guides
User runs /deep-wiki:onboard command
User wants to help new team members understand a codebase

Output Structure

Generate an onboarding/ folder with these files:

onboarding/ ├── index.md # Onboarding hub — links to all 4 guides with audience descriptions ├── contributor-guide.md # For new contributors (assumes Python or JS background) ├── staff-engineer-guide.md # For staff/principal engineers ├── executive-guide.md # For VP/director-level engineering leaders └── product-manager-guide.md # For product managers and non-engineering stakeholders

index.md — Onboarding Hub

A landing page with:

One-paragraph project summary
Guide selector table:

Guide Audience What You'll Learn Time

Contributor Guide New contributors with Python/JS experience Setup, first PR, codebase patterns ~30 min

Staff Engineer Guide Staff/principal engineers Architecture, design decisions, system boundaries ~45 min

Executive Guide VP/directors of engineering Capabilities, risks, team topology, investment thesis ~20 min

Product Manager Guide Product managers Features, user journeys, constraints, data model ~20 min

Language Detection

Scan the repository for build files to determine the primary language for code examples:

package.json / tsconfig.json → TypeScript/JavaScript
*.csproj / *.sln → C# / .NET
Cargo.toml → Rust
pyproject.toml / setup.py / requirements.txt → Python
go.mod → Go
pom.xml / build.gradle → Java

Guide 1: Contributor Guide

File: onboarding/contributor-guide.md

Audience: Engineers joining the project. Assumes proficiency in Python or JavaScript and general software engineering experience. Length: 1000–2500 lines. Progressive — each section builds on the last.

Required Sections

Part I: Foundations (skip if repo uses Python or JS)

{Primary Language} for Python/JS Engineers — Syntax comparison tables, async model, collections, type system, package management. Concrete code side-by-side, NOT abstract descriptions.
{Primary Framework} Essentials — Compare to equivalent Python/JS frameworks (e.g., FastAPI, Express). Request pipeline, routing, DI, config.

Part II: This Codebase 3. What This Project Does — 2-3 sentence elevator pitch 4. Project Structure — Annotated directory tree (what lives where and why). Include graph TB architecture overview. 5. Core Concepts — Domain-specific terminology explained with code examples. Use erDiagram for data model. 6. Request Lifecycle — sequenceDiagram (with autonumber ) tracing a typical request end-to-end. 7. Key Patterns — "If you want to add X, follow this pattern" templates with real code

Part III: Getting Productive 8. Prerequisites & Setup — Table: Tool, Version, Install Command. Step-by-step with expected output at each step. 9. Your First Task — End-to-end walkthrough of adding a simple feature 10. Development Workflow — Branch strategy, commit conventions, PR process. Use flowchart diagram. 11. Running Tests — All tests, single file, single test, coverage commands 12. Debugging Guide — Common issues table: Symptom, Cause, Fix 13. Common Pitfalls — Mistakes every new contributor makes and how to avoid them

Appendices

Glossary (40+ terms)
Key File Reference — Table: Path, Purpose, Why It Matters, Source
Quick Reference Card — Cheat sheet of most-used commands and patterns

Rules

All code examples in the detected primary language
Every command must be copy-pasteable with expected output
Minimum 5 Mermaid diagrams (architecture, ER, sequence, flowchart, state)
Use Mermaid for workflow diagrams (dark-mode colors) — add  comment block after each
Ground all claims in actual code — cite using linked format

Guide 2: Staff Engineer Guide

File: onboarding/staff-engineer-guide.md

Audience: Staff/principal engineers who need the "why" behind every decision. Deep systems experience, may not know this repo's language. Length: 800–1200 lines. Dense, opinionated, architectural.

Required Sections

Executive Summary — What the system is in one dense paragraph. What it owns vs delegates.
The Core Architectural Insight — The SINGLE most important concept. Include pseudocode in a DIFFERENT language from the repo.
System Architecture — Full Mermaid graph TB diagram. Call out the "heart" of the system.
Domain Model — Mermaid erDiagram of core entities. Data invariants table: Entity, Invariant, Enforced By, Source.
Key Abstractions & Interfaces — classDiagram showing load-bearing abstractions.
Request Lifecycle — sequenceDiagram (with autonumber ) showing typical request from entry to response.
State Transitions — stateDiagram-v2 for entities with meaningful lifecycle states.
Decision Log — Table: Decision, Alternatives Considered, Rationale, Source.
Dependency Rationale — Table: Dependency, Purpose, What It Replaced, Source.
Data Flow & State — How data moves through the system. Storage comparison table.
Failure Modes & Error Handling — flowchart for error propagation paths.
Performance Characteristics — Bottlenecks, scaling limits, hot paths.
Security Model — Auth, authorization, trust boundaries, data sensitivity.
Testing Strategy — What's tested, what isn't, testing philosophy.
Known Technical Debt — Table: Issue, Risk Level, Affected Files, Source.
Where to Go Deep — Recommended reading order of source files, links to wiki sections.

Rules

Use pseudocode in a different language to explain concepts
Use comparison tables to map unfamiliar concepts (e.g., Task<T> = Awaitable[T] )
Dense prose with tables, NOT shallow bullet lists
Every claim backed by linked citation
Minimum 5 Mermaid diagrams (architecture, ER, class, sequence, state, flowchart)
Each diagram followed by  comment block
Use tables aggressively — decisions, dependencies, debt should ALL be tables with Source columns
Focus on WHY decisions were made, not just WHAT exists

Guide 3: Executive Guide

File: onboarding/executive-guide.md

Audience: VP/director of engineering. Needs capability overview, risk assessment, and investment context — NOT code-level details. Length: 400–800 lines. Strategic, concise, decision-oriented.

Required Sections

System Overview — What it does, who uses it, business value in 2-3 sentences
Capability Map — Table: Capability, Status (Built/Partial/Planned), Maturity, Dependencies. What the system can and cannot do today.
Architecture at a Glance — High-level Mermaid graph LR diagram. Services, data stores, external integrations — NO internal code details. Focus on deployment units and team boundaries.
Team Topology — Which team/person owns which components. Table: Component, Owner, Criticality, Bus Factor.
Technology Investment Thesis — Why these technologies were chosen. Table: Technology, Purpose, Alternatives Considered, Risk Level.
Risk Assessment — Table: Risk, Likelihood, Impact, Mitigation, Owner. Cover reliability, security, scalability, compliance.
Cost & Scaling Model — How costs scale with usage. What the bottlenecks are. When the next scaling investment is needed.
Dependency Map — graph TB showing critical external dependencies. Table: Dependency, Type (Service/Library/Platform), Risk if Unavailable.
Key Metrics & Observability — What's measured, what dashboards exist, alerting coverage. Table: Metric, Current Value, Target, Source.
Roadmap Alignment — Engineering workstreams mapped to business priorities. What's in progress, what's planned, what's blocked.
Technical Debt Summary — Top 5 debt items with business impact. Table: Issue, Business Impact, Effort to Fix, Priority.
Recommendations — 3-5 actionable recommendations for the next quarter, prioritized by impact.

Rules

NO code snippets — this guide is for engineering leaders, not coders
Diagrams at service/team level, not class/function level
Every claim backed by evidence — cite wiki sections, architecture docs, or source files
Minimum 3 Mermaid diagrams (architecture overview, dependency map, capability/roadmap)
Tables for every structured finding — this audience reads tables, not prose
Business language — translate technical concepts into impact (reliability, velocity, cost, risk)

Guide 4: Product Manager Guide

File: onboarding/product-manager-guide.md

Audience: Product managers and non-engineering stakeholders. Needs to understand what the system does, what's possible, and where the boundaries are — NOT how it's built. Length: 400–800 lines. User-centric, feature-focused, constraint-aware.

Required Sections

What This System Does — 2-3 sentence elevator pitch in user-facing language (no jargon)
User Journey Map — Mermaid graph LR or journey diagram showing primary user flows through the system
Feature Capability Map — Table: Feature, Status (Live/Beta/Planned/Not Possible), User-Facing Behavior, Limitations. Comprehensive map of what's built and what's not.
Data Model (Product View) — Simplified Mermaid erDiagram showing entities users interact with. Explain in business terms (e.g., "A Project has many Documents" not "FK relationship").
Configuration & Feature Flags — Table: Flag/Config, What It Controls, Default, Who Can Change It. What can be toggled without engineering work.
API Capabilities — What integrations are possible. Table: Capability, Endpoint/Method, Authentication, Rate Limits. Written for integration partners, not developers.
Performance & SLAs — Response times, throughput limits, availability targets. Table: Operation, Expected Latency, Throughput Limit, Current SLA.
Known Limitations & Constraints — Honest list of what the system can't do or does poorly. Table: Limitation, User Impact, Workaround, Planned Fix.
Data & Privacy — What data is collected, where it's stored, retention policies, compliance status. Table: Data Type, Storage Location, Retention, Compliance.
Glossary — Domain terms explained in plain language (not engineering jargon)
FAQ — 10+ common questions a PM would ask, answered concisely

Rules

ZERO engineering jargon — no "middleware", "dependency injection", "ORM". Use plain language.
User-centric framing — describe everything in terms of what users experience, not how code works
Minimum 3 Mermaid diagrams (user journey, data model, feature map/capability overview)
Tables for every structured finding — PMs scan tables, not prose
If a technical concept must be mentioned, explain it in one sentence (e.g., "Feature flags — toggles that let us turn features on/off without deploying code")
Every claim grounded in evidence — cite wiki sections or source files for verification

Mermaid Diagram Rules (ALL guides)

ALL diagrams must use dark-mode colors:

Node fills: #2d333b , borders: #6d5dfc , text: #e6edf3
Subgraph backgrounds: #161b22 , borders: #30363d
Lines: #8b949e
If using inline style directives, use dark fills with ,color:#e6edf3
Do NOT use <br/> in Mermaid labels (use <br> or line breaks)

Validation

After generating each guide, verify:

All file paths mentioned actually exist in the repo
All class/method names are accurate (not hallucinated)
Mermaid diagrams render (no syntax errors)
No bare HTML-like tags (generics like List<T> ) outside code fences — wrap in backticks
Each guide is appropriate for its audience — no code in Executive/PM guides

wiki-onboarding

Safety Notice

Copy this and send it to your AI assistant to learn

Source Transparency

Related Skills

azure-observability

azure-appconfiguration-java

wiki-architect

azure-ai-formrecognizer-java