Engineering Manager Operating System

Your complete playbook for engineering leadership. Not generic management advice — this is the specific system that high-performing engineering managers run daily.

Phase 1: Team Architecture

Team Topology Assessment

Before managing people, understand the system they work in.

team_topology:
  name: "[Team Name]"
  type: stream-aligned | platform | enabling | complicated-subsystem
  mission: "[One sentence — what does this team exist to do?]"
  boundaries:
    owns: ["service-x", "domain-y", "pipeline-z"]
    consumes: ["auth-service", "data-platform"]
    provides: ["checkout-api", "payment-events"]
  cognitive_load: low | medium | high | overloaded
  interaction_modes:
    - team: "[Other Team]"
      mode: collaboration | x-as-a-service | facilitating
      friction: low | medium | high
      notes: "[What's working/not working]"
  current_headcount: N
  ideal_headcount: N
  skill_gaps: ["observability", "mobile", "ML"]

Team Health Radar (Monthly)

Score 1-5 for each dimension. Track trends over time.

Dimension	Score	Signal
Delivery pace	_ /5	Are we shipping what we committed?
Quality	_ /5	Bug rate, incident frequency, tech debt trajectory
Collaboration	_ /5	Cross-functional work, PR review speed, knowledge sharing
Morale	_ /5	Energy in meetings, voluntary contributions, retention signals
Learning	_ /5	New skills adopted, conference talks, internal tech talks
Autonomy	_ /5	Can the team make decisions without waiting for me?
Psychological safety	_ /5	Do people raise concerns, admit mistakes, challenge ideas?
On-call health	_ /5	Page frequency, off-hours burden, burnout signals

Action rules:

Any dimension ≤2 → Address THIS WEEK (it's a fire)
Any dimension at 3 → Create improvement plan within 2 weeks
Overall average <3.5 → Team is struggling, block new commitments until fixed
Track quarter-over-quarter — sustained decline in any dimension = systemic issue

Team Composition Model

The ideal team has these roles covered (not necessarily 1:1 with people):

Role	Description	Gap Impact
Tech lead	Architecture decisions, code quality bar	Decisions bottleneck through you
Senior IC (2-3)	Carry complex work, mentor juniors	Velocity drops, quality suffers
Mid-level (2-3)	Reliable delivery, growing scope	No bench for senior pipeline
Junior (0-2)	Learning, fresh perspective	No talent pipeline
Domain expert	Deep knowledge of the problem space	Constantly solving wrong problems

Rule of thumb: Never have >60% of team at same level. Mix creates natural mentorship.

Phase 2: 1:1 System

1:1 Cadence

Report Level	Frequency	Duration	Focus
Direct reports	Weekly	30 min	Career + blockers + feedback
Skip-levels	Monthly	30 min	Team health + career + honesty check
Your manager	Weekly	30 min	Priorities + asks + air cover
Cross-functional peers	Bi-weekly	25 min	Dependencies + alignment

1:1 Template (Direct Reports)

one_on_one:
  date: "YYYY-MM-DD"
  person: "[Name]"
  role: "[Title]"
  tenure: "[X months on team]"
  
  # Their agenda first — ALWAYS
  their_topics: []
  
  # Check-in (2 min)
  energy_level: 1-10  # "How are you feeling about work this week?"
  energy_trend: up | stable | down
  
  # Delivery (5 min)
  current_work: "[What they're working on]"
  blockers: []
  help_needed: "[What can I unblock?]"
  
  # Growth (10 min — skip if urgent topics dominate, but never 3 weeks in a row)
  career_conversation: "[Topic discussed]"
  feedback_given: "[Specific behavior → impact → request]"
  feedback_received: "[What they told me]"
  stretch_opportunity: "[Current or upcoming]"
  
  # Action items
  my_actions: []  # What I committed to do
  their_actions: []  # What they committed to do
  
  # Signals (private — don't share these)
  flight_risk: low | medium | high
  performance_trajectory: improving | stable | declining
  notes: "[Anything notable]"

1:1 Question Bank

Opening (rotate these — never use the same opener 3 weeks in a row):

"What's on your mind?"
"What was the best/worst part of your week?"
"If you could change one thing about how we work, what would it be?"
"What's something you're proud of from this week that I might not know about?"
"On a scale of 1-10, how's your energy? What would move it up one point?"

Career development (monthly deep-dive):

"Where do you want to be in 2 years? What's the gap between here and there?"
"What skills are you not using that you'd like to use more?"
"Who in the org (or industry) has a role you'd want? What specifically about it?"
"What's the hardest technical problem you've solved recently? What did you learn?"
"If you left tomorrow, what would you regret not doing here?"

Team health (probe with care):

"Who on the team do you learn the most from? The least?"
"Is there anyone whose work you don't trust to review?"
"What's something the team avoids talking about?"
"If you were me, what would you change about how this team operates?"

Feedback solicitation (for YOU):

"What's one thing I could do differently that would help you most?"
"Am I giving you too much direction or too little?"
"Is there context I have that I'm not sharing that would help you?"
"When was the last time I frustrated you? What happened?"

Flight Risk Detection

Monitor these signals — if 3+ present, have a retention conversation within a week:

Signal	Weight	Detection
LinkedIn profile update	🔴 High	Someone mentions it, or you notice
Declining 1:1 engagement	🔴 High	Shorter answers, less eye contact, "everything's fine"
Stopped volunteering for projects	🟡 Medium	Used to raise hand, now doesn't
Increased PTO without travel	🟡 Medium	Interviewing signal
Disengaged in meetings	🟡 Medium	Camera off, multitasking, no opinions
Complaining shifted from specific to general	🟡 Medium	"This sprint is rough" → "This place..."
Stopped arguing for their ideas	🔴 High	They've mentally checked out
Life event (new baby, move, partner change)	🟡 Medium	Re-evaluating everything

Retention conversation framework:

Name it: "I've noticed [specific behavior change]. I want to check in."
Listen: Let them talk. Don't interrupt. Don't get defensive.
Understand: "What would make this the best job you've ever had?"
Act: Make a concrete commitment within 48 hours — title, comp, scope, flexibility
Follow up: Check back in 1 week. Did what you promised make a difference?

Phase 3: Performance Management

Performance Calibration Framework

Rate on two axes (both matter):

Delivery Impact (What)

Level	Description
1 - Below	Missing commitments, quality issues, needs close oversight
2 - Meeting	Delivering assigned work reliably
3 - Exceeding	Delivering beyond scope, finding better solutions
4 - Outstanding	Multiplying team output, solving problems no one asked them to

Behaviors (How)

Level	Description
1 - Below	Creating friction, not collaborating, ignoring feedback
2 - Meeting	Professional, collaborative, receptive to feedback
3 - Exceeding	Mentoring others, proactively improving processes
4 - Outstanding	Shaping culture, attracting talent, raising the entire bar

Calibration matrix:

	Behavior 1	Behavior 2	Behavior 3	Behavior 4
Delivery 4	Coach behaviors	Strong	Top performer	Superstar
Delivery 3	Coach behaviors	Solid	Strong	Top performer
Delivery 2	PIP candidate	Meets expectations	Developing	Growing
Delivery 1	Exit	PIP	Coach delivery	Coach delivery

Feedback Framework: SBI-I (Situation-Behavior-Impact-Intent)

Template: "In [situation], when you [specific behavior], the impact was [concrete effect]. I'd like to see [specific change] because [intent/why it matters]."

Examples:

✅ Good: "In yesterday's design review, when you challenged the API schema with the versioning concern, it caught a breaking change we would have shipped. That's exactly the kind of technical leadership I want to see more of."

❌ Bad: "You're doing great work. Keep it up." (Too vague — they learn nothing)

✅ Good: "In the last two sprints, PRs have been sitting in review for 3+ days. The impact is features are merging late and we're missing sprint commitments. I'd like us to commit to <24h first review because velocity depends on review speed."

❌ Bad: "You need to review PRs faster." (No situation, no impact, no collaboration)

Performance Improvement Plan (PIP) Template

pip:
  employee: "[Name]"
  role: "[Title]"
  manager: "[Your name]"
  start_date: "YYYY-MM-DD"
  end_date: "YYYY-MM-DD"  # 30-60 days, never >90
  
  context: |
    [Specific pattern of underperformance with dates and examples.
     Must reference prior feedback conversations and dates they occurred.]
  
  expectations:
    - area: "[Specific skill/behavior]"
      current_state: "[What's happening now — with examples]"
      target_state: "[What success looks like — measurable]"
      measurement: "[How we'll measure — PR metrics, sprint completion, etc.]"
      support: "[What I'll provide — pairing, training, reduced scope]"
  
  check_ins:
    frequency: weekly
    day: "[Day]"
    format: "[30 min 1:1 with written summary]"
  
  outcomes:
    success: "[What happens if targets met — return to normal performance management]"
    failure: "[What happens if targets not met — typically termination]"
  
  # CRITICAL: Have HR review before sharing. Document every check-in.
  hr_reviewed: false
  hr_reviewer: "[Name]"

PIP rules:

A PIP should never be a surprise — if it is, YOU failed at feedback
PIPs are for capability gaps, not attitude problems (attitude = manage out faster)
70% of PIPs end in termination — be honest with yourself about whether this is a development tool or a documentation exercise
Weekly check-ins are non-negotiable — document everything in writing
If performance improves during PIP then declines after: second PIP is rarely worth it

Promotion Case Template

promotion_case:
  candidate: "[Name]"
  current_level: "[Level]"
  target_level: "[Level]"
  manager: "[Your name]"
  date: "YYYY-MM-DD"
  
  # Already operating at next level (past 6+ months)
  evidence:
    - dimension: "Technical complexity"
      examples:
        - "[Specific project/decision with measurable impact]"
        - "[Another example]"
    - dimension: "Scope & ownership"
      examples:
        - "[Owned X end-to-end, previously needed guidance]"
    - dimension: "Influence & leadership"
      examples:
        - "[Mentored Y, led Z initiative, shaped team direction]"
    - dimension: "Business impact"
      examples:
        - "[Revenue/efficiency/reliability improvement with numbers]"
  
  peer_feedback:
    - from: "[Name, role]"
      quote: "[Specific praise with examples]"
  
  # Why now, not 6 months from now?
  timing_justification: |
    [They've been consistently operating at next level for X months.
     Delaying creates retention risk and sends wrong signal to team.]
  
  # What's the gap? (Be honest — calibration committees will find it)
  growth_areas: |
    [Areas they're still developing. Frame as "growing into" not "lacking."]

Phase 4: Hiring Machine

Hiring Pipeline

Role opened → Job description → Sourcing (5-7 days)
→ Resume screen → Recruiter screen (30 min)
→ Technical phone screen (60 min) → Take-home OR live coding (2-4 hrs)
→ Onsite/virtual loop (3-4 hrs) → Debrief → Offer → Close

Target: <21 days from first screen to offer

Job Description Template

# [Role Title] — [Team Name]

## What you'll do
[3-5 bullet points of ACTUAL work, not generic responsibilities]
- Ship [specific feature/system] that [specific impact]
- Own [specific domain] end-to-end
- [Concrete example of a recent problem this person would solve]

## What you'll need
[Must-haves only — each one must be a genuine filter]
- X years building [specific technology/domain]
- Experience with [specific technical requirement]
- [Skill that actually differentiates candidates]

## Nice to have (genuinely nice, not secretly required)
- [Thing that would accelerate ramp-up]
- [Adjacent skill that adds value]

## What we offer
[Be specific — "competitive salary" means nothing]
- Salary range: $X-$Y (based on [location/level])
- [Specific benefits that matter to engineers]
- [Team/culture thing that's actually true and differentiating]

## How we hire
[Timeline and what to expect — respect their time]
1. [Step]: [Duration] — [What we're assessing]
2. [Step]: [Duration] — [What we're assessing]
Total time investment: ~X hours

Interview Scorecard (Per Interviewer)

scorecard:
  candidate: "[Name]"
  interviewer: "[Name]"
  interview_type: "technical | system design | behavioral | culture"
  date: "YYYY-MM-DD"
  
  # Score each dimension 1-4 (no 3s allowed — forces a decision)
  dimensions:
    - name: "Technical depth"
      score: _  # 1=no hire, 2=lean no, 4=lean yes, 5=strong yes (skip 3)
      evidence: "[Specific examples from the interview]"
    - name: "Problem solving approach"
      score: _
      evidence: "[How they broke down the problem, handled hints]"
    - name: "Communication clarity"
      score: _
      evidence: "[Could they explain their thinking? Did they ask good questions?]"
    - name: "Collaboration signals"
      score: _
      evidence: "[How did they respond to pushback? Did they build on ideas?]"
  
  # Overall
  hire_recommendation: strong_no | no | yes | strong_yes
  level_recommendation: "[What level would you place them?]"
  concerns: "[Anything that gave you pause]"
  highlights: "[What impressed you most]"

Debrief Protocol

No pre-discussion — Submit scorecards BEFORE the debrief meeting
Hire bar holder speaks last — Prevent anchoring
Discuss each dimension, not overall vibes — "Tell me about their system design approach" not "What did you think?"
Any strong_no is a veto — Unless the interviewer can be convinced their signal was a misread
Decide in the room — Don't "sleep on it" unless genuinely torn (then it's probably a no)
Leveling before offer — Agree on level first, then comp follows from band

Closing Candidates

The 3 things that close engineers:

The problem — "Here's the specific hard problem you'd work on"
The people — Connect them with future teammates before offer
The growth — "Here's where this role leads in 18 months"

Offer call structure (15-20 min):

Express genuine excitement (2 min)
Present offer details — base, equity, bonus, start date (3 min)
Explain equity/comp philosophy (3 min)
Ask: "How does this compare to what you were expecting?" (listen)
Address concerns immediately if possible
Set a decision deadline (3-5 business days, not open-ended)
Ask: "Is there anything that would make this a clear yes?"

Phase 5: Technical Leadership

Architecture Decision Record (ADR)

adr:
  id: "ADR-NNN"
  title: "[Decision title]"
  date: "YYYY-MM-DD"
  status: proposed | accepted | deprecated | superseded
  superseded_by: "ADR-NNN"  # if applicable
  
  context: |
    [What situation are we in? What forces are at play?
     Include constraints: timeline, team skill, budget, scale requirements.]
  
  options:
    - name: "[Option A]"
      pros: ["pro 1", "pro 2"]
      cons: ["con 1", "con 2"]
      effort: "[T-shirt size]"
      risk: low | medium | high
    - name: "[Option B]"
      pros: ["pro 1"]
      cons: ["con 1", "con 2", "con 3"]
      effort: "[T-shirt size]"
      risk: low | medium | high
  
  decision: |
    [What we decided and WHY. The "why" is the most important part.
     Future readers need to understand the reasoning, not just the choice.]
  
  consequences: |
    [What follows from this decision? What becomes easier/harder?
     What do we need to monitor?]
  
  review_date: "YYYY-MM-DD"  # When to revisit this decision

Tech Debt Prioritization

Score each debt item on two axes:

Impact of fixing (1-5):

5: Unblocks multiple teams or critical features
4: Significant velocity improvement for our team
3: Moderate improvement, prevents future problems
2: Nice to have, minor improvement
1: Cosmetic or theoretical benefit

Cost of NOT fixing (1-5):

5: Will cause incidents or data loss
4: Blocking hiring/onboarding (can't explain the code)
3: Slowing every feature by >20%
2: Occasional friction, workarounds exist
1: Annoying but harmless

Priority = Impact × Cost-of-not-fixing

Score	Action
20-25	Fix THIS sprint — it's an emergency
12-19	Schedule within 2 sprints
6-11	Add to quarterly tech debt budget (allocate 15-20% of sprint capacity)
1-5	Backlog — revisit quarterly

Code Review Culture Guidelines

code_review_standards:
  sla:
    first_review: "< 4 hours during work hours"
    follow_up: "< 2 hours"
    max_pr_size: 400  # lines changed — larger needs pre-review or splitting
  
  what_to_review:
    always:
      - "Correctness — does it do what it claims?"
      - "Edge cases — what happens with nil/empty/max/concurrent?"
      - "Security — auth checks, input validation, secrets exposure"
      - "Naming — will someone understand this in 6 months?"
    sometimes:
      - "Performance — only if in hot path or O(n²)+ risk"
      - "Style — only if it significantly hurts readability"
    never:
      - "Personal preference disguised as improvement"
      - "Premature optimization suggestions"
      - "Rewriting working code to your style"
  
  tone_rules:
    - "Ask questions instead of making demands: 'What happens if X is nil?' not 'Handle the nil case'"
    - "Prefix opinion with 'nit:' or 'optional:' — make severity clear"
    - "Praise good code — 'Nice abstraction here' costs nothing"
    - "If >5 comments, offer to pair instead"
    - "Approve with comments when nothing is blocking — trust your team"

Phase 6: Sprint & Delivery

Sprint Ceremony Cheat Sheet

Ceremony	Duration	Who	Purpose	Your Role
Sprint planning	1-2 hrs	Team + PO	Commit to sprint goal	Facilitate, challenge estimates, protect capacity
Daily standup	15 min	Team	Surface blockers	Listen for problems, DON'T manage tasks
Backlog refinement	1 hr	Team + PO	Prepare future work	Ensure technical feasibility, flag risks
Sprint review	30 min	Team + stakeholders	Demo working software	Let the team present, handle stakeholder Qs
Retrospective	1 hr	Team only	Improve process	Facilitate, ensure psychological safety, track actions

Sprint Health Metrics

Track these weekly — trend matters more than absolute numbers:

Metric	Healthy Range	Red Flag
Sprint completion rate	80-100% of committed points	<70% for 2+ sprints
Carry-over stories	0-1 per sprint	Same story carried 3+ sprints
PR cycle time	<48 hours open to merge	>72 hours consistently
Bug escape rate	<10% of stories create bugs	Rising trend
Deployment frequency	Daily to weekly	Monthly or less
Sprint goal achievement	Yes/No binary	No for 3+ consecutive sprints

Estimation Heuristic

When the team struggles with estimation:

Certainty Level	Approach
"We've done this exact thing before"	Size by comparison to past work
"We understand the problem but not the solution"	Spike first (timeboxed), then estimate
"We don't fully understand the problem"	Discovery task (1-2 days), then re-scope
"We have no idea"	Break it down until you reach pieces you can estimate

Rule: If an estimate is >8 points (or >5 days), it's not estimated — it's a guess. Break it down further.

Phase 7: Incident Management

Incident Response Framework

incident:
  id: "INC-YYYY-NNN"
  severity: SEV1 | SEV2 | SEV3 | SEV4
  detected: "YYYY-MM-DD HH:MM UTC"
  resolved: "YYYY-MM-DD HH:MM UTC"
  duration: "Xh Ym"
  commander: "[Name]"
  
  # Severity guide
  # SEV1: Revenue impact, data loss, full outage — ALL HANDS, exec notification
  # SEV2: Degraded service, partial outage — On-call + team lead
  # SEV3: Minor degradation, workaround exists — On-call handles
  # SEV4: Cosmetic, no user impact — Normal ticket
  
  timeline:
    - time: "HH:MM"
      action: "[What happened / what was done]"
      who: "[Name]"
  
  root_cause: |
    [Technical root cause — be specific. 
     "Human error" is never the root cause. What system allowed the error?]
  
  contributing_factors:
    - "[Factor 1 — e.g., missing monitoring on X]"
    - "[Factor 2 — e.g., deployment during peak without feature flag]"
  
  action_items:
    - description: "[Specific fix]"
      owner: "[Name]"
      due_date: "YYYY-MM-DD"
      priority: P0 | P1 | P2
      status: open | in_progress | done

Blameless Post-Mortem Template

Facilitation rules:

Focus on systems, not individuals
"What" and "how," never "who"
Everyone involved attends (including on-call who was paged)
Schedule within 48 hours of resolution (memories fade)
Write it up and share publicly within the engineering org

Structure (60-90 min):

Timeline review (20 min) — Walk through chronologically. Fill gaps.
Root cause analysis (15 min) — "5 Whys" until you hit a systemic issue
What went well (10 min) — Reinforce good incident response behaviors
What went wrong (15 min) — Process failures, detection gaps, communication issues
Action items (15 min) — Each must have an owner and due date. Max 5 items — focus beats volume.

On-Call Health Guidelines

Metric	Healthy	Unhealthy
Pages per week	<5	>10
Off-hours pages	<2/week	>5/week
Time to acknowledge	<5 min	>15 min
False positive rate	<20%	>50%
Rotation size	4+ people	<3 people
Consecutive weeks on-call	Never >2	Regular 3+ week stretches

If on-call is unhealthy: This is a tech debt problem, not a people problem. Invest in reliability before adding headcount.

Phase 8: Scaling & Org Design

When to Split a Team

Signal	Action
Team >8 people	Split before communication overhead kills velocity
Two distinct domains in one team	Split along domain boundaries
Standup takes >15 min	Too many threads — people are tuning out
PR review queue >48 hours consistently	Not enough context overlap — specialize
On-call covers too many services	Reduce blast radius per team

Splitting Protocol

Define boundaries clearly — What does each new team OWN? Write it down.
Split the backlog — Every ticket gets a home. Shared backlogs = shared ownership = no ownership.
Split on-call — Each team owns their services' reliability.
Name the teams — Sounds trivial, matters for identity.
Designate tech leads — Don't leave both teams looking to you for technical decisions.
Give it 3 months — Resist re-orging again too quickly. Turbulence is normal.

Manager-to-IC Ratio

Team Size	Structure
3-5 ICs	Player-coach (you're still coding ~30-40%)
5-8 ICs	Full-time manager (stop coding in critical path)
8-12 ICs	Split the team OR add a tech lead as force multiplier
12+ ICs	Must split — you cannot manage this effectively

The IC-to-Manager Transition

If you're newly managing (or coaching someone through it):

Stop doing:

Writing code in the critical path (you're now the bottleneck)
Solving every technical problem yourself
Being the best engineer on the team (your job changed)

Start doing:

Asking "who should own this?" instead of doing it yourself
Measuring success by team output, not your output
Having uncomfortable conversations early (feedback, performance, conflict)
Blocking time for thinking, not just meetings

Keep doing:

Staying technical enough to evaluate decisions (read code, review designs)
Coding on side projects, tools, or prototypes (stay sharp)
Having strong technical opinions (but hold them loosely)

Timeline to competence:

Month 1-3: Imposter syndrome, everything feels slow. Normal.
Month 3-6: Finding your rhythm, some wins, some failures. Normal.
Month 6-12: Confident in the role, building systems. Target.
Month 12+: Multiplying impact. If you're not here by month 18, honest conversation needed.

Phase 9: Communication & Stakeholder Management

Weekly Status Update Template

Send this to your manager and stakeholders every Friday:

# [Team Name] — Week of [Date]

## 🎯 Sprint Goal: [Goal] — On Track / At Risk / Off Track

## ✅ Shipped This Week
- [Feature/fix] — [Impact in user/business terms]
- [Feature/fix] — [Impact]

## 🔨 In Progress
- [Work item] — [Status, ETA, any blockers]

## 🚨 Risks & Blockers
- [Risk] — [What you're doing about it, what you need]

## 📊 Key Metrics
- Deploy frequency: X
- Incident count: X (SEV breakdown)
- Sprint completion: X%

## 🔮 Next Week
- [Priority 1]
- [Priority 2]

Managing Up Checklist

Do	Don't
Bring solutions with problems	Dump problems without proposals
Flag risks early with mitigation plans	Surprise with bad news at the last minute
Quantify impact (hours, $$, users)	Use vague language ("it's kinda slow")
Say "I need X from you by Y"	Hope they'll figure out you need help
Send written updates proactively	Wait to be asked for status
Disagree in private	Disagree in public meetings
Ask for feedback regularly	Assume no news is good news

Cross-Functional Relationship Map

stakeholders:
  - name: "[Product Manager]"
    relationship: partner
    cadence: "Daily async + weekly 1:1"
    currency: "Scope clarity, user data, priority decisions"
    
  - name: "[Design Lead]"
    relationship: partner
    cadence: "Bi-weekly sync + ad-hoc"
    currency: "Early technical feasibility input"
    
  - name: "[Platform/Infra Team]"
    relationship: dependency
    cadence: "Monthly sync + Slack"
    currency: "Clear requirements, advance notice of needs"
    
  - name: "[Your Manager]"
    relationship: air_cover
    cadence: "Weekly 1:1"
    currency: "No surprises, clear asks, good judgment"

Phase 10: Engineering Manager Rituals

Daily (15 min total)

Scan Slack/email for blockers — unblock before standup
Attend standup — listen for patterns, not task updates
Check PR queue — nudge any >24h reviews
One piece of feedback (positive or constructive) to someone

Weekly

All 1:1s completed (never cancel — reschedule if needed)
Sprint metrics reviewed
Status update sent to stakeholders
Calendar audit — am I in meetings I shouldn't be in?
One skip-level or cross-functional conversation

Monthly

Team health radar updated
Career development conversation with each report
Tech debt review and prioritization
On-call health review
Update team topology doc

Quarterly

Performance calibration (formal or informal)
Team goals review and reset
Architecture review — any ADRs need revisiting?
Headcount planning — what do we need in 6 months?
Retrospective on YOUR performance — ask your team for feedback

Phase 11: Difficult Situations Playbook

Scenario: Two Senior Engineers Disagree on Architecture

Let them present both approaches in a design doc (each writes their own section)
Define decision criteria BEFORE evaluating: reversibility, maintenance cost, team familiarity, timeline
Facilitate a time-boxed discussion (60 min max)
If no consensus: the tech lead or DRI decides. Not you (unless you must).
Document the decision as an ADR — the "why" matters more than the "what"
The person who "lost" must commit fully. Monitor for passive resistance.

Scenario: High Performer Wants to Be a Manager

Explore motivation: "Tell me what you think a manager does day-to-day"
Test with real work: lead a project, mentor a junior, run a retrospective
Be honest about tradeoffs: less coding, more meetings, slower feedback loops, ambiguous success metrics
Offer the Staff/Principal IC path as a genuine alternative, not a consolation prize
If they proceed: set explicit check-in at 3 months — "Is this what you wanted?"

Scenario: You Inherit a Low-Performing Team

Week 1-2: Listen. 1:1 with every person. Don't change anything yet.
Week 3-4: Identify the 1-2 systemic issues (usually: unclear priorities, no accountability, or trust deficit)
Month 2: Make ONE process change. Get a quick win. Build credibility.
Month 3: Address performance issues you've now observed firsthand
Never: Blame the previous manager publicly. Never say "things are going to change around here."

Scenario: Layoffs / Reorg Affecting Your Team

Before announcement: Prepare a plan for remaining team — who covers what?
During: Be honest about what you know and what you don't. "I don't know" > corporate-speak.
After: 1:1 with every remaining person within 48 hours. Expect anger, fear, guilt.
Ongoing: Workload audit — don't expect same output from fewer people. Push back on scope.
Self-care: This is one of the hardest parts of the job. Talk to your own manager or a coach.

Scenario: Your Best Engineer Gives Notice

Same day: Have a real conversation. Not a counteroffer — understand why.
If it's about money: Match or beat if they're worth it. If your company won't, that tells you something.
If it's about growth/role: Can you create what they want? Be honest if you can't.
If they're leaving for the right reasons: Celebrate them. Write a recommendation. Don't make it weird.
Immediately: Start knowledge transfer plan. Identify what only they know.
To the team: Transparent but positive. "X is leaving for a great opportunity. Here's our transition plan."

Scoring Rubric: Engineering Manager Effectiveness (0-100)

Dimension	Weight	Indicators
Team health	20%	Retention, engagement scores, psychological safety signals
Delivery	20%	Sprint completion, quality metrics, stakeholder satisfaction
People development	20%	Promotions, skill growth, 1:1 quality, mentorship
Technical stewardship	15%	Tech debt trajectory, architecture quality, incident trends
Hiring	10%	Pipeline health, offer acceptance rate, new hire ramp time
Communication	10%	Stakeholder relationships, information flow, no surprises
Self-improvement	5%	Seeking feedback, adapting, growing as a leader

Scoring:

90-100: Exceptional — team thriving, people growing, shipping reliably
75-89: Strong — most things working, some areas to develop
60-74: Developing — foundational skills present, needs coaching
40-59: Struggling — significant gaps, at risk of losing team trust
<40: Intervention needed — coaching, role change, or transition

Natural Language Commands

"Prepare 1:1 with [name]" → Generate agenda from recent context
"Write performance review for [name]" → Calibrate and draft using framework
"Create job description for [role]" → Generate using template
"Run team health check" → Walk through radar dimensions
"Draft ADR for [decision]" → Structure architecture decision
"Incident post-mortem for [incident]" → Generate post-mortem template
"Sprint health report" → Analyze metrics and flag issues
"Promotion case for [name]" → Build evidence-based promotion doc
"Evaluate tech debt [item]" → Score using prioritization matrix
"Flight risk assessment" → Review signals for each team member
"Stakeholder update" → Generate weekly status from context
"Interview scorecard for [candidate]" → Create structured evaluation