Database Migration
Safe patterns for evolving database schemas in production.
Migration Principles
-
Backward compatible - New code works with old schema
-
Reversible - Can rollback if needed
-
Tested - Verify on staging before production
-
Incremental - Small changes, not big-bang
-
Zero downtime - No service interruption
Safe Migration Pattern
Phase 1: Add New (Compatible)
-- Add new column (nullable initially) ALTER TABLE users ADD COLUMN full_name VARCHAR(255) NULL;
-- Deploy new code that writes to both old and new UPDATE users SET full_name = CONCAT(first_name, ' ', last_name);
Phase 2: Migrate Data
-- Backfill existing data UPDATE users SET full_name = CONCAT(first_name, ' ', last_name) WHERE full_name IS NULL;
Phase 3: Make Required
-- Make column required ALTER TABLE users ALTER COLUMN full_name SET NOT NULL;
Phase 4: Remove Old (After New Code Deployed)
-- Remove old columns ALTER TABLE users DROP COLUMN first_name; ALTER TABLE users DROP COLUMN last_name;
Common Migrations
Adding Index
-- Create index concurrently (PostgreSQL) CREATE INDEX CONCURRENTLY idx_users_email ON users(email);
Renaming Column
-- Phase 1: Add new column ALTER TABLE users ADD COLUMN email_address VARCHAR(255);
-- Phase 2: Copy data UPDATE users SET email_address = email;
-- Phase 3: Drop old column (after deploy) ALTER TABLE users DROP COLUMN email;
Changing Column Type
-- Phase 1: Add new column with new type ALTER TABLE products ADD COLUMN price_cents INTEGER;
-- Phase 2: Migrate data UPDATE products SET price_cents = CAST(price * 100 AS INTEGER);
-- Phase 3: Drop old column ALTER TABLE products DROP COLUMN price; ALTER TABLE products RENAME COLUMN price_cents TO price;
Adding Foreign Key
-- Add column first ALTER TABLE orders ADD COLUMN user_id INTEGER NULL;
-- Populate data UPDATE orders SET user_id = ( SELECT id FROM users WHERE users.email = orders.user_email );
-- Add foreign key ALTER TABLE orders ADD CONSTRAINT fk_orders_users FOREIGN KEY (user_id) REFERENCES users(id);
Migration Tools
Python (Alembic)
Generate migration
alembic revision --autogenerate -m "add user full_name"
Apply migration
alembic upgrade head
Rollback
alembic downgrade -1
JavaScript (Knex)
// Create migration knex migrate:make add_full_name
// Apply migrations knex migrate:latest
// Rollback knex migrate:rollback
Rails
Generate migration
rails generate migration AddFullNameToUsers full_name:string
Run migrations
rails db:migrate
Rollback
rails db:rollback
Testing Migrations
def test_migration_forward_backward(): # Apply migration apply_migration("add_full_name")
# Verify schema
assert column_exists("users", "full_name")
# Rollback
rollback_migration()
# Verify rollback
assert not column_exists("users", "full_name")
Dangerous Operations
❌ Avoid in Production
-- Locks table for long time ALTER TABLE users ADD COLUMN email VARCHAR(255) NOT NULL;
-- Can't rollback DROP TABLE old_users;
-- Breaks existing code immediately ALTER TABLE users DROP COLUMN email;
✅ Safe Alternatives
-- Add as nullable first ALTER TABLE users ADD COLUMN email VARCHAR(255) NULL;
-- Rename instead of drop ALTER TABLE old_users RENAME TO archived_users;
-- Keep old column until new code deployed -- (multi-phase approach)
Rollback Strategy
-- Every migration needs DOWN -- UP ALTER TABLE users ADD COLUMN full_name VARCHAR(255);
-- DOWN ALTER TABLE users DROP COLUMN full_name;
Decision Support
Quick Decision Guide
Making a schema change?
-
Breaking change (drops/modifies data) → Multi-phase migration (expand-contract)
-
Additive change (new columns/tables) → Single-phase migration
-
Large table (millions of rows) → Use CONCURRENTLY for indexes
Need zero downtime?
-
Schema change → Expand-contract pattern (5 phases)
-
Data migration (< 10k rows) → Synchronous in-migration
-
Data migration (> 1M rows) → Background worker pattern
Planning rollback?
-
Added new schema only → Simple DOWN migration
-
Modified/removed schema → Multi-phase rollback or fix forward
-
Cannot lose data → Point-in-time recovery (PITR)
Choosing migration tool?
-
Python/Django → Django Migrations
-
Python/SQLAlchemy → Alembic
-
Node.js/TypeScript → Prisma Migrate or Knex.js
-
Enterprise/multi-language → Flyway or Liquibase
→ See references/decision-trees.md for comprehensive decision frameworks
Troubleshooting
Common Issues Quick Reference
Migration failed halfway → Check database state, fix forward with repair migration
Schema drift detected → Use autogenerate to create reconciliation migration
Cannot rollback (no downgrade) → Create reverse migration or fix forward
Foreign key violation → Clean data before adding constraint, or add as NOT VALID
Migration locks table too long → Use CONCURRENTLY, add columns in phases, batch updates
Circular dependency → Create merge migration or reorder dependencies
→ See references/troubleshooting.md for detailed solutions with examples
Navigation
Detailed References
🌳 Decision Trees - Schema migration strategies, zero-downtime patterns, rollback strategies, migration tool selection, and data migration approaches. Load when planning migrations or choosing strategies.
🔧 Troubleshooting - Failed migration recovery, schema drift detection, migration conflicts, rollback failures, data integrity issues, and performance problems. Load when debugging migration issues.
Remember
-
Test migrations on copy of production data
-
Have rollback plan ready
-
Monitor during deployment
-
Communicate with team about schema changes
-
Keep migrations small and focused