Code Quality Standards

Non-negotiable code quality standards. These are not preferences — they are requirements.

Testing Standards

The Testing Pyramid

Layer What it Tests Speed Purpose

Unit Tests Individual functions/components Fast TDD lives here. Catches logic errors early.

Integration Tests Components working together Medium Catches connection and data flow issues.

E2E Tests Full user flows Slowest Confirms the system does the thing.

Human Review Visual correctness, UX Manual Irreducible quality judgment.

Test-Driven Development (TDD)

TDD is mandatory at the unit test level:

Tests are written BEFORE implementation — Never implement without a failing test first
Red -> Green -> Refactor is mandatory — No exceptions
Tests define behavior — Implementation serves tests
Small incremental steps — Tiny, safe changes over large speculative edits
Tests are the source of truth — If it's not tested, it doesn't work

When in doubt: Slow down, write the test, make the smallest possible change.

What Makes a Good Test

Structure every test as Arrange-Act-Assert:

def test_apply_discount_reduces_total(): # Arrange — set up the scenario cart = Cart(items=[Item(price=100)]) discount = Discount(percent=20)

# Act — perform the action under test
cart.apply_discount(discount)

# Assert — verify the outcome
assert cart.total == 80

One concept per test. If a test name has "and" in it, split it into two tests.

Name tests to describe behavior, not implementation:

Bad Good

test_calculate

test_calculate_total_sums_item_prices

test_error

test_negative_quantity_raises_validation_error

test_user_service

test_deactivated_user_cannot_place_order

Tests must be:

Isolated — No test depends on another test's state or execution order
Deterministic — Same input, same result. No randomness, no clock dependency, no network calls.
Fast — Unit tests run in milliseconds. If they're slow, they're not unit tests.
Readable — A failing test name should tell you what broke without reading the test body

Unit Tests

Foundation of testing
Run in milliseconds
Test one function/component in isolation
Mock external dependencies, not internal logic

Integration Tests

Verify modules work together
Use test databases or containers, not mocks
Reset state between tests

E2E Tests

Critical user paths only
Keep the suite small and focused
Accept some flakiness, build in retries

Human Review

Does it work correctly?
Does it look right?
Does it feel good?
Is it accessible?

Code Structure

Early Returns Over Nesting

Guard clauses first. Flatten control flow.

Bad — nested, hard to follow:

def process(order): if order: if order.items: if order.is_valid: return calculate_total(order) return None

Good — flat, clear:

def process(order): if not order: return None if not order.items: return None if not order.is_valid: return None

return calculate_total(order)

Max nesting depth: 3 levels. If deeper, extract to a function.

Function Size

A function should do one thing. If you need a comment to separate "sections" inside a function, those sections should be separate functions.

Guidelines:

If a function exceeds ~30 lines, look for extraction opportunities
If a function takes more than 3-4 parameters, it's probably doing too much
If you can't name the function clearly, it has too many responsibilities

Single Responsibility

Every function, class, and module should have one reason to change.

Smell: "This function handles validation AND formatting AND saving." Fix: Three functions — validate , format , save .

Explicit Over Clever

Readability beats brevity. Separate operations into clear steps.

Bad — clever but hard to debug:

names = [u.name for u in users if u.is_active and u.role in allowed]

Good — clear intent, debuggable:

active_users = filter_active(users) authorized_users = filter_by_role(active_users, allowed) names = extract_names(authorized_users)

When a one-liner requires mental parsing, break it apart. Optimize for the reader, not the writer.

Error Handling

Fail Fast

Validate inputs at the boundary. Don't let bad data travel deep into the system.

def create_user(email, name): if not email: raise ValidationError("Email is required") if not is_valid_email(email): raise ValidationError("Invalid email format")

return save_user(email, name)

Specific Errors Over Generic

Catch what you expect. Re-raise what you don't. Never write except Exception — identify the actual failure mode first.

Match the exception type to the operation:

Operation Catch Why

File open/read/write OSError

Covers FileNotFoundError, PermissionError, IsADirectoryError

File read + parse content (OSError, UnicodeDecodeError)

File may exist but contain invalid encoding

JSON/YAML parsing (json.JSONDecodeError, ValueError)

Malformed content

String → number conversion ValueError

Invalid format

Dict/list access (KeyError, IndexError)

Missing key or out-of-range index

Network requests (ConnectionError, TimeoutError)

Network-specific failures

Subprocess execution (subprocess.SubprocessError, OSError)

Process launch or execution failure

Regex operations re.error

Invalid pattern

Bad — swallows everything:

try: do_risky_thing() except Exception: pass

Good — handles what it understands:

try: do_risky_thing() except NetworkError: return retry() except ValidationError as e: return error_response(e.message)

Unexpected errors propagate up

When broad catch IS acceptable: Only at top-level application boundaries (CLI main() , API request handlers) where the alternative is an unhandled crash. Even then, log the full exception before continuing.

Never Swallow Errors

If you catch an error, you must either:

Handle it — take a meaningful recovery action
Log and re-raise it — make the failure visible
Transform it — wrap in a more specific error for the caller

Empty catch / except blocks are bugs.

Naming Conventions

Names must clearly communicate:

Who is acting — The subject performing the action
What action is occurring — The verb describing the behavior
Direction of data or ownership flow — Where things are going to/from

Directional Clarity

Use prepositions (to , from , into , onto ) or named parameters.

Bad — Ambiguous:

shop.buy_item(item_id, buyer) # Who is buying? transfer(amount, account) # Transfer to or from?

Good — Clear:

shop.sell_item_to(item_id, buyer) # Shop sells TO buyer shop.sell(item_id, to=buyer) # Named parameter clarifies transfer_from(account, amount) # Direction explicit account.transfer_to(other, amount) # Direction in method name

The Read-Aloud Test

If a method call doesn't read naturally when spoken aloud, the name is wrong.

"shop buy item buyer" — confusing

shop.buy_item(item_id, buyer)

"shop sell item to buyer" — clear

shop.sell_item_to(item_id, buyer)

Boolean Naming

Always prefix booleans with is , has , should , can , will , or did :

Bad — ambiguous (is it a noun? a verb? a state?)

active = True permission = True refresh = True

Good — clearly a yes/no question

is_active = True has_permission = True should_refresh = True

Naming Patterns

Pattern Use When Example

verb_noun_to(target)

Action flows to target send_message_to(user)

verb_noun_from(source)

Action flows from source receive_payment_from(customer)

noun.verb_to(target)

Object performs action toward target cart.transfer_to(order)

verb(noun, to=target)

Named parameter clarifies assign(task, to=developer)

Never Abbreviate

Write the full word. Every time. The only acceptable abbreviations are universally understood technical terms: id , url , api , db , io .

No single-character variables. Not even loop counters. i and j hide what you're iterating over:

Bad — what is i? what is j?

for i in range(len(rows)): for j in range(len(columns)): grid[i][j] = calculate(i, j)

Good — names describe the iteration

for row_index in range(len(rows)): for column_index in range(len(columns)): grid[row_index][column_index] = calculate(row_index, column_index)

Common violations — these appear constantly and must always be expanded:

Write This Not This

dependency

dep

index / position

idx

source

src

destination

dst

description

desc

threshold

thresh

config / configuration

cfg

message

msg

request

req

response

res

context

ctx

error

err

value

val

count

cnt

button

btn

user

usr

callback

function

manager

mgr

service

svc

repository

repo

implementation

impl

password

pwd

temporary

tmp

number

num

Full list: references/naming-reference.md

Avoid

Don't Instead

Single-character names (i , x , e ) Descriptive name (row_index , coordinate , error )

Generic names (data , list , temp ) Specific noun (user_data , order_list )

Negated booleans (is_not_disabled ) Positive form (is_enabled )

Constants & Clarity

No Magic Values

Every number and string literal should have a name. Apply the extraction test before writing any literal:

The Extraction Test: If a literal isn't 0 , 1 , -1 , True , False , None , or "" — it needs a named constant.

This includes:

Thresholds and limits — MAX_RETRIES = 3 , HIGH_COUPLING_THRESHOLD = 10
Sizes and measurements — MIN_FONT_SIZE_PX = 12 , MASK_VISIBLE_CHARACTERS = 4
String patterns — DEFAULT_ENCODING = "utf-8" , CSV_DELIMITER = ","
Configuration values — TOP_RESULTS_DISPLAY_LIMIT = 20 , SCAN_DEPTH = 3

Name the constant by what it means, not what it is. THREE = 3 is pointless. MAX_RETRIES = 3 communicates intent.

Bad:

if retry_count > 3: sleep(60) if len(results) > 20: results = results[:20]

Good:

MAX_RETRIES = 3 RETRY_DELAY_SECONDS = 60 TOP_RESULTS_DISPLAY_LIMIT = 20

if retry_count > MAX_RETRIES: sleep(RETRY_DELAY_SECONDS) if len(results) > TOP_RESULTS_DISPLAY_LIMIT: results = results[:TOP_RESULTS_DISPLAY_LIMIT]

Place constants at the top of the module, grouped by purpose, before any function definitions.

Boolean Parameters

Boolean arguments hide meaning at the call site.

Bad — what does True mean?

create_user(data, True, False)

Good — named parameters or options:

create_user(data, send_welcome=True, require_verification=False)

If the language doesn't support named parameters, use an options object/struct.

Documentation

Docstrings

Docstrings are living documentation. Public APIs must be self-explanatory without reading implementation.

Required Elements

Every public function, method, and class must include:

Purpose — What it does (one line)
Parameters — Each parameter with type and meaning
Returns — What is returned and when
Side effects — Any state changes, I/O, or mutations
Errors — What exceptions/errors can occur
Examples — Realistic usage showing common cases

Example Docstring

def sell_item_to(self, item_id: str, buyer: Customer) -> Receipt: """Sell an item from shop inventory to a customer.

Transfers ownership of the item from the shop to the buyer,
processes payment, and updates inventory.

Args:
    item_id: Unique identifier of the item to sell.
    buyer: Customer purchasing the item. Must have sufficient balance.

Returns:
    Receipt containing transaction details and timestamp.

Raises:
    ItemNotFoundError: If item_id doesn't exist in inventory.
    InsufficientBalanceError: If buyer can't afford the item.
    ItemAlreadySoldError: If item was sold between check and purchase.

Examples:
    Basic sale:
    >>> shop = Shop(inventory=[item])
    >>> buyer = Customer(balance=100)
    >>> receipt = shop.sell_item_to(item.id, buyer)
    >>> assert receipt.amount == item.price
    >>> assert item.id not in shop.inventory

    Handling insufficient balance:
    >>> poor_buyer = Customer(balance=0)
    >>> shop.sell_item_to(item.id, poor_buyer)
    Raises InsufficientBalanceError
"""

Docstring Rules

Examples should mirror actual test scenarios
Update docstrings when behavior changes
Treat docstrings as first-class code, not decoration

Comments

Before writing any comment, apply the Delete Test: mentally delete the comment. Is anything lost? If the code already communicates the same information through naming and structure, don't write the comment.

Do Comment Don't Comment

Why — intent, business reason, non-obvious context What — the code already says this

Non-obvious gotchas or edge cases Obvious operations

Complex algorithm summaries Bad code to explain it (fix the code instead)

TODO with ticket/issue reference TODO without context

Regex pattern documentation (what the pattern matches) Restating a function call (# Send the email )

Bad — restates the code:

Get the users

users = get_users()

Filter active users

active_users = filter_active(users)

Count the results

count = len(active_users)

Good — no comments needed (the code speaks for itself):

users = get_users() active_users = filter_active(users) count = len(active_users)

Good — explains why:

Offset by 1 because CSS cascade position is 1-indexed but array is 0-indexed

cascade_position = file_index + 1

If you need a comment to explain what code does, the code should be clearer. Rename variables, extract functions, simplify logic — then the comment becomes unnecessary.

Quick Reference

Tests written BEFORE implementation
Red -> Green -> Refactor followed
Each test has one concept, Arrange-Act-Assert structure
Tests are isolated, deterministic, fast
All tests pass, edge cases covered
Functions are short, single-responsibility
Max 3 levels of nesting, early returns used
Errors fail fast at boundaries with specific types
Exception types match the operation (no bare except Exception )
No empty catch/except blocks
Names pass the read-aloud test
No single-character variables — use descriptive names
No abbreviated names — write the full word (dependency not dep )
Directional clarity in method names (to/from)
Booleans prefixed with is/has/should/can
Every literal passes the extraction test — named constant if not 0/1/True/False/None/""
Constants at module top, grouped by purpose
Boolean parameters use named args or options
All public APIs have complete docstrings
Comments pass the delete test — only explain why, never what

Enforced Rules

These rules are deterministically checked by check.js (clean-team). When updating these standards, update the corresponding check.js rules to match — and vice versa.

Rule ID Severity What It Checks

no-debugger

error debugger statements left in code

no-var

error var declarations (use const /let )

no-empty-catch

error Empty catch blocks with no handling

no-console

warn console.log /warn/error statements

no-double-equals

warn

/!= instead of strict equality (allows == null )

References

references/testing-reference.md — Testing pyramid deep-dive, mocking guidelines, anti-patterns
references/naming-reference.md — Complete naming conventions, abbreviation rules, domain naming
references/error-handling-reference.md — Error hierarchies, retry/fallback patterns, error boundaries

Assets

assets/tdd-checklist.md — Step-by-step TDD workflow checklist
assets/docstring-templates.md — Copy-paste docstring templates (Python, JS/TS, C#, Rust, Go)
assets/code-review-checklist.md — Comprehensive code review checklist

Scripts

scripts/check_naming.py — Validate naming conventions across any codebase
scripts/check_complexity.py — Check function length, nesting depth, parameter count

code-quality

Safety Notice

Copy this and send it to your AI assistant to learn