Problem Decomposition & Scoping

Breaking down complex AI safety problems into tractable research questions

⏱️ Intermediate

Problem Decomposition & Scoping

Core Principles

1. Start with the End Goal

Before decomposing, clearly articulate:

What does success look like?
How will we know when we've made progress?
What are the observable outcomes?

2. The Decomposition Process

Level 1: Grand Challenge Example: "Ensure AI systems remain aligned with human values as they become more capable"

Level 2: Research Areas

Value learning and specification
Robustness to distribution shift
Interpretability of learned values
Corrigibility mechanisms

Level 3: Concrete Questions

"Can we detect when a model's behavior diverges from training?"
"How do reward models generalize to new domains?"
"What features correlate with deceptive behavior?"

Level 4: Tractable Experiments

"Build a toy environment where we can induce goal misgeneralization"
"Compare reward model outputs on held-out ethical dilemmas"
"Train probes to detect planning behavior in transformer layers"

Scoping Strategies

1. The Minimum Viable Experiment (MVE)

What's the smallest test that could invalidate your hypothesis?
Can you build a proof-of-concept in days, not months?
What assumptions can you temporarily accept to make progress?

2. Time-Boxing Research

2-day exploration: "Can this approach work at all?"
2-week prototype: "Does this show promise?"
2-month project: "Is this publication-worthy?"

3. The "Good Enough" Principle

Perfect is the enemy of done
Iterate rather than optimize
Publication ≠ Problem solved

Common Pitfalls

1. Scope Creep

Symptom: "But we also need to consider..." Solution: Write down out-of-scope items for future work

2. Analysis Paralysis

Symptom: Endless literature review without experimentation Solution: Set a deadline for moving to implementation

3. Premature Optimization

Symptom: Building elaborate infrastructure before validating the idea Solution: Start with hacky prototypes

Practical Techniques

1. The Research Question Generator

Turn vague concerns into specific questions:

"I'm worried about X" → "Under what conditions does X occur?"
"We need Y" → "What are the minimum requirements for Y?"
"Z is important" → "How do we measure Z?"

2. The Assumption Stack

List all assumptions your research makes, ordered by:

How critical they are to your conclusions
How likely they are to be wrong
How hard they are to test

3. Success Criteria Template

Before starting any project, define:

Minimum Success: What's the least that makes this worthwhile?
Expected Success: What do you realistically hope to achieve?
Exceptional Success: What would exceed expectations?

Case Study: From "AI Deception" to Research Project

Starting Point: "We need to solve the problem of AI systems being deceptive"

Decomposition Process:

Define Deception: Intentional vs. emergent, active vs. passive
Identify Subtypes: Lies, misdirection, concealment, paltering
Choose Focus: Concealment of capabilities in evaluation
Operationalize: Models performing worse on tests than their true ability
Design Experiment: Compare model performance with/without knowledge of evaluation
Build Prototype: Simple grid world where agents can hide their planning ability

Result: A 2-week project that provides empirical evidence and publishable insights

Tools and Frameworks

Research Canvas

Create a one-page overview:

Research Question (one sentence)
Why It Matters (three bullets)
Method (paragraph)
Success Metrics (three concrete measures)
Timeline (key milestones)
Dependencies (what you need)

The "Five Whys" for Research

Why is this problem important?
Why hasn't it been solved?
Why might your approach work?
Why should you be the one to do it?
Why now?

Action Items

This Week: Take one of your "big ideas" and decompose it into 5-10 specific research questions
This Month: Choose one question and design a minimum viable experiment
This Quarter: Complete one small-scale idea

Remember: The goal isn't to solve everything at once. It's to make consistent, measurable progress on hard problems.

Resources

@article@How to do Research At the MIT AI Lab (1988) - Classic advice that still holds
@video@Richard Hamming: You and Your Research - On choosing important, tractable problems
@article@An Opinionated Guide to ML Research - John Schulman on research strategy
@article@How to do good research, get it published in top venues, and have real-world impact - Practical academic advice

← Back to Module

Loading...

⚡Pre-rendered at build time (instant load)