Problem Decomposition & Scoping
Breaking down complex AI safety problems into tractable research questions
Problem Decomposition & Scoping
Core Principles
1. Start with the End Goal
Before decomposing, clearly articulate:
- What does success look like?
- How will we know when we've made progress?
- What are the observable outcomes?
2. The Decomposition Process
Level 1: Grand Challenge Example: "Ensure AI systems remain aligned with human values as they become more capable"
Level 2: Research Areas
- Value learning and specification
- Robustness to distribution shift
- Interpretability of learned values
- Corrigibility mechanisms
Level 3: Concrete Questions
- "Can we detect when a model's behavior diverges from training?"
- "How do reward models generalize to new domains?"
- "What features correlate with deceptive behavior?"
Level 4: Tractable Experiments
- "Build a toy environment where we can induce goal misgeneralization"
- "Compare reward model outputs on held-out ethical dilemmas"
- "Train probes to detect planning behavior in transformer layers"
Scoping Strategies
1. The Minimum Viable Experiment (MVE)
- What's the smallest test that could invalidate your hypothesis?
- Can you build a proof-of-concept in days, not months?
- What assumptions can you temporarily accept to make progress?
2. Time-Boxing Research
- 2-day exploration: "Can this approach work at all?"
- 2-week prototype: "Does this show promise?"
- 2-month project: "Is this publication-worthy?"
3. The "Good Enough" Principle
- Perfect is the enemy of done
- Iterate rather than optimize
- Publication ≠ Problem solved
Common Pitfalls
1. Scope Creep
Symptom: "But we also need to consider..." Solution: Write down out-of-scope items for future work
2. Analysis Paralysis
Symptom: Endless literature review without experimentation Solution: Set a deadline for moving to implementation
3. Premature Optimization
Symptom: Building elaborate infrastructure before validating the idea Solution: Start with hacky prototypes
Practical Techniques
1. The Research Question Generator
Turn vague concerns into specific questions:
- "I'm worried about X" → "Under what conditions does X occur?"
- "We need Y" → "What are the minimum requirements for Y?"
- "Z is important" → "How do we measure Z?"
2. The Assumption Stack
List all assumptions your research makes, ordered by:
- How critical they are to your conclusions
- How likely they are to be wrong
- How hard they are to test
3. Success Criteria Template
Before starting any project, define:
- Minimum Success: What's the least that makes this worthwhile?
- Expected Success: What do you realistically hope to achieve?
- Exceptional Success: What would exceed expectations?
Case Study: From "AI Deception" to Research Project
Starting Point: "We need to solve the problem of AI systems being deceptive"
Decomposition Process:
- Define Deception: Intentional vs. emergent, active vs. passive
- Identify Subtypes: Lies, misdirection, concealment, paltering
- Choose Focus: Concealment of capabilities in evaluation
- Operationalize: Models performing worse on tests than their true ability
- Design Experiment: Compare model performance with/without knowledge of evaluation
- Build Prototype: Simple grid world where agents can hide their planning ability
Result: A 2-week project that provides empirical evidence and publishable insights
Tools and Frameworks
Research Canvas
Create a one-page overview:
- Research Question (one sentence)
- Why It Matters (three bullets)
- Method (paragraph)
- Success Metrics (three concrete measures)
- Timeline (key milestones)
- Dependencies (what you need)
The "Five Whys" for Research
- Why is this problem important?
- Why hasn't it been solved?
- Why might your approach work?
- Why should you be the one to do it?
- Why now?
Action Items
- This Week: Take one of your "big ideas" and decompose it into 5-10 specific research questions
- This Month: Choose one question and design a minimum viable experiment
- This Quarter: Complete one small-scale idea
Remember: The goal isn't to solve everything at once. It's to make consistent, measurable progress on hard problems.
Resources
- @article@How to do Research At the MIT AI Lab (1988) - Classic advice that still holds
- @video@Richard Hamming: You and Your Research - On choosing important, tractable problems
- @article@An Opinionated Guide to ML Research - John Schulman on research strategy
- @article@How to do good research, get it published in top venues, and have real-world impact - Practical academic advice