πΊοΈHow to Create an AI Product Roadmap
AI roadmaps are different. The model layer changes monthly; the user expectations shift quarterly. Plan for compounding.
Traditional PM roadmaps lock in 6 months of work; AI roadmaps that try to do the same become stale by month 3. PMs who design AI roadmaps with compounding loops and explicit model-flexibility ship better products.
An AI roadmap has three layers: (1) Customer-job bets (what we'll enable for users), (2) Capability bets (what AI techniques we'll master), (3) Infrastructure bets (evals, agents, integrations). Plan the customer-job layer in quarters; the capability/infra layer in months. Always assume the model layer will get better than expected.
The three layers
Layer 1: Customer-job bets (quarterly). What user job will we enable AI on this quarter? Examples:
- Q1: AI-drafted follow-up emails in CRM
- Q2: AI call summarization
- Q3: AI deal risk prediction
These are product bets. They take a quarter each. Plan 3-4 ahead.
Layer 2: Capability bets (monthly). What AI techniques will we master this month? Examples:
- Month 1: build first RAG pipeline
- Month 2: add semantic search
- Month 3: build first agent
These are platform bets. They unlock multiple customer-job bets.
Layer 3: Infrastructure bets (always-on).
- Evals (every feature gets evals before launch)
- Cost monitoring (alerts on token usage)
- Model routing (pick the right model per task)
- Observability (trace every AI call)
The 'model gets better' factor
Frontier models (Claude, GPT, Gemini) improve materially every 3-6 months. Your roadmap should explicitly assume:
- Quality you can't achieve today will be possible in 6 months
- Cost-per-call will drop ~50% per year on a fixed quality
- New capabilities (image gen, agents, multimodal) will land regularly
Don't design features that need the current generation's exact capabilities. Design for the trajectory.
The eval-first roadmap
Before any feature ships, the team builds an eval suite. Before a feature is 'done,' it hits the eval score target. Evals are first-class roadmap items, not afterthoughts.
This is the discipline that separates production AI from demos. Most teams skip it; the ones that don't compound a quality advantage.
When to NOT roadmap
For exploration phases, don't roadmap. Run a 4-6 week 'discovery sprint' β vibe-code prototypes, eval them, see what works. Then decide what to put on the roadmap.
Premature roadmapping of AI features locks in the wrong bets. The model layer moves too fast.
Roadmap communication
For AI features, internal comms need extra discipline:
- Quality bar. What eval score qualifies as 'shippable'?
- Model dependency. Which model is this feature designed for? What's our plan if a better one ships?
- Cost model. What's per-user cost at scale?
- Failure modes. How will we know it's misbehaving? What's the response?
Make these explicit. AI features are higher-risk than feature parity work; the comms have to match.
Real-world examples
Linear's AI features (auto-summarization, agent-driven issue creation) shipped incrementally. The team waited until the models were good enough rather than forcing AI in early. The discipline of 'wait for the right moment' kept feature quality high.
Go deeper β recommended reading
Interview questions (1)
Q1Walk me through how you'd build an AI roadmap for your current product.ai-pmseniorβΌ
Three-layer plan.
Layer 1: Customer jobs (quarterly). Identify 3-4 user jobs where AI changes the experience qualitatively. Pick based on pain magnitude and AI fit. Plan one quarter each. For my product [example], the candidates are [job A, job B, job C].
Layer 2: Capabilities (monthly). What AI techniques do we need to master? Probably: RAG (month 1), evals (ongoing), agents (month 3), semantic search (month 2). Each capability unlocks 2-4 customer-job bets.
Layer 3: Infrastructure (always). Eval suites, cost monitoring, model routing, observability. Not exciting but load-bearing.
I'd run a 4-week discovery sprint first to validate which customer jobs are real AI opportunities. Then commit Q1 to the highest-impact one. Q2 forward stays flexible because the model layer will surprise us.
I'd also explicitly NOT lock 6 months ahead. AI roadmaps that try traditional 6-month planning get stale by month 3. Plan one quarter firmly, the rest in pencil.