Real-session tests, plan-by-plan capability matrices, and prompt templates that work — for AI tools that change too fast for old guides to keep up.