AI-native engineering
A maturity model for responsible agentic product development.
The goal is not novelty. The goal is better software delivery — stronger review, clearer context, measured quality, and team learning.
Level 1
01
Individual prompting
Engineers use AI tools independently for explanation, drafting, coding support, or debugging.
What it looks like
- Prompting varies by person.
- Outputs stay mostly private until work is reviewed.
- Value is real but hard to repeat.
Risks
- Uneven quality
- Hidden assumptions
- Over-trust in generated output
Manager responsibility
- Set basic expectations for review, privacy, and test discipline.
- Coach engineers to show assumptions and reasoning.
Metrics
- Adoption signal
- Review findings
- Defect patterns
Level 2
02
Shared prompts and reusable workflows
Teams create reusable prompts and workflow patterns for common planning, review, QA, and documentation tasks.
What it looks like
- Prompt libraries and examples are shared.
- Team members compare outputs and improve patterns.
- AI support becomes part of normal delivery conversations.
Risks
- Stale prompts
- False consistency
- Prompts detached from repository reality
Manager responsibility
- Keep workflows tied to team standards.
- Make shared examples reviewable and easy to update.
Metrics
- Workflow reuse
- Ticket readiness
- PR review cycle quality
Level 3
03
Repo-specific agent instructions and skills
Agents use repository-specific context, conventions, and quality gates before proposing code or workflow changes.
What it looks like
- Repositories include explicit agent guidance.
- Agents read relevant files before editing.
- Work aligns better with local architecture and tests.
Risks
- Outdated instructions
- Context gaps
- Automation that hides complexity
Manager responsibility
- Assign ownership for instruction maintenance.
- Use code review to validate that guidance improves outcomes.
Metrics
- Instruction freshness
- Rework rate
- Build and lint pass rate
Level 4
04
Jira/GitHub/CI-integrated agents
Agents participate in delivery systems by reviewing tickets, drafting implementation plans, preparing pull requests, and responding to CI signals.
What it looks like
- Ticket and PR workflows include agent-assisted checks.
- CI feedback is summarized into actionable remediation.
- Engineers retain ownership of final decisions.
Risks
- Noisy automation
- Permission creep
- Overloaded review surfaces
Manager responsibility
- Design clear boundaries for agent actions.
- Measure whether integrations reduce or increase delivery friction.
Metrics
- Cycle time
- CI failure recovery
- CTA and review completion rates
Level 5
05
Measured agentic SDLC with evals, observability, and governance
AI-assisted delivery is measured, governed, and continuously improved with quality signals, evals, and operational feedback.
What it looks like
- Workflows have measurable success criteria.
- Teams evaluate agent output quality over time.
- Governance protects quality, privacy, and maintainability.
Risks
- Metric gaming
- Governance theater
- Treating evals as a substitute for judgment
Manager responsibility
- Tie metrics to delivery quality and team learning.
- Keep governance lightweight, real, and aligned with risk.
Metrics
- Defect escape rate
- Review quality
- Workflow eval results
- Operational noise