Question 1

What are current engineering benchmarks — DORA metrics, cycle time, deployment frequency by team size?

Accepted Answer

Comprehensive engineering productivity benchmarks covering the five DORA metrics (deployment frequency, lead time for changes, change failure rate, mean time to recovery, rework rate) plus cycle time, PR metrics, and throughput data. Sourced from the 2025 DORA Report (~5,000 respondents) and LinearB's analysis of 8.1M+ pull requests across 4,800 teams. The most significant finding: AI coding assistants boost individual output (21% more tasks completed, 98% more PRs merged) but organizational delivery metrics remain flat — individual productivity does not automatically translate to team-leve...

Question 2

DORA metrics benchmarks

Accepted Answer

Comprehensive engineering productivity benchmarks covering the five DORA metrics (deployment frequency, lead time for changes, change failure rate, mean time to recovery, rework rate) plus cycle time, PR metrics, and throughput data. Sourced from the 2025 DORA Report (~5,000 respondents) and LinearB's analysis of 8.1M+ pull requests across 4,800 teams. The most significant finding: AI coding assistants boost individual output (21% more tasks completed, 98% more PRs merged) but organizational delivery metrics remain flat — individual productivity does not automatically translate to team-leve...

Question 3

software delivery performance benchmarks

Accepted Answer

Comprehensive engineering productivity benchmarks covering the five DORA metrics (deployment frequency, lead time for changes, change failure rate, mean time to recovery, rework rate) plus cycle time, PR metrics, and throughput data. Sourced from the 2025 DORA Report (~5,000 respondents) and LinearB's analysis of 8.1M+ pull requests across 4,800 teams. The most significant finding: AI coding assistants boost individual output (21% more tasks completed, 98% more PRs merged) but organizational delivery metrics remain flat — individual productivity does not automatically translate to team-leve...

Question 4

deployment frequency benchmarks by team size

Accepted Answer

Comprehensive engineering productivity benchmarks covering the five DORA metrics (deployment frequency, lead time for changes, change failure rate, mean time to recovery, rework rate) plus cycle time, PR metrics, and throughput data. Sourced from the 2025 DORA Report (~5,000 respondents) and LinearB's analysis of 8.1M+ pull requests across 4,800 teams. The most significant finding: AI coding assistants boost individual output (21% more tasks completed, 98% more PRs merged) but organizational delivery metrics remain flat — individual productivity does not automatically translate to team-leve...

Question 5

engineering velocity benchmarks 2026

Accepted Answer

Comprehensive engineering productivity benchmarks covering the five DORA metrics (deployment frequency, lead time for changes, change failure rate, mean time to recovery, rework rate) plus cycle time, PR metrics, and throughput data. Sourced from the 2025 DORA Report (~5,000 respondents) and LinearB's analysis of 8.1M+ pull requests across 4,800 teams. The most significant finding: AI coding assistants boost individual output (21% more tasks completed, 98% more PRs merged) but organizational delivery metrics remain flat — individual productivity does not automatically translate to team-leve...

Question 6

cycle time benchmarks software teams

Accepted Answer

Comprehensive engineering productivity benchmarks covering the five DORA metrics (deployment frequency, lead time for changes, change failure rate, mean time to recovery, rework rate) plus cycle time, PR metrics, and throughput data. Sourced from the 2025 DORA Report (~5,000 respondents) and LinearB's analysis of 8.1M+ pull requests across 4,800 teams. The most significant finding: AI coding assistants boost individual output (21% more tasks completed, 98% more PRs merged) but organizational delivery metrics remain flat — individual productivity does not automatically translate to team-leve...

Segment	Median	25th Pct	75th Pct	Top Decile
Small team (2-10)	2-3x/week	1x/week	1x/day	Multiple/day
Mid-size (11-50)	1-2x/week	2x/month	3-5x/week	1x/day
Large (51-200)	1x/week	2x/month	2-3x/week	Daily
Enterprise (200+)	2-4x/month	1x/month	1x/week	2-3x/week

Segment	Median	25th Pct	75th Pct	Top Decile
Small team (2-10)	1-2 days	2-5 days	2-6 hours	< 1 hour
Mid-size (11-50)	2-5 days	1-2 weeks	1-2 days	< 1 day
Large (51-200)	3-7 days	1-4 weeks	2-3 days	1-2 days
Enterprise (200+)	1-2 weeks	1-6 months	3-7 days	1-3 days

Segment	Median	25th Pct	75th Pct	Top Decile
Small team (2-10)	10%	15-20%	5%	< 2%
Mid-size (11-50)	12%	20-25%	5-8%	< 3%
Large (51-200)	15%	25-30%	8-10%	< 5%
Enterprise (200+)	18%	30%+	10-15%	< 5%

Segment	Median	25th Pct	75th Pct	Top Decile
Small team (2-10)	1-4 hours	4-12 hours	30-60 min	< 15 min
Mid-size (11-50)	2-8 hours	8-24 hours	1-2 hours	< 30 min
Large (51-200)	4-12 hours	12-48 hours	2-4 hours	< 1 hour
Enterprise (200+)	12-24 hours	24-72 hours	4-12 hours	< 2 hours

Segment	Median	25th Pct	75th Pct	Top Decile
Small team (2-10)	3-4 days	5-7 days	1-2 days	< 26 hours
Mid-size (11-50)	5-7 days	7-14 days	2-4 days	< 2 days
Large (51-200)	7-10 days	10-21 days	4-6 days	< 3 days
Enterprise (200+)	10-14 days	14-30 days	5-8 days	< 5 days

Engineering Productivity Benchmarks (DORA + Delivery Metrics)

What are current engineering benchmarks — DORA metrics, cycle time, deployment frequency by team size?

Summary

Constraints

Metrics

Velocity

Deployment Frequency

Lead Time for Changes

Stability

Change Failure Rate (CFR)

Mean Time to Recovery (MTTR)

Rework Rate (5th DORA Metric — New in 2025)

Efficiency

Cycle Time (PR Open to Merged)

Throughput (PRs Merged per Developer per Week)

Quality

PR Size

Merge Time

Composite Metrics & Rules of Thumb

Segment Definitions

Year-over-Year Trend Summary

Common Misinterpretations

When This Matters

Rule	Formula / Threshold	Interpretation
DORA Throughput Score	High deployment frequency + Low lead time	Both must be strong — high frequency with long lead time indicates small, inefficient batches
DORA Stability Score	Low CFR + Low MTTR + Low rework rate	All three must be healthy — low CFR with high MTTR means failures are rare but catastrophic
Cycle Time Ratio	Review time / Total cycle time < 50%	If review exceeds 50% of cycle time, review process is the bottleneck
PR Size Rule	Median PR < 200 lines	Highest-leverage metric — drives cycle time, CFR, and review quality simultaneously
Deploy:Rework Ratio	Planned deploys / Rework deploys > 8:1	Less than 12.5% of deployments should be unplanned fixes
AI Productivity Paradox	Individual output up + Team metrics flat	AI boosts individual velocity but does not automatically improve organizational throughput

Segment	Definition	Typical Characteristics
Small team (2-10 engineers)	Startup or small product team, single-service	Direct communication, minimal process overhead, trunk-based development
Mid-size (11-50 engineers)	Growth-stage company or business unit	Multiple squads, code ownership emerging, PR reviews required
Large (51-200 engineers)	Scale-up or division within enterprise	Platform teams, shared services, architecture governance
Enterprise (200+ engineers)	Large organization or multi-BU company	Complex CI/CD, compliance gates, change advisory boards

Metric	2023	2024	2025	Direction
Deployment frequency (% daily+)	30%	32%	38%	Up 8pp over 2 years
Lead time (% under 1 day)	35%	38%	41%	Up 6pp, steady
Change failure rate (median)	12%	14%	15%	Up 3pp — AI contributing
MTTR (% under 1 hour)	20%	22%	25%	Up 5pp — automation gains
Cycle time (average)	8 days	7.5 days	7 days	Down 12.5% over 2 years
PR size (elite threshold)	250 lines	220 lines	194 lines	Down 22% — smaller PRs