We do not provide evidence that: | Clarification |
---|---|
AI systems do not currently speed up many or most software developers | We do not claim that our developers or repositories represent a majority or plurality of software development work |
AI systems do not speed up individuals or groups in domains other than software development | We only study software development |
AI systems in the near future will not speed up developers in our exact setting | Progress is difficult to predict, and there has been substantial AI progress over the past five years [3] |
There are not ways of using existing AI systems more effectively to achieve positive speedup in our exact setting | Cursor does not sample many tokens from LLMs, it may not use optimal prompting/scaffolding, and domain/repository-specific training/finetuning/few-shot learning could yield positive speedup |
Factor Analysis: We investigate 20 potential factors that might explain the slowdown, finding evidence that 5 likely contribute:
We rule out many experimental artifacts—developers used frontier models, complied with their treatment assignment, didn’t differentially drop issues (e.g. dropping hard AI-disallowed issues, reducing the average AI-disallowed difficulty), and submitted similar quality PRs with and without AI. The slowdown persists across different outcome measures, estimator methodologies, and many other subsets/analyses of our data. See the paper for further details and analysis.
FAQs:
Last modified 23 August 2025