The AI Code Review Reversal: Developers Now Spend More Hours Reviewing Than Writing
A Q1 2026 survey from Digital Applied covering 2,847 developers across 320 organisations quantified a shift that many practitioners have felt but not measured: developers now report spending 11.4 hours per week reviewing AI-generated code versus 9.8 hours writing new code — a reversal from 2024 patterns when writing still dominated. Claude Code (28%) and Cursor (24%) together account for over half of primary-tool selections among respondents, with GitHub Copilot and Windsurf (now Devin Desktop) following.
Self-reported productivity gains follow a consistent curve: a median 34% improvement within the first 60 days of adoption, then a plateau. Gains are concentrated in boilerplate generation, test coverage, and tasks in unfamiliar languages — categories where the cognitive cost of writing is high relative to the creativity required. Architectural decisions and security-sensitive code show the weakest gains, with 24% of respondents reporting that AI was actively unhelpful or introduced anti-patterns in those contexts. Cost volatility and prompt injection risks ranked as the top two pain points in 2026, displacing model reliability concerns from 2024.
The review-hours reversal is the most practically significant finding. It suggests that the bottleneck in AI-augmented development is shifting from code generation to code validation — and that teams need to invest in tooling and process for AI output review with the same seriousness they applied to test coverage and code review culture over the past decade.
Read more — Digital Applied Blog
Builders, Shippers, and Coasters: How AI Is Reshaping the Developer Spectrum
The Pragmatic Engineer's April 2026 survey of 900+ engineers identifies three archetypes emerging from AI adoption, each experiencing the technology's impact in qualitatively different ways. Builders — developers focused on code quality, architecture, and long-term maintainability — report the most ambivalent relationship with AI. They benefit from AI for large-scale refactors and working in unfamiliar codebases, but struggle with reviewing what they describe as "AI slop" in pull requests and experience professional identity loss as hands-on coding time shrinks.
Shippers — engineers optimised for throughput and delivery speed — report the strongest enthusiasm for AI tools. They cite dramatic output acceleration and note that AI lets them tackle features they would previously have deferred. However, the survey surfaces a concern: shippers accumulate technical debt faster with AI than without it, and frequently build unvalidated features that later need removal. The third archetype, Coasters, uses AI primarily to advance through performance reviews with minimum effort — a pattern that produces the highest short-term output numbers while potentially masking skill atrophy.
The cost picture is sobering. Companies typically pay $100–200 per engineer per month for premium AI plan access, while individuals average $20. Survey respondents in leadership roles frequently cited concern about "unsustainable AI-tooling budgets" as usage scales across engineering organisations. Around 30% of respondents hit usage limits regularly, prompting plan upgrades, tool switching, or API adoption to maintain uninterrupted workflows — each workaround adding friction and cost complexity.
Read more — The Pragmatic Engineer
The Unreviewed PR: The Most Insidious Anti-Pattern in Agentic Engineering
Simon Willison's Agentic Engineering Patterns guide, updated March 2026, names a single behaviour as the defining anti-pattern of the current moment: submitting agent-generated code in a pull request without having personally validated that it works. Willison is precise about why this is different from a careless PR: by not running the code yourself, you are not reviewing the agent's work — you are delegating your review responsibility to your colleagues, who must now do the actual work of verifying correctness.
The anti-pattern is insidious because it is dressed in the language of productivity. Agents generate plausible-looking code quickly, and the developer submitting it has technically seen the output. But seeing and validating are different acts. Willison's standard is unambiguous: before opening a PR, you must be able to articulate what the code does, why each change is correct, and what edge cases it handles — the same standard that applies to code you wrote manually. High-quality PRs should be small, well-tested, and accompanied by documentation that demonstrates the author's thorough review.
This guidance is particularly relevant as more teams adopt agentic coding workflows where the agent handles an entire task from specification to implementation. The temptation to treat the output as "the agent's PR" rather than "my PR that the agent helped write" is structurally encouraged by the framing of these tools. Willison's point is that professional responsibility for code correctness cannot be delegated to an AI system, and that teams that allow unreviewed agent PRs will experience the kind of compounding technical debt and defect accumulation that CI/CD pipelines were designed to prevent.
Read more — Simon Willison's Weblog
Safe & Secure AI Agent Practices
OWASP: Prompt Injection Maps to Six of Ten Agentic AI Security Categories
OWASP's June 2026 State of Agentic AI Security report, covered by Help Net Security on June 11, delivers a quantified assessment of what is going wrong in production agentic deployments. Prompt injection remains the dominant attack vector, and — critically — the report shows it maps to six of the ten categories in OWASP's Top 10 for Agentic Applications. This is not a single vulnerability class; it is an architectural condition that enables a wide surface of exploitation.
The report tracks 53 active agentic projects and finds that 28 of them are coding agents, with the five fastest-growing tools all in that category. Security advisories are accumulating rapidly: Claude Code has 22 known advisories, AutoGPT 15, Dify 13, Roo-Code 11, and n8n leads the list with 57. Seven of the tracked projects ship updates daily, and one averaged a release every eight hours — a pace at which most security teams cannot perform meaningful patch validation. Researchers identify what Simon Willison has called the "lethal trifecta" as the core architectural risk condition: an agent that simultaneously has access to private data, can consume untrusted content, and can make external communications.
Two concrete defensive frameworks emerge from the report. Meta's "Agents Rule of Two" limits any autonomous agent to accessing at most two of the three lethal trifecta capabilities — no agent should be able to read sensitive data, ingest untrusted input, and communicate externally in the same session. For regulatory timelines, developers and operators should be aware that breach notification windows are tightening: DORA requires four hours, NIS2 twenty-four hours, the US RAISE Act seventy-two hours, and California SB 53 fifteen days. Agentic AI incidents that trigger these notifications demand the same incident response infrastructure as traditional security events.
Read more — Help Net Security