Ga naar de inhoud

Research: AI's ability to complete long and complex software engineering tasks doubles every 6-7 months, but there is a “messiness tax” for real-world tasks (Boaz Barak/Windows On Theory) 05-11-2025

Boaz Barak / Windows On Theory:
Research: AI’s ability to complete long and complex software engineering tasks doubles every 6-7 months, but there is a “messiness tax” for real-world tasks  —  METR has had a very influential work by Kwa and West et al on measuring AI’s ability to complete long tasks.


Lees verder op Tech Meme