The length of tasks that generalist frontier model agents can complete autonomously with 50% reliability has been doubling approximately every 7 months
submitted by /u/katxwoods [link] [comments]
Category Added in a WPeMatico Campaign