专栏名称: 超级科技迷
分享科技资讯,前沿技术解析。不做营销号,每日分享技术摘要都会附上来源URL。希望大家找到自己感兴趣的文章再去精读原文。加油!
今天看啥  ›  专栏  ›  超级科技迷

Hacker News 精彩评论及翻译

超级科技迷  · 公众号  ·  · 2024-12-22 10:15
    

文章预览

  Hacker News 精彩评论及翻译 OpenAI O3 breakthrough high score on ARC-AGI-PUB https://news.ycombinator.com/item?id=42473876 Efficiency is now key. ~=$3400 per single task to meet human performance on this benchmark is a lot. Also it shows the bullets as "ARC-AGI-TUNED", which makes me think they did some undisclosed amount of fine-tuning (eg. via the API they showed off last week), so even more compute went into this task. We can compare this roughly to a human doing ARC-AGI puzzles, where a human will take (high variance in my subjective experience) between 5 second and 5 minutes to solve the task. (So i'd argue a human is at 0.03USD - 1.67USD per puzzle at 20USD/hr, and they include in their document an average mechancal turker at $2 USD task in their document) Going the other direction: I am interpreting this result as human level reasoning now costs (approximately) 41k/hr to 2.5M/hr with current compute. Super exciting that OpenAI pushed the compute out this far so we could ………………………………

原文地址:访问原文地址
快照地址: 访问文章快照
总结与预览地址:访问总结与预览