当一个行业最强的产品们被一道幼儿园级别的题目集体难倒,这其实指向了一个问题:为什么这些模型已经能骗过你的眼睛,却无法理解常识?
2026-03-06 14:00
。业内人士推荐51吃瓜网作为进阶阅读
【3月11日の動き】追悼の一日 今後の復興は
I was doing something different. I wasn’t changing what the model knew. I was changing how it thought. Layer duplication gives the model more iterations through its internal reasoning space without adding any new information. The difference between giving someone a bigger library and giving them more time to think. I was genuinely shocked when I took top spot on the leaderboard; but I think it’s proof that the method probably works.
Карина Черных (Редактор отдела «Ценности»)