Comment on OpenAI releases o1, its first model with ‘reasoning’ abilities
NoiseColor@startrek.website 2 months agoThis is one stat I’ve found
Comment on OpenAI releases o1, its first model with ‘reasoning’ abilities
NoiseColor@startrek.website 2 months agoThis is one stat I’ve found
Martineski@lemmy.dbzer0.com 2 months ago
simple-bench.com/index.html I was referring to this benchmark specifically because the point of it is to benchmark the actual reasoning capabilities of LLMs:
Image