Comment on OpenAI releases o1, its first model with ‘reasoning’ abilities
Martineski@lemmy.dbzer0.com 2 months ago
I’m curious how it will do on the private benchmark that ai explained made. I think it was called simple bench?
Comment on OpenAI releases o1, its first model with ‘reasoning’ abilities
Martineski@lemmy.dbzer0.com 2 months ago
I’m curious how it will do on the private benchmark that ai explained made. I think it was called simple bench?
NoiseColor@startrek.website 2 months ago
Image
This is one stat I’ve found
Martineski@lemmy.dbzer0.com 2 months ago
simple-bench.com/index.html I was referring to this benchmark specifically because the point of it is to benchmark the actual reasoning capabilities of LLMs:
Image