Willison: “No model has beaten GPT-4 on a range of widely used benchmarks like this.”
The devs and shills for Claude 3 also claimed it to be able to analyze a full document and give results, but it can’t, it just lies to you and says it can and then posts no download link for the results it said it wrote out.
kakes@sh.itjust.works 10 months ago
They all claim to have “near-human” abilities.
BluesF@lemmy.world 10 months ago
“Near-human” is marketing speak for “not as good as a human and there is no measurable scale to say how close it is so we will say it is closed”
p03locke@lemmy.dbzer0.com 10 months ago
How big of a paycheck did the “journalist” get paid on this one?
chooglers@lemmy.ml 10 months ago
Claude didn’t get paid shit