Willison: “No model has beaten GPT-4 on a range of widely used benchmarks like this.”
The devs and shills for Claude 3 also claimed it to be able to analyze a full document and give results, but it can’t, it just lies to you and says it can and then posts no download link for the results it said it wrote out.
kakes@sh.itjust.works 8 months ago
They all claim to have “near-human” abilities.
BluesF@lemmy.world 8 months ago
“Near-human” is marketing speak for “not as good as a human and there is no measurable scale to say how close it is so we will say it is closed”
p03locke@lemmy.dbzer0.com 8 months ago
How big of a paycheck did the “journalist” get paid on this one?
chooglers@lemmy.ml 8 months ago
Claude didn’t get paid shit