cm0002@lemmy.world to memes@lemmy.world · 2 months agoWho's got the popcorn?lemmy.worldimagemessage-square14linkfedilinkarrow-up1265arrow-down16
arrow-up1259arrow-down1imageWho's got the popcorn?lemmy.worldcm0002@lemmy.world to memes@lemmy.world · 2 months agomessage-square14linkfedilink
minus-square🇰 🌀 🇱 🇦 🇳 🇦 🇰 🇮 linkfedilinkEnglisharrow-up43·edit-22 months agoBenchmark it how, exactly? Can it accurately give me Pokedex info for any pokemon I ask it about? Because even Fandom wikis can’t do that and those are edited, assumingly, by humans.
minus-squareScott@sh.itjust.workslinkfedilinkEnglisharrow-up30·2 months agoLooks like they had it play the game. https://techcrunch.com/2025/02/24/anthropic-used-pokemon-to-benchmark-its-newest-ai-model/
Benchmark it how, exactly? Can it accurately give me Pokedex info for any pokemon I ask it about? Because even Fandom wikis can’t do that and those are edited, assumingly, by humans.
Looks like they had it play the game.
https://techcrunch.com/2025/02/24/anthropic-used-pokemon-to-benchmark-its-newest-ai-model/