cm0002@lemmy.world to memes@lemmy.world · 17 hours agoWho's got the popcorn?lemmy.worldimagemessage-square13fedilinkarrow-up1245arrow-down15
arrow-up1240arrow-down1imageWho's got the popcorn?lemmy.worldcm0002@lemmy.world to memes@lemmy.world · 17 hours agomessage-square13fedilink
minus-square🇰 🌀 🇱 🇦 🇳 🇦 🇰 🇮 linkfedilinkEnglisharrow-up37·edit-217 hours agoBenchmark it how, exactly? Can it accurately give me Pokedex info for any pokemon I ask it about? Because even Fandom wikis can’t do that and those are edited, assumingly, by humans.
minus-squareScott@sh.itjust.workslinkfedilinkEnglisharrow-up27·17 hours agoLooks like they had it play the game. https://techcrunch.com/2025/02/24/anthropic-used-pokemon-to-benchmark-its-newest-ai-model/
Benchmark it how, exactly? Can it accurately give me Pokedex info for any pokemon I ask it about? Because even Fandom wikis can’t do that and those are edited, assumingly, by humans.
Looks like they had it play the game.
https://techcrunch.com/2025/02/24/anthropic-used-pokemon-to-benchmark-its-newest-ai-model/