schizoidman@lemm.ee to Technology@lemmy.worldEnglish · 5 days agoApple to use Chinese giant Alibaba’s AI in iPhoneswww.semafor.comexternal-linkmessage-square20fedilinkarrow-up1112arrow-down17file-textcross-posted to: technology@lemmy.mltechnology@beehaw.orgtechnology@lemmy.zip
arrow-up1105arrow-down1external-linkApple to use Chinese giant Alibaba’s AI in iPhoneswww.semafor.comschizoidman@lemm.ee to Technology@lemmy.worldEnglish · 5 days agomessage-square20fedilinkfile-textcross-posted to: technology@lemmy.mltechnology@beehaw.orgtechnology@lemmy.zip
minus-squareIndustryStandard@lemmy.worldlinkfedilinkEnglisharrow-up1·5 days agoDeepseek R1 is currently the selfhosting model to use
minus-squarebrucethemoose@lemmy.worldlinkfedilinkEnglisharrow-up1·5 days agoSome of the distillations are trained on top of Qwen 2.5. And for some cases, FuseAI (a special merge of several thinking models), Qwen Coder, EVA-Gutenberg Qwen, or some other specialized models do a better job than Deepseek 32B in certain niches.
Deepseek R1 is currently the selfhosting model to use
Some of the distillations are trained on top of Qwen 2.5.
And for some cases, FuseAI (a special merge of several thinking models), Qwen Coder, EVA-Gutenberg Qwen, or some other specialized models do a better job than Deepseek 32B in certain niches.