kenna@lemm.eeM to star technology and research @lemm.eeEnglish · edit-21 year agoEfficient Streaming Language Models with Attention Sinks - #AIarxiv.orgexternal-linkmessage-square0fedilinkarrow-up11arrow-down10cross-posted to: technews@radiation.party
arrow-up11arrow-down1external-linkEfficient Streaming Language Models with Attention Sinks - #AIarxiv.orgkenna@lemm.eeM to star technology and research @lemm.eeEnglish · edit-21 year agomessage-square0fedilinkcross-posted to: technews@radiation.party