[视频作者] 秋刀鱼的炼丹工坊
[视频时长] 24:49
[视频类型] 计算机技术
论文题目:Retentive Network: A Successor to Transformer for Large Language Models 论文地址:http://arxiv.org/abs/2307.08621 代码:https://github.com/microsoft/torchscale - xPos: A Length-Extrapolatable Transformer http://arxiv.org/abs/2212.10554 * 本视频旨在传递一篇论文的存在推荐感兴趣的
![[图][论文速览]RetNet: A Successor to Transformer for Large Language Models[2307.08621]](https://i1.hdslb.com/bfs/archive/c10d0bbc4327d0894856c5d22199fe81411b969d.jpg)