YaRN: New Approach to Expanding Context in LLaMa-2 Up to 128k Tokens

by Altszn.com

September 4, 2023

in Metaverse, Web3

0

YaRN: New Approach to Expanding Context in LLaMa-2 Up to 128k Tokens

399

SHARES

2.3k

VIEWS

Share on Facebook Share on Twitter

A new method known as YaRN (Yet Another RoPE for Transformers) has emerged, offering the potential to extend context capabilities in large language models (LLMs) using the RoPE technique for positional coding. This approach, as detailed in a recent article, provides the means to expand context up to 64k or even 128k tokens. This innovation is particularly notable as it addresses the growing demand for models that can accommodate substantial context, such as extended texts or lengthy message histories.