The authors claim this method was used to extend Llama 2 to 128k: https://github.com/jquesnelle/yarn