arXiv preprint arXiv:2308.12950, 2023 Effective Long-Context Scaling of Foundation Models.Wenhan Xiong, Jingyu Liu, Igor Molybog, Hejia Zhang, Prajjwal Bhargava, Rui Hou, Louis Martin, Rashi Rungta, ...