Managing tokens, OpenAI, 2024 (OpenAI) - Provides essential guidance on understanding and managing token limits for OpenAI models, including concepts like context window and token estimation.
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela, 2020Advances in Neural Information Processing Systems (NeurIPS 2020)DOI: 10.48550/arXiv.2005.11401 - Introduces Retrieval-Augmented Generation (RAG), a method for extending LLM knowledge by retrieving relevant information from a large corpus, addressing context limitations.
Claude 2.1: New features and pricing, Anthropic, 2023 (Anthropic) - Announces Claude 2.1 with a 200K token context window, discussing its capabilities for long-form content processing and the challenges of recall in large contexts.