Paper: A Survey of Context Engineering for Large Language Models

Authors: Lingrui Mei, Jiayu Yao, Yuyao Ge, Yiwei Wang, Baolong Bi, Yujun Cai, Jiazhi Liu, Mingyu Li, Zhong-Zhi Li, Duzhen Zhang, Chenlin Zhou, Jiayi Mao, Tianze Xia, Jiafeng Guo, and Shenghua Liu.
Source: arXiv e-print (Preprint)
Date: July 2025

Reference List

Journal Articles & Preprints

Mei, L., Yao, J., Ge, Y., et al. (2025) A Survey of Context Engineering for Large Language Models. arXiv preprint [2507.13334]. Available at: https://arxiv.org/abs/2507.13334 (Accessed: 4 November 2025).
Lewis, P., Oğuz, B., Yarats, D., et al. (2020) ‘Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks’. In: Advances in Neural Information Processing Systems 33. Vancouver, Canada: NeurIPS.
Brown, T. B., Mann, B., Ryder, N., et al. (2020) ‘Language Models are Few-Shot Learners’. In: Advances in Neural Information Processing Systems 33. Vancouver, Canada: NeurIPS.

Technical Reports & Online Guides

Anthropic (2025) Effective context engineering for AI agents. Available at: https://www.anthropic.com/engineering/effective-context-engineering-for-ai-agents (Accessed: 4 November 2025).
LlamaIndex (2025) Context Engineering – What it is, and techniques to consider. Available at: https://www.llamaindex.ai/blog/context-engineering-what-it-is-and-techniques-to-consider (Accessed: 4 November 2025).

2. Notable Products and Frameworks in the Market

Context Engineering is primarily implemented using open-source frameworks and vendor platforms that specialize in building LLM applications, especially those relying on RAG and Agent architectures.

LangChain is a Development Framework

LangChain Provides a cohesive framework for building agentic workflows and RAG pipelines. It offers modules for memory management, tool calling, and chaining LLM steps, which are all fundamental to context orchestration.

LlamaIndex is Data Framework

LlamaIndex Specializes in connecting LLMs to external data sources. It is core to the Retrieval component of context engineering, offering various indexing and query transformation strategies to retrieve the most relevant information for the context window.

LlamaStack is Integrated Ecosystem

A framework and ecosystem built to specifically simplify the creation and deployment of applications using Meta's Llama models. Focuses on optimized data processing, training, and streamlined inference/deployment for Llama-based context.

Weaviate / Pinecone are Vector Databases

These are the storage backends for long-term memory and external knowledge. They allow for semantic search, enabling the Retrieval-Augmented Generation (RAG) process that feeds precise, context-specific information to the LLM.

Zep is Context Management Platform

A dedicated platform for managing complex conversational memory and state. It specifically addresses challenges related to session history, user profiles, and filtering context for long-running AI applications.

AutoGen is Multi-Agent Framework

Developed by Microsoft, it focuses on building multi-agent systems where different AI agents collaborate to solve complex tasks. Context engineering here involves defining communication protocols and isolating context for each sub-agent.

Microsoft Agent Framework is Agent Orchestration Framework

A comprehensive, open-source SDK combining the strengths of AutoGen and Semantic Kernel. It features graph-based workflows, robust state/context management, and high control over multi-agent execution paths.

Microsoft Agent Lightning is Optimization Framework

A cutting-edge framework that enables Reinforcement Learning (RL)-based training of any existing AI agent (e.g., those built with LangChain, AutoGen) by decoupling training from execution. Focuses on optimizing agent behavior and long-horizon context logic.

Google (Vertex AI & Gemini)Platform / Model Suite

Emphasizes the use of massive context windows (e.g., 1M+ tokens in Gemini models) to directly handle large amounts of multimodal context (text, video, audio) in one call. Vertex AI Agent Builder provides enterprise tools for designing and deploying agents.

Meta (Research Focus)

Research / Open SourceMeta's approach often centers on optimizing their Llama family of models for efficiency and context handling. Research highlights include frameworks for autonomous context auditing and continuous learning where an agent enriches conversation history with structured metadata/tags to manage its own knowledge base.

LangGraph Graph-based Orchestration

An extension of LangChain that allows for stateful, cyclical workflows using graphs. It provides fine-grained control over which components (RAG, memory, tool calls) are executed and what context is passed at each step of a complex multi-agent process.

Lifetime Scope Journal (LSJ) - the AI Agent Business Blog

(LSJ) Context engineering advanced

Reference List

2. Notable Products and Frameworks in the Market

LangChain is a Development Framework

LlamaIndex is Data Framework

LlamaStack is Integrated Ecosystem

Weaviate / Pinecone are Vector Databases

Zep is Context Management Platform

AutoGen is Multi-Agent Framework

Microsoft Agent Framework is Agent Orchestration Framework

Microsoft Agent Lightning is Optimization Framework

Google (Vertex AI & Gemini)Platform / Model Suite

Meta (Research Focus)

LangGraph Graph-based Orchestration

Lifetime World

Lifetime Scope Journal - The AI Business Blog

Get Updates from Us!

Lifetime Scope Journal (LSJ) - the AI Agent Business Blog

Reference List

2. Notable Products and Frameworks in the Market

LangChain is a Development Framework

LlamaIndex is Data Framework

LlamaStack is Integrated Ecosystem

Weaviate / Pinecone are Vector Databases

Zep is Context Management Platform

AutoGen is Multi-Agent Framework

Microsoft Agent Framework is Agent Orchestration Framework

Microsoft Agent Lightning is Optimization Framework

Google (Vertex AI & Gemini)Platform / Model Suite

Meta (Research Focus)

LangGraph Graph-based Orchestration

Lifetime World

Lifetime Scope Journal - The AI Business Blog

Get Updates from Us!​

Get Updates from Us!