当前位置：首页 > 资讯 > 系统环境

大模型应用开发之上下文工程（Context Engineering）原理与应用

时间：2025-11-11 18:32 作者：来源：阅读：0
扫一扫，手机访问

摘要：大模型应用开发之上下文工程（Context Engineering）原理与应用研究员：专家研究员日期： 2025年11月04日摘要本报告旨在深入探讨大型语言模型（LLM）应用开发中的一个新兴且至关重大的领域——上下文工程（Context Engineering）。随着LLM能力的不断增强和应用场景的日益复杂，如何系统性地设计、管理和优化模型的输入上下文，已成为决定应用性能、可靠性和效率的关键。本

大模型应用开发之上下文工程（Context Engineering）原理与应用

研究员： 专家研究员

日期： 2025年11月04日

摘要

本报告旨在深入探讨大型语言模型（LLM）应用开发中的一个新兴且至关重大的领域——上下文工程（Context Engineering）。随着LLM能力的不断增强和应用场景的日益复杂，如何系统性地设计、管理和优化模型的输入上下文，已成为决定应用性能、可靠性和效率的关键。本报告将第一界定上下文工程的核心概念，阐明其与传统提示工程（Prompt Engineering）的区别与演进关系。随后，报告将详细解析其核心技术原理、方法论，特别是在生产环境中优化有限上下文窗口利用率的关键技术，如上下文压缩、选择性检索等。此外，报告将结合具体应用案例和可量化的性能指标，展示上下文工程在智能客服、代码生成、复杂推理等领域的实际价值，例如带来高达18倍的准确率提升和显著的成本降低。最后，本报告将探讨包括动态检索增强生成（Dynamic RAG）和安全信息流控制在内的高级架构，并分析当前面临的挑战与未来发展趋势，为构建更强劲、更可靠的下一代AI应用提供理论基础和实践指引。

1. 引言：上下文工程的崛起与重大性

1.1 定义：超越提示工程的新范式

上下文工程（Context Engineering）是针对高级AI模型，特别是大型语言模型（LLM）的一种系统性工程方法 [3][15]。其核心是设计、构建、管理和优化提供给模型的全部输入信息，即“上下文”，以确保模型能够高效、准确地理解并执行任务 [1][12]。这个“上下文”包罗万象，不仅限于用户的直接提问，还涵盖了背景知识、历史对话记录、系统指令、可用的工具（API）、外部数据源、甚至是模型自身的中间状态等一切模型在推理时能够“看到”的信息 [6][10][15]。

与聚焦于优化单一指令或问题的“提示工程”（Prompt Engineering）不同，上下文工程是一个更宏大、更系统的概念 [17]。它被视为提示工程的自然演进和工业化升级 [1][2]。如果说提示工程是“术”，专注于雕琢与模型单次交互的话语，那么上下文工程则是“道”，关注于构建一个完整、动态、高效的信息流系统，以解决LLM在生产环境中固有的不确定性，是实现生产级AI部署的关键环节 [2][10]。

1.2 核心目标与价值

上下文工程的根本目标是通过系统性地优化信息流来提升LLM应用的整体性能 [5][23]。实践证明，LLM的输出质量极大地依赖于其在推理时获得的上下文质量 [1][14]。因此，上下文工程的价值主要体目前以下几个方面：

l 提升理解与推理精度： 通过提供准确、相关的背景信息，协助模型更深入地理解用户意图和任务的复杂性，从而提升逻辑推理能力 [7]。

l 减少“幻觉”现象： 为模型提供实际依据（Grounding），特别是通过检索外部知识库，能显著减少模型捏造实际的倾向，提高输出的可靠性和真实性 [8][112]。

l 优化性能与效率： 在有限的上下文窗口内，通过压缩和筛选信息，最大化信息密度，过滤无关噪声，从而降低API调用成本和计算延迟 [25][28][112]。

l 增强任务完成能力： 通过集成外部工具和多代理协作，使模型能够执行更复杂的、多步骤的任务，超越纯粹的文本生成 [2][13]。

众多真实世界的部署案例已经证明，卓越的上下文工程能够带来显著的投资回报，包括任务成功率提升10倍、运营成本降低40%等 [22][104]。

2. 上下文工程的核心原理与方法论

上下文工程并非单一技术，而是一套综合性的方法论，其核心在于对信息进行系统化的管理和组织。

2.1 上下文的构成要素

在LLM应用中，上下文是一个动态变化的信息集合，其主要构成要素包括：

l 用户指令（Instruction）： 用户直接下达的任务或问题。

l 历史对话（History）： 多轮对话中的交互记录，为模型提供对话背景。

l 外部知识（Knowledge）： 通过检索从知识库、数据库或实时API中获取的相关信息。

l 示例（Examples）： 少样本学习（Few-shot Learning）中提供的范例，用于引导模型输出格式和风格 [41][51]。

l 工具定义（Tool Definitions）： 允许模型调用的外部函数或API的描述。

l 系统状态（System State）： 应用当前的状态信息，如用户身份、权限等。

2.2 四大核心策略：写入、选择、压缩与隔离

根据业界专家的总结，上下文工程的实践可以归纳为四个核心策略 [8][17][23]：

1.写入（Write）：将动态变化的信息（如用户会话历史、检索到的知识）存储到外部的、可持久化的存储系统中（如向量数据库、键值存储）。这为后续的信息检索和管理奠定了基础。

2.选择（Select）：在模型处理请求之前，从外部存储中选择与当前任务最相关的信息子集。这是上下文工程中最关键的一步，直接决定了注入上下文的质量。选择性获取和优先级排序是这里的核心技术 [22][27]。

3.压缩（Compress）：对选择出的信息进行压缩，以在有限的上下文窗口内传递最大信息量。这不仅能减少噪声干扰，还能有效降低Token消耗和推理延迟 [25][29]。信息理论被认为是指导最优压缩策略的理论基础 [30]。

4.隔离（Isolate）：在复杂的应用（如多代理系统）中，需要为不同的子任务或代理（Agent）创建独立的上下文空间，以防止信息交叉污染和上下文混淆 [6][8]。

2.3 关键技术：检索增强生成（RAG）

检索增强生成（Retrieval-Augmented Generation, RAG）是上下文工程中最核心、应用最广泛的技术之一 [11][16][44]。RAG通过一个双模态架构，将信息检索系统与LLM生成器相结合 [63][71]。其基本工作流程是：当接收到用户查询时，系统第一从外部知识源（如企业文档、数据库）中检索相关信息片段，然后将这些信息片段与原始查询一同作为增强的上下文，输入给LLM，从而生成更准确、更具实际依据的回答 [123][137]。

RAG技术是实现“选择”策略的主要手段，它有效地将模型的参数化知识（训练数据中学习到的）与非参数化知识（外部实时数据）结合起来，极大地提升了模型的知识获取能力和答案准确性 [123]。

2.4 面临的挑战

尽管上下文工程极为强劲，但在实践中也面临诸多挑战，例如“上下文中毒”（Context Poisoning，恶意注入误导性信息）、“上下文分散”（Attention Distraction，不相关信息分散模型注意力）、“上下文混淆”（Context Confusion）和“上下文冲突”（Conflicting Information）等问题 [8][19]。解决这些问题需要更精细化的信息评估和过滤机制。

3. 生产系统中的上下文窗口优化

LLM的上下文窗口（Context Window）指的是模型一次性可以处理的Token数量上限 [38]。虽然现代模型的上下文窗口越来越大，但其增长并非没有代价：计算复杂度和内存占用一般会随之呈二次方增长 [36][153]。因此，在成本和延迟敏感的生产环境中，高效利用上下文窗口是上下文工程的核心任务。

3.1 上下文窗口的瓶颈

l 成本： 大多数商业LLM的定价与输入和输出的Token数量直接相关，过长的上下文意味着更高的单次调用成本。

l 延迟： 处理更长的上下文序列需要更多的计算时间，导致用户响应延迟增加。

l 性能： 研究表明，在超级长的上下文中，模型可能会出现“中间遗忘”（Lost in the Middle）现象，即对上下文两端信息的关注度高于中间部分，导致性能下降。

3.2 关键优化技术

3.2.1 上下文压缩与过滤

上下文压缩旨在用更少的Token表达一样或更核心的信息。这是一种在信息保真度和成本效率之间进行权衡的艺术 [28]。

l 算法实现： 像LLMLinguaCompressor这样的工具，可以通过设置压缩率（compression_rate）或动态压缩比，依据信息的重大性对上下文进行有损或无损压缩 [143][263]。

l 过滤冗余： LangChain中的
ContextualCompressionRetriever就是一个典型的实现，它包裹一个基础检索器，并通过一个文档压缩器（如LLMChainExtractor）对检索结果进行后处理，仅提取与查询最相关的句子或片段，从而过滤掉大量无关内容 [141][160][201]。

l 智能分块（Chunking）： 在RAG流程中，将文档预处理成大小合适的块（chunks）至关重大。块太小可能丢失关键上下文，块太大则会引入噪声并增加成本。优化分块大小本身就是上下文工程的一项重大工作 [261][268]。

3.2.2 上下文选择与优先级排序

在将信息送入上下文窗口之前，必须进行智能选择。

l 选择性检索： RAG系统通过向量类似度搜索等方法，从海量数据中仅检索出top-k个最相关的文档块，这是最基础的选择机制 [159]。

l 优先级评分： 更高级的系统会对检索到的信息、历史对话、工具定义等所有潜在的上下文元素进行相关性评分和优先级排序，确保最有价值的信息被优先包含在上下文窗口内 [22][32]。

l 动态工具选择： 在Agent应用中，系统可以根据当前任务动态地从大量可用工具中选择一小部分最相关的工具定义放入上下文，而不是一次性全部提供 [31]。

3.3 框架实现：LangChain与LlamaIndex

LangChain和LlamaIndex是目前主流的两个用于构建LLM应用的开源框架，它们都为上下文工程提供了强劲的工具集。

l LlamaIndex： 专注于数据密集型应用的上下文增强，尤其擅长RAG流程中的数据摄取、索引和查询 [82][83]。它提供了丰富的索引结构和高效的检索策略，并支持索引压缩技术以减少存储和加快检索速度 [88][148]。在需要高效处理海量数据、构建复杂查询引擎的场景中，LlamaIndex一般表现更佳 [81][147]。

l LangChain： 更侧重于构建复杂的、由LLM驱动的代理（Agent）和链式（Chain）工作流 [81][90]。它提供了强劲的上下文管理和记忆（Memory）模块，能够轻松维持多轮对话的连贯性 [86]。其
ContextualCompressionRetriever是实现上下文压缩和选择的标准化组件 [209]。

在生产实践中，开发者常将两者结合使用，例如利用LlamaIndex进行高效的数据索引和检索，然后将结果整合到LangChain构建的复杂Agent工作流中 [89][149]。

4. 实践应用与量化性能提升

上下文工程的价值最终体目前其为实际应用带来的可量化性能提升上。

4.1 典型应用案例

l 案例一：智能客服与电商平台

在智能客服系统中，通过上下文工程架构，可以将用户的个人信息、历史购买记录、浏览行为以及知识库中的产品信息和常见问题解答（FAQ）动态整合到上下文中 [46]。这使得AI客服能够提供高度个性化且准确的解答，显著提升了交互效率和用户满意度 [47]。有大型电商平台利用LangChain工具链，成功构建了动态交互的客服系统，取得了良好的性能表现 [47]。

l 案例二：AI编程助手

微软部署的AI代码助手是一个典型的成功案例。通过整合代码库的上下文、相关API文档和用户的编程意图，该助手在任务完成率和代码质量上实现了显著提升，同时错误率减少了65% [113]。

l 案例三：复杂推理任务

在需要多步逻辑推理的领域（如数学竞赛），单纯的LLM往往表现不佳。IBM的研究人员通过为GPT-4.1模型提供一套“认知工具”作为上下文（包括理解问题、回忆信息、检查答案等步骤），使其在AIME 2024数学竞赛基准测试中的pass@1准确率从26.7%飙升至43.3% [102][166][317]。这充分证明了通过结构化上下文引导模型进行复杂推理的有效性。

4.2 可量化的性能指标与成效

上下文工程的有效性可以通过一系列关键绩效指标（KPIs）来衡量 [103][110]：

l 准确率与成功率的大幅提升：

l RAG：在特定任务中，如文本导航，RAG技术被证实可带来18倍的准确率提升，成功率高达94% [41][108][221]。

l 少样本学习：通过在上下文中提供少量高质量示例，代码摘要任务的BLEU-4得分提升了9.90%，错误修复任务的准确率提升了175.96% [41][51][311]。

l 整体改善：领先组织通过系统性的上下文优化，每月可以观察到AI准确性持续提升15-20% [105]。

l 成本、效率与可靠性优化：

l 减少幻觉： 精心设计的上下文工程系统能够将模型的幻觉现象减少40-60% [112]。

l 降低成本： 通过上下文压缩和选择，可以优化Token使用，将API调用成本降低20-30% [112]。

l 提升效率： 在特定应用中，上下文工程被证实可以节省**75-99%**的时间 [104]。

这些数据清晰地表明，上下文工程不仅是理论上的概念，更是能够为企业带来真实、可观业务价值的工程实践。

5. 高级架构：动态信息流与安全集成

随着应用复杂度的提升，上下文工程正在向更动态、更安全的架构演进。

5.1 动态RAG：从静态检索到自适应信息流

传统的RAG系统一般在生成过程开始前进行一次性检索。不过，高级的上下文工程系统正在采用更动态的模式 [128]。

l 自适应/递归RAG（Adaptive/Recursive RAG）： 这类系统允许模型在生成过程中根据需要多次、迭代地进行检索 [132][134]。例如，模型在生成答案时发现信息不足，可以主动触发新的检索查询来补充上下文。这形成了一种动态的信息流，使系统能够处理需要多跳转、多信息源的复杂问题。学界已提出如DRAG (Dynamic Retrieval-Augmented Generation) 和 ARAG (Adaptive Retrieval-Augmented Generation) 等概念框架 [124][128]。

这种动态信息流控制机制是上下文工程走向更高智能水平的关键，它模仿了人类在解决问题时“边想边查”的认知过程。

5.2 安全集成：在RAG中实现信息流控制

当RAG系统接入包含敏感信息的企业内部知识库时，安全性和隐私保护成为首要问题 [231][241]。此时，上下文工程必须与信息流控制（Information Flow Control）机制紧密结合。

l 访问控制集成： 一个生产就绪的RAG架构必须集成严格的访问控制。这一般通过基于角色的访问控制（RBAC）实现 [231][291]。在RAG流程的检索阶段，系统必须验证当前用户是否有权限访问被检索到的文档或数据片段。只有通过权限检查的信息才能被“选择”并注入到LLM的上下文中 [233][293]。

l 上下文感知过滤： 除了基于用户的权限控制，系统还可以实现上下文感知的过滤策略。例如，根据当前对话的敏感级别，动态调整信息检索的范围和输出内容的详尽程度 [291]。

l 输出控制与验证： 在生成最终响应后，需要有一个验证层来检查生成内容是否意外泄露了敏感信息，这构成了信息流的最后一道防线 [291]。

通过将这些安全机制深度集成到RAG的每个环节，可以构建一个既强劲又安全的上下文工程系统，有效防止数据泄露和隐私侵犯 [237][237]。

5.3 生产就绪的实现模式与考量

一个生产就绪的上下文工程系统一般是模块化的，包含数据摄入、索引、检索、上下文构建、LLM推理和后处理等多个阶段 [121][136]。在部署时，需要重点思考可扩展性、延迟、成本和可维护性（即RAGOps）[132][132]。云原生和容器化技术常被用于部署和管理这些复杂的流水线 [236]。

6. 挑战、评估与未来展望

6.1 当前面临的主要挑战

上下文工程虽然前景广阔，但仍面临一系列严峻挑战 [50][51][55]：

l 复杂性与维护成本： 设计和维护一个高效的上下文工程系统超级复杂，需要跨领域的专业知识。

l 评估困难： 衡量上下文工程的有效性极具挑战。传统的NLP指标（如BLEU、ROUGE）无法全面评估系统的推理能力、实际一致性和抗幻觉能力 [104][271][311]。

l 长上下文处理： 尽管模型上下文窗口在增长，但如何高效、无损地利用超长上下文依旧是一个开放的研究问题。

l 多模态融合： 如何将文本、图像、音频、结构化数据等多种模态的信息无缝融合到统一的上下文中，是未来的一个重大方向 [46][50]。

l 安全与伦理： 如何防止上下文注入攻击，以及如何确保系统的公平、无偏见，是必须解决的伦理问题。

6.2 评估方法的演进

鉴于传统评估指标的局限性，社区正在探索新的评估范式 [108][221]。未来的评估方法将更加注重：

l 多维度评估： 不仅评估准确性，还需评估上下文相关性、信息完整性、响应连贯性、实际一致性、鲁棒性等多个维度 [103][104]。

l 反馈驱动评估： 建立包含用户反馈、模型自我批评和A/B测试的闭环评估系统，实现持续的自动化改善 [22][35]。

l 任务导向评估： 评估的核心应是AI系统是否真正协助用户更快、更好地完成了任务，而不是孤立地看某个技术指标 [106]。

6.3 未来发展方向：迈向自适应与自我改善的AI

上下文工程是推动AI从“被动响应”向“主动适应”和“自我改善”演进的关键驱动力 [313][315]。未来的发展将聚焦于：

l 基础理论研究： 建立更完善的数学框架和优化原则来指导上下文的设计与管理 [30][50]。

l 技术创新： 发展更高效的上下文压缩算法、更智能的动态检索策略，以及更强劲的长期记忆管理机制 [169]。

l 自动化与自适应： 构建能够根据任务和用户反馈自动优化其上下文策略的AI系统，例如通过进化算法自动生成和优化提示链与上下文结构 [169]。

结论

上下文工程代表了大模型应用开发领域的一次深刻范式转移。它从系统工程的视角出发，将开发者的关注点从雕琢孤立的“提示”扩展到构建和管理一个完整、动态、高效的“信息生态系统”。通过综合运用检索增强生成（RAG）、上下文压缩、选择性注入和动态信息流控制等一系列技术，上下文工程已经证明其在提升模型准确性、可靠性和效率方面的巨大潜力，并带来了可量化的商业价值。

尽管在实现复杂性、评估方法和安全性等方面仍面临挑战，但随着理论研究的深入、技术工具的成熟以及最佳实践的沉淀，上下文工程无疑将成为释放大型语言模型全部潜能、构建下一代智能应用的核心基石。对于所有致力于开发生产级AI应用的团队而言，掌握并精通上下文工程，将不再是可选项，而是必需的核心竞争力。

大模型应用开发之上下文工程（Context Engineering）原理与应用

参考资料

1. A Survey of Context Engineering for Large Language Models

2. Context Engineering

3. Context Engineering：从Prompt Engineering到上下文工程的演进

4. Tom B. Brown, Benjamin Mann et al. “Language Models are Few-Shot Learners.” ArXiv (2020).

5. 基于 1400+ 论文的 Context Engineering 系统综述

6. Context Engineering(上下文工程)的核心概念与实践方法

7. 在大语言模型（LLM）驱动的 AI Agent 开发时代，数据的组织、存储和应用方式发生了根本性变革。本文将探讨数据库、语料库、知识库和大语言模型应用之间的关系，及 AI Agent 技术架构。

8. Context Engineering应用的目标和手段

9. Colin Raffel, Noam M. Shazeer et al. “Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer.” J. Mach. Learn. Res. (2019).

10. 别再只会写‘一句话指令’！用Context Engineering打造企业级AI应用

11. 大型語言模型上下文工程綜述

12. Context Engineering Concepts

13. The Complete Guide to Context Engineering Framework for Large Language Models

14. Introduction to Context Engineering

15. Context Engineering: The New Frontier in Large Language Model Development

16. Advanced AI techniques in this Week's Chunk

17. Context Engineering について

18. Hugo Touvron, Thibaut Lavril et al. “LLaMA: Open and Efficient Foundation Language Models.” ArXiv (2023).

19. Context Engineering（上下文工程）

20. Context Engineering: Optimizing Enterprise AI

21. Context Engineering Part 2: Advanced Techniques for Using AI in Production

22. Context Engineering for AI Agents: Token Economics and Production Optimization Strategies

23. Agents need context (e.g., instructions, external knowledge, tool feedback) to perform tasks. Context engineering is the art and science of filling the context window with just the right information at each step of an agent’s trajectory.

24. Context Engineering Webinar: Manus + LangChain

25. Context Engineering: How MLOps Accelerates AI Model Deployment

26. Context Engineering for AI Agents: Mastering the Art of Feeding LLMs the Right Information

27. The Rise of Context Engineering: Building the Foundation for Next-Generation AI Agents

28. Context Engineering Guide

29. Context Engineering: Why Most AI Systems Fail

30. A Survey of Context Engineering for Large Language Models

31. Context Engineering in AI: Complete Implementation Guide

32. Context Engineering: Going Beyond Prompt Engineering and RAG

33. Context Engineering with Manus AI Agent

34. The Emergence of Context Engineering as a Critical Discipline in AI Development

35. Context Engineer - Master the Next Evolution Beyond Prompt Engineering

36. Context Engineering

37. Context Engineering: Strategies for Managing AI Agents' Context Windows

38. Prompt Engineering vs. Context Engineering: Crafting AI for Precision

39. Context engineering: Best practices for an emerging discipline

41. 上下文工程：大语言模型的信息管理与系统优化

42. Jacob Devlin, Ming-Wei Chang et al. “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.” North American Chapter of the Association for Computational Linguistics

43. Tom B. Brown, Benjamin Mann et al. “Language Models are Few-Shot Learners.” ArXiv (2020).

44. 上下文工程 Context Engineering

45. Context engineering: Best practices for an emerging discipline

46. 上下文工程：大语言模型信息载荷的系统性设计与优化

47. 上下文工程：提升大型语言模型任务执行效率与准确性的关键路径

48. Hugo Touvron, Thibaut Lavril et al. “LLaMA: Open and Efficient Foundation Language Models.” ArXiv (2023).

49. 上下文工程：系统化优化大语言模型的上下文信息

50. 大语言模型的上下文工程（Context Engineering for Large Language Models）

51. A Survey of Context Engineering for Large Language Models

52. 7 大实战策略：高效提升大语言模型在生产环境中的上下文表现

53. Aakanksha Chowdhery, Sharan Narang et al. “PaLM: Scaling Language Modeling with Pathways.” J. Mach. Learn. Res. (2022).

54. 上下文工程的一些常见策略

55. 大语言模型的上下文工程调研

56. 过去两年，大模型的使用范式正悄悄从「提示词（Prompt）工程」过渡到「上下文（Context）工程」

61. Architecture-Independent Dynamic Information Flow Tracking *

62. A Better Facet of Dynamic Information Flow Control

63. 基于检索增强生成（RAG）的企业知识管理系统构建与实践

64. 信息流控制研究进展

65. From dynamic to static and back: Riding the roller coaster of information-flow control research

66. From Fine- to Coarse-Grained Dynamic Information Flow Control and Back

67. Smart Manufacturing

68. Mandatory access control and information flow control

69. CS 5430 Dynamic Information-Flow Control

70. Practical, Flexible Programming with Information Flow Control

71. 增强生成（RAG）技术：原理、架构与前沿实践

72. 检索增强生成（Retrieval-Augmented Generation, RAG）

73. Marco Vassena, Alejandro Russo et al. “From fine- to coarse-grained dynamic information flow control and back.” Proceedings of the ACM on Programming Languages (2019).

74. Automated optimization under dynamic flow conditions

75. 检索增强生成(RAG)技术正在重塑企业知识管理的方式

81. LlamaIndex and LangChain: A Comparative Analysis of Two Frameworks for Large Language Models

82. LangChain for Life Sciences and Healthcare: Innovation Through LLMs and Generative AI Agents

83. LangChain 在生命科学与医疗保健领域的应用——面向企业 ...

84. LlamaIndex is an open source data orchestration framework for building large language model (LLM) applications

85. Les plateformes LlamaIndex et LangChain facilitent la création et la mise en œuvre des systèmes de génération augmentée par récupération (RAG)

86. What Is The Difference Between LlamaIndex vs LangChain? How To Choose?

87. Context Engineering

88. LlamaIndex vs LangChain：AI原生应用开发工具深度评测与场景化选择指南

89. 开发基于大型模型的应用时，选择合适的应用框架

90. Comparing LlamaIndex vs LangChain: Building Search and Retrieval Applications with Large Language Models

91. Proceedings of the 46th Annual conference of the South African Institute of Computer Scientists and Information Technologists (SAICSIT 2025)

92. 两大RAG框架LangChain与LlamaIndex深度对比，助你选择最适合项目的工具

93. LlamaIndex 和 LangChain 的比较与用例分析

94. Retrieval Augmented Generation: A Simple Introduction

95. Choosing Between LlamaIndex and LangChain

96. Building AI-driven applications with gen AI

97. LlamaIndex and LangChain: Orchestrating LLMs

98. DESARROLLO DE UN SISTEMA DE CLASIFICACIÓN Y BÚSQUEDA DE ARTÍCULOS ACADÉMICOS MEDIANTE CADENAS CON MODELOS DE LENGUAJE Y CONTEXTUALIZACIÓN POR MODELOS RAG

99. LangChain: A Framework for Simplifying LLM App Development

100. LlamaIndex

101. Knowledge Graph-Driven Real-Time Data Engineering for Context-Aware Machine Learning Pipelines

102. A Survey of Context Engineering - July 2025

103. 7. Evaluation Metrics and Benchmarking for Context Engineering Effectiveness

104. Context Engineering

105. Why Context Engineering Matters Now

106. Context Engineering: The Bottleneck to Better Continuous AI

107. Agentic AI and Context Engineering

108. A Survey of Context Engineering for Large Language Models

109. Context Engineering for AI [2025 Guide]

110. Context Engineering for AI Agents: Token Economics and Production Optimization Strategies

111. Context Engineering for AI Agents: Mastering the Art of Feeding LLMs the Right Information

112. Context Engineering and MCP Toolbox: The Hidden Backbone of Modern AI You Must Know

113. Context engineering has become a transformative force in moving from experimental AI demos to robust, production-grade systems across various industries.

114. Benchmarking AI Inference at the Edge: Measuring Performance and Efficiency for Real-World Deployments

115. DSBC : Data Science task Benchmarking with Context engineering

116. ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions

121. Retrieval-Augmented Generation (RAG) System Design and Implementation

122. 深度解析 RAG（Retrieval-Augmented Generation）技术原理

123. Recent Innovations in Cloud-Optimized Retrieval-Augmented Generation Architectures for AI-Driven Decision Systems

124. Dynamic Retrieval-Augmented Generation

125. An advanced retrieval-augmented generation system for manufacturing quality control

126. Bell developed a sophisticated hybrid RAG (Retrieval Augmented Generation) system combining batch and incremental processing to handle both static and dynamic knowledge bases.

127. Ein praktischer Leitfaden zur Implementierung von Generativer KI für Retrieval-Augmented Generation

128. CTRLA: ADAPTIVE RETRIEVAL-AUGMENTED GENERATION VIA INHERENT CONTROL

129. A Practical Blueprint for Implementing Generative AI Retrieval-Augmented Generation

130. Retrieval augmented generation system for dynamic document tagging and query-driven retrieval

131. R²AG: Incorporating Retrieval Information into Retrieval Augmented Generation

132. RAGOps: Operating and Managing Retrieval-Augmented Generation Pipelines

133. GenFlow: Interactive Modular System for Image Generation

134. DioR: Adaptive Cognitive Detection and Contextual Retrieval Optimization for Dynamic Retrieval-Augmented Generation

135. A Retrieval-Augmented Generation (RAG) system combines data retrieval with AI-generated responses

136. RAGFlow Explained: The Ultimate Guide to Next-Gen Retrieval-Augmented Generation

137. Real-Time AI Data Pipelines Using APIs and LLMs

138. Accelerating Retrieval-Augmented Generation

141. LangChain 与 LlamaIndex 深度对比与选型指南

142. LlamaIndex and LangChain: A Comparative Analysis of Two Frameworks for Large Language Models

143. Marathon: A Race Through the Realm of Long Context with Large Language Models

144. LangChain：用开发LLM的通用框架。

145. LlamaIndex is an open source data orchestration framework for building large language model (LLM) applications

146. LangChain 在生命科学与医疗保健领域的应用——面向企业 ...

147. An In-Depth Evaluation of Three Leading LLM Frameworks: LangChain, LlamaIndex, and Haystack

148. LlamaIndex vs LangChain：AI原生应用开发工具深度评测与场景化选择指南

149. 开发基于大型模型的应用时，选择合适的应用框架

150. LlamaIndex and LangChain: Orchestrating LLMs

151. LlamaIndex 与 LangChain：核心定位与生态概览

152. Algoritmos Avançados de GEN AI: LangChain

153. Context Engineering

154. LangChain与LlamaIndex，作为两大AI应用开发框架

155. Building Data-Driven Applications with LlamaIndex

156. How SAP/ERP AI Chatbots Can Boost Your Sales and Customer Satisfaction

157. LlamaIndex 与 LangChain 的最佳用例

158. Evaluate RAG responses with Amazon Bedrock, LlamaIndex and RAGAS

159. 当前 Python 生态中最主流、应用最广泛的两大 Agent 开发框架：LangChain 和 LlamaIndex

160. PROGETTAZIONE E IMPLEMENTAZIONE DI UN CHATBOT INTELLIGENTE TRAMITE PIATTAFORMA LANGCHAIN: STUDIO E VALUTAZIONE DEI VECTORDB

161. THE CONTEXT ENGINEERING APPROACH: A Framework for Information Systems Development

162. A Survey of Context Engineering for Large Language Models

163. Context engineering has become a transformative force in moving from experimental AI demos to robust, production-grade systems across various industries.

164. Reproducible Performance Analysis & Engineering of Large-Scale IT Infrastructures

165. Context Engineering：从Prompt Engineering到上下文工程的演进

166. A Survey of Context Engineering - July 2025

167. Context Engineering：概念与技术实现深度解析

168. Context Engineering

169. 火爆硅谷的Context Engineering

170. 果比AI日报

181. Integrating Retrieval with Generation: A New Approach in Augmented AI

182. Retrieval Augmented Generation (RAG)

183. RARE: Retrieval-Aware Robustness Evaluation for Retrieval-Augmented Generation Systems

184. Ein praktischer Leitfaden zur Implementierung von Generativer KI für Retrieval-Augmented Generation

185. Recent Innovations in Cloud-Optimized Retrieval-Augmented Generation Architectures for AI-Driven Decision Systems

186. Integrating Access Control with Retrieval-Augmented Generation: A Proof of Concept for Managing Sensitive Patient Profiles

187. Production insights from implementing retrieval augmented generation at OpenAI, Docker, Langchain, CircleCI, and more.

188. A Retrieval-Augmented Generation (RAG) system combines data retrieval with AI-generated responses

189. The Retrieval-Augmented Generation (RAG) Pattern

190. Retrieval-Augmented Generation (RAG) is a powerful GenAI implementation pattern that enhances generative models by incorporating corporate information via data retrieval mechanisms without additional model training.

191. Retrieval-augmented generation (RAG) techniques

192. CTRLA: ADAPTIVE RETRIEVAL-AUGMENTED GENERATION VIA INHERENT CONTROL

193. Retrieval Augmented Generation (RAG) Workflows

194. GenFlow: Interactive Modular System for Image Generation

195. Privacy Issues and Privacy-preserving Mechanisms in Retrieval-Augmented Generation Systems

196. Integrating Retrieval-Augmented Generation with Large Language Models for Supply Chain Strategy Optimization

197. R²AG: Incorporating Retrieval Information into Retrieval Augmented Generation

198. Integrate any data with and for LLMs

199. Retrieval Augmented Generation (RAG) – 5 Use Cases

201. LangChain and RAG best practices

202. Contextual compression for RAG-based applications

203. LlamaIndex and LangChain: A Comparative Analysis of Two Frameworks for Large Language Models

204. ContextualCompressionRetriever

205. Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG

206. LangChain性能优化系统设计与实现

207. An In-Depth Evaluation of Three Leading LLM Frameworks: LangChain, LlamaIndex, and Haystack

208. LlamaIndex：a data framework for your LLM applications，especially for RAG

209. 高效信息检索新方法：LangChain中Retriever的多种高级策略

210. How to Optimize LLM Applications With Prompt Compression Using LLMLingua and LangChain

211. LlamaIndex (GPT Index)

212. Contextual compression

213. Building Data-Driven Applications with LlamaIndex

214. Proceedings of the 46th Annual conference of the South African Institute of Computer Scientists and Information Technologists (SAICSIT 2025)

215. PROGETTAZIONE E IMPLEMENTAZIONE DI UN CHATBOT INTELLIGENTE TRAMITE PIATTAFORMA LANGCHAIN: STUDIO E VALUTAZIONE DEI VECTORDB

216. LlamaIndex vs LangChain：AI原生应用开发工具深度评测与场景化选择指南

217. 面向开发者的 LLM 入门课程

218. LangChain 在生命科学与医疗保健领域的应用——面向企业 ...

221. A Survey of Context Engineering for Large Language Models

222. Dylan Pfeifer, A. Gerstlauer. “Expression-Level Parallelism for Distributed Spice Circuit Simulation.” 2011 IEEE/ACM 15th International Symposium on Distributed Simulation and Real Time Applications (2011).

223. Context Engineering

224. 18xсно╥

225. Context Engineering 驱动的新一代AI编程助手生态系统

226. Context Engineering: Beyond Prompt Engineering

227. Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

228. GPS 18x TECHNICAL SPECIFICATIONS

231. Secure Framework for Retrieval-Augmented Generation: Challenges and Solutions

232. Common architecture pattern of RAG systems for enterprise

233. Integrating Access Control with Retrieval-Augmented Generation: A Proof of Concept for Managing Sensitive Patient Profiles

234. Retrieval Augmented Generation (RAG)

235. Privacy Issues and Privacy-preserving Mechanisms in Retrieval-Augmented Generation Systems

236. Recent Innovations in Cloud-Optimized Retrieval-Augmented Generation Architectures for AI-Driven Decision Systems

237. Securing Retrieval-Augmented Generation Pipelines: A Comprehensive Framework

238. Accelerating Retrieval-Augmented Generation

239. Modular RAG: A Revolutionary Approach to Building Retrieval-Augmented Generation Systems

240. RAG Architecture Design Patterns Balancing Retrieval Depth and Generative Coherence

241. Optimizing Retrieval-Augmented Generation with Multi-Agent Hybrid Retrieval

242. Retrieval-augmented generation (RAG) techniques

243. A Retrieval-Augmented Generation (RAG) system combines data retrieval with AI-generated responses

251. LangChain and RAG best practices

252. Integration for ContextCrunch in a LangChain pipeline.

253. Contextual compression for RAG-based applications

254. LlamaIndex and LangChain: A Comparative Analysis of Two Frameworks for Large Language Models

255. ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference

256. 高效信息检索新方法：LangChain中Retriever的多种高级策略

257. LangChain性能优化系统设计与实现

258. Proceedings of the 46th Annual conference of the South African Institute of Computer Scientists and Information Technologists (SAICSIT 2025)

259. Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG

260. An In-Depth Evaluation of Three Leading LLM Frameworks: LangChain, LlamaIndex, and Haystack

261. Efficient Chunk Size Optimization for RAG Pipelines with LlamaCloud

262. COMPLLM: COMPRESSION FOR LONG CONTEXT Q&A

263. How to Optimize LLM Applications With Prompt Compression Using LLMLingua and LangChain

264. Lighter And Better: Towards Flexible Context Adaptation For Retrieval Augmented Generation

265. PROGETTAZIONE E IMPLEMENTAZIONE DI UN CHATBOT INTELLIGENTE TRAMITE PIATTAFORMA LANGCHAIN: STUDIO E VALUTAZIONE DEI VECTORDB

266. Provence: efficient and robust context pruning for retrieval-augmented generation

267. 面向开发者的 LLM 入门课程

268. Advanced RAG: Techniques for Better Retrieval and Generation

271. A Survey of Context Engineering for Large Language Models

272. Context-Aware Text Recognition with Enhanced Feature Extraction and Decoding

273. Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

274. 火爆硅谷的Context Engineering

275. Context Engineering

276. What Is Context Engineering? A Guide for AI & LLMs

277. 果比AI日报

278. Haystack Engineering: Context Engineering Meets the Long-Context Challenge in Large Language Models

279. Context engineering has become a transformative force in moving from experimental AI demos to robust, production-grade systems across various industries.

280. A. R. Salama, Özge Alaçam et al. “Text Completion using Context-Integrated Dependency Parsing.” Rep4NLP@ACL (2018).

291. Secure Framework for Retrieval-Augmented Generation: Challenges and Solutions

292. Common architecture pattern of RAG systems for enterprise

293. Integrating Access Control with Retrieval-Augmented Generation: A Proof of Concept for Managing Sensitive Patient Profiles

294. Retrieval Augmented Generation (RAG)

295. Privacy Issues and Privacy-preserving Mechanisms in Retrieval-Augmented Generation Systems

296. Recent Innovations in Cloud-Optimized Retrieval-Augmented Generation Architectures for AI-Driven Decision Systems

297. Accelerating Retrieval-Augmented Generation

298. Modular RAG: A Revolutionary Approach to Building Retrieval-Augmented Generation Systems

299. Understanding the Design Decisions of Retrieval-Augmented Generation Systems

300. Securing Retrieval-Augmented Generation Pipelines: A Comprehensive Framework

301. RAG Architecture Design Patterns Balancing Retrieval Depth and Generative Coherence

302. Retrieval-augmented generation (RAG) techniques

303. A Retrieval-Augmented Generation (RAG) system combines data retrieval with AI-generated responses

311. A Survey of Context Engineering for Large Language Models

312. Context-Aware Text Recognition with Enhanced Feature Extraction and Decoding

313. 火爆硅谷的Context Engineering

314. What Is Context Engineering? A Guide for AI & LLMs

315. Context Engineering

316. 基于 1400+ 论文的 Context Engineering 系统综述

317. 果比AI日报

318. Haystack Engineering: Context Engineering Meets the Long-Context Challenge in Large Language Models

319. Context Engineering: Beyond Prompt Engineering

全部评论(0)

上一篇：这大模型项目你就练吧...一练一个不吱声
下一篇：给大家普及一下大模型transformer面试需要达到的学习强度

真快激活码

店铺

推荐商品