Agent

"Agent"

2026-05-14 Agent-as-Tool 把一個專責 agent 包成可被另一個 agent 呼叫的 tool，形成跨 agent 的責任邊界
2026-05-14 Beyond LLM: Enhancing LLM Applications (Stanford CS230) Stanford CS230 Deep Learning 講座整理：從 prompt engineering、fine-tuning、RAG 到 agentic workflow、evals、multi-agent system 的全景 survey。保留英文原文。
2026-05-14 Case Study：customer support agent 從 task decomposition 到 eval 把模組四原理串成端到端案例：observe → decompose → design workflow → instrument trace → design eval → iterate。每段標出引用哪章。
2026-05-14 Context Drift Agent 長任務中累積上下文逐步偏離原始目標，導致後續行動看似合理但整體跑偏
2026-05-14 Goal Drift Agent 把子目標誤當成整體目標，提早停止或朝錯誤完成條件前進的失敗模式
2026-05-14 Multi-agent system 多個 LLM agent 協作的系統、跟 multi-call workflow 的差異在控制流跟責任邊界、三種拓樸 flat / hierarchical / agent-as-tool
2026-05-14 Tool Result Misread Agent 誤讀工具輸出，把錯誤、空結果或部分成功當成成功，導致後續步驟建立在錯誤狀態上
2026-05-12 Agent Loop LLM agent 自我循環的工作流：LLM 規劃下一步、執行 tool、看結果、再規劃下一步、直到任務完成或停止條件觸發
2026-05-12 Agent Memory Agent 在 context window 之外管理長期狀態的設計、五個層次：working / short-term / long-term episodic / semantic / procedural
2026-05-12 Tool Use LLM 透過結構化呼叫外部工具（讀檔、查資料庫、發 API request）來擴展能力的設計、function calling 跟 MCP 是常見實作
2026-05-12 6.2 tool use 與 MCP server 的權限模型個人 dev 場景下 tool use / MCP server 的副作用權限：檔案系統 / shell / 網路存取邊界、第三方 MCP 信任、副作用的可逆性
2026-05-11 4.4 Agent 架構原理 Agent loop 結構、失敗模式、什麼任務適合 vs 不適合、跟人類審查的協作模型
2026-05-14 4.8 Multi-Agent 拓樸：flat / hierarchical / agent-as-tool 從 multi-call workflow 走到 multi-agent system 的判讀、flat vs hierarchical 拓樸、agent-as-tool 的 MCP 視角、specialization 跟 orchestration overhead 的取捨
2026-06-22 Datadog 成本治理與 Agent 配置說明 Datadog 的計價模型、custom metrics 成本控制、Agent 部署配置與常見故障模式
2026-05-12 4.19 Agent memory 分層架構 Agent 在 context window 之外管理長期狀態的設計：working / short-term / long-term episodic / semantic / procedural 五個層次、寫入時機、retrieval 設計、失敗模式
2026-05-18 Background Agent 平行研究：main context 節省的量化效應用 background agent 平行做同類研究任務、主 context 只收 finding summary、節省 ~80% context 用量的工作方法
2026-05-12 LLM Agent Prompt Injection 後果治理 production LLM agent 場景的 prompt injection 後果：tool spec 設計、agent loop 限制、review checkpoint、跟 incident workflow 的接合