<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Reddit on Tarragon</title><link>https://tarrragon.github.io/blog/backend/08-incident-response/cases/reddit/</link><description>Recent content in Reddit on Tarragon</description><generator>Hugo -- gohugo.io</generator><language>zh-TW</language><copyright>Tarragon (CC BY 4.0)</copyright><lastBuildDate>Fri, 01 May 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://tarrragon.github.io/blog/backend/08-incident-response/cases/reddit/index.xml" rel="self" type="application/rss+xml"/><item><title>Reddit：2023 Kubernetes 升級事故</title><link>https://tarrragon.github.io/blog/backend/08-incident-response/cases/reddit/2023-kubernetes-upgrade-incident/</link><pubDate>Thu, 07 May 2026 00:00:00 +0000</pubDate><guid>https://tarrragon.github.io/blog/backend/08-incident-response/cases/reddit/2023-kubernetes-upgrade-incident/</guid><description>&lt;p>這起案例的核心責任是把平台升級納入事故流程。升級事件不是純部署問題，會直接影響事件分級、回退與通訊節奏。&lt;/p>
&lt;h2 id="判讀訊號">判讀訊號&lt;/h2>
&lt;table>
 &lt;thead>
 &lt;tr>
 &lt;th>訊號&lt;/th>
 &lt;th>判讀重點&lt;/th>
 &lt;th>回寫章節&lt;/th>
 &lt;/tr>
 &lt;/thead>
 &lt;tbody>
 &lt;tr>
 &lt;td>post-upgrade error burst&lt;/td>
 &lt;td>變更後退化是否快速擴散&lt;/td>
 &lt;td>&lt;a href="https://tarrragon.github.io/blog/backend/08-incident-response/incident-severity-trigger/" data-link-title="8.1 事故分級與啟動條件" data-link-desc="建立統一分級標準與事故啟動門檻">8.1&lt;/a>&lt;/td>
 &lt;/tr>
 &lt;tr>
 &lt;td>rollback decision delay&lt;/td>
 &lt;td>回退決策是否過慢&lt;/td>
 &lt;td>&lt;a href="https://tarrragon.github.io/blog/backend/08-incident-response/incident-decision-log/" data-link-title="8.19 Incident Decision Log" data-link-desc="把事中假設、決策、證據、回退條件與責任人留下可復盤紀錄">8.19&lt;/a>&lt;/td>
 &lt;/tr>
 &lt;tr>
 &lt;td>service recovery slope&lt;/td>
 &lt;td>恢復是否分批收斂&lt;/td>
 &lt;td>&lt;a href="https://tarrragon.github.io/blog/backend/08-incident-response/containment-recovery-strategy/" data-link-title="8.3 止血、降級與回復策略" data-link-desc="把短期止血與正式回復拆成可執行步驟">8.3&lt;/a>&lt;/td>
 &lt;/tr>
 &lt;/tbody>
&lt;/table>
&lt;h2 id="邊界判讀">邊界判讀&lt;/h2>
&lt;p>這個案例的邊界是「平台升級變更」與「事故分級決策」要共用同一套欄位。主要風險是把升級當例行操作，延後回退判斷。&lt;/p>
&lt;h2 id="下一步路由">下一步路由&lt;/h2>
&lt;p>把升級變更與事故決策共用欄位，並在 &lt;a href="https://tarrragon.github.io/blog/backend/06-reliability/release-gate/" data-link-title="6.8 Release Gate 與變更節奏" data-link-desc="把驗證、migration、相容性納入放行判準">6.8&lt;/a> 加入升級專屬 gate。事故收斂後回寫 &lt;a href="https://tarrragon.github.io/blog/backend/08-incident-response/incident-decision-log/" data-link-title="8.19 Incident Decision Log" data-link-desc="把事中假設、決策、證據、回退條件與責任人留下可復盤紀錄">8.19&lt;/a>。&lt;/p></description><content:encoded><![CDATA[<p>這起案例的核心責任是把平台升級納入事故流程。升級事件不是純部署問題，會直接影響事件分級、回退與通訊節奏。</p>
<h2 id="判讀訊號">判讀訊號</h2>
<table>
  <thead>
      <tr>
          <th>訊號</th>
          <th>判讀重點</th>
          <th>回寫章節</th>
      </tr>
  </thead>
  <tbody>
      <tr>
          <td>post-upgrade error burst</td>
          <td>變更後退化是否快速擴散</td>
          <td><a href="/blog/backend/08-incident-response/incident-severity-trigger/" data-link-title="8.1 事故分級與啟動條件" data-link-desc="建立統一分級標準與事故啟動門檻">8.1</a></td>
      </tr>
      <tr>
          <td>rollback decision delay</td>
          <td>回退決策是否過慢</td>
          <td><a href="/blog/backend/08-incident-response/incident-decision-log/" data-link-title="8.19 Incident Decision Log" data-link-desc="把事中假設、決策、證據、回退條件與責任人留下可復盤紀錄">8.19</a></td>
      </tr>
      <tr>
          <td>service recovery slope</td>
          <td>恢復是否分批收斂</td>
          <td><a href="/blog/backend/08-incident-response/containment-recovery-strategy/" data-link-title="8.3 止血、降級與回復策略" data-link-desc="把短期止血與正式回復拆成可執行步驟">8.3</a></td>
      </tr>
  </tbody>
</table>
<h2 id="邊界判讀">邊界判讀</h2>
<p>這個案例的邊界是「平台升級變更」與「事故分級決策」要共用同一套欄位。主要風險是把升級當例行操作，延後回退判斷。</p>
<h2 id="下一步路由">下一步路由</h2>
<p>把升級變更與事故決策共用欄位，並在 <a href="/blog/backend/06-reliability/release-gate/" data-link-title="6.8 Release Gate 與變更節奏" data-link-desc="把驗證、migration、相容性納入放行判準">6.8</a> 加入升級專屬 gate。事故收斂後回寫 <a href="/blog/backend/08-incident-response/incident-decision-log/" data-link-title="8.19 Incident Decision Log" data-link-desc="把事中假設、決策、證據、回退條件與責任人留下可復盤紀錄">8.19</a>。</p>
]]></content:encoded></item></channel></rss>