Data topology 是 process content 的第 6 audit 維度

2026-05-19

結論

Process content 的 diff dimension audit 原本 5 維 — schema / operational / paradigm / components / application change — 漏了 data topology 這軸。Topology 是 資料在 cluster / partition / region 之間的分佈拓樸、跟既有 5 維任一個都不對等：

維度	處理對象	對 topology 的關係
Schema / API	資料結構（column / type / index）	不同層、schema 不變 topology 可能變
Operational model	運維 stack（HA / backup / monitoring）	topology 可能影響 ops、但不是同一概念
Paradigm	核心抽象（OLTP / log / pub-sub）	同 paradigm 內 topology 可變
Components	元件數量（1 vs N）	同 component 數可有不同 topology
Application change	application code 改動量	topology 變不必然 application 改
Data topology	slot / shard / partition / region 分佈	本卡新增的第 6 維

Data topology 是 資料分佈 層級的概念 — 跟資料結構（schema）、運維機制（operational）、抽象模型（paradigm）、組件數量（components）、application code 改動量（application change）並列為第 6 軸；topology 變動時其他 5 維可能完全不變、但 資料在 cluster / partition / region 之間的擺放方式 改變、需要獨立的結構處理。

擴 audit 到 6 維、新增 Type F「Topology re-layout」結構對映 topology 高差異 的 process content。

Topology 的 5 個 sub-dimension

不同 source/target 配對對 topology 的影響不同、用 5 sub-dimension 描述具體變化：

Sub-dimension	內容	例
Sharding strategy	Slot / hash / range / consistent hash / key-based	Redis cluster slot 重分配
Partition strategy	Declarative / range / list / hash / sub-partition	PostgreSQL monthly → daily partition
Replication topology	Single primary / multi-master / star / hub-spoke / mesh	Single primary → multi-master 切換、或加 logical replication subscriber
Region distribution	Single / multi-AZ / multi-region / global	Cassandra single DC → multi-DC
Co-location / locality	Locality-aware queries / row-level region pinning	CockroachDB region 強制 row 對應

任一 sub-dimension 變動就構成 topology layout 變動；多個 sub-dimension 同時變更（如「sharding strategy + region distribution 同時改」）是 complex topology migration、結構複雜度高。

為什麼 topology 不能塞進既有 5 維

Reviewer 質疑：為什麼不直接歸進 operational 或 paradigm？三個拒絕理由：

Schema 不變但 topology 變：PostgreSQL partition strategy 改（monthly → daily）— schema 完全相同、partition boundary 重劃；歸 Schema 維度錯位
Operational stack 不變但 topology 變：Redis cluster 加 node 重分 slot — Sentinel / monitoring / backup 不變、純粹是 slot mapping 重劃；歸 Operational 維度太寬
Paradigm 不變但 topology 變：Cassandra 從 single DC 加到 multi-DC — 同 distributed DB paradigm、co-location / replication topology 變；歸 Paradigm 維度誤導
Components 不變但 topology 變：Kafka topic re-partition（10 partitions → 100）— 同 1 個 cluster、partition count 變；歸 Components 維度錯位

Topology 是 獨立的問題軸、5 維 audit 漏掉時會誤判結構。

觸發 Type F 的情境

情境	Topology 變化	是否同 vendor
Cluster re-sharding	Slot / shard 重分配	yes
Partition redesign	Partition boundary / strategy 重劃	yes
Single-region → multi-region	Region distribution + replication topology 雙變	多數 yes（同 vendor 加 region）
Multi-master rollout	Replication topology 從 single primary 變 multi-master	yes
DynamoDB GSI / global tables	Sharding + replication 雙變	yes
Kafka topic re-partitioning	Sharding strategy 變	yes
Cassandra keyspace re-balance	Replication factor（sub-dim 3）+ token range（sub-dim 1）雙變	yes
MongoDB sharded cluster 加 shard	Sharding 重分布	yes

多數 Type F 場景是 同 vendor — 跟 #127 Type A-E 預設「跨 vendor」對應、Type F 是 同 vendor 內 topology 重劃。

6 維 audit decision rule（updated）

擴 audit 到 6 維後、type 對映規則更新：

維度組合	對映 type
Schema = High（其他 Low）	Type A phased rule translation
全 Low	Type B drop-in
Operational = High（其他 Low）	Type C operational redesign hybrid
Components = High	Type D parallel streams
Paradigm = High	Type E partial + 混合架構
Topology = High（其他 Low）	Type F topology re-layout（本卡新增）
多軸 High	按 #127 多重歸類規則

主導維度判讀的優先序也擴張：Schema > Paradigm > Operational > Topology > Components。Topology 在 schema / paradigm / operational 之後、components 之前 — 因為 topology 對讀者 conceptual impact 通常比 components 拆分大、但比 schema / paradigm 小。

Type F「Topology re-layout」結構 anatomy

從 Redis cluster re-sharding 抽出的標準形態：

11. 為什麼 re-layout（4-N 種 driver）
22. 結構 differentiator（re-layout 不是 migration）
33. Pre-layout analysis（current topology audit / hot key / slot 分佈）
44. Re-layout 機制（slot migration / partition split / shard rebalance）
55. Execution flow（per-step、含 rollback boundary）
66. Production 故障演練
77. Capacity / cost
88. 整合 / 下一步

7-9 章節、200-260 行。三個 新元素 是 Type F 的核心承擔：

Pre-layout analysis 段：在執行前列出當前 topology（slot 分佈 / hot key / replica lag / partition imbalance）、決定 re-layout 的範圍跟順序；缺這段、後續執行階段沒 baseline 可比、failure 偵測延遲
Re-layout 機制段：解釋 vendor 的 slot migration / partition split / shard rebalance protocol —讀者要理解 vendor 內部機制才能預估 latency / locking / atomicity 邊界
Execution flow per-step + rollback boundary：跟 Type A 的 phased 對照、Type F per-step 粒度更細（單 slot migration vs 整個 phase）、每 step 都要明示 能否回退、回退時資料狀態

跟 Type B 對照、Type F 多了「topology audit」段、Step-by-step 比 Type B 細（per-step 不是 per-cutover）；跟 Type A phased 對照、Type F 多數情境不需要 schema translation / parallel run / cleanup phase（source / target 同 cluster）；但 multi-region rollout 子情境例外、仍需 parallel run（兩 region 同跑後切流量）— 此時 Type F + Type A parallel run 段組合應用、見「多重歸類」規則。

注意 anatomy 列 8 row 是 規範形態、不是強制機械對映 — 實作上「結構 differentiator」+「pre-layout analysis」段可 inline 到開頭 audit 段（如 Redis cluster re-sharding 的「Source = Target，但 topology 重劃」段內聯處理）、實作 H2 數可能比 anatomy 列 row 少 1-2 個。

Production 反模式

反模式	後果
把 re-sharding 套 Type B drop-in	漏掉 slot migration 機制段、cluster busy 跟 stale client cache 沒被處理
把 multi-region rollout 套 Type C	漏掉 locality-aware queries 跟 replication topology 設計
Topology 變化只列在「容量」段	讀者把 topology 當 capacity 子議題、忽略結構影響
多 sub-dimension 同時變、只寫一個	例：Cassandra 加 DC 同時改 replication factor、只寫前者
Type F 套錯場景（topology 沒變的 migration）	強迫 phased per-step、phase 空白

跟其他抽象層原則的關係

原則	關係
#127 Process content 結構由最大差異維度決定	父卡 — 本卡擴 #127 的 audit 框架從 5 維到 6 維、新增 Type F；#127 的 5 type 仍適用、本卡加第 6 type
#125 Collapse 是隱形預設	同骨 — 5 維 audit 漏 topology 是「結構分類 collapse 掉 topology 軸」、是 #125 在 audit dimension surface 的子實例
#118 Standard-driven vs case-driven domain judgment	Sibling — 兩卡都是寫作前的 domain audit、#118 判 case-driven vs standard-driven、本卡判 topology 是否需要 Type F
#122 Cadence 同質化是模板的隱形維度	同骨 — 模板有「內容欄位 / cadence」兩維度（#122）vs audit 有「6 維 / topology」兩 layer；都是「初始框架漏軸、用實證浮現補位」

判讀徵兆

訊號	該做的事
寫到一半發現 5 維 audit 都 Low、但內容跟 Type B drop-in 不一樣	Topology 可能是漏掉的維度、補 6 維 audit
「容量規劃」段比實作段還複雜	Topology 變動被誤歸 capacity、應該獨立段
Sharding / partition / region 任一變動	跑 topology audit、評估是否 Type F
同 vendor 內升級 / re-layout	大概率不是 5 type、檢查 topology 是否變
Type B 結構寫不下實際內容	可能是 Type F 而非 Type B
多個 sub-dimension 同時變	Complex topology migration、結構複雜度 +1 階

核心：5 維 audit 漏 topology 是初始框架的盲點；topology 是 資料分佈 而非 資料結構 / 元件 / 抽象、需要獨立 audit 軸。Type F「Topology re-layout」對映 topology = High 的 process content、跟 Type A-E 並列；多軸 High 配對按 #127 多重歸類規則處理。

Self-aware limitation：本卡的 6 個未解結構性質疑

第二輪 4-reviewer audit 揭露 6 項結構性 issue、本卡選擇 meta-acknowledgment（記錄）而非 substantive restructure（重寫）— 跟 #127 self-aware limitation spirit 一致：

6 維仍可能漏類：reviewer 提 identity / authorization / consistency / transactional / data residency 三軸候選；本卡確認 6 維是 current best understanding、不是窮盡；下一輪 batch 跑前優先驗證這些候選軸是否真的獨立
Type F 跟 Type B 結構重疊度高：anatomy 8 row 中 6 row 跟 Type B 對齊、實質差異在「pre-layout analysis + re-layout 機制」兩段；可能下次 evolution 是 Type B 的 variant 而非並列 type；保留現狀因為「同 cluster」邊界對讀者區分有用
「不需要 parallel run」claim 部分不成立：multi-region rollout 子情境仍需 parallel run（兩 region 同跑然後切流量）— anatomy 已加註此例外、跟「多重歸類」規則組合應用
主導維度優先序是 audience-dependent heuristic：DBA 視角 Topology 可能 > Operational、application developer 視角 Schema > Paradigm；當前 Schema > Paradigm > Operational > Topology > Components 預設是「跨 audience 平均」、非 universal；reviewer 識別此 stipulation 性質
「topology 不能塞進既有 5 維」拒絕理由的窄定義依賴：4 個拒絕點都靠 narrow 既有 5 維定義成立；換個合理定義（如「component = 任何 cluster-internal primitive、包含 partition」）topology 跟 components 邊界會 collapse；保留現狀因為當前定義對寫作判讀有用
既有 5 篇 playbook 沒 retroactive audit：6 維框架 retroactively 對既有 Type A-E 文章未重審；Splunk → Elastic / Datadog → Grafana / Postgres → Aurora 按 6 維可能變 multi-axis；這是已知 silent grandfathering、不是清白「擴張」

下一輪 batch trigger：

寫 1-2 篇 Type F dogfood 驗證 anatomy 通用性（Cassandra re-balance / PG partition redesign 是候選）
若浮現 Type F 跟 Type B 結構真同構、考慮降級為 variant
若浮現 identity / consistency / residency 真的獨立軸、再擴 audit 到 7 維
既有 5 篇 retroactive audit 在累積到 10+ migration playbook 後做、單獨成 retrospective report

Update（2026-05-19 第三輪 migration batch 後）：4 條 tripwire 全驗證

第三輪 migration batch（5 篇）執行了上述 4 條 trigger、各自結果：

Tripwire 預測	第三輪結果
Type F dogfood × 2 驗證 anatomy 通用性	完成：PG partition redesign + MongoDB shard+multi-DC；anatomy 在 PG / MongoDB 上仍適用、跟 Redis re-sharding 對齊
Type F vs Type B 結構同構驗證	部分浮現：PG partition / Redis re-sharding 不需 parallel run、MongoDB multi-DC 需要；建議 Type F 拆 F-cluster（單 cluster 內、不需 parallel run）+ F-multi-region（跨 region、需 parallel run）兩 sub-type、未來累積更多 case 後 commit
Identity / consistency / residency 三軸候選驗證	三軸各 1 case 驗證、工作量分佈支持獨立軸：Vault → AWS Secrets Manager（identity、45% 工作量）/ DynamoDB consistency（consistency、85% 工作量）/ PG GDPR multi-region（residency、40% 工作量）；累積到 3-5 case / 軸後 commit 升 7-9 維 audit
既有 5 篇 retroactive audit	暫不執行、累積到 10+ migration playbook 後再做（當前共 10 篇 migration、剛達 trigger threshold、留下輪 retrospective 處理）

3 軸候選驗證 detail：

Identity axis：Vault → AWS Secrets Manager 45% 工作量在 identity model 對位（Vault token vs IAM principal）、不歸 schema / operational / application change；驗證 identity 可獨立發生 + 帶獨立工作量
Consistency axis：DynamoDB strong → eventual 85% 工作量在 per-call-site contract review、不歸 paradigm / application change；驗證 consistency 可獨立發生 + 帶獨立工作量
Residency axis：GDPR multi-region 40% 工作量在 compliance（DPIA / evidence collection / DPO sign-off）、reverse-constrain topology + operational + application；驗證 residency 不只是 driver、是 cross-cutting constraint

新浮現議題（不在原 tripwire 內）：

Residency 是 cross-cutting constraint vs 獨立軸：reviewer 把 residency 歸為 driver、實證上是 cross-cutting constraint — 反向約束其他維度 + 帶獨立合規工作量；可能需要 constraint layer 概念跟 axis 並列
Type F sub-type 浮現：multi-region rollout 跟 cluster re-sharding 是不同 sub-type；前者需 parallel run、後者不需；anatomy 在 sub-type 之間有差異

#report #事後檢討 #工程方法論 #原則 #抽象層 #content-design #process-writing #audit-dimension