PostgreSQL Replication Slot Management：Physical / Logical / Failover Slot 治理

2026-05-19

本文是 PostgreSQL overview 的 implementation-layer deep article。Overview 已說明 PG 在 OLTP 譜系的定位、本文聚焦 replication slot management — physical / logical / failover slot 三類治理。

Replication Slot 兩大類

PG 兩種 replication slot：

Physical Replication Slot

對應 streaming replication（physical WAL byte-level）：

1SELECT pg_create_physical_replication_slot('standby1_slot');

用於：

Streaming replication standby（Replication Topology）
pg_basebackup 用 slot 防 WAL 清理
高 lag standby 防 WAL premature deletion

Logical Replication Slot

對應 logical replication / logical decoding：

1SELECT pg_create_logical_replication_slot('my_slot', 'pgoutput');
2-- 或用 wal2json plugin
3SELECT pg_create_logical_replication_slot('debezium_slot', 'wal2json');

用於：

PG-to-PG logical replication（publication / subscription）
CDC（Debezium / Maxwell / pg_logical_emitter）
Multi-master replication（BDR / pgEdge / Spock）

logical slot 跟 physical slot 共存、各自獨立 retention。

Slot Lifecycle

1建立 → active（有 consumer）→ inactive（consumer 失聯）→ drop
2                                    ↓
3                              WAL 持續累積（直到推進 LSN 或 drop）

狀態查詢：

1SELECT slot_name,
2       slot_type,
3       active,
4       restart_lsn,
5       confirmed_flush_lsn,
6       pg_size_pretty(pg_wal_lsn_diff(pg_current_wal_lsn(), restart_lsn)) AS retained_wal
7FROM pg_replication_slots;

關鍵欄位：

slot_type：physical / logical
active：true / false（consumer 是否連著）
restart_lsn：slot 起點 LSN、primary 必須保留這以後的 WAL
confirmed_flush_lsn：logical slot 已 confirm flush 的 LSN
retained_wal：當前因 slot 累積的 WAL

Failover Slot Synchronization (PG 17+)

PG 17 之前的痛點：logical replication slot 是 primary 上的 state、failover 後 新 primary 沒這個 slot、CDC consumer 失聯、需要重建（大工程）。

PG 17 加 failover slot synchronization：

 1-- PG 17+：標 slot 為 failover-tracked
 2-- signature: pg_create_logical_replication_slot(slot_name, plugin, temporary, two_phase, failover)
 3SELECT pg_create_logical_replication_slot('my_slot', 'pgoutput', false, false, true);
 4--                                                                          ↑
 5--                                                                     failover=true（第 5 個參數）
 6-- 注意：第 4 個參數是 two_phase（這裡 false）、第 5 個才是 failover
 7
 8-- Standby 上 enable sync_replication_slots
 9ALTER SYSTEM SET sync_replication_slots = on;
10SELECT pg_reload_conf();

sync_replication_slots = on 後、physical replication 同步 slot state 到 standby。Failover promote standby 後、logical slot 仍可用、CDC consumer 重連即可。

PG 17 之前用 pgEdge / pglogical 等 extension 提供類似功能、現在 PG core 內建。

Orphan Slot 治理

active = false 的 slot 持續累積 WAL、disk 爆是 PG production 經典事故。

監控 orphan slot

1-- 找 inactive 太久的 slot
2SELECT slot_name, active, restart_lsn,
3       pg_size_pretty(pg_wal_lsn_diff(pg_current_wal_lsn(), restart_lsn)) AS retained_wal
4FROM pg_replication_slots
5WHERE NOT active
6  AND pg_wal_lsn_diff(pg_current_wal_lsn(), restart_lsn) > 1024 * 1024 * 1024;  -- > 1 GB

自動 invalidate slot（PG 13+）

1-- postgresql.conf
2ALTER SYSTEM SET max_slot_wal_keep_size = '50GB';  -- slot 累積 > 50GB 自動 invalidate

當 slot 累積 WAL 超過 max_slot_wal_keep_size、PG 自動 invalidate slot（active=false 且不再保留 WAL）。Consumer 重連會 fail、必須重建（base backup + new slot）。

這是 trade-off：

設 limit → 保護 disk、但 consumer 失聯 → 大重建工作
不設 limit → consumer 失聯 OK、但 disk 爆

實務多數設 max_slot_wal_keep_size 給 disk capacity 50%、避免徹底 disk full。

手動 drop orphan slot

1-- 確認 slot 真的不需要
2SELECT * FROM pg_replication_slots WHERE slot_name = 'old_standby_slot';
3
4-- Drop
5SELECT pg_drop_replication_slot('old_standby_slot');

DR runbook 必須包含 standby 退役流程：先 standby fence、再 primary drop slot。

5 個 Production 踩雷

1. Orphan slot disk 爆

最經典 PG 事故：standby decomission 沒 drop slot、primary 持續保留 WAL、pg_wal/ 累積到 disk full、primary 也掛。

修法：

監控 pg_replication_slots + pg_size_pretty(pg_wal_lsn_diff(pg_current_wal_lsn(), restart_lsn)) retained_wal
設 max_slot_wal_keep_size（PG 13+）— hard limit
Standby 退役 runbook 強制 先 fence、再 drop slot
Cron job 自動 alert orphan slot

2. Logical slot lag — CDC consumer 跟不上

Logical decoding 比 physical replication 慢（per-transaction logical event 重組）。CDC consumer（Debezium）跟不上 → slot lag 累積。

修法：

監控 pg_replication_slots.confirmed_flush_lsn 跟 primary pg_current_wal_lsn() 對比
CDC consumer 性能調整（throughput / batch size）
Throttle source writes（如果不能升 consumer）
對 hot table 拆 publication / subscription、避免單 slot 處理所有變更

詳見 Logical Replication + Debezium。

3. Failover 後 logical slot 丟（PG 16 之前）

PG 16 之前、failover promote standby、新 primary 沒有原 logical slot。CDC consumer 試連、ERROR: replication slot "xxx" does not exist。

修法（PG 17+）：

用 failover slot synchronization（如上）
pg_create_logical_replication_slot(..., failover := true)
Standby sync_replication_slots = on

修法（PG 16-）：

用 pglogical 或 pgEdge extension
Failover runbook 包含 新 primary 重建 logical slot（CDC consumer 重 snapshot）
Pre-create slot on standby + manual sync（早期 workaround）

4. `wal_keep_size` 跟 slot 衝突

wal_keep_size（PG 13+）/ wal_keep_segments（< 13）跟 slot 都會保留 WAL：

wal_keep_size：固定 minimum WAL 保留量
Slot：動態保留直到 consumer 推進

兩者一起 set 時：實際保留 WAL = max(wal_keep_size, slot 需要的量)。

修法：

wal_keep_size 設小（如 1-2 GB）作 minimum backup
主要靠 slot 動態保留 — 給 active consumer
監控 pg_wal/ 大小 + 拆解 retention source（wal_keep_size vs slot 各佔多少）

5. Slot 數量上限

max_replication_slots 預設 10、不夠時新 slot 建不出來、報錯。

修法：

Production 大 cluster 設 max_replication_slots = 50 或更多
對 standby + logical replication + CDC consumer 同時跑、計算需要的 slot 數
監控 SELECT count(*) FROM pg_replication_slots 接近 limit 時告警

Slot Naming Convention

Production 大 cluster 多 slot、命名 convention 重要：

1<consumer-type>_<consumer-name>_<purpose>
2例：
3- physical_standby1_replication
4- physical_standby2_replication
5- logical_debezium_orders_cdc
6- logical_pgedge_node2_subscription
7- physical_pgbasebackup_temp（base backup 用、completed 後 drop）

清楚命名讓 看 slot 名 就知道用途、誰負責、能不能 drop。

跟其他模組整合

Replication Topology：physical slot 給 streaming replication 用
Logical Replication + Debezium：logical slot 給 CDC
BDR / Multi-Master：multi-master 大量用 logical slot
PITR + WAL Archiving：WAL archive 跟 slot 是兩種 WAL retention 機制、可並行

監控 metric

Production 持續監控：

pg_replication_slots.active — 失聯 slot
pg_wal_lsn_diff(pg_current_wal_lsn(), restart_lsn) — slot 累積 WAL
pg_replication_slots.confirmed_flush_lsn vs pg_current_wal_lsn() — logical slot lag
pg_ls_waldir() 看 pg_wal/ 目錄大小
count(*) FROM pg_replication_slots 對 max_replication_slots 比例

把這些丟進 Datadog / Prometheus + alert。

Tarragon

PostgreSQL Replication Slot Management：Physical / Logical / Failover Slot 治理

Replication Slot 兩大類

Physical Replication Slot

Logical Replication Slot

Slot Lifecycle

Failover Slot Synchronization (PG 17+)

Orphan Slot 治理

監控 orphan slot

自動 invalidate slot（PG 13+）

手動 drop orphan slot

5 個 Production 踩雷

1. Orphan slot disk 爆

2. Logical slot lag — CDC consumer 跟不上

3. Failover 後 logical slot 丟（PG 16 之前）

4. `wal_keep_size` 跟 slot 衝突

5. Slot 數量上限

Slot Naming Convention

跟其他模組整合

監控 metric

相關連結

PostgreSQL Replication Slot Management：Physical / Logical / Failover Slot 治理

Replication Slot 兩大類

Physical Replication Slot

Logical Replication Slot

Slot Lifecycle

Failover Slot Synchronization (PG 17+)

Orphan Slot 治理

監控 orphan slot

自動 invalidate slot（PG 13+）

手動 drop orphan slot

5 個 Production 踩雷

1. Orphan slot disk 爆

2. Logical slot lag — CDC consumer 跟不上

3. Failover 後 logical slot 丟（PG 16 之前）

4. wal_keep_size 跟 slot 衝突

5. Slot 數量上限

Slot Naming Convention

跟其他模組整合

監控 metric

相關連結

4. `wal_keep_size` 跟 slot 衝突