Tag: Replication
Replication failures, lag, or divergence in stateful systems.
| ID | Title | Description | Category | Technology | Tags |
|---|---|---|---|---|---|
| CRE-2025-0020 High Impact: 10/10 Mitigation: 6/10 | Self-hosted PostgreSQL HA: WAL Streaming & HA Controller Crisis (Replication Slot Loss, Disk Full, Etcd Quorum Failure) | Detects high-severity failures in self-hosted PostgreSQL high-availability clusters managed by Patroni, Zalando, or similar HA controllers. This rule targets catastrophic conditions that break replication or cluster consensus: - WAL streaming failures due to missing replication slots (usually after disk full or crash events) - Persistent errors resolving HA controller endpoints (etcd/consul) and loss of HA controller quorum - Disk saturation leading to WAL write errors and replication breakage | PostgreSQL High Availability | postgresql | High AvailabilityPatroniZalandoEtcdReplicationWALStorageQuorumCrashData LossTimeout |
| CRE-2025-0070 Critical Impact: 10/10 Mitigation: 6/10 | Kafka Under-Replicated Partitions Crisis | Critical Kafka cluster degradation detected: Multiple partitions have lost replicas due to broker failure, resulting in an under-replicated state. This pattern indicates a broker has become unavailable, causing partition leadership changes and In-Sync Replica (ISR) shrinkage across multiple topics. | Message Queue Problems | kafka | KafkaReplicationData LossHigh AvailabilityBroker FailureCluster Degradation |
| CRE-2025-0140 Medium Impact: 6/10 Mitigation: 5/10 | Supabase Self-Hosted: Realtime Service Crash Due to Invalid Configuration | Detects when Supabase Realtime service fails to start or crashes due to invalid configuration parameters. This affects WebSocket connections, real-time subscriptions, and live data streaming capabilities. Common issues include invalid replication modes, missing database permissions, or incorrect environment variables. | Realtime Problems | realtime | SupabaseRealtimeConfigurationReplicationConnectionSelf-HostedConfiguration FailurePublic |
| CRE-2025-0175 Critical Impact: 8/10 Mitigation: 6/10 | Redis Master-Replica Synchronization Failure | Detects failures in Redis master-replica synchronization including broken replication links, sync timeouts, and full resync loops. These issues compromise data consistency and high availability in Redis deployments. | In-Memory Database Problems | redis | RedisReplicationMaster-ReplicaSyncPartial Sync |
| CRE-2025-0178 Critical Impact: 5/10 Mitigation: 9/10 | Redis Read-Only Replica Write Attempt Error | Detects attempts to perform write operations on Redis read-only replicas. This error indicates application misconfiguration where clients are incorrectly routing write commands to replica instances instead of the master. | In-Memory Database Problems | redis | RedisREADONLYReplicaReplicationWrite Error |