Skip to main content

Tag: Redpanda

Issues related to Redpanda streaming data platform

IDTitleDescriptionCategoryTechnologyTags
CRE-2025-0080
High
Impact: 0/10
Mitigation: 9/10
Redpanda High Severity IssuesDetects when Redpanda hits any of these on startup or early runtime: 1. Fails to create its crash_reports directory (POSIX error 13). 2. Heartbeat or node-status RPC failures indicating a broker is down. 3. Raft group failure. 4. Data center failureData Streaming PlatformsredpandaRedpandaStartup FailurePermission FailureRPCRaftNode DownCluster DegradationData AvailabilityDatabase Corruption
CRE-2025-0091
Critical
Impact: 10/10
Mitigation: 7/10
Redpanda Consumer Mass Disconnect → Coordinator FailureDetects high-severity failure when mass consumer disconnections overwhelm Redpanda's group coordinator. - Multiple consumers simultaneously leave consumer groups - Coordinator becomes unresponsive (NodeNotReadyError) - MemberIdRequiredError indicates coordinator state corruption - Can lead to complete message processing haltDistributed Messaging Connectivity IssuesconsumerRedpandaConsumer GroupsCoordinator FailureMass DisconnectKafka CompatibilityMessage Processing Halt
CRE-2025-0092
High
Impact: 0/10
Mitigation: 9/10
Redpanda Quorum LossDetects when a Redpanda node becomes isolated (heartbeats fail) and triggers a Raft re-election, indicating quorum loss.Redpanda High AvailabilityredpandaRedpandaRaftQuorumLeader Election
CRE-2025-0099
High
Impact: 8/10
Mitigation: 7/10
Redpanda Crash Due to Memory Exhaustion and Startup FailuresRedpanda streaming platform crashes due to a combination of system-level failures including permission denied errors for performance monitoring subsystems, missing critical configuration files, and memory allocation failures.Data Streaming Platformsapplication-logsRedpandaContainer CrashMemory ExhaustionConfiguration FailureStreaming PlatformKafka CompatiblePermission DeniedSIGKILL
CRE-2025-0102
High
Impact: 0/10
Mitigation: 0/10
Redpanda Cluster Critical Failure - Node Loss, Quorum Lost, and Data Availability Impacted- The Redpanda streaming data platform is experiencing a severe, cascading failure. - This typically involves critical errors on one or more nodes (e.g., storage failures), leading to nodes becoming unresponsive or shutting down. - Subsequently, this can cause loss of controller quorum, leadership election problems for partitions, and a significant degradation in overall cluster health and data availability.Redpanda ProblemsredpandaRedpandaStreaming DataCluster FailureNode DownQuorum LossData AvailabilityErrorsDistributed System
CRE-2025-0107
Medium
Impact: 3/10
Mitigation: 6/10
Redpanda Node Missing State Files on StartupDetects when a Redpanda node starts up but cannot find key state files, such as the key-value store snapshot or configuration cache. This is normal behavior for a brand-new node starting for the first time but can indicate a problem (like a cleared or misconfigured volume) if it occurs on an existing node that is expected to have state.Redpanda ProblemsredpandaRedpandaRedpanda StartupRedpanda State MissingSnapshot