[CONTROLLER-1891] Installation of snapshot on a follower can get stuck indefinitely Created: 30/Apr/19 Updated: 07/May/19 Resolved: 07/May/19 |
|
| Status: | Resolved |
| Project: | controller |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | Sodium, Neon SR1, Fluorine SR3 |
| Type: | Bug | Priority: | Medium |
| Reporter: | Tomas Cere | Assignee: | Tomas Cere |
| Resolution: | Done | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Description |
|
If a follower is restarted/crashed/etc after the last chunk is received and before the last InstallSnapshotReply is sent back to the leader, the leader gets stuck thinking the snapshot has been fully sent, never attempting to install the snapshot again. After the follower starts again the leader and follower endlessly ping pong back and forth unsuccessful append entries and the follower is never caught up. |