Uploaded image for project: 'controller'
  1. controller
  2. CONTROLLER-1589

Current raft implementation seems to be unstable when dynamically adding peers when new nodes come up.

    XMLWordPrintable

Details

    • Bug
    • Status: Resolved
    • Resolution: Cannot Reproduce
    • None
    • None
    • clustering
    • None
    • Operating System: All
      Platform: All

    • 7696

    Description

      Scenario were trying to do:

      We want to have a shard started with replicas on all cluster members(even ones added at any time in the future).

      Currently the raft implementation seems to quite fragile when dynamically adding peers, when you send an AddServer message to the actor on MemberUp/Reachable events the followers seem to always ignore it leading to each peer having different set of peers which leads to never ending elections.
      The current implementation seems to rely either on having hardcoded replicas in the module conf. If we went similar route like the ClusterAdminService addReplica does this also seems to only update peers on the leader which leads to the neverending elections due to different peers for each actor.

      Attachments

        Issue Links

          No reviews matched the request. Check your Options in the drop-down menu of this sections header.

          Activity

            People

              Unassigned Unassigned
              tcere Tomas Cere
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: