[CONTROLLER-1478] Switch reconciliation is not happening after leader node restart in 3 node cluster Created: 29/Jan/16 Updated: 19/Oct/17 Resolved: 25/Feb/16 |
|
| Status: | Resolved |
| Project: | controller |
| Component/s: | clustering |
| Affects Version/s: | Beryllium |
| Fix Version/s: | None |
| Type: | Bug | ||
| Reporter: | Anil Gujele | Assignee: | Unassigned |
| Resolution: | Done | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Environment: |
Operating System: All |
||
| Attachments: |
|
| External issue ID: | 5135 |
| Description |
|
Build used : Test Type : Objective of test : Test Steps : Note: Controllers (to cross-check logs): Enclosed Logs: |
| Comments |
| Comment by Anil Gujele [ 29/Jan/16 ] |
|
Attachment resyncFailed3SwitchLeaderRestart.rar has been added with description: attached logs from c1, c2,c3 node. |
| Comment by Anil Gujele [ 29/Jan/16 ] |
|
In step-8, flow is pushed from node c3. |
| Comment by Luis Gomez [ 31/Jan/16 ] |
|
I can easily reproduce issue in 13 by: 1) Push a normal switch flow (do not connect a switch for now) 2) Kill and recover the inventory shard leader 3) Connect switch to old shard leader - Flow is not programmed |
| Comment by Luis Gomez [ 31/Jan/16 ] |
|
Anil, is this the cluster RPC issue you commented to me? |
| Comment by Anil Vishnoi [ 31/Jan/16 ] |
|
Yes luis. |
| Comment by Anil Gujele [ 02/Feb/16 ] |
|
I have verified this defect with build from latest ODL Berrilium master code, reconciliation is working in this scenario. I see below log messages in other two nodes in every 5 seconds once leader node is down. 2016-02-02 03:51:05,862 | WARN | ds-oper-thread-0 | OperationLimiter | 143 - org.opendaylight.controller.sal-distributed-datastore - 1.3.0.SNAPSHOT | Failed to acquire operation permit for transaction member-3-chn-11-txn-2021 |
| Comment by Ryan Goulding [ 09/Feb/16 ] |
|
So this works? |
| Comment by Anil Vishnoi [ 09/Feb/16 ] |
|
Hi Anil, Can you please test with the latest stable/beryllium, it should be fixed with https://git.opendaylight.org/gerrit/#/c/34115/ Thanks |
| Comment by Anil Gujele [ 11/Feb/16 ] |
|
Hi Anil, I have tested it with your patch and it again happened in second attempt. Thanks |
| Comment by Ryan Goulding [ 16/Feb/16 ] |
|
Is this actually fixed? Is this still a blocker? If this is still causing issues, cna you please reopen. |
| Comment by Tom Pantelis [ 25/Feb/16 ] |
|
I haven't heard anything wrt this still being an issue after Anil's patch. Closing... |