[BGPCEP-442] BGP Route injection fails after controller restart Created: 18/Apr/16 Updated: 03/Mar/19 Resolved: 15/Aug/16 |
|
| Status: | Resolved |
| Project: | bgpcep |
| Component/s: | BGP |
| Affects Version/s: | Bugzilla Migration |
| Fix Version/s: | Bugzilla Migration |
| Type: | Bug | ||
| Reporter: | Ajay Chhabria | Assignee: | Claudio David Gasparini |
| Resolution: | Done | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Environment: |
Operating System: All |
||
| Attachments: |
|
| External issue ID: | 5742 |
| Description |
|
SYMPTOM: BGP route injection fails through REST after all the 3 nodes clustered controllers are restarted. Please refer the attached text files for error messages. CONDITIONS: 3-node clustering environment. WORK-AROUND: Re-install the controllers and inject routes. |
| Comments |
| Comment by Ajay Chhabria [ 18/Apr/16 ] |
|
Attachment Delete_from_datastore_and_re-inject.txt has been added with description: Delete from Datastore and re-inject |
| Comment by Ajay Chhabria [ 18/Apr/16 ] |
|
Attachment First_Route_injection_after_restart.txt has been added with description: First route injection after restart |
| Comment by Milos Fabian [ 19/Apr/16 ] |
|
Ajay, are there any exceptions/errors while closing the controllers? Could you please share full log file from all three nodes? |
| Comment by Ajay Chhabria [ 21/Apr/16 ] |
|
Attachment karaf_controller_1.log has been added with description: Karaf Logs on the first node |
| Comment by Ajay Chhabria [ 21/Apr/16 ] |
|
Attachment karaf_controller_2.log has been added with description: Karaf Logs on the second node |
| Comment by Ajay Chhabria [ 21/Apr/16 ] |
|
Attachment karaf_controller_3.log has been added with description: Karaf Logs on the third node |
| Comment by Milos Fabian [ 21/Apr/16 ] |
|
Those shutdown errors looks familiar - https://bugs.opendaylight.org/show_bug.cgi?id=4366 |
| Comment by Milos Fabian [ 25/Apr/16 ] |
|
I was able to reproduce the bug with one node ODL setup. |
| Comment by Claudio David Gasparini [ 14/Jul/16 ] |
|
No able to reproduce with Be SR2 or master branch. Steps done : I followed the next step trying to replicate the BUG, without success. Ajay, Could you try it with latest version and confirm if you still able to replicate. And attach the steps you did. Thank you |
| Comment by Ajay Chhabria [ 14/Jul/16 ] |
|
Hi Claudio, I have seen this issue in a 3-node clustering environment with BGP_RIB shard replicated across all the controller nodes. Following are the steps I did to reproduce it: 1. Got 3-node cluster UP. 2. Inject a BGP route. 3. Restart all the 3 controller nodes. 4. Inject the route again. You could try the same steps with 1 node controller also. Let me know if you need any further information regarding the same. Thanks, |
| Comment by Claudio David Gasparini [ 20/Jul/16 ] |
|
The issue is not reproducible in 1 node controller. Issue is seen when application peers goes up after restart. 2016-07-20 11:47:38,654 | WARN | ult-dispatcher-6 | ConcurrentDOMDataBroker | 174 - org.opendaylight.controller.sal-distributed-datastore - 1.3.3.SNAPSHOT | Tx: DOM-CHAIN-35-0 Error during phase CAN_COMMIT, starting Abort |
| Comment by Milos Fabian [ 25/Jul/16 ] |
|
stable/beryllium: https://git.opendaylight.org/gerrit/#/c/42295/ |
| Comment by Claudio David Gasparini [ 25/Jul/16 ] |
| Comment by Milos Fabian [ 29/Jul/16 ] |
|
Re-opening as the bug fix patch was reverted. |
| Comment by Claudio David Gasparini [ 15/Aug/16 ] |