[NEUTRON-204] networking-odl gives up on websocket, stops retrying Created: 08/Nov/18 Updated: 10/Jan/19 |
|
| Status: | Open |
| Project: | neutron |
| Component/s: | northbound-api |
| Affects Version/s: | None |
| Fix Version/s: | Fluorine-SR2, Neon |
| Type: | Bug | Priority: | High |
| Reporter: | Jamo Luhrsen | Assignee: | Josh Hershberg |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | csit:3node | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Issue Links: |
|
||||||||
| Description |
|
after bouncing all three nodes of an ODL cluster, websocket registration fails and networking-odl gives up. From that point forward, the netvirt workflow is broken |
| Comments |
| Comment by Jamo Luhrsen [ 08/Nov/18 ] |
|
Is NEUTRON the right project for this? |
| Comment by Jamo Luhrsen [ 08/Nov/18 ] |
|
copy paste from an email: I think we are really close with everything, but I want to get to the bottom Essentially, what's happening is that networking-odl just quits trying to 2018-11-07 19:35:45.116 sERROR networking_odl.common.websocket_client None req-4b2fa066-886c-4c1b-a836-8ea50cafa8ae None None websocket irrecoverable error which is from here [1] I think what's happening is that the /restconf call to make the registration I'm not sure that retrying would help anyway. Something seems Now, in each of the karaf logs [3][4][5] you can see that our haproxy So, where is that registration getting lost. The one from [1]. Oh, That's as far as I got for now. Any ideas to run with? While I was writing this, one of my oxygen jobs [7] also seems to have Here's a JIRA for this one: Thanks, [0] https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/builder-copy-sandbox-logs/499/jamo-netvirt-csit-3node-0cmb-1ctl-2cmp-openstack-queens-upstream-stateful-fluorine/14 |
| Comment by Jamo Luhrsen [ 13/Dec/18 ] |
|
jhershbe, did we ever get a patch in networking-odl to not actually die in the thread that |