[NETVIRT-512] CSIT Sporadic failures - tempest.scenario.test_network_advanced_server_ops.TestNetworkAdvancedServerOps.test_server_connectivity_stop_start Created: 06/Mar/17 Updated: 27/Nov/19 |
|
| Status: | Confirmed |
| Project: | netvirt |
| Component/s: | General |
| Affects Version/s: | Oxygen |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Medium |
| Reporter: | Jamo Luhrsen | Assignee: | Srinivas Rachakonda |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Environment: |
Operating System: All |
||
| Attachments: |
|
| External issue ID: | 7909 |
| Description |
|
|
| Comments |
| Comment by Jamo Luhrsen [ 06/Jul/17 ] |
| Comment by Jamo Luhrsen [ 18/Jul/17 ] |
| Comment by Jamo Luhrsen [ 27/Sep/17 ] |
|
no longer seen in CSIT |
| Comment by Jamo Luhrsen [ 07/Mar/18 ] |
|
This is back now, and we need to track it. Starting it out as an Oxygen Blocker since it was introduced recently and very consistent. |
| Comment by Jamo Luhrsen [ 07/Mar/18 ] |
|
it's consistent in queens as well as pike. Pike is easier to debug because the openstack services have their logs some clues:
|
| Comment by Daniel Farrell [ 08/Mar/18 ] |
|
k.faseela - Any update on this? Is someone actively working on it? Friendly reminder that we need to get this resolved ASAP as it's blocking the imminent Oxygen release. |
| Comment by Faseela K [ 08/Mar/18 ] |
|
Jamo, the new failure is not anything on the flow programming side, rather the VM is not coming up, right? I don't think I am the right person to debug this, some openstack expertise will be needed. Vishal or Vivekanandan might be able to help? |
| Comment by Faseela K [ 08/Mar/18 ] |
|
shague jluhrsen : Had a chat with Vishal, he is telling this needs to be looked upon by someone who has nova/neutron expertise. He was suggesting to check with yamahata |
| Comment by Daniel Farrell [ 09/Mar/18 ] |
|
Has someone started digging into this? yamahata? Maybe jhershbe could help? |
| Comment by Jamo Luhrsen [ 10/Mar/18 ] |
|
I've dug around a lot, looking for something obvious, but not really finding anything blatant. There is a difference in some of the nova logs, when I compare a passing job to a failing job. in the passing job (timestamp Feb 08 07:03:54) you will see that the instance had the lifecycle I'm trying to think of ways to rule out OpenDaylight as the culprit, so we can move this from
|
| Comment by Jamo Luhrsen [ 11/Mar/18 ] |
|
I was wondering if this could somehow be a bug outside of ODL, maybe in the openstack side of things. However, I did run Oxygen with the Ocata branch of openstack and the bug did not show up. log here I'm not sure what to make of this yet. |
| Comment by Josh Hershberg [ 12/Mar/18 ] |
|
So I also suspect that this might be an openstack issue or even a libvirt issue. When I looked at this last week I noticed this test was failing on the start (unpause) of the VM after it was stopped. Turns out, neutron and ODL are not involved in that process at all. |
| Comment by Daniel Farrell [ 12/Mar/18 ] |
|
jluhrsen - Is pushing on this hard, looking like it's an OpenStack bug so far, will update by EOD PT. |
| Comment by Jamo Luhrsen [ 13/Mar/18 ] |
It took me all day to get my local setup working, and did not get this specific test to run yet. I'll hopefully get some time in my |
| Comment by Sam Hague [ 13/Mar/18 ] |
|
There are earlier jobs with the failure back to 2/8/18:
|
| Comment by Jamo Luhrsen [ 14/Mar/18 ] |
|
This is not a bug with OpenDaylight itself. I have removed this from blocker, but leaving the bug open for now It appears this patch in nova is what introduced the failure. However, it's possible (and theorized by Mohamed Naser) that We've also noticed that non-voting tempest tests that run against networking-odl also started failing around the same time |
| Comment by Sam Hague [ 05/Apr/18 ] |
|
https://review.openstack.org/553035 was merged to revert the original nova patch that was causing the problem. running new gates to reenable the test and see if it is fixed. |
| Comment by Sam Hague [ 10/Apr/18 ] |
|
New patch to fix original issue in nova: https://review.openstack.org/558001 |
| Comment by Abhinav Gupta [ 25/Nov/19 ] |
|
any update here? |
| Comment by Jamo Luhrsen [ 25/Nov/19 ] |
|
I'm not keeping track of netvirt jobs any more. Can we get someone to go through the tempest jobs and see if this test case is still failing sporadically or not? actually it's not that hard to figure out. here is a link for the sodium job that shows that it does still fail sporadically. |