[VTN-80] VTN Coordinator returns error in CSIT Created: 16/Jul/15  Updated: 19/Oct/17  Resolved: 07/Oct/15

Status: Resolved
Project: vtn
Component/s: ODC Driver
Affects Version/s: unspecified
Fix Version/s: None

Type: Bug
Reporter: Venkatrangan Govindarajan Assignee: Unassigned
Resolution: Done Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment:

Operating System: Linux
Platform: All


External issue ID: 4008

 Description   

When the CSIT Test cases are executed in sandbox, It is noted that the VTN Coordiantor returns 503 in some scenarios.

the current order of test cases is as follows

a. add controller
b. wait until the controller status is "up" to ensure that audit is complete

c. Continue to create vtn+vbridge+interface and portmaps to ensure some successful ping test using mininet.

The problem here is when the (c) executes, sometimes Coordinaotr returns 503 causing the tests to fail.

It needs to be analyzed what blocking operation occurs after the status is "up" as AUDIT is complete.



 Comments   
Comment by Karthik Sivasamy [ 06/Aug/15 ]

When we increase waiting time of controller status from 32 to 40s, the issue 503 error code is not reproducible.

Comment by Hideyuki Tai [ 06/Aug/15 ]

(In reply to Karthik Sivasamy from comment #1)
> When we increase waiting time of controller status from 32 to 40s, the issue
> 503 error code is not reproducible.

Did you submit the change to the vtn.git?

Could you add more explanation?

Comment by Karthik Sivasamy [ 07/Aug/15 ]

(In reply to Hideyuki Tai from comment #2)
> (In reply to Karthik Sivasamy from comment #1)
> > When we increase waiting time of controller status from 32 to 40s, the issue
> > 503 error code is not reproducible.
>
> Did you submit the change to the vtn.git?
>
> Could you add more explanation?

> Analysing the issue why VTN coordinator returns 503 when wait time is 32s, as controller status is UP, test case fail in creating VTN configurations i.e, (503 service unavailable) error returns.

Comment by Karthik Sivasamy [ 27/Aug/15 ]

Observation:
1.There is a fraction of delay between the controller status becoming “up” and then AUDIT operation actually completing. Between, next REST API gets executed in this time.
2. This causes higher layer module to fail the commit operation by returning SYSTEM BUSY(503) error as AUDIT is yet to complete with all components.

Workaround:
a.Create Controller with “Audit Disable”
b.Wait for atleast 10sec for controller to become ready for Audit from down state.
c.Manually trigger AUDIT using the AUDIT Controller REST API
d.Then continue the rest of TESTS.

Comment by Vratko Polak [ 27/Aug/15 ]

> b.Wait for atleast 10sec for controller to become ready for Audit from down
state.

Previously it took longer. My brain pattern-matched the change to a change in test setup: value of odcdrv_ping_interval in /usr/local/vtn/modules/odcdriver.conf was changed from 30 (seconds) to 10. As ping itself takes some time, actual test waits up to 12 seconds.

To me, it looks like the system under test waits for a ping to succeed (perhaps two pings separated by the interval?).

As a committer on Integration project, I am willing to accept the new test (which hides the 503 issue) as soon as test case documentation points to this Bug discussion.

Comment by Venkatrangan Govindarajan [ 27/Aug/15 ]

Hi Vratko,

Actually it is a little different. The issue reported here is about the time taken for the "AUDIT" operation to complete. The ping interval was made as 10 seconds for a different scneario. This paramter controls the frequency in which the vtn coordinator tries to chck if the ODL is connected. It is for the "down" state checking when setting wrong IP address. We are checking the same.

Comment by Karthik Sivasamy [ 07/Oct/15 ]

This issue occurs only when testing vtn coordinator in csit.

Occurance of the issue:
1) Add a controller.
2) Check controller status if "UP"
3) Add vtn, vbr (Here it returns error as SYSTEM BUSY).

Reason of the issue:
1.There is a fraction of delay between the controller status becoming “up” and then AUDIT operation actually completing. Between, next REST API gets (VTN and VBR creation)executed in this time.
2. This causes higher layer module to fail the commit operation by returning SYSTEM BUSY(503) error as AUDIT is yet to complete with all components.And it is confirmed it is know behaviour to overcome this we need to do trigger manual audit in the test case after adding controller.

This issue is ressolved by fixing manually trigerring audit in the CSIT test cases.

The patch for this bug is fixed in CSIT test cases of vtn and it is merged in "integration branch". Please find the below link for the same,
https://git.opendaylight.org/gerrit/#/c/24860/

Generated at Wed Feb 07 20:47:58 UTC 2024 using Jira 8.20.10#820010-sha1:ace47f9899e9ee25d7157d59aa17ab06aee30d3d.