<!-- 
RSS generated by JIRA (8.20.10#820010-sha1:ace47f9899e9ee25d7157d59aa17ab06aee30d3d) at Wed Feb 07 20:22:28 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>OpenDaylight JIRA</title>
    <link>https://jira.opendaylight.org</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>8.20.10</version>
        <build-number>820010</build-number>
        <build-date>22-06-2022</build-date>
    </build-info>


<item>
            <title>[NETVIRT-788] CSIT Sporadic failures - ElanService suite - openstack instances failing to get dhcp address for more than 3m</title>
                <link>https://jira.opendaylight.org/browse/NETVIRT-788</link>
                <project id="10144" key="NETVIRT">netvirt</project>
                    <description>&lt;p&gt;&lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-learn-carbon/336/log.html.gz#s1-s6-s1-k1-k2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-learn-carbon/336/log.html.gz#s1-s6-s1-k1-k2&lt;/a&gt;&lt;/p&gt;</description>
                <environment>&lt;p&gt;Operating System: All&lt;br/&gt;
Platform: All&lt;/p&gt;</environment>
        <key id="20709">NETVIRT-788</key>
            <summary>CSIT Sporadic failures - ElanService suite - openstack instances failing to get dhcp address for more than 3m</summary>
                <type id="10104" iconUrl="https://jira.opendaylight.org/secure/viewavatar?size=xsmall&amp;avatarId=10303&amp;avatarType=issuetype">Bug</type>
                                                <status id="5" iconUrl="https://jira.opendaylight.org/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="green"/>
                                    <resolution id="10000">Done</resolution>
                                        <assignee username="k.faseela">Faseela K</assignee>
                                    <reporter username="jluhrsen">Jamo Luhrsen</reporter>
                        <labels>
                    </labels>
                <created>Mon, 17 Jul 2017 18:00:56 +0000</created>
                <updated>Mon, 30 Oct 2017 19:52:46 +0000</updated>
                            <resolved>Wed, 9 Aug 2017 01:47:04 +0000</resolved>
                                    <version>Carbon</version>
                                                    <component>General</component>
                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                                                                <comments>
                            <comment id="38122" author="jluhrsen" created="Mon, 17 Jul 2017 18:06:46 +0000"  >&lt;p&gt;some discussion here:&lt;br/&gt;
&lt;a href=&quot;https://lists.opendaylight.org/pipermail/netvirt-dev/2017-July/005001.html&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://lists.opendaylight.org/pipermail/netvirt-dev/2017-July/005001.html&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;but the gist of this bug is that it&apos;s coming periodically and when we hit&lt;br/&gt;
it, the openstack instances that are created don&apos;t get their dhcp ip address&lt;br/&gt;
in at least 3m &lt;span class=&quot;error&quot;&gt;&amp;#91;0&amp;#93;&lt;/span&gt;, which is too long. In a passing case example &lt;span class=&quot;error&quot;&gt;&amp;#91;0&amp;#93;&lt;/span&gt; it still&lt;br/&gt;
is taking over 2m which may be considered too long as well. Potentially we&lt;br/&gt;
have some performance issue in programming flows in time?&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;0&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-learn-carbon/336/log.html.gz#s1-s6-s1-k1-k2-k3-k12&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-learn-carbon/336/log.html.gz#s1-s6-s1-k1-k2-k3-k12&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-ocata-upstream-learn-carbon/67/log.html.gz#s1-s6-s1-k1-k2-k3-k12&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-ocata-upstream-learn-carbon/67/log.html.gz#s1-s6-s1-k1-k2-k3-k12&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="38123" author="thapar" created="Tue, 18 Jul 2017 05:42:12 +0000"  >&lt;p&gt;Details of analysis captured in &lt;span class=&quot;error&quot;&gt;&amp;#91;2&amp;#93;&lt;/span&gt; and this looks like OpenStack issue.&lt;/p&gt;

&lt;p&gt;tl;dr version: &lt;br/&gt;
The test fails coz &apos;nameserver&apos; is not configured, which comes from Metadata service. VMs get IP fom DHCP, but not ssh keys, nameserver etc. Metadata proxy is not sending information about the VMs. &lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;2&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://lists.opendaylight.org/pipermail/netvirt-dev/2017-July/005001.html&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://lists.opendaylight.org/pipermail/netvirt-dev/2017-July/005001.html&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="38124" author="jluhrsen" created="Tue, 18 Jul 2017 22:44:23 +0000"  >&lt;p&gt;for reference, this is an email exchange with some debugging info:&lt;br/&gt;
-------------------------------------------------------------------&lt;/p&gt;


&lt;p&gt;Vishal,&lt;/p&gt;

&lt;p&gt;I might be confused, but I think we had two issues last week, and the Elan Suite&lt;br/&gt;
issue is not the metadata issue where we got the dhcp lease but not the nameserver.&lt;br/&gt;
I think that was just a nitro issue in a gate job. again, high chance that I&apos;m&lt;br/&gt;
just confused &lt;img class=&quot;emoticon&quot; src=&quot;https://jira.opendaylight.org/images/icons/emoticons/smile.png&quot; height=&quot;16&quot; width=&quot;16&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/p&gt;

&lt;p&gt;but, check this link &lt;span class=&quot;error&quot;&gt;&amp;#91;i&amp;#93;&lt;/span&gt;. That&apos;s the console log of a VM in the elan suite after&lt;br/&gt;
3m of polling for it&apos;s dhcp lease. It looks like dhcp just failed.&lt;/p&gt;

&lt;p&gt;snippet:&lt;/p&gt;

&lt;p&gt;Starting network... &lt;br/&gt;
udhcpc (v1.20.1) started &lt;br/&gt;
Sending discover... &lt;br/&gt;
Sending discover... &lt;br/&gt;
Sending discover... &lt;br/&gt;
Usage: /sbin/cirros-dhcpc &amp;lt;up|down&amp;gt; &lt;br/&gt;
No lease, failing &lt;br/&gt;
WARN: /etc/rc3.d/S40-network failed &lt;br/&gt;
cirros-ds &apos;net&apos; up at 187.58 &lt;br/&gt;
checking &lt;a href=&quot;http://169.254.169.254/2009-04-04/instance-id&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://169.254.169.254/2009-04-04/instance-id&lt;/a&gt; &lt;br/&gt;
failed 1/20: up 187.79. request failed&lt;/p&gt;


&lt;p&gt;Also, I spent some time trying to look for clues in the various logs. I didn&apos;t&lt;br/&gt;
find anything in the q-* logs which includes the metadata log. The karaf log&lt;br/&gt;
has some interesting things. I&apos;m comparing with a job that passed this Elan&lt;br/&gt;
Suite Setup (#335 whereas &lt;span class=&quot;error&quot;&gt;&amp;#91;i&amp;#93;&lt;/span&gt; is #337). The failing case has a lot of these&lt;br/&gt;
retry and &apos;timed out&apos; logs around ACL rule programming it seems. stuff like&lt;br/&gt;
this:&lt;/p&gt;

&lt;p&gt;LockManager                      | 318 - org.opendaylight.genius.lockmanager-impl - 0.2.2.SNAPSHOT | Waiting for the lock acl.flow.priorities.pool.154761518883256.243.PERMITTCP_DESTINATION_1_0 is timed out. retrying again&lt;/p&gt;

&lt;p&gt;LockManager                      | 318 - org.opendaylight.genius.lockmanager-impl - 0.2.2.SNAPSHOT | Already locked for acl.flow.priorities.pool.154761518883256.243.PERMITTCP_DESTINATION_1_0 after waiting 1000ms, try 12&lt;/p&gt;

&lt;p&gt;that general pattern happens a few times once the Elan suite starts, and I didn&apos;t&lt;br/&gt;
see it in the passing case.&lt;/p&gt;

&lt;p&gt;Also, I saw a bunch of the mdsalutil-api threads getting terminated in a row due to&lt;br/&gt;
some NPE, which was not there in the passing case. example:&lt;/p&gt;

&lt;p&gt;ERROR | eChangeHandler-1 | AsyncDataTreeChangeListenerBase  | 292 - org.opendaylight.genius.mdsalutil-api - 0.2.2.SNAPSHOT | Thread terminated due to uncaught exception: AsyncDataTreeChangeListenerBase-DataTreeChangeHandler-1&lt;br/&gt;
java.lang.NullPointerException&lt;/p&gt;



&lt;p&gt;Having said that, I want to point out that the passing case has it&apos;s own set&lt;br/&gt;
(a lot) of exceptions ERRORS that are occuring during this Elan Suite Setup,&lt;br/&gt;
and since it passes it makes me doubt that the errors listed above are very&lt;br/&gt;
helpful.&lt;/p&gt;

&lt;p&gt;I&apos;ll pasted this thread in to the BZ in case others are following there.&lt;/p&gt;

&lt;p&gt;Thanks,&lt;br/&gt;
JamO&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;i&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-learn-carbon/337/log.html.gz#s1-s6-s1-k1-k2-k3-k12-k1-k2-k2-k15&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-learn-carbon/337/log.html.gz#s1-s6-s1-k1-k2-k3-k12-k1-k2-k2-k15&lt;/a&gt;&lt;/p&gt;



&lt;p&gt;On 07/17/2017 10:47 PM, Vishal Thapar wrote:&lt;br/&gt;
&amp;gt; Will take a look at it and see if it is perf degradation. I updated &lt;a href=&quot;https://jira.opendaylight.org/browse/NETVIRT-788&quot; title=&quot;CSIT Sporadic failures - ElanService suite - openstack instances failing to get dhcp address for more than 3m&quot; class=&quot;issue-link&quot; data-issue-key=&quot;NETVIRT-788&quot;&gt;&lt;del&gt;NETVIRT-788&lt;/del&gt;&lt;/a&gt; with analysis.&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; 8859 is an OpenStack issue as of now, metadata service to be more specific. So would appreciate help from folks more familiar with it as am clueless when it comes to metadata service.&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; Regards,&lt;br/&gt;
&amp;gt; Vishal.&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; ----&lt;del&gt;Original Message&lt;/del&gt;----&lt;br/&gt;
&amp;gt; From: Jamo Luhrsen &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;mailto:jluhrsen@gmail.com&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;jluhrsen@gmail.com&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.opendaylight.org/images/icons/mail_small.gif&quot; height=&quot;12&quot; width=&quot;13&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt; &lt;br/&gt;
&amp;gt; Sent: 18 July 2017 11:04&lt;br/&gt;
&amp;gt; To: Vishal Thapar &amp;lt;vishal.thapar@ericsson.com&amp;gt;; netvirt-dev@lists.opendaylight.org&lt;br/&gt;
&amp;gt; Subject: Re: &lt;span class=&quot;error&quot;&gt;&amp;#91;netvirt-dev&amp;#93;&lt;/span&gt; netvirt daily 1node CSIT report (7/17)&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; Vishal,&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; every job has this &lt;span class=&quot;error&quot;&gt;&amp;#91;a&amp;#93;&lt;/span&gt; which shows runtime for the full job. Also, we can drill down to each test case to see it&apos;s duration, like this one &lt;span class=&quot;error&quot;&gt;&amp;#91;b&amp;#93;&lt;/span&gt; that is checking that VMs get IP addresses in our L2 connectivity suite. That&apos;s interested, actually, as it seems that around build 814 (5 days ago) that test case had an increase in run time from aprox 1:20 to 1:50 (30s increase). not sure what to make of that...&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; Thanks,&lt;br/&gt;
&amp;gt; JamO&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; &lt;span class=&quot;error&quot;&gt;&amp;#91;a&amp;#93;&lt;/span&gt;&lt;br/&gt;
&amp;gt; &lt;a href=&quot;https://jenkins.opendaylight.org/releng/user/jluhrsen/my-views/view/netvirt%20csit/job/netvirt-csit-1node-openstack-newton-nodl-v2-upstream-stateful-carbon/buildTimeTrend&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/user/jluhrsen/my-views/view/netvirt%20csit/job/netvirt-csit-1node-openstack-newton-nodl-v2-upstream-stateful-carbon/buildTimeTrend&lt;/a&gt;&lt;br/&gt;
&amp;gt; &lt;span class=&quot;error&quot;&gt;&amp;#91;b&amp;#93;&lt;/span&gt;&lt;br/&gt;
&amp;gt; &lt;a href=&quot;https://jenkins.opendaylight.org/releng/user/jluhrsen/my-views/view/netvirt%20csit/job/netvirt-csit-1node-openstack-newton-nodl-v2-upstream-stateful-carbon/839/robot/netvirt-1node-openstack.txt/Connectivity/01%20L2%20Tests/Check%20Vm%20Instances%20Have%20Ip%20Address/durationGraph?maxBuildsToShow=0&amp;amp;hd=true&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/user/jluhrsen/my-views/view/netvirt%20csit/job/netvirt-csit-1node-openstack-newton-nodl-v2-upstream-stateful-carbon/839/robot/netvirt-1node-openstack.txt/Connectivity/01%20L2%20Tests/Check%20Vm%20Instances%20Have%20Ip%20Address/durationGraph?maxBuildsToShow=0&amp;amp;hd=true&lt;/a&gt;&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; On 07/17/2017 10:11 PM, Vishal Thapar wrote:&lt;br/&gt;
&amp;gt;&amp;gt; Jamo will do.&lt;br/&gt;
&amp;gt;&amp;gt;&lt;br/&gt;
&amp;gt;&amp;gt; One question, we recently had a few changes to fix CSIT exceptions. I expected a knock-on impact where things move a bit faster but couldn&apos;t figure out how. Is it possible to have a dashboard/graph for test runtimes, similar to fail/success one?&lt;br/&gt;
&amp;gt;&amp;gt;&lt;br/&gt;
&amp;gt;&amp;gt; Regards,&lt;br/&gt;
&amp;gt;&amp;gt; Vishal.&lt;br/&gt;
&amp;gt;&amp;gt;&lt;br/&gt;
&amp;gt;&amp;gt; ----&lt;del&gt;Original Message&lt;/del&gt;----&lt;br/&gt;
&amp;gt;&amp;gt; From: netvirt-dev-bounces@lists.opendaylight.org &lt;br/&gt;
&amp;gt;&amp;gt; &lt;span class=&quot;nobr&quot;&gt;&lt;a href=&quot;mailto:netvirt-dev-bounces@lists.opendaylight.org&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;netvirt-dev-bounces@lists.opendaylight.org&lt;sup&gt;&lt;img class=&quot;rendericon&quot; src=&quot;https://jira.opendaylight.org/images/icons/mail_small.gif&quot; height=&quot;12&quot; width=&quot;13&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/sup&gt;&lt;/a&gt;&lt;/span&gt; On Behalf Of Jamo &lt;br/&gt;
&amp;gt;&amp;gt; Luhrsen&lt;br/&gt;
&amp;gt;&amp;gt; Sent: 17 July 2017 23:41&lt;br/&gt;
&amp;gt;&amp;gt; To: netvirt-dev@lists.opendaylight.org&lt;br/&gt;
&amp;gt;&amp;gt; Subject: &lt;span class=&quot;error&quot;&gt;&amp;#91;netvirt-dev&amp;#93;&lt;/span&gt; netvirt daily 1node CSIT report (7/17)&lt;br/&gt;
&amp;gt;&amp;gt;&lt;br/&gt;
&amp;gt;&amp;gt; 75% of our jobs passed 100% in their last run, which is as good as I&apos;ve seen it in a long time.&lt;br/&gt;
&amp;gt;&amp;gt;&lt;br/&gt;
&amp;gt;&amp;gt; for today:&lt;br/&gt;
&amp;gt;&amp;gt;&lt;br/&gt;
&amp;gt;&amp;gt; opened 8859&lt;br/&gt;
&amp;gt;&amp;gt; re-opened 7885&lt;br/&gt;
&amp;gt;&amp;gt;&lt;br/&gt;
&amp;gt;&amp;gt;&lt;br/&gt;
&amp;gt;&amp;gt; - &lt;a href=&quot;https://jira.opendaylight.org/browse/NETVIRT-506&quot; title=&quot;CSIT Sporadic failures - tempest.scenario.test_port_security_macspoofing_port&quot; class=&quot;issue-link&quot; data-issue-key=&quot;NETVIRT-506&quot;&gt;&lt;del&gt;NETVIRT-506&lt;/del&gt;&lt;/a&gt; - test_port_security_macspoofing_port&lt;br/&gt;
&amp;gt;&amp;gt;    &lt;span class=&quot;error&quot;&gt;&amp;#91;0&amp;#93;&lt;/span&gt;&lt;br/&gt;
&amp;gt;&amp;gt;&lt;br/&gt;
&amp;gt;&amp;gt; - &lt;a href=&quot;https://jira.opendaylight.org/browse/NETVIRT-520&quot; title=&quot;CSIT Sporadic failures - Flow(s) missing in VPNService suite on compute node&quot; class=&quot;issue-link&quot; data-issue-key=&quot;NETVIRT-520&quot;&gt;&lt;del&gt;NETVIRT-520&lt;/del&gt;&lt;/a&gt; - Flow(s) missing in VPNService suite on compute node&lt;br/&gt;
&amp;gt;&amp;gt;    &lt;span class=&quot;error&quot;&gt;&amp;#91;0&amp;#93;&lt;/span&gt;&lt;br/&gt;
&amp;gt;&amp;gt;&lt;br/&gt;
&amp;gt;&amp;gt; - &lt;a href=&quot;https://jira.opendaylight.org/browse/NETVIRT-788&quot; title=&quot;CSIT Sporadic failures - ElanService suite - openstack instances failing to get dhcp address for more than 3m&quot; class=&quot;issue-link&quot; data-issue-key=&quot;NETVIRT-788&quot;&gt;&lt;del&gt;NETVIRT-788&lt;/del&gt;&lt;/a&gt; - ElanService suite - openstack instances failing to get dhcp address for more than 3m&lt;br/&gt;
&amp;gt;&amp;gt;    &lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt;&lt;br/&gt;
&amp;gt;&amp;gt;       * for this one, it&apos;s not coming every time obviously but I am wondering if by fixing&lt;br/&gt;
&amp;gt;&amp;gt;         it we&apos;ll also be fixing other random failures. Even in the passing case it&apos;s taking&lt;br/&gt;
&amp;gt;&amp;gt;         2+ minutes to get ips, which is irritatingly long (as a user). Maybe something is&lt;br/&gt;
&amp;gt;&amp;gt;         bogged down internally and if we figure that out other random CSIT issues could go&lt;br/&gt;
&amp;gt;&amp;gt;         away.&lt;br/&gt;
&amp;gt;&amp;gt;       @Vishal, I know you did more debugging on this last week. I put a link to one email&lt;br/&gt;
&amp;gt;&amp;gt;                thread in the bug. Can you update the bug with any other details you know?&lt;br/&gt;
&amp;gt;&amp;gt;&lt;br/&gt;
&amp;gt;&amp;gt; - &lt;a href=&quot;https://jira.opendaylight.org/browse/NETVIRT-780&quot; title=&quot;CSIT Sporadic failures - v6 instance connectivity failures in vpnservice suite&quot; class=&quot;issue-link&quot; data-issue-key=&quot;NETVIRT-780&quot;&gt;&lt;del&gt;NETVIRT-780&lt;/del&gt;&lt;/a&gt; - v6 instance connectivity failures in vpnservice suite&lt;br/&gt;
&amp;gt;&amp;gt;    * all &lt;del&gt;learn&lt;/del&gt; jobs&lt;br/&gt;
&amp;gt;&amp;gt;&lt;br/&gt;
&amp;gt;&amp;gt; Thanks,&lt;br/&gt;
&amp;gt;&amp;gt; JamO&lt;br/&gt;
&amp;gt;&amp;gt;&lt;br/&gt;
&amp;gt;&amp;gt; &lt;span class=&quot;error&quot;&gt;&amp;#91;0&amp;#93;&lt;/span&gt;&lt;br/&gt;
&amp;gt;&amp;gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-ope&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-ope&lt;/a&gt;&lt;br/&gt;
&amp;gt;&amp;gt; nstack-newton-nodl-v2-upstream-learn-carbon/333/log.html.gz&lt;br/&gt;
&amp;gt;&amp;gt; &lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt; &lt;br/&gt;
&amp;gt;&amp;gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-ope&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-ope&lt;/a&gt;&lt;br/&gt;
&amp;gt;&amp;gt; nstack-newton-upstream-learn-carbon/336/log.html.gz&lt;br/&gt;
&amp;gt;&amp;gt; _______________________________________________&lt;br/&gt;
&amp;gt;&amp;gt; netvirt-dev mailing list&lt;br/&gt;
&amp;gt;&amp;gt; netvirt-dev@lists.opendaylight.org&lt;br/&gt;
&amp;gt;&amp;gt; &lt;a href=&quot;https://lists.opendaylight.org/mailman/listinfo/netvirt-dev&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://lists.opendaylight.org/mailman/listinfo/netvirt-dev&lt;/a&gt;&lt;br/&gt;
&amp;gt;&amp;gt;&lt;/p&gt;</comment>
                            <comment id="38125" author="jluhrsen" created="Sun, 23 Jul 2017 22:50:46 +0000"  >&lt;p&gt;&lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-stateful-carbon/467/log.html.gz#s1-s6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-stateful-carbon/467/log.html.gz#s1-s6&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="38126" author="jluhrsen" created="Mon, 24 Jul 2017 19:21:05 +0000"  >&lt;p&gt;&lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-learn-carbon/342/log.html.gz#s1-s6-s1-k1-k2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-learn-carbon/342/log.html.gz#s1-s6-s1-k1-k2&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="38127" author="jluhrsen" created="Wed, 26 Jul 2017 23:17:04 +0000"  >&lt;p&gt;here is a new example now that we have extra DEBUG logs introduced and&lt;br/&gt;
enabled:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-ocata-upstream-stateful-carbon/76/log.html.gz#s1-s6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-ocata-upstream-stateful-carbon/76/log.html.gz#s1-s6&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;attached is the karaf.log from the start of the ELAN suite, but the above&lt;br/&gt;
logs link can be used to get to the full karaf.log if needed&lt;/p&gt;</comment>
                            <comment id="38137" author="jluhrsen" created="Wed, 26 Jul 2017 23:18:14 +0000"  >&lt;p&gt;Attachment odl1_karaf.log.gz has been added with description: karaf log from the start of the Elan suite&lt;/p&gt;</comment>
                            <comment id="38128" author="jluhrsen" created="Wed, 26 Jul 2017 23:30:45 +0000"  >&lt;p&gt;(In reply to Jamo Luhrsen from comment #6)&lt;br/&gt;
&amp;gt; here is a new example now that we have extra DEBUG logs introduced and&lt;br/&gt;
&amp;gt; enabled:&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-&lt;/a&gt;&lt;br/&gt;
&amp;gt; ocata-upstream-stateful-carbon/76/log.html.gz#s1-s6&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; attached is the karaf.log from the start of the ELAN suite, but the above&lt;br/&gt;
&amp;gt; logs link can be used to get to the full karaf.log if needed&lt;/p&gt;

&lt;p&gt;Ignore this please. The Elan Suite failed because of a mistake I had&lt;br/&gt;
in the code to disable DEBUG logging. I have a patch to fix that.&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/60794/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/60794/&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="38129" author="jluhrsen" created="Fri, 28 Jul 2017 23:54:29 +0000"  >&lt;p&gt;I think we now have an example to look at after our better debugs have&lt;br/&gt;
been enabled in CSIT and the controller.&lt;/p&gt;

&lt;p&gt;robot log:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-stateful-snat-conntrack-carbon/125/log.html.gz#s1-s6-s1-k1-k2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-stateful-snat-conntrack-carbon/125/log.html.gz#s1-s6-s1-k1-k2&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;karaf log:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-stateful-snat-conntrack-carbon/125/odl1_karaf.log.gz&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-stateful-snat-conntrack-carbon/125/odl1_karaf.log.gz&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;you can see a lot of those WARNing messages about waiting for locks&lt;br/&gt;
and retrying after the Elan Suite starts. If you grep for ROBOT|DEBUG in&lt;br/&gt;
the karaf log you will see a bunch of flow removal lines, but only a &lt;br/&gt;
single install message in the timeframe of the Elan Suite. I am stil &lt;br/&gt;
wondering if we are stuck (locked and retrying) so nothing gets set up.&lt;/p&gt;</comment>
                            <comment id="38130" author="jluhrsen" created="Thu, 3 Aug 2017 23:41:39 +0000"  >&lt;p&gt;&lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-stateful-carbon/478/log.html.gz&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-stateful-carbon/478/log.html.gz&lt;/a&gt;&lt;/p&gt;


&lt;p&gt;&lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-stateful-snat-conntrack-carbon/130/log.html.gz&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/netvirt-csit-1node-openstack-newton-upstream-stateful-snat-conntrack-carbon/130/log.html.gz&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="38131" author="faseela.k@ericsson.com" created="Mon, 7 Aug 2017 17:37:21 +0000"  >&lt;p&gt;Fixing wrong table 200 flows with the below patch :&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/61197/6&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/61197/6&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="38132" author="faseela.k@ericsson.com" created="Mon, 7 Aug 2017 17:37:51 +0000"  >&lt;ul&gt;
	&lt;li&gt;table 220&lt;/li&gt;
&lt;/ul&gt;
</comment>
                            <comment id="38133" author="shague@redhat.com" created="Tue, 8 Aug 2017 14:40:28 +0000"  >&lt;p&gt;There are multiple aspects to this. It can be any of the following, but might not be always due to the same issue, but you can let me know if any of the following symptoms are seen :&lt;br/&gt;
Any issues in Table 17 or Table 220 programming. For example last time when Manu debugged, the flows in Table 220 were something like below :&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt;cookie=0x6900000, duration=127.462s, table=220, n_packets=28, n_bytes=2096, priority=6,reg6=0x8200 actions=load:0x70008200-&amp;gt;NXM_NX_REG6[],write_metadata:0x138f000000/0xfffffffffe,goto_table:241&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;2&amp;#93;&lt;/span&gt;cookie=0x8000007, duration=127.523s, table=220, n_packets=0, n_bytes=0, priority=9,reg6=0x90008200 actions=output:42&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;3&amp;#93;&lt;/span&gt;cookie=0x6900000, duration=126.910s, table=241, n_packets=1, n_bytes=370, priority=63010,udp,reg6=0x8200/0xfffff00,tp_src=67,tp_dst=68 actions=resubmit(,220)&lt;br/&gt;
For the same interface-tag -&amp;gt; 8200, the higher priority service is loading service-index=7, but the next priority service is matching for service-index=9, which will never match. Flows in Table 17 and Table 220 works on this service-binding logic, that for the same interface-tag,&lt;br/&gt;
There will be &#8220;n&#8221; entries in the tables where n = no of services bound on the interface. And the highest priority service will set the next priority in service-index before sending the packet to the service pipeline, so that on a resubmit the packet matches the next service-index in Table 17/220.&lt;br/&gt;
2.       ConflictingModificationException for urn:opendaylight:inventory?revision=2013-08-19)nodes/node/node where the flow-id has &#8220;DPNID:TABLE-ID:INTERFACE-NAME:SERVICE-INDEX&#8221; format, where TABLE-ID = 17 or 220&lt;br/&gt;
ConflcitingModificationException for urn:opendaylight:genius:bound-services-state&lt;/p&gt;</comment>
                            <comment id="38134" author="shague@redhat.com" created="Tue, 8 Aug 2017 14:42:02 +0000"  >&lt;p&gt;There are multiple aspects to this. It can be any of the following, but might not be always due to the same issue, but you can let me know if any of the following symptoms are seen :&lt;/p&gt;

&lt;p&gt;1. Any issues in Table 17 or Table 220 programming. For example last time when Manu debugged, the flows in Table 220 were something like below :&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt;cookie=0x6900000, duration=127.462s, table=220, n_packets=28, n_bytes=2096, priority=6,reg6=0x8200 actions=load:0x70008200-&amp;gt;NXM_NX_REG6[],write_metadata:0x138f000000/0xfffffffffe,goto_table:241&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;2&amp;#93;&lt;/span&gt;cookie=0x8000007, duration=127.523s, table=220, n_packets=0, n_bytes=0, priority=9,reg6=0x90008200 actions=output:42&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;3&amp;#93;&lt;/span&gt;cookie=0x6900000, duration=126.910s, table=241, n_packets=1, n_bytes=370, priority=63010,udp,reg6=0x8200/0xfffff00,tp_src=67,tp_dst=68 actions=resubmit(,220)&lt;br/&gt;
For the same interface-tag -&amp;gt; 8200, the higher priority service is loading service-index=7, but the next priority service is matching for service-index=9, which will never match. Flows in Table 17 and Table 220 works on this service-binding logic, that for the same interface-tag,&lt;br/&gt;
There will be &#8220;n&#8221; entries in the tables where n = no of services bound on the interface. And the highest priority service will set the next priority in service-index before sending the packet to the service pipeline, so that on a resubmit the packet matches the next service-index in Table 17/220.&lt;/p&gt;

&lt;p&gt;2. ConflictingModificationException for urn:opendaylight:inventory?revision=2013-08-19)nodes/node/node where the flow-id has &#8220;DPNID:TABLE-ID:INTERFACE-NAME:SERVICE-INDEX&#8221; format, where TABLE-ID = 17 or 220&lt;/p&gt;

&lt;p&gt;3. ConflictingModificationException for urn:opendaylight:genius:bound-services-state&lt;/p&gt;</comment>
                            <comment id="38135" author="jluhrsen" created="Tue, 8 Aug 2017 19:16:58 +0000"  >&lt;p&gt;(In reply to Sam Hague from comment #14)&lt;br/&gt;
&amp;gt; There are multiple aspects to this. It can be any of the following, but&lt;br/&gt;
&amp;gt; might not be always due to the same issue, but you can let me know if any of&lt;br/&gt;
&amp;gt; the following symptoms are seen :&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; 1. Any issues in Table 17 or Table 220 programming. For example last time&lt;br/&gt;
&amp;gt; when Manu debugged, the flows in Table 220 were something like below :&lt;br/&gt;
&amp;gt; &lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt;cookie=0x6900000, duration=127.462s, table=220, n_packets=28,&lt;br/&gt;
&amp;gt; n_bytes=2096, priority=6,reg6=0x8200&lt;br/&gt;
&amp;gt; actions=load:0x70008200-&amp;gt;NXM_NX_REG6[],write_metadata:0x138f000000/&lt;br/&gt;
&amp;gt; 0xfffffffffe,goto_table:241&lt;br/&gt;
&amp;gt; &lt;span class=&quot;error&quot;&gt;&amp;#91;2&amp;#93;&lt;/span&gt;cookie=0x8000007, duration=127.523s, table=220, n_packets=0, n_bytes=0,&lt;br/&gt;
&amp;gt; priority=9,reg6=0x90008200 actions=output:42&lt;br/&gt;
&amp;gt; &lt;span class=&quot;error&quot;&gt;&amp;#91;3&amp;#93;&lt;/span&gt;cookie=0x6900000, duration=126.910s, table=241, n_packets=1, n_bytes=370,&lt;br/&gt;
&amp;gt; priority=63010,udp,reg6=0x8200/0xfffff00,tp_src=67,tp_dst=68&lt;br/&gt;
&amp;gt; actions=resubmit(,220)&lt;br/&gt;
&amp;gt; For the same interface-tag -&amp;gt; 8200, the higher priority service is loading&lt;br/&gt;
&amp;gt; service-index=7, but the next priority service is matching for&lt;br/&gt;
&amp;gt; service-index=9, which will never match. Flows in Table 17 and Table 220&lt;br/&gt;
&amp;gt; works on this service-binding logic, that for the same interface-tag,&lt;br/&gt;
&amp;gt; There will be &#8220;n&#8221; entries in the tables where n = no of services bound on&lt;br/&gt;
&amp;gt; the interface. And the highest priority service will set the next priority&lt;br/&gt;
&amp;gt; in service-index before sending the packet to the service pipeline, so that&lt;br/&gt;
&amp;gt; on a resubmit the packet matches the next service-index in Table 17/220.&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; 2. ConflictingModificationException for&lt;br/&gt;
&amp;gt; urn:opendaylight:inventory?revision=2013-08-19)nodes/node/node where the&lt;br/&gt;
&amp;gt; flow-id has &#8220;DPNID:TABLE-ID:INTERFACE-NAME:SERVICE-INDEX&#8221; format, where&lt;br/&gt;
&amp;gt; TABLE-ID = 17 or 220&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; 3. ConflictingModificationException for&lt;br/&gt;
&amp;gt; urn:opendaylight:genius:bound-services-state&lt;/p&gt;

&lt;p&gt;the problem is that currently, we only see the symptom of this bug in automation.&lt;br/&gt;
the only check we have to catch this issue or similar issues, is higher up at the&lt;br/&gt;
level where VMs don&apos;t get IPs. Ideally, we can add checks for all 3 items above in&lt;br/&gt;
our CSIT code, but I&apos;m not sure who can take that on. There are still many other&lt;br/&gt;
pending priorities for CSIT work.&lt;/p&gt;</comment>
                            <comment id="38136" author="faseela.k@ericsson.com" created="Wed, 9 Aug 2017 01:47:04 +0000"  >&lt;p&gt;No need to add any more tests, if a test fails, on urther triaging any of the above symptoms are seen, assign it directly to me &lt;img class=&quot;emoticon&quot; src=&quot;https://jira.opendaylight.org/images/icons/emoticons/wink.png&quot; height=&quot;16&quot; width=&quot;16&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                            <attachment id="12553" name="odl1_karaf.log.gz" size="157074" author="jluhrsen" created="Wed, 26 Jul 2017 23:18:14 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                            <customfield id="customfield_11400" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                        <customfield id="customfield_10208" key="com.atlassian.jira.plugin.system.customfieldtypes:textfield">
                        <customfieldname>External issue ID</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>8859</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10201" key="com.atlassian.jira.plugin.system.customfieldtypes:url">
                        <customfieldname>External issue URL</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue><![CDATA[https://bugs.opendaylight.org/show_bug.cgi?id=8859]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                <customfield id="customfield_10000" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>0|i01tyn:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10203" key="com.atlassian.jira.plugin.system.customfieldtypes:textfield">
                        <customfieldname>Status Whiteboard</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>csit:sporadic_failures</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                        </customfields>
    </item>
</channel>
</rss>