<!-- 
RSS generated by JIRA (8.20.10#820010-sha1:ace47f9899e9ee25d7157d59aa17ab06aee30d3d) at Wed Feb 07 20:32:09 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>OpenDaylight JIRA</title>
    <link>https://jira.opendaylight.org</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>8.20.10</version>
        <build-number>820010</build-number>
        <build-date>22-06-2022</build-date>
    </build-info>


<item>
            <title>[OPNFLWPLUG-322] Need to close the ODL Denial of Service interface</title>
                <link>https://jira.opendaylight.org/browse/OPNFLWPLUG-322</link>
                <project id="10155" key="OPNFLWPLUG">OpenFlowPlugin</project>
                    <description>&lt;p&gt;In some of my testing I&apos;ve discovered an external message and code path that will cause ODL to fail.  If left unfixed, I am concerned that this could be used for a Denial of Service attack&lt;/p&gt;

&lt;p&gt;Internally, we us &apos;monit&apos; to ensure all core processes are up.  The following line was added last month to our monit configuration:&lt;br/&gt;
   if failed host 127.0.0.1 with port 6653 type TCP for 3 times within 3 cycles then restart&lt;/p&gt;

&lt;p&gt;This line causes monit to connect to ODL (via the loop-back interface) and then disconnect.  After this line was added, we discovered that our ODL installations would fail after a few days of running. &lt;/p&gt;

&lt;p&gt;I opened &lt;a href=&quot;https://bugs.opendaylight.org/show_bug.cgi?id=2394&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://bugs.opendaylight.org/show_bug.cgi?id=2394&lt;/a&gt; to track a specific problem associated with this (OFHandshake threads building up).&lt;/p&gt;

&lt;p&gt;When an external entity opens an OpenFlow connection, but does not complete the handshake, I see log messages along the lines of:&lt;/p&gt;

&lt;p&gt;2014-11-24 15:17:42.200 UTC &lt;span class=&quot;error&quot;&gt;&amp;#91;OFHandshake-7-0&amp;#93;&lt;/span&gt; WARN  o.o.o.o.m.c.ErrorHandlerSimpleImpl - exception -&amp;gt; FIRST HELLO sending failed because of connection issue., session -&amp;gt; null&lt;br/&gt;
org.opendaylight.openflowplugin.ConnectionException: FIRST HELLO sending failed because of connection issue.&lt;br/&gt;
        at org.opendaylight.openflowplugin.openflow.md.core.HandshakeManagerImpl.sendHelloMessage(HandshakeManagerImpl.java:297) ~&lt;span class=&quot;error&quot;&gt;&amp;#91;bundlefile:na&amp;#93;&lt;/span&gt;&lt;br/&gt;
        at org.opendaylight.openflowplugin.openflow.md.core.HandshakeManagerImpl.shake(HandshakeManagerImpl.java:95) ~&lt;span class=&quot;error&quot;&gt;&amp;#91;bundlefile:na&amp;#93;&lt;/span&gt;&lt;br/&gt;
        at org.opendaylight.openflowplugin.openflow.md.core.HandshakeStepWrapper.run(HandshakeStepWrapper.java:47) &lt;span class=&quot;error&quot;&gt;&amp;#91;bundlefile:na&amp;#93;&lt;/span&gt;&lt;br/&gt;
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) &lt;span class=&quot;error&quot;&gt;&amp;#91;na:1.7.0_71&amp;#93;&lt;/span&gt;&lt;br/&gt;
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) &lt;span class=&quot;error&quot;&gt;&amp;#91;na:1.7.0_71&amp;#93;&lt;/span&gt;&lt;br/&gt;
        at java.lang.Thread.run(Thread.java:745) &lt;span class=&quot;error&quot;&gt;&amp;#91;na:1.7.0_71&amp;#93;&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;Resources are not freed and ODL eventually locks up.&lt;/p&gt;

&lt;p&gt;I&apos;ve patched by code with my patch for 2394, but I am concerned that there are other code paths that I haven&apos;t investigated.&lt;/p&gt;</description>
                <environment>&lt;p&gt;Operating System: All&lt;br/&gt;
Platform: All&lt;/p&gt;</environment>
        <key id="27590">OPNFLWPLUG-322</key>
            <summary>Need to close the ODL Denial of Service interface</summary>
                <type id="10104" iconUrl="https://jira.opendaylight.org/secure/viewavatar?size=xsmall&amp;avatarId=10303&amp;avatarType=issuetype">Bug</type>
                                                <status id="5" iconUrl="https://jira.opendaylight.org/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="green"/>
                                    <resolution id="10000">Done</resolution>
                                        <assignee username="michal.rehak">Michal Rehak</assignee>
                                    <reporter username="jimw@a-bb.net">Jim West</reporter>
                        <labels>
                    </labels>
                <created>Mon, 24 Nov 2014 15:48:26 +0000</created>
                <updated>Mon, 27 Sep 2021 09:01:23 +0000</updated>
                            <resolved>Tue, 13 Oct 2015 21:15:17 +0000</resolved>
                                                                    <component>General</component>
                        <due></due>
                            <votes>0</votes>
                                    <watches>8</watches>
                                                                                                                <comments>
                            <comment id="56328" author="abhijit2511" created="Tue, 24 Feb 2015 15:23:48 +0000"  >&lt;p&gt;Jim,&lt;/p&gt;

&lt;p&gt;Do you have a patch for this? In case not then we will need to assign this bug to someone.&lt;/p&gt;

&lt;p&gt;Thanks,&lt;br/&gt;
Abhijit&lt;/p&gt;</comment>
                            <comment id="56329" author="jimw@a-bb.net" created="Tue, 24 Feb 2015 16:07:04 +0000"  >&lt;p&gt;Hi Abhijit&lt;/p&gt;

&lt;p&gt;I have a patch associated with 2394.  With this patch, I don&apos;t see the problem any longer.  I&apos;m not sure that the patch for 2394 covers all the code patches (that&apos;s the reason I opened this bug)&lt;/p&gt;

&lt;p&gt;(In reply to Abhijit Kumbhare from comment #1)&lt;br/&gt;
&amp;gt; Jim,&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; Do you have a patch for this? In case not then we will need to assign this&lt;br/&gt;
&amp;gt; bug to someone.&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; Thanks,&lt;br/&gt;
&amp;gt; Abhijit&lt;/p&gt;</comment>
                            <comment id="56330" author="abhijit2511" created="Wed, 6 May 2015 16:49:25 +0000"  >&lt;p&gt;Can we lower this from critical to major or normal?&lt;/p&gt;</comment>
                            <comment id="56331" author="abhijit2511" created="Wed, 6 May 2015 16:55:04 +0000"  >&lt;p&gt;This needs retest - as handshake thread model was changed in Li codebase &amp;amp; this is to do with  close and cleanup after handshake failure.&lt;/p&gt;</comment>
                            <comment id="56332" author="jimw@a-bb.net" created="Wed, 6 May 2015 16:57:53 +0000"  >&lt;p&gt;What&apos;s the reasoning for lowering the Importance?  Is it because we think it&apos;s fixed?&lt;/p&gt;

&lt;p&gt;(In reply to Abhijit Kumbhare from comment #3)&lt;br/&gt;
&amp;gt; Can we lower this from critical to major or normal?&lt;/p&gt;</comment>
                            <comment id="56333" author="abhijit2511" created="Wed, 6 May 2015 17:11:02 +0000"  >&lt;p&gt;(In reply to Jim West from comment #5)&lt;br/&gt;
&amp;gt; What&apos;s the reasoning for lowering the Importance?  Is it because we think&lt;br/&gt;
&amp;gt; it&apos;s fixed?&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; (In reply to Abhijit Kumbhare from comment #3)&lt;br/&gt;
&amp;gt; &amp;gt; Can we lower this from critical to major or normal?&lt;/p&gt;

&lt;p&gt;Reason: you no longer see the issue with your patch - and the developers may need to concentrate on the other higher priority bugs at this moment (during the bug fix cycle). Any case - we need a retest of this (see my previous comment) - can you please retest?&lt;/p&gt;</comment>
                            <comment id="56334" author="jluhrsen" created="Thu, 7 May 2015 20:30:00 +0000"  >&lt;p&gt;This is an easy issue to reproduce with the &quot;nc&quot; linux tool.&lt;/p&gt;

&lt;p&gt;steps:&lt;br/&gt;
1.  while 1&amp;gt;0; do nc -w 1 172.17.7.22 6633&lt;br/&gt;
2.  get coffee&lt;/p&gt;

&lt;p&gt;with a profiler (e.g. jvisual) you can watch the thread count rise, or this linux CLI will work too:&lt;/p&gt;

&lt;p&gt;watch -n1 ps -o nlwp $ODL_PID &lt;/p&gt;

&lt;p&gt;I&apos;ve recreated the thread count growth on the Helium release bits, as well as using this distro:&lt;br/&gt;
distribution-karaf-0.3.0-20150506.171255-1469.zip&lt;/p&gt;

&lt;p&gt;around aprox 30k working threads, there may be some trouble with OOM or file descriptors.  In every case, the controller proves to be useless.&lt;/p&gt;</comment>
                            <comment id="56335" author="abhijit2511" created="Sat, 9 May 2015 00:10:14 +0000"  >&lt;p&gt;We should not reduce the priority of this.&lt;/p&gt;</comment>
                            <comment id="56336" author="jluhrsen" created="Mon, 11 May 2015 18:03:50 +0000"  >&lt;p&gt;This issue is &lt;b&gt;NOT&lt;/b&gt; seen in Helium SR3, but is seen in all Helium releases prior to it.  I have manually checked this on Helium, Helium-SR1 and Helium-SR2.&lt;/p&gt;</comment>
                            <comment id="56337" author="vishnoianil@gmail.com" created="Mon, 11 May 2015 18:06:41 +0000"  >&lt;p&gt;Yes, this issue was fixed in helium SR3, so this is expected. It means this is regression from the master branch.&lt;/p&gt;</comment>
                            <comment id="56338" author="jgloncak" created="Tue, 12 May 2015 09:34:09 +0000"  >&lt;p&gt;This patch should fix problem in Li codebase&lt;br/&gt;
&lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/20097/1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/20097/1&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="56339" author="mirehak@cisco.com" created="Thu, 14 May 2015 08:12:11 +0000"  >&lt;p&gt;Please retest ans eventually mark resolved.&lt;/p&gt;</comment>
                            <comment id="56340" author="jluhrsen" created="Thu, 14 May 2015 17:29:19 +0000"  >&lt;p&gt;This is resolved, and automation is there to show this.  Search the &lt;br/&gt;
flow-services robot suites for the test case with title &quot;Bug Validation.2429&quot;&lt;/p&gt;</comment>
                            <comment id="56341" author="pgubka@cisco.com" created="Thu, 25 Jun 2015 08:51:54 +0000"  >&lt;p&gt;the test created for this bug start failing for lithium redesigned openflow plugin &lt;/p&gt;

&lt;p&gt;in stable/lithium &lt;br/&gt;
&lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/openflowplugin/job/openflowplugin-csit-1node-cds-flow-services-lithium-redesign-only-stable-lithium/47&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/openflowplugin/job/openflowplugin-csit-1node-cds-flow-services-lithium-redesign-only-stable-lithium/47&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;and master&lt;br/&gt;
&lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/openflowplugin/job/openflowplugin-csit-1node-cds-flow-services-lithium-redesign-only-master/417&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/openflowplugin/job/openflowplugin-csit-1node-cds-flow-services-lithium-redesign-only-master/417&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="56342" author="mirehak@cisco.com" created="Thu, 25 Jun 2015 15:01:46 +0000"  >&lt;p&gt;&lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/23279/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/23279/&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="56343" author="jluhrsen" created="Fri, 26 Jun 2015 18:00:19 +0000"  >&lt;p&gt;I&apos;ve fixed the test now:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/23285/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/23285/&lt;/a&gt;&lt;/p&gt;


&lt;p&gt;Lithium (the release distro) does not fail this test case, and I was able to use the sandbox&lt;br/&gt;
to verify the bug &lt;b&gt;was&lt;/b&gt; there in a distro from 6/18 (before the fix for &lt;a href=&quot;https://jira.opendaylight.org/browse/OPNFLWPLUG-507&quot; title=&quot;OFHandshake thread leak leads to OOM&quot; class=&quot;issue-link&quot; data-issue-key=&quot;OPNFLWPLUG-507&quot;&gt;&lt;del&gt;OPNFLWPLUG-507&lt;/del&gt;&lt;/a&gt; was put in&lt;br/&gt;
on 6/19)&lt;/p&gt;

&lt;p&gt;marking this RESOLVED/FIXED.&lt;/p&gt;</comment>
                            <comment id="56344" author="jluhrsen" created="Fri, 26 Jun 2015 18:00:42 +0000"  >&lt;p&gt;I&apos;ve fixed the test now:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/23285/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/23285/&lt;/a&gt;&lt;/p&gt;


&lt;p&gt;Lithium (the release distro) does not fail this test case, and I was able to use the sandbox&lt;br/&gt;
to verify the bug &lt;b&gt;was&lt;/b&gt; there in a distro from 6/18 (before the fix for &lt;a href=&quot;https://jira.opendaylight.org/browse/OPNFLWPLUG-507&quot; title=&quot;OFHandshake thread leak leads to OOM&quot; class=&quot;issue-link&quot; data-issue-key=&quot;OPNFLWPLUG-507&quot;&gt;&lt;del&gt;OPNFLWPLUG-507&lt;/del&gt;&lt;/a&gt; was put in&lt;br/&gt;
on 6/19)&lt;/p&gt;

&lt;p&gt;marking this RESOLVED/FIXED.&lt;/p&gt;</comment>
                            <comment id="56345" author="jluhrsen" created="Fri, 25 Sep 2015 22:48:58 +0000"  >&lt;p&gt;This issue still seems to be there.  It&apos;s not showing up in CI for some reason,&lt;br/&gt;
but it did happen recently when the releng folks were trying to upgrade their&lt;br/&gt;
CI robot VM to use centOS7.  with the centOS6 vm this test is passing, so my &lt;br/&gt;
guess is that something is broken in that test vm such that this bug is not being&lt;br/&gt;
caught by CSIT.&lt;/p&gt;

&lt;p&gt;I&apos;ve reproduced this with the robot suite locally.  It has failed with both&lt;br/&gt;
the Lithium release as well as the latest Beryllium distro from 9/22.&lt;/p&gt;</comment>
                            <comment id="56346" author="ecelgp" created="Fri, 25 Sep 2015 23:22:52 +0000"  >&lt;p&gt;Very good Jamo &lt;img class=&quot;emoticon&quot; src=&quot;https://jira.opendaylight.org/images/icons/emoticons/smile.png&quot; height=&quot;16&quot; width=&quot;16&quot; align=&quot;absmiddle&quot; alt=&quot;&quot; border=&quot;0&quot;/&gt;&lt;/p&gt;</comment>
                            <comment id="56347" author="abhijit2511" created="Fri, 9 Oct 2015 17:46:35 +0000"  >&lt;p&gt;Michal,&lt;/p&gt;

&lt;p&gt;Can you look at this one before the next SR? I think next SR is in a month or 2.&lt;/p&gt;

&lt;p&gt;Abhijit&lt;/p&gt;</comment>
                            <comment id="56348" author="abhijit2511" created="Fri, 9 Oct 2015 17:47:59 +0000"  >&lt;p&gt;Luis says fails in both He &amp;amp; Li designs - is this related to OF Java Michal Polkorab?&lt;/p&gt;</comment>
                            <comment id="56349" author="mirehak@cisco.com" created="Mon, 12 Oct 2015 11:09:16 +0000"  >&lt;p&gt;Hi all,&lt;br/&gt;
I just tested stable/lithium HEAD locally and by both - &lt;/p&gt;
{He,Li}
&lt;p&gt;-design I got no issues:&lt;/p&gt;

&lt;p&gt;for (( i=0; i&amp;lt;1000; i++ )); do &lt;br/&gt;
  nc -w 1 localhost 6633 | hexdump -C &amp;amp;  &lt;br/&gt;
  sleep 0.1&lt;br/&gt;
  pkill -f &apos;nc -w&apos;&lt;br/&gt;
done&lt;/p&gt;


&lt;p&gt;netstat -pn | grep -E &apos;66&lt;span class=&quot;error&quot;&gt;&amp;#91;35&amp;#93;&lt;/span&gt;3&apos; | nl&lt;/p&gt;

&lt;p&gt;Amount of threads used by controller was like 150. When running this cycle then the thread amount was always less then 160.&lt;br/&gt;
Both designs behaved the same.&lt;br/&gt;
With commented out the sleep line I got many opened tcp connections waiting for close. &lt;br/&gt;
When sleep line was effective then connections got closed correctly.&lt;/p&gt;</comment>
                            <comment id="56350" author="jluhrsen" created="Tue, 13 Oct 2015 21:15:17 +0000"  >&lt;p&gt;(In reply to michal rehak from comment #22)&lt;br/&gt;
&amp;gt; Hi all,&lt;br/&gt;
&amp;gt; I just tested stable/lithium HEAD locally and by both - &lt;/p&gt;
{He,Li}
&lt;p&gt;-design I got&lt;br/&gt;
&amp;gt; no issues:&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; for (( i=0; i&amp;lt;1000; i++ )); do &lt;br/&gt;
&amp;gt;   nc -w 1 localhost 6633 | hexdump -C &amp;amp;  &lt;br/&gt;
&amp;gt;   sleep 0.1&lt;br/&gt;
&amp;gt;   pkill -f &apos;nc -w&apos;&lt;br/&gt;
&amp;gt; done&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; netstat -pn | grep -E &apos;66&lt;span class=&quot;error&quot;&gt;&amp;#91;35&amp;#93;&lt;/span&gt;3&apos; | nl&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; Amount of threads used by controller was like 150. When running this cycle&lt;br/&gt;
&amp;gt; then the thread amount was always less then 160.&lt;br/&gt;
&amp;gt; Both designs behaved the same.&lt;br/&gt;
&amp;gt; With commented out the sleep line I got many opened tcp connections waiting&lt;br/&gt;
&amp;gt; for close. &lt;br/&gt;
&amp;gt; When sleep line was effective then connections got closed correctly.&lt;/p&gt;


&lt;p&gt;Yes, you are right.  The test was working quickly enough to accidentally&lt;br/&gt;
count the closing sockets as and mark it as a failure.  I&apos;ve pushed a fix&lt;br/&gt;
in the test:&lt;/p&gt;

&lt;p&gt; &lt;a href=&quot;https://git.opendaylight.org/gerrit/28356&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/28356&lt;/a&gt; &lt;/p&gt;

&lt;p&gt;I verified manually that this new test code will fail on Helium SR2 where&lt;br/&gt;
the bug was known to exist, but the test will pass with Lithium SR2.&lt;/p&gt;

&lt;p&gt;I also ran extra manual tests on Lithium SR2 to ensure I don&apos;t see this &lt;br/&gt;
issue, so re-closing the bug now.&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                            <customfield id="customfield_11400" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                        <customfield id="customfield_10208" key="com.atlassian.jira.plugin.system.customfieldtypes:textfield">
                        <customfieldname>External issue ID</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>2429</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10201" key="com.atlassian.jira.plugin.system.customfieldtypes:url">
                        <customfieldname>External issue URL</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue><![CDATA[https://bugs.opendaylight.org/show_bug.cgi?id=2429]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                <customfield id="customfield_10204" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>ODL SR Target Milestone</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10327"><![CDATA[Lithium-1]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                    <customfield id="customfield_10000" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>0|i030fr:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>