<!-- 
RSS generated by JIRA (8.20.10#820010-sha1:ace47f9899e9ee25d7157d59aa17ab06aee30d3d) at Wed Feb 07 20:33:13 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>OpenDaylight JIRA</title>
    <link>https://jira.opendaylight.org</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>8.20.10</version>
        <build-number>820010</build-number>
        <build-date>22-06-2022</build-date>
    </build-info>


<item>
            <title>[OPNFLWPLUG-727] cbench throughput test does not work in Boron</title>
                <link>https://jira.opendaylight.org/browse/OPNFLWPLUG-727</link>
                <project id="10155" key="OPNFLWPLUG">OpenFlowPlugin</project>
                    <description>&lt;p&gt;Both plugins (He + Li) show the following:&lt;/p&gt;

&lt;p&gt;cbench -c 10.29.8.203 -t -m 12000 -M 10000 -s 8 -l 10&lt;/p&gt;

&lt;p&gt;cbench: controller benchmarking tool&lt;br/&gt;
   running in mode &apos;throughput&apos;&lt;br/&gt;
   connecting to controller at 10.29.8.203:6633 &lt;br/&gt;
   faking 8 switches offset 1 :: 10 tests each; 12000 ms per test&lt;br/&gt;
   with 10000 unique source MACs per switch&lt;br/&gt;
   learning destination mac addresses before the test&lt;br/&gt;
   starting test with 0 ms delay after features_reply&lt;br/&gt;
   ignoring first 1 &quot;warmup&quot; and last 0 &quot;cooldown&quot; loops&lt;br/&gt;
   connection delay of 0ms per 1 switch(es)&lt;br/&gt;
   debugging info is off&lt;br/&gt;
controller msgbuf_read() = -1:  msgbuf_read: Connection reset by peer&lt;br/&gt;
... exiting&lt;/p&gt;

&lt;p&gt;My interpretation is that controller closes the switches connections and therefore we get following report:&lt;/p&gt;

&lt;p&gt;05:52:56.797 8   switches: flows/sec:  0  0  0  0  0  0  0  0   total = 0.000000 per ms &lt;br/&gt;
05:53:08.897 8   switches: flows/sec:  0  0  0  0  0  0  20000  0   total = 1.666627 per ms &lt;br/&gt;
05:53:20.998 8   switches: flows/sec:  0  0  0  0  0  0  0  0   total = 0.000000 per ms &lt;br/&gt;
05:53:33.098 8   switches: flows/sec:  0  0  0  0  0  0  0  0   total = 0.000000 per ms &lt;br/&gt;
05:53:45.222 8   switches: flows/sec:  0  0  0  0  0  776  0  0   total = 0.064542 per ms &lt;br/&gt;
05:53:57.323 8   switches: flows/sec:  0  0  0  0  0  0  0  0   total = 0.000000 per ms &lt;br/&gt;
05:54:09.423 8   switches: flows/sec:  0  0  0  0  0  0  0  0   total = 0.000000 per ms&lt;/p&gt;</description>
                <environment>&lt;p&gt;Operating System: All&lt;br/&gt;
Platform: All&lt;/p&gt;</environment>
        <key id="27995">OPNFLWPLUG-727</key>
            <summary>cbench throughput test does not work in Boron</summary>
                <type id="10104" iconUrl="https://jira.opendaylight.org/secure/viewavatar?size=xsmall&amp;avatarId=10303&amp;avatarType=issuetype">Bug</type>
                                                <status id="5" iconUrl="https://jira.opendaylight.org/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="green"/>
                                    <resolution id="10000">Done</resolution>
                                        <assignee username="tomas.slusny@pantheon.tech">Tomas Slusny</assignee>
                                    <reporter username="ecelgp">Luis Gomez</reporter>
                        <labels>
                    </labels>
                <created>Fri, 8 Jul 2016 21:13:07 +0000</created>
                <updated>Mon, 27 Sep 2021 09:01:51 +0000</updated>
                            <resolved>Mon, 21 Nov 2016 21:22:08 +0000</resolved>
                                                                    <component>General</component>
                        <due></due>
                            <votes>0</votes>
                                    <watches>7</watches>
                                                                                                                <comments>
                            <comment id="58015" author="jozef.bacigal@pantheon.tech" created="Mon, 1 Aug 2016 12:52:33 +0000"  >&lt;p&gt;Already solved.&lt;/p&gt;</comment>
                            <comment id="58016" author="ecelgp" created="Thu, 18 Aug 2016 02:21:05 +0000"  >&lt;p&gt;Reopening, issue shows again and it is tracked here:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/openflowplugin/job/openflowplugin-csit-1node-cbench-performance-only-boron/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/openflowplugin/job/openflowplugin-csit-1node-cbench-performance-only-boron/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;BR/Luis&lt;/p&gt;</comment>
                            <comment id="58017" author="ecelgp" created="Thu, 18 Aug 2016 21:12:38 +0000"  >&lt;p&gt;I easily reporduced locally:&lt;/p&gt;

&lt;p&gt;mininet@mininet-vm:~\&amp;gt; cbench -c 192.168.0.1 -t -m 12000 -M 10000 -s 8 -l 10&lt;br/&gt;
cbench: controller benchmarking tool&lt;br/&gt;
   running in mode &apos;throughput&apos;&lt;br/&gt;
   connecting to controller at 192.168.0.1:6633 &lt;br/&gt;
   faking 8 switches offset 1 :: 10 tests each; 12000 ms per test&lt;br/&gt;
   with 10000 unique source MACs per switch&lt;br/&gt;
   learning destination mac addresses before the test&lt;br/&gt;
   starting test with 0 ms delay after features_reply&lt;br/&gt;
   ignoring first 1 &quot;warmup&quot; and last 0 &quot;cooldown&quot; loops&lt;br/&gt;
   connection delay of 0ms per 1 switch(es)&lt;br/&gt;
   debugging info is off&lt;br/&gt;
14:04:25.711 8   switches: flows/sec:  11006  0  0  16669  0  0  0  0   total = 2.306181 per ms &lt;br/&gt;
14:04:37.815 8   switches: flows/sec:  0  0  0  3331  0  0  0  11903   total = 1.269233 per ms &lt;br/&gt;
controller msgbuf_read() = -1:  msgbuf_read: Connection reset by peer&lt;br/&gt;
... exiting&lt;/p&gt;

&lt;p&gt;Not only the above but when the test stops the controller stays with 100% CPU with no switch connected so there is really an issue here.&lt;/p&gt;

&lt;p&gt;To install cbench on ubuntu just look at the cbench section of this script:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://git.opendaylight.org/gerrit/gitweb?p=releng/builder.git;a=blob;f=packer/provision/mininet.sh&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/gitweb?p=releng/builder.git;a=blob;f=packer/provision/mininet.sh&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="58018" author="ecelgp" created="Thu, 18 Aug 2016 21:13:53 +0000"  >&lt;p&gt;Rising to critical as people in the field will try cbench for sure.&lt;/p&gt;</comment>
                            <comment id="58019" author="miroslav.macko" created="Fri, 26 Aug 2016 12:51:12 +0000"  >&lt;p&gt;Hi Luis,&lt;/p&gt;

&lt;p&gt;Tested on master.&lt;/p&gt;

&lt;p&gt;We have found some blocked threads, because of logging.&lt;/p&gt;

&lt;p&gt;I have tried to turn logging off, but anyway the best I&apos;ve got locally is this:&lt;/p&gt;

&lt;p&gt;cbench: controller benchmarking tool&lt;br/&gt;
   running in mode &apos;throughput&apos;&lt;br/&gt;
   connecting to controller at localhost:6633 &lt;br/&gt;
   faking 8 switches offset 1 :: 9 tests each; 12000 ms per test&lt;br/&gt;
   with 10000 unique source MACs per switch&lt;br/&gt;
   learning destination mac addresses before the test&lt;br/&gt;
   starting test with 1000 ms delay after features_reply&lt;br/&gt;
   ignoring first 1 &quot;warmup&quot; and last 0 &quot;cooldown&quot; loops&lt;br/&gt;
   connection delay of 0ms per 1 switch(es)&lt;br/&gt;
   debugging info is off&lt;br/&gt;
13:43:16.118 8   switches: flows/sec:  0  19999  17883  19165  0  0  0  0   total = 4.751830 per ms &lt;br/&gt;
13:43:28.220 8   switches: flows/sec:  14254  0  0  0  0  3916  0  14849   total = 2.751527 per ms &lt;br/&gt;
13:43:40.320 8   switches: flows/sec:  0  0  0  0  7962  0  0  0   total = 0.663450 per ms &lt;br/&gt;
13:43:52.427 8   switches: flows/sec:  0  0  0  0  0  19999  20000  0   total = 3.331661 per ms &lt;br/&gt;
13:44:19.436 8   switches: flows/sec:  0  0  0  0  12038  0  0  0   total = 0.447354 per ms &lt;br/&gt;
13:44:31.544 8   switches: flows/sec:  0  0  0  0  0  0  0  0   total = 0.000000 per ms &lt;br/&gt;
13:44:43.645 8   switches: flows/sec:  20000  51877  56720  29757  0  60000  0  60000   total = 23.195637 per ms &lt;br/&gt;
13:44:55.746 8   switches: flows/sec:  78687  56798  71107  60000  0  55125  0  94532   total = 34.687287 per ms &lt;br/&gt;
13:45:08.235 8   switches: flows/sec:  45838  80000  53229  60000  0  74013  0  65008   total = 30.518821 per ms &lt;br/&gt;
RESULT: 8 switches 8 tests min/max/avg/stdev = 0.00/34687.29/11949.47/13917.78 responses/s&lt;/p&gt;

&lt;p&gt;There could be some issue with that, but it will be probably not the only one.&lt;/p&gt;

&lt;p&gt;CPU is running also for me at 100% after cbench end. I am not sure about connected switches. How do you check it? &lt;/p&gt;

&lt;p&gt;That is what we have for now. &lt;/p&gt;

&lt;p&gt;Thanks,&lt;br/&gt;
Miro&lt;/p&gt;</comment>
                            <comment id="58020" author="ecelgp" created="Mon, 29 Aug 2016 06:58:50 +0000"  >&lt;p&gt;With more testing I could fix the test by adding some start delay (-D 10000) and reducing the number of MACs (-M 100):&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/44773/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/44773/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;However even with these changes the controller stays with 100% CPU after the test which I think we should fix as this could be a security vulnerability.&lt;/p&gt;</comment>
                            <comment id="58021" author="ecelgp" created="Mon, 29 Aug 2016 18:15:44 +0000"  >&lt;p&gt;After connecting a profiler, this thread seems to be the issue:&lt;/p&gt;

&lt;p&gt;AbstractStackedOutboundQueue.java:333 org.opendaylight.openflowjava.protocol.impl.core.connection.StackedSegment.failAll(OutboundQueueException)&lt;/p&gt;</comment>
                            <comment id="58022" author="andrejleitner" created="Tue, 30 Aug 2016 15:35:39 +0000"  >&lt;p&gt;Hi Luis, &lt;br/&gt;
I think, there is still problem with excessive logging. I prepared the patch where log level is set to debug and also precondition isDebugEnabled is used. I ran cbench locally on this patch (+logging in karaf set to ERROR) and my CPU after test and karaf shutdown get to normal (2%) and didn&apos;t get stuck at 100%. Could you please test it in jenkins enviroment?&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/44868/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/44868/&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="58023" author="shuva.jyoti.kar.87@gmail.com" created="Tue, 30 Aug 2016 15:49:00 +0000"  >&lt;p&gt;(In reply to Andrej Leitner from comment #8)&lt;br/&gt;
&amp;gt; Hi Luis, &lt;br/&gt;
&amp;gt; I think, there is still problem with excessive logging. I prepared the patch&lt;br/&gt;
&amp;gt; where log level is set to debug and also precondition isDebugEnabled is&lt;br/&gt;
&amp;gt; used. I ran cbench locally on this patch (+logging in karaf set to ERROR)&lt;br/&gt;
&amp;gt; and my CPU after test and karaf shutdown get to normal (2%) and didn&apos;t get&lt;br/&gt;
&amp;gt; stuck at 100%. Could you please test it in jenkins enviroment?&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; &lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/44868/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/44868/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Logging the failures as error should be ok, since those are failures. any trace should be removed and lets have the debug logs checked before logging.I will also take a look at the places we log and come out with an improved logging&lt;/p&gt;</comment>
                            <comment id="58024" author="andrejleitner" created="Wed, 31 Aug 2016 07:17:14 +0000"  >&lt;p&gt;Hi Shuva, &lt;br/&gt;
I think that despite the fact that unsuccessful RPC result brings errors from device there is no reason for throwing error since it is not error/failure of karaf or OFP itself. It&apos;s &quot;only&quot; problem of device and we can go on. I think (can&apos;t find it on mailing-lists now) we should throw ERROR only if there is a real problem and we are not able to recover (e.g. restart needed). Hence log level for this shouldn&apos;t be more than info. I chose debug level because information about result is useful/targeted for admin and not common user (like everyone, there is still operational/DS where you see if flow got installed or not)&lt;/p&gt;</comment>
                            <comment id="58025" author="ecelgp" created="Thu, 1 Sep 2016 06:37:15 +0000"  >&lt;p&gt;Upgrading to blocker due to impact in ODL perf reports.&lt;/p&gt;</comment>
                            <comment id="58026" author="shuva.jyoti.kar.87@gmail.com" created="Thu, 1 Sep 2016 06:47:17 +0000"  >&lt;p&gt;(In reply to Luis Gomez from comment #11)&lt;br/&gt;
&amp;gt; Upgrading to blocker due to impact in ODL perf reports.&lt;/p&gt;

&lt;p&gt;Luis do we still see controller closing the switch connections ? or is it that the controller cpu stays at 100% even after the test ?&lt;/p&gt;</comment>
                            <comment id="58027" author="ecelgp" created="Thu, 1 Sep 2016 06:51:48 +0000"  >&lt;p&gt;I am currently testing the proposed patch, I will update shortly on it.&lt;/p&gt;</comment>
                            <comment id="58028" author="ecelgp" created="Thu, 1 Sep 2016 08:32:51 +0000"  >&lt;p&gt;This patch may improve some perf number but it does not help with:&lt;/p&gt;

&lt;p&gt;1) Test abort due to cbench switches disconnect issue: This is mostly addressed in this patch: &lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/44773/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/44773/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;2) CPU very high after test finishes and switches are disconnected. This is to me the blocker now as controller seems to be inoperable after the test.&lt;/p&gt;

&lt;p&gt;BR/Luis&lt;/p&gt;</comment>
                            <comment id="58029" author="shuva.jyoti.kar.87@gmail.com" created="Thu, 1 Sep 2016 08:37:00 +0000"  >&lt;p&gt;(In reply to Luis Gomez from comment #14)&lt;br/&gt;
&amp;gt; This patch may improve some perf number but it does not help with:&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; 1) Test abort due to cbench switches disconnect issue: This is mostly&lt;br/&gt;
&amp;gt; addressed in this patch: &lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/44773/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/44773/&lt;/a&gt;&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; 2) CPU very high after test finishes and switches are disconnected. This is&lt;br/&gt;
&amp;gt; to me the blocker now as controller seems to be inoperable after the test.&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; BR/Luis&lt;/p&gt;


&lt;p&gt;Does the CPU usage come down after sometime or it remains same eternally ?&lt;/p&gt;</comment>
                            <comment id="58030" author="ecelgp" created="Thu, 1 Sep 2016 09:26:49 +0000"  >&lt;p&gt;For as long as my patient allows which is some minutes after the test the CPU is still 100%.&lt;/p&gt;</comment>
                            <comment id="58031" author="ecelgp" created="Thu, 1 Sep 2016 18:00:41 +0000"  >&lt;p&gt;FYI &lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/44773/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/44773/&lt;/a&gt; is already merged so test should go to green but CPU issue is still there and I will be probably extending the test to catch this scenario&lt;/p&gt;</comment>
                            <comment id="58032" author="anipbu" created="Fri, 2 Sep 2016 04:22:39 +0000"  >&lt;p&gt;Is there an ETA for this bug and someone assigned to fix?&lt;/p&gt;</comment>
                            <comment id="58033" author="ecelgp" created="Fri, 2 Sep 2016 16:03:13 +0000"  >&lt;p&gt;FYI, I added check to ver&lt;/p&gt;</comment>
                            <comment id="58034" author="ecelgp" created="Fri, 2 Sep 2016 16:08:24 +0000"  >&lt;p&gt;FYI, I added extra check in cbench test to track this bug:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/45046/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/45046/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This job will fail until this issue gets fixed:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/openflowplugin/job/openflowplugin-csit-1node-cbench-performance-only-boron/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/openflowplugin/job/openflowplugin-csit-1node-cbench-performance-only-boron/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;BR/Luis&lt;/p&gt;</comment>
                            <comment id="58035" author="anipbu" created="Tue, 6 Sep 2016 14:38:40 +0000"  >&lt;p&gt;To better assess the impact of this bug and fix, could someone from your team please help us identify the following:&lt;br/&gt;
Severity: Could you elaborate on the severity of this bug?  Is this a BLOCKER such that we cannot release Boron without it?  Is there a workaround such that we can write a release note and fix in future Boron SR1?&lt;br/&gt;
Testing: Could you also elaborate on the testing of this patch?  How extensively has this patch been tested?  Is it covered by any unit tests or system tests?  &lt;br/&gt;
Impact: Does this fix impact any dependent projects?&lt;/p&gt;</comment>
                            <comment id="58036" author="andrejleitner" created="Tue, 6 Sep 2016 17:11:52 +0000"  >&lt;p&gt;Severity&lt;br/&gt;
The bug is blocker because of performance regression -&amp;gt;&lt;br/&gt;
(In reply to Luis Gomez from comment #11)&lt;br/&gt;
&amp;gt; Upgrading to blocker due to impact in ODL perf reports.&lt;/p&gt;

&lt;p&gt;Testing&lt;br/&gt;
The patch for this bug in ofplugin consists of decreasing log level to prevent overloading of netty thread with logging errors (seen in troughput test) and ConcurrentModificationException at higher load (ibid.). There was no need to unit test update, we ran patch-test for ofplugin more times to check impact.&lt;/p&gt;

&lt;p&gt;There is also an issue in ofjava described in &lt;a href=&quot;https://jira.opendaylight.org/browse/OPNFLWJAVA-79&quot; title=&quot;Infinite reschedule of flush&quot; class=&quot;issue-link&quot; data-issue-key=&quot;OPNFLWJAVA-79&quot;&gt;&lt;del&gt;OPNFLWJAVA-79&lt;/del&gt;&lt;/a&gt;. &lt;/p&gt;

&lt;p&gt;Impact&lt;br/&gt;
In principle, the fix does not impact any dependent projects.&lt;/p&gt;</comment>
                            <comment id="58037" author="andrejleitner" created="Tue, 6 Sep 2016 17:12:29 +0000"  >&lt;p&gt;merged in boron&lt;/p&gt;</comment>
                            <comment id="58038" author="ecelgp" created="Tue, 6 Sep 2016 21:30:14 +0000"  >&lt;p&gt;This is fixed now according to:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://jenkins.opendaylight.org/releng/job/openflowplugin-csit-1node-cbench-performance-only-boron/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/job/openflowplugin-csit-1node-cbench-performance-only-boron/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;BR/Luis&lt;/p&gt;</comment>
                            <comment id="58039" author="ecelgp" created="Wed, 7 Sep 2016 07:33:33 +0000"  >&lt;p&gt;Reopening the bug: the CPU issue is fixed but after running throughput test couple of times (cbench -c 192.168.0.1 -t -m 12000 -M 100 -l 10 -s 16 -D 5000) I see memory issues:&lt;/p&gt;

&lt;p&gt;OpenJDK 64-Bit Server VM warning: INFO: os::commit_memory(0x00000000cdc00000, 121634816, 0) failed; error=&apos;Cannot allocate memory&apos; (errno=12)&lt;br/&gt;
#&lt;/p&gt;
&lt;ol&gt;
	&lt;li&gt;There is insufficient memory for the Java Runtime Environment to continue.&lt;/li&gt;
	&lt;li&gt;Native memory allocation (mmap) failed to map 121634816 bytes for committing reserved memory.&lt;/li&gt;
	&lt;li&gt;An error report file with more information is saved as:&lt;/li&gt;
	&lt;li&gt;/home/vagrant/controller-test/distribution-karaf-0.5.0-SNAPSHOT/hs_err_pid10272.log&lt;br/&gt;
Killed&lt;/li&gt;
&lt;/ol&gt;
</comment>
                            <comment id="58040" author="ecelgp" created="Wed, 7 Sep 2016 07:49:51 +0000"  >&lt;p&gt;After increasing the physical RAM size, I do not see issues running the test locally. Still some unstability in CI so downgrading to Major until we understand why Beryllium does not show this unstability.&lt;/p&gt;</comment>
                            <comment id="58041" author="anipbu" created="Thu, 8 Sep 2016 01:04:57 +0000"  >&lt;p&gt;(In reply to Luis Gomez from comment #24)&lt;br/&gt;
&amp;gt; This is fixed now according to:&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; &lt;a href=&quot;https://jenkins.opendaylight.org/releng/job/openflowplugin-csit-1node-cbench-&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/job/openflowplugin-csit-1node-cbench-&lt;/a&gt;&lt;br/&gt;
&amp;gt; performance-only-boron/&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; BR/Luis&lt;/p&gt;

&lt;p&gt;Based on Luis&apos;s comment, can I safely assume that this bug been verified as fixed in the latest Boron RC 3.1 Build?&lt;/p&gt;</comment>
                            <comment id="58042" author="ecelgp" created="Thu, 8 Sep 2016 07:23:41 +0000"  >&lt;p&gt;This bug needs to remain open with lower priority in case we get some questions after Boron release. The reason is I had to modify the test to sleep 10 secs between Cbench runs in order to stabilize it:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/45294/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/45294/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;In Beryllium we did not have to do this.&lt;/p&gt;

&lt;p&gt;BR/Luis&lt;/p&gt;</comment>
                            <comment id="58043" author="andrejleitner" created="Thu, 8 Sep 2016 09:43:14 +0000"  >&lt;p&gt;(In reply to Luis Gomez from comment #25)&lt;br/&gt;
&amp;gt; Reopening the bug: the CPU issue is fixed but after running throughput test&lt;br/&gt;
&amp;gt; couple of times (cbench -c 192.168.0.1 -t -m 12000 -M 100 -l 10 -s 16 -D&lt;br/&gt;
&amp;gt; 5000) I see memory issues:&lt;br/&gt;
&amp;gt; &lt;br/&gt;
&amp;gt; OpenJDK 64-Bit Server VM warning: INFO:&lt;br/&gt;
&amp;gt; os::commit_memory(0x00000000cdc00000, 121634816, 0) failed; error=&apos;Cannot&lt;br/&gt;
&amp;gt; allocate memory&apos; (errno=12)&lt;br/&gt;
&amp;gt; #&lt;br/&gt;
&amp;gt; # There is insufficient memory for the Java Runtime Environment to continue.&lt;br/&gt;
&amp;gt; # Native memory allocation (mmap) failed to map 121634816 bytes for&lt;br/&gt;
&amp;gt; committing reserved memory.&lt;br/&gt;
&amp;gt; # An error report file with more information is saved as:&lt;br/&gt;
&amp;gt; #&lt;br/&gt;
&amp;gt; /home/vagrant/controller-test/distribution-karaf-0.5.0-SNAPSHOT/&lt;br/&gt;
&amp;gt; hs_err_pid10272.log&lt;br/&gt;
&amp;gt; Killed&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://jira.opendaylight.org/browse/OPNFLWJAVA-78&quot; title=&quot;Failed entries not freed from queue, causing superfluous iterations&quot; class=&quot;issue-link&quot; data-issue-key=&quot;OPNFLWJAVA-78&quot;&gt;&lt;del&gt;OPNFLWJAVA-78&lt;/del&gt;&lt;/a&gt; in openflowjava could be related to memory issues. &lt;a href=&quot;https://jira.opendaylight.org/browse/OPNFLWJAVA-79&quot; title=&quot;Infinite reschedule of flush&quot; class=&quot;issue-link&quot; data-issue-key=&quot;OPNFLWJAVA-79&quot;&gt;&lt;del&gt;OPNFLWJAVA-79&lt;/del&gt;&lt;/a&gt; merged as dependency for our cbench blocker allows to fail unflushed segments, however failed entries in segment are not counted into completed (&lt;a href=&quot;https://jira.opendaylight.org/browse/OPNFLWJAVA-78&quot; title=&quot;Failed entries not freed from queue, causing superfluous iterations&quot; class=&quot;issue-link&quot; data-issue-key=&quot;OPNFLWJAVA-78&quot;&gt;&lt;del&gt;OPNFLWJAVA-78&lt;/del&gt;&lt;/a&gt;) and segment is not considered as completed thus not removed. The patch is already merged in carbon, tested by Luis as per comments: &lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/45210/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/45210/&lt;/a&gt; and got +2 from Michal. Not sure if it still possible to merge it for RC3.&lt;/p&gt;</comment>
                            <comment id="58044" author="tomas.slusny@pantheon.tech" created="Thu, 8 Sep 2016 14:25:04 +0000"  >&lt;p&gt;According to CBench unstability - this is bug with CBench.&lt;/p&gt;

&lt;p&gt;When we think that switch is IDLE, we send HELLO message to switch, and we are expecting that switch will also reply with HELLO message. But CBench was not sending this HELLO message and silently ignored all incoming HELLO messages.&lt;/p&gt;

&lt;p&gt;I added proper HELLO reply when HELLO is received to CBench sources. Here is my fork of CBench repo with this fix: &lt;a href=&quot;https://github.com/deathbeam/oflops&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://github.com/deathbeam/oflops&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;I also created pull request to official repository, but since it is pretty inactive, I doubt that it will be ever merged.&lt;/p&gt;</comment>
                            <comment id="58045" author="tomas.slusny@pantheon.tech" created="Thu, 8 Sep 2016 15:04:58 +0000"  >&lt;p&gt;After some more testing, we are actually sending ECHO and on HELLO on switch IDLE, and CBench is actually trying to send ECHO_REPLY, but I think for some reason we are not receiving this ECHO_REPLY in time. I will investigate this a bit more.&lt;/p&gt;</comment>
                            <comment id="58046" author="ecelgp" created="Thu, 8 Sep 2016 18:32:01 +0000"  >&lt;p&gt;OK there is a line on the sand of 11:59p UTC on sunday, if you find something and can get some patch by that fine, otherwise it will have to wait until SR1.&lt;/p&gt;</comment>
                            <comment id="58047" author="andrejleitner" created="Tue, 13 Sep 2016 09:13:52 +0000"  >&lt;p&gt;we are getting 100% pass on jenkins from Sep 8&lt;/p&gt;</comment>
                            <comment id="58048" author="andrejleitner" created="Mon, 19 Sep 2016 13:06:33 +0000"  >&lt;p&gt;Luis, could we close the bug as resolved?&lt;/p&gt;</comment>
                            <comment id="58049" author="ecelgp" created="Thu, 22 Sep 2016 18:24:46 +0000"  >&lt;p&gt;Yes, this is fixed now.&lt;/p&gt;</comment>
                            <comment id="58050" author="andrejleitner" created="Thu, 27 Oct 2016 16:01:44 +0000"  >&lt;p&gt;latency rerun is failing occasionally&lt;/p&gt;</comment>
                            <comment id="58051" author="tomas.slusny@pantheon.tech" created="Mon, 21 Nov 2016 14:48:01 +0000"  >&lt;p&gt;According to jenkins: &lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/openflowplugin/job/openflowplugin-csit-1node-cbench-performance-only-carbon/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/openflowplugin/job/openflowplugin-csit-1node-cbench-performance-only-carbon/&lt;/a&gt; and &lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/openflowplugin/job/openflowplugin-csit-1node-cbench-performance-only-boron/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/openflowplugin/job/openflowplugin-csit-1node-cbench-performance-only-boron/&lt;/a&gt; cbench runs are now pretty stable, only with occasional fails (but there failed all 3 tests, so it is probably environment issue) so closing this, again.&lt;/p&gt;</comment>
                            <comment id="58052" author="ecelgp" created="Mon, 21 Nov 2016 21:22:08 +0000"  >&lt;p&gt;Right, this can be closed for now.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10000">
                    <name>Blocks</name>
                                                                <inwardlinks description="is blocked by">
                                        <issuelink>
            <issuekey id="24309">OPNFLWJAVA-79</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                            <customfield id="customfield_11400" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                        <customfield id="customfield_10208" key="com.atlassian.jira.plugin.system.customfieldtypes:textfield">
                        <customfieldname>External issue ID</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>6176</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10201" key="com.atlassian.jira.plugin.system.customfieldtypes:url">
                        <customfieldname>External issue URL</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue><![CDATA[https://bugs.opendaylight.org/show_bug.cgi?id=6176]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                <customfield id="customfield_10000" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>0|i032xr:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>