<!-- 
RSS generated by JIRA (8.20.10#820010-sha1:ace47f9899e9ee25d7157d59aa17ab06aee30d3d) at Wed Feb 07 19:59:54 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>OpenDaylight JIRA</title>
    <link>https://jira.opendaylight.org</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>8.20.10</version>
        <build-number>820010</build-number>
        <build-date>22-06-2022</build-date>
    </build-info>


<item>
            <title>[GENIUS-97] IdPoolListener OOM</title>
                <link>https://jira.opendaylight.org/browse/GENIUS-97</link>
                <project id="10126" key="GENIUS">genius</project>
                    <description>&lt;p&gt;Internal downstream testing reports OOM with latest stable/nitrogen builds.&lt;/p&gt;

&lt;p&gt;HPROF analysis by MAT points to something really badly wrong in IdPoolListener.&lt;/p&gt;

&lt;p&gt;see attached ZIP (ignore DJC, that&apos;s &lt;a href=&quot;https://jira.opendaylight.org/browse/GENIUS-96&quot; title=&quot;DataStoreJobCoordinator OOM&quot; class=&quot;issue-link&quot; data-issue-key=&quot;GENIUS-96&quot;&gt;&lt;del&gt;GENIUS-96&lt;/del&gt;&lt;/a&gt;) &lt;/p&gt;</description>
                <environment></environment>
        <key id="28694">GENIUS-97</key>
            <summary>IdPoolListener OOM</summary>
                <type id="10104" iconUrl="https://jira.opendaylight.org/secure/viewavatar?size=xsmall&amp;avatarId=10303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="1" iconUrl="https://jira.opendaylight.org/images/icons/priorities/blocker.svg">Highest</priority>
                        <status id="10004" iconUrl="https://jira.opendaylight.org/images/icons/status_generic.gif" description="">Verified</status>
                    <statusCategory id="3" key="done" colorName="green"/>
                                    <resolution id="10003">Cannot Reproduce</resolution>
                                        <assignee username="Kency">Kency Kurian</assignee>
                                    <reporter username="vorburger">Michael Vorburger</reporter>
                        <labels>
                    </labels>
                <created>Fri, 3 Nov 2017 22:06:00 +0000</created>
                <updated>Thu, 19 Apr 2018 14:39:50 +0000</updated>
                            <resolved>Thu, 19 Apr 2018 14:39:50 +0000</resolved>
                                    <version>Nitrogen-SR1</version>
                                    <fixVersion>Nitrogen-SR1</fixVersion>
                                        <due></due>
                            <votes>0</votes>
                                    <watches>7</watches>
                                                                                                                <comments>
                            <comment id="59910" author="vorburger" created="Fri, 3 Nov 2017 22:21:08 +0000"  >&lt;p&gt;&lt;a href=&quot;https://jira.opendaylight.org/secure/ViewProfile.jspa?name=tpantelis&quot; class=&quot;user-hover&quot; rel=&quot;tpantelis&quot;&gt;tpantelis&lt;/a&gt; points out that:&lt;/p&gt;

&lt;blockquote&gt;&lt;p&gt;Looks like an Executor queue is backed up - something&apos;s probably stuck - we need to get thread dumps.&lt;/p&gt;&lt;/blockquote&gt;

&lt;p&gt;Does anyone know any automatic way to get the JVM to do that, just like the -XX:+HeapDumpOnOutOfMemoryError makes it create the HRPOF? Or we ask folks to keep doing jstack up to until it boums - is that the best we can offer? (Perhaps I could come up with some nifty .. thingie in infrautils that does this - start logging WARN level if the thread count is above a (configurable) threshold, that would be quite useful in this kind of situation, wouldn&apos;t it?)&lt;/p&gt;</comment>
                            <comment id="59912" author="tpantelis" created="Fri, 3 Nov 2017 23:25:09 +0000"  >&lt;p&gt;It&apos;s probably not the thread count - it&apos;s a particular thread that gets stuck. Maybe the app code gets deadlocked or blocking on something (there&apos;s a lot of that).  Or it&apos;s the cumulative effect of many queued tasks that are slow, ie the classic producer faster than the consumer. All of the listeners and JC use unbounded queues so if something gets stuck or runs slowly, it is inevitable a queue will back up and hit OOM. &lt;/p&gt;

&lt;p&gt;What the testers are doing?  Scale testing? longevity? is it reproducible? Are they tracking the memory usage?  If so, as it climbs then start taking thread dumps. &lt;/p&gt;</comment>
                            <comment id="59916" author="faseela.k@ericsson.com" created="Sat, 4 Nov 2017 02:04:48 +0000"  >&lt;p&gt;&lt;a href=&quot;https://jira.opendaylight.org/secure/ViewProfile.jspa?name=Kency&quot; class=&quot;user-hover&quot; rel=&quot;Kency&quot;&gt;Kency&lt;/a&gt; will you be able to analyze this? I don&apos;t think it is IdPoolListener, probably too many other things have slowed down the system, and this is just an after effect&lt;/p&gt;</comment>
                            <comment id="59918" author="vorburger" created="Sat, 4 Nov 2017 02:24:47 +0000"  >&lt;p&gt;&lt;a href=&quot;https://jira.opendaylight.org/browse/INFRAUTILS-23&quot; title=&quot;Remove Executors.newSingleThreadExecutor() and replace users with SpecialExecutors.newBoundedSingleThreadExecutor()&quot; class=&quot;issue-link&quot; data-issue-key=&quot;INFRAUTILS-23&quot;&gt;INFRAUTILS-23&lt;/a&gt; proposes to switch to using bounded queues in listeners, and &lt;a href=&quot;https://jira.opendaylight.org/browse/INFRAUTILS-24&quot; title=&quot;JobCoordinator should use bounded Executors and bound its own job queue&quot; class=&quot;issue-link&quot; data-issue-key=&quot;INFRAUTILS-24&quot;&gt;&lt;del&gt;INFRAUTILS-24&lt;/del&gt;&lt;/a&gt; the same for JC.&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://jira.opendaylight.org/browse/INFRAUTILS-21&quot; title=&quot;Automatic deadlock detection logging&quot; class=&quot;issue-link&quot; data-issue-key=&quot;INFRAUTILS-21&quot;&gt;&lt;del&gt;INFRAUTILS-21&lt;/del&gt;&lt;/a&gt; and maybe &lt;a href=&quot;https://jira.opendaylight.org/browse/INFRAUTILS-22&quot; title=&quot;Automatic threads over threshold detection and logging&quot; class=&quot;issue-link&quot; data-issue-key=&quot;INFRAUTILS-22&quot;&gt;&lt;del&gt;INFRAUTILS-22&lt;/del&gt;&lt;/a&gt; will do something re. automatic way to dump deadlocked threads (there&apos;s no readily available JVM option AFAIK).&lt;/p&gt;

&lt;p&gt;We need to fix the bug hit during testing in this JIRA issue; I&apos;ll get us a jstack.&lt;/p&gt;</comment>
                            <comment id="59921" author="pperiyasamy" created="Sat, 4 Nov 2017 08:46:02 +0000"  >&lt;p&gt;The task (in update method) being executed under IdPoolListener is a short-lived one which i don&apos;t think causes a thread starvation. &lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt; is invoked only in a corner case where updating local id pool when reclamation happen from other local id pool due to id exhaustion in node&apos;s local and parent id pools. Does that happen in your tests ?&lt;/p&gt;

&lt;p&gt;It&apos;s better to look at jstack and jmap to see what is actually going on.&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://git.opendaylight.org/gerrit/gitweb?p=genius.git;a=blob;f=idmanager/idmanager-impl/src/main/java/org/opendaylight/genius/idmanager/IdPoolListener.java;h=adc77023c371853543745fe50ac4f649c7c5bab6;hb=refs/heads/master#l67&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/gitweb?p=genius.git;a=blob;f=idmanager/idmanager-impl/src/main/java/org/opendaylight/genius/idmanager/IdPoolListener.java;h=adc77023c371853543745fe50ac4f649c7c5bab6;hb=refs/heads/master#l67&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="59945" author="klou" created="Mon, 6 Nov 2017 20:47:21 +0000"  >&lt;p&gt;Is this issue related to &lt;a href=&quot;https://jira.opendaylight.org/browse/NETVIRT-974&quot; class=&quot;external-link&quot; rel=&quot;nofollow&quot;&gt;https://jira.opendaylight.org/browse/NETVIRT-974&lt;/a&gt; ?&lt;/p&gt;

&lt;p&gt;Is this truly a blocker for Nitrogen-SR1?  Thanks!&lt;/p&gt;</comment>
                            <comment id="59951" author="vorburger" created="Mon, 6 Nov 2017 22:18:08 +0000"  >&lt;p&gt;&amp;gt; Is this issue related to &lt;a href=&quot;https://jira.opendaylight.org/browse/NETVIRT-974&quot; class=&quot;external-link&quot; rel=&quot;nofollow&quot;&gt;https://jira.opendaylight.org/browse/NETVIRT-974&lt;/a&gt; ?&lt;/p&gt;

&lt;p&gt;no, not at all. But it could turn out to have one and the same single cause  as &lt;a href=&quot;https://jira.opendaylight.org/browse/GENIUS-96&quot; title=&quot;DataStoreJobCoordinator OOM&quot; class=&quot;issue-link&quot; data-issue-key=&quot;GENIUS-96&quot;&gt;&lt;del&gt;GENIUS-96&lt;/del&gt;&lt;/a&gt; though - we don&apos;t know yet.&lt;/p&gt;

&lt;p&gt;&amp;gt; Is this truly a blocker for Nitrogen-SR1? Thanks!&lt;/p&gt;

&lt;p&gt;Yup.&lt;/p&gt;</comment>
                            <comment id="59969" author="vorburger" created="Tue, 7 Nov 2017 14:45:37 +0000"  >&lt;p&gt;Attached jstack_info.log.xz has 67x jstack taken once per minute leading up to the OOM. Anyone spotting anything useful here?&lt;/p&gt;</comment>
                            <comment id="59978" author="vorburger" created="Tue, 7 Nov 2017 17:02:04 +0000"  >&lt;p&gt;This can only be reproduced with latest stable/Nitrogen HEAD (which will be SR1), NOT with the first September 26 Nitro, so recently broke.&lt;/p&gt;</comment>
                            <comment id="59987" author="klou" created="Tue, 7 Nov 2017 22:41:48 +0000"  >&lt;p&gt;Do we have ETA on resolution?  Need input to assess how far we have to push out Nitrogen-SR1.  Thanks!&lt;/p&gt;</comment>
                            <comment id="59993" author="vorburger" created="Wed, 8 Nov 2017 02:02:58 +0000"  >&lt;p&gt;&lt;a href=&quot;https://jira.opendaylight.org/secure/ViewProfile.jspa?name=klou&quot; class=&quot;user-hover&quot; rel=&quot;klou&quot;&gt;klou&lt;/a&gt; ETA is when it&apos;s Fixed. We can try to do earlier then when it&apos;s Done, but we would need a time machine.&lt;/p&gt;</comment>
                            <comment id="59998" author="vorburger" created="Wed, 8 Nov 2017 18:22:38 +0000"  >&lt;p&gt; &lt;a href=&quot;https://jira.opendaylight.org/secure/ViewProfile.jspa?name=klou&quot; class=&quot;user-hover&quot; rel=&quot;klou&quot;&gt;klou&lt;/a&gt; &amp;amp; &lt;a href=&quot;https://jira.opendaylight.org/secure/ViewProfile.jspa?name=vivekshiv&quot; class=&quot;user-hover&quot; rel=&quot;vivekshiv&quot;&gt;vivekshiv&lt;/a&gt; FYI I&apos;m hoping to update and possibly close this one tomorrow, currentlywaiting for feedback from &lt;a href=&quot;https://jira.opendaylight.org/secure/ViewProfile.jspa?name=ltomasbo%40redhat.com&quot; class=&quot;user-hover&quot; rel=&quot;ltomasbo@redhat.com&quot;&gt;ltomasbo@redhat.com&lt;/a&gt; ...&lt;/p&gt;</comment>
                            <comment id="60000" author="vorburger" created="Wed, 8 Nov 2017 20:36:15 +0000"  >&lt;p&gt;Closing as CANNOT REPRO, because &lt;a href=&quot;https://jira.opendaylight.org/secure/ViewProfile.jspa?name=ltomasbo&quot; class=&quot;user-hover&quot; rel=&quot;ltomasbo&quot;&gt;ltomasbo&lt;/a&gt;  has clarified that he only hits an OOM with the 512 MB instead of 2 GB due to devstack, see &lt;a href=&quot;https://jira.opendaylight.org/browse/GENIUS-96&quot; title=&quot;DataStoreJobCoordinator OOM&quot; class=&quot;issue-link&quot; data-issue-key=&quot;GENIUS-96&quot;&gt;&lt;del&gt;GENIUS-96&lt;/del&gt;&lt;/a&gt;. &lt;/p&gt;

&lt;p&gt;What was particularly confusing me here was that the attached Leak Suspects MAT report I produced from his HPROF &lt;b&gt;does&lt;/b&gt; show 2 GB, BUT this was only because: (quote) &lt;em&gt;&quot;probably the previous one left the env (DB) unstable, with existing ports and flows... I then removed the env, recreated it with 0.7.1 (= SR1 to-be) with 2 GB on a completely fresh install. I&apos;m re-testing it and is not seeing the OOM anymore.&quot;&lt;/em&gt;&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10000">
                    <name>Blocks</name>
                                                                <inwardlinks description="is blocked by">
                                        <issuelink>
            <issuekey id="28697">INFRAUTILS-23</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                            <attachment id="14214" name="java_pid65530_Leak_Suspects.zip" size="117540" author="vorburger" created="Fri, 3 Nov 2017 22:06:26 +0000"/>
                            <attachment id="14219" name="jstack_info.log.xz" size="120188" author="vorburger" created="Tue, 7 Nov 2017 14:42:20 +0000"/>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                            <customfield id="customfield_11400" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    <customfield id="customfield_10202" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Priority</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10313"><![CDATA[Highest]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10000" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>0|i03793:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>