<!-- 
RSS generated by JIRA (8.20.10#820010-sha1:ace47f9899e9ee25d7157d59aa17ab06aee30d3d) at Wed Feb 07 19:55:46 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>OpenDaylight JIRA</title>
    <link>https://jira.opendaylight.org</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>8.20.10</version>
        <build-number>820010</build-number>
        <build-date>22-06-2022</build-date>
    </build-info>


<item>
            <title>[CONTROLLER-1524] Clustering: Jolokia fails to report joining after 2 members were killed</title>
                <link>https://jira.opendaylight.org/browse/CONTROLLER-1524</link>
                <project id="10113" key="CONTROLLER">controller</project>
                    <description>&lt;p&gt;This is a recent Boron regression detected by system tests.&lt;/p&gt;

&lt;p&gt;The test scenario starts with 3-node cluster, then it kills 2 members, and then it boots one member back up. Failure occurs after the booted-up member reports it is in sync, the symptom is that 30 seconds is not enough for a (car-config) shard leader to appear.&lt;/p&gt;

&lt;p&gt;During May, sync, was achieved typically in 36 seconds &lt;span class=&quot;error&quot;&gt;&amp;#91;0&amp;#93;&lt;/span&gt;, and shard leader was confirmed in under 1 second &lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt;. In June, sync status is achieved in 18 seconds &lt;span class=&quot;error&quot;&gt;&amp;#91;2&amp;#93;&lt;/span&gt;, but searching for shard leader still shows NPE in the jolokia output after 30 seconds &lt;span class=&quot;error&quot;&gt;&amp;#91;3&amp;#93;&lt;/span&gt;.&lt;/p&gt;

&lt;p&gt;Looking at karaf.log of the booting-up member (index 1) &lt;span class=&quot;error&quot;&gt;&amp;#91;4&amp;#93;&lt;/span&gt;, 44 second gap is seen, without any explanation what was jolokia waiting for:&lt;/p&gt;

&lt;p&gt;2016-06-06 09:02:52,218 | INFO  | config-pusher    | ConfigPusherImpl                 | 102 - org.opendaylight.controller.config-persister-impl - 0.5.0.SNAPSHOT | Successfully pushed configuration snapshot &lt;br/&gt;
10-restconf-service.xml(odl-restconf,odl-restconf)&lt;br/&gt;
2016-06-06 09:03:36,456 | INFO  | lt-dispatcher-43 | kka://opendaylight-cluster-data) | 139 - com.typesafe.akka.slf4j - 2.4.4 | Cluster Node &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.155:2550&amp;#93;&lt;/span&gt; - Node &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt; is JOINING, roles &lt;span class=&quot;error&quot;&gt;&amp;#91;member-2&amp;#93;&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;And this is from karaf.log of the member that was still up (index 3):&lt;br/&gt;
2016-06-06 09:02:50,297 | INFO  | ult-dispatcher-4 | ShardManager                     | 155 - org.opendaylight.controller.sal-distributed-datastore - 1.4.0.SNAPSHOT | shard-manager-operational: Received role changed for member-3-shard-entity-ownership-operational from Candidate to Leader&lt;br/&gt;
2016-06-06 09:02:50,297 | INFO  | ult-dispatcher-4 | ShardManager                     | 155 - org.opendaylight.controller.sal-distributed-datastore - 1.4.0.SNAPSHOT | shard-manager-operational: All Shards are ready - data store operational is ready, available count is 0&lt;br/&gt;
2016-06-06 09:02:51,003 | WARN  | lt-dispatcher-23 | ReliableDeliverySupervisor       | 139 - com.typesafe.akka.slf4j - 2.4.4 | Association with remote system &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt; has failed, address is now gated for &lt;span class=&quot;error&quot;&gt;&amp;#91;5000&amp;#93;&lt;/span&gt; ms. Reason: [Association failed with &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt;] Caused by: &lt;span class=&quot;error&quot;&gt;&amp;#91;Connection refused: /10.30.11.182:2550&amp;#93;&lt;/span&gt;&lt;br/&gt;
2016-06-06 09:02:56,014 | WARN  | lt-dispatcher-31 | ReliableDeliverySupervisor       | 139 - com.typesafe.akka.slf4j - 2.4.4 | Association with remote system &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt; has failed, address is now gated for &lt;span class=&quot;error&quot;&gt;&amp;#91;5000&amp;#93;&lt;/span&gt; ms. Reason: [Association failed with &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt;] Caused by: &lt;span class=&quot;error&quot;&gt;&amp;#91;Connection refused: /10.30.11.182:2550&amp;#93;&lt;/span&gt;&lt;br/&gt;
2016-06-06 09:03:01,065 | WARN  | ult-dispatcher-4 | ReliableDeliverySupervisor       | 139 - com.typesafe.akka.slf4j - 2.4.4 | Association with remote system &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt; has failed, address is now gated for &lt;span class=&quot;error&quot;&gt;&amp;#91;5000&amp;#93;&lt;/span&gt; ms. Reason: [Association failed with &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt;] Caused by: &lt;span class=&quot;error&quot;&gt;&amp;#91;Connection refused: /10.30.11.182:2550&amp;#93;&lt;/span&gt;&lt;br/&gt;
2016-06-06 09:03:06,088 | WARN  | lt-dispatcher-16 | ReliableDeliverySupervisor       | 139 - com.typesafe.akka.slf4j - 2.4.4 | Association with remote system &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt; has failed, address is now gated for &lt;span class=&quot;error&quot;&gt;&amp;#91;5000&amp;#93;&lt;/span&gt; ms. Reason: [Association failed with &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt;] Caused by: &lt;span class=&quot;error&quot;&gt;&amp;#91;Connection refused: /10.30.11.182:2550&amp;#93;&lt;/span&gt;&lt;br/&gt;
2016-06-06 09:03:11,135 | WARN  | lt-dispatcher-43 | ReliableDeliverySupervisor       | 139 - com.typesafe.akka.slf4j - 2.4.4 | Association with remote system &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt; has failed, address is now gated for &lt;span class=&quot;error&quot;&gt;&amp;#91;5000&amp;#93;&lt;/span&gt; ms. Reason: [Association failed with &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt;] Caused by: &lt;span class=&quot;error&quot;&gt;&amp;#91;Connection refused: /10.30.11.182:2550&amp;#93;&lt;/span&gt;&lt;br/&gt;
2016-06-06 09:03:16,153 | WARN  | lt-dispatcher-15 | ReliableDeliverySupervisor       | 139 - com.typesafe.akka.slf4j - 2.4.4 | Association with remote system &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt; has failed, address is now gated for &lt;span class=&quot;error&quot;&gt;&amp;#91;5000&amp;#93;&lt;/span&gt; ms. Reason: [Association failed with &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt;] Caused by: &lt;span class=&quot;error&quot;&gt;&amp;#91;Connection refused: /10.30.11.182:2550&amp;#93;&lt;/span&gt;&lt;br/&gt;
2016-06-06 09:03:21,163 | WARN  | ult-dispatcher-4 | ReliableDeliverySupervisor       | 139 - com.typesafe.akka.slf4j - 2.4.4 | Association with remote system &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt; has failed, address is now gated for &lt;span class=&quot;error&quot;&gt;&amp;#91;5000&amp;#93;&lt;/span&gt; ms. Reason: [Association failed with &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt;] Caused by: &lt;span class=&quot;error&quot;&gt;&amp;#91;Connection refused: /10.30.11.182:2550&amp;#93;&lt;/span&gt;&lt;br/&gt;
2016-06-06 09:03:26,174 | WARN  | lt-dispatcher-42 | ReliableDeliverySupervisor       | 139 - com.typesafe.akka.slf4j - 2.4.4 | Association with remote system &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt; has failed, address is now gated for &lt;span class=&quot;error&quot;&gt;&amp;#91;5000&amp;#93;&lt;/span&gt; ms. Reason: [Association failed with &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt;] Caused by: &lt;span class=&quot;error&quot;&gt;&amp;#91;Connection refused: /10.30.11.182:2550&amp;#93;&lt;/span&gt;&lt;br/&gt;
2016-06-06 09:03:31,223 | WARN  | lt-dispatcher-43 | ReliableDeliverySupervisor       | 139 - com.typesafe.akka.slf4j - 2.4.4 | Association with remote system &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt; has failed, address is now gated for &lt;span class=&quot;error&quot;&gt;&amp;#91;5000&amp;#93;&lt;/span&gt; ms. Reason: [Association failed with &lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.11.182:2550&amp;#93;&lt;/span&gt;] Caused by: &lt;span class=&quot;error&quot;&gt;&amp;#91;Connection refused: /10.30.11.182:2550&amp;#93;&lt;/span&gt;&lt;br/&gt;
2016-06-06 09:03:37,180 | INFO  | ult-dispatcher-2 | ShardInformation                 | 155 - org.opendaylight.controller.sal-distributed-datastore - 1.4.0.SNAPSHOT | updatePeerAddress for peer member-2-shard-default-config with address akka.tcp://opendaylight-cluster-data@10.30.11.182:2550/user/shardmanager-config/member-2-shard-default-config&lt;/p&gt;

&lt;p&gt;Note that data manipulation test cases were successful during the time member-2 was down, but both karaf.log segments only show activity with respect to member-2.&lt;/p&gt;

&lt;p&gt;Strangely, after member-2 was brought up, member-1 started reporting its shard status correctly, but member-2 started to show the NPE instead (making subsequent tests fail).&lt;br/&gt;
Here is the the beginning of the NPE:&lt;br/&gt;
{&quot;request&quot;:&lt;/p&gt;
{&quot;mbean&quot;:&quot;org.opendaylight.controller:Category=Shards,name=member-2-shard-car-config,type=DistributedConfigDatastore&quot;,&quot;type&quot;:&quot;read&quot;}
&lt;p&gt;,&quot;stacktrace&quot;:&quot;java.lang.NullPointerException\n\tat com.google.common.base.Preconditions.checkNotNull(Preconditions.java:210)\n\tat com.google.common.base.Joiner.toString(Joiner.java:433)\n\tat com.google.common.base.Joiner$MapJoiner.appendTo(Joiner.java:359)\n\tat com.google.common.base.Joiner$MapJoiner.appendTo(Joiner.java:393)\n\tat com.google.common.base.Joiner$MapJoiner.join(Joiner.java:419)\n\tat com.google.common.base.Joiner$MapJoiner.join(Joiner.java:408)\n\tat com.google.common.base.Joiner$MapJoiner.join(Joiner.java:330)\n\tat org.opendaylight.controller.cluster.datastore.jmx.mbeans.shard.ShardStats.toStringMap(ShardStats.java:319)\n\tat org.opendaylight.controller.cluster.datastore.jmx.mbeans.shard.ShardStats.getPeerAddresses(ShardStats.java:315)\n&lt;br/&gt;
...&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;0&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/controller/job/controller-csit-3node-clustering-only-boron/513/robot/report/log.html#s1-s5-t3-k2-k5&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/controller/job/controller-csit-3node-clustering-only-boron/513/robot/report/log.html#s1-s5-t3-k2-k5&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/controller/job/controller-csit-3node-clustering-only-boron/513/robot/report/log.html#s1-s5-t3-k3&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/controller/job/controller-csit-3node-clustering-only-boron/513/robot/report/log.html#s1-s5-t3-k3&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;2&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/controller/job/controller-csit-3node-clustering-only-boron/529/robot/report/log.html#s1-s5-t3-k2-k5&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/controller/job/controller-csit-3node-clustering-only-boron/529/robot/report/log.html#s1-s5-t3-k2-k5&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;3&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/controller/job/controller-csit-3node-clustering-only-boron/529/robot/report/log.html#s1-s5-t3-k3-k14-k1-k1-k1-k1-k5-k2-k1-k5-k1-k4-k1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/controller/job/controller-csit-3node-clustering-only-boron/529/robot/report/log.html#s1-s5-t3-k3-k14-k1-k1-k1-k1-k5-k2-k1-k5-k1-k4-k1&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;4&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/controller/job/controller-csit-3node-clustering-only-boron/lastSuccessfulBuild/artifact/odl1_karaf.log.xz&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/controller/job/controller-csit-3node-clustering-only-boron/lastSuccessfulBuild/artifact/odl1_karaf.log.xz&lt;/a&gt;&lt;/p&gt;</description>
                <environment>&lt;p&gt;Operating System: All&lt;br/&gt;
Platform: All&lt;/p&gt;</environment>
        <key id="26078">CONTROLLER-1524</key>
            <summary>Clustering: Jolokia fails to report joining after 2 members were killed</summary>
                <type id="10104" iconUrl="https://jira.opendaylight.org/secure/viewavatar?size=xsmall&amp;avatarId=10303&amp;avatarType=issuetype">Bug</type>
                                                <status id="5" iconUrl="https://jira.opendaylight.org/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="green"/>
                                    <resolution id="10000">Done</resolution>
                                        <assignee username="-1">Unassigned</assignee>
                                    <reporter username="vrpolak">Vratko Polak</reporter>
                        <labels>
                    </labels>
                <created>Mon, 6 Jun 2016 13:51:52 +0000</created>
                <updated>Tue, 25 Jul 2023 08:24:10 +0000</updated>
                            <resolved>Tue, 14 Jun 2016 17:54:36 +0000</resolved>
                                                                    <component>clustering</component>
                        <due></due>
                            <votes>0</votes>
                                    <watches>3</watches>
                                                                                                                <comments>
                            <comment id="51443" author="vrpolak" created="Mon, 6 Jun 2016 14:15:20 +0000"  >&lt;p&gt;Also note that there are previous suites which pass. Car_Failover_Crud kills at most 1 member at a time, Car_Persistence_Recovery kills all 3 members (and then brings them up) at once.&lt;/p&gt;</comment>
                            <comment id="51444" author="ecelgp" created="Wed, 8 Jun 2016 07:14:48 +0000"  >&lt;p&gt;This is regression vs Beryllium, therefore rising priority to critical.&lt;/p&gt;</comment>
                            <comment id="51445" author="tpantelis" created="Tue, 14 Jun 2016 02:21:40 +0000"  >&lt;p&gt;The NPE is fixed by &lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/40173/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/40173/&lt;/a&gt;.&lt;/p&gt;</comment>
                            <comment id="51446" author="ecelgp" created="Tue, 14 Jun 2016 17:54:36 +0000"  >&lt;p&gt;I agree, controller cluster jobs are back GREEN.&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                            <customfield id="customfield_11400" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                        <customfield id="customfield_10208" key="com.atlassian.jira.plugin.system.customfieldtypes:textfield">
                        <customfieldname>External issue ID</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>6013</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10201" key="com.atlassian.jira.plugin.system.customfieldtypes:url">
                        <customfieldname>External issue URL</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue><![CDATA[https://bugs.opendaylight.org/show_bug.cgi?id=6013]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10206" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Issue Type</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10300"><![CDATA[Bug]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10000" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>0|i02r3r:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>