<!-- 
RSS generated by JIRA (8.20.10#820010-sha1:ace47f9899e9ee25d7157d59aa17ab06aee30d3d) at Wed Feb 07 19:56:12 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>OpenDaylight JIRA</title>
    <link>https://jira.opendaylight.org</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>8.20.10</version>
        <build-number>820010</build-number>
        <build-date>22-06-2022</build-date>
    </build-info>


<item>
            <title>[CONTROLLER-1693] UnreachableMember during remove-shard-replica prevents new leader to get elected</title>
                <link>https://jira.opendaylight.org/browse/CONTROLLER-1693</link>
                <project id="10113" key="CONTROLLER">controller</project>
                    <description>&lt;p&gt;This manifests as a CSIT failure &lt;span class=&quot;error&quot;&gt;&amp;#91;0&amp;#93;&lt;/span&gt;. The UnreachableMember is another issue (&lt;a href=&quot;https://jira.opendaylight.org/browse/CONTROLLER-1645&quot; title=&quot;shard moved during 1M bgp prefix advertizing (with tell-based=true)&quot; class=&quot;issue-link&quot; data-issue-key=&quot;CONTROLLER-1645&quot;&gt;CONTROLLER-1645&lt;/a&gt; for example). It is possible that cluster members end up with an inconsistent shard configuration.&lt;/p&gt;

&lt;p&gt;Karaf.log on member-1 &lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt; shows the replica removal started at 01:55:43,472, then this happened:&lt;br/&gt;
2017-05-22 01:56:06,569 | WARN  | lt-dispatcher-32 | aftActorLeadershipTransferCohort | 193 - org.opendaylight.controller.sal-akka-raft - 1.5.0.Carbon | member-1-shard-default-config: Failed to transfer leadership in 10.01 s&lt;br/&gt;
2017-05-22 01:56:06,572 | INFO  | lt-dispatcher-22 | Shard                            | 192 - org.opendaylight.controller.sal-clustering-commons - 1.5.0.Carbon | Stopping Shard member-1-shard-default-config&lt;/p&gt;

&lt;p&gt;Finally, the test teardown started adding the replica back at 01:56:28,959.&lt;br/&gt;
Thus even though the test was waiting 45 seconds, members only have 20 seconds to realize the previous leader is gone (we can add more time to the test if needed).&lt;/p&gt;

&lt;p&gt;As member-3 karaf.log &lt;span class=&quot;error&quot;&gt;&amp;#91;2&amp;#93;&lt;/span&gt; shows no activity between 01:56:03,165 and 01:56:56,244 it looks like member-1 was perhaps somehow still a leader, but &quot;has no leader&quot; response &lt;span class=&quot;error&quot;&gt;&amp;#91;3&amp;#93;&lt;/span&gt; from member-1 when adding the replica back proves there really was no leader, at least from member-1 point of view.&lt;/p&gt;

&lt;p&gt;Every member shows multiple UnreachableMember messages. Not sure if the subsequent ones are the cause or the result of missing the leader.&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;0&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/715/archives/log.html.gz#s1-s36-t1-k2-k13-k1-k3-k1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/715/archives/log.html.gz#s1-s36-t1-k2-k13-k1-k3-k1&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/715/archives/odl1_karaf.log.gz&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/715/archives/odl1_karaf.log.gz&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;2&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/715/archives/odl3_karaf.log.gz&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/715/archives/odl3_karaf.log.gz&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;3&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/715/archives/log.html.gz#s1-s36-t1-k2-k14-k2-k3-k1-k4-k7-k1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/715/archives/log.html.gz#s1-s36-t1-k2-k14-k2-k3-k1-k4-k7-k1&lt;/a&gt;&lt;/p&gt;</description>
                <environment>&lt;p&gt;Operating System: All&lt;br/&gt;
Platform: All&lt;/p&gt;</environment>
        <key id="26247">CONTROLLER-1693</key>
            <summary>UnreachableMember during remove-shard-replica prevents new leader to get elected</summary>
                <type id="10104" iconUrl="https://jira.opendaylight.org/secure/viewavatar?size=xsmall&amp;avatarId=10303&amp;avatarType=issuetype">Bug</type>
                                                <status id="5" iconUrl="https://jira.opendaylight.org/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="green"/>
                                    <resolution id="10000">Done</resolution>
                                        <assignee username="-1">Unassigned</assignee>
                                    <reporter username="vrpolak">Vratko Polak</reporter>
                        <labels>
                    </labels>
                <created>Mon, 22 May 2017 08:47:46 +0000</created>
                <updated>Tue, 25 Jul 2023 08:24:40 +0000</updated>
                            <resolved>Mon, 18 Sep 2017 12:12:37 +0000</resolved>
                                                                    <component>clustering</component>
                        <due></due>
                            <votes>0</votes>
                                    <watches>2</watches>
                                                                                                                <comments>
                            <comment id="52299" author="vrpolak" created="Tue, 23 May 2017 11:34:06 +0000"  >&lt;p&gt;This seems to be a stable failure for listener tests when the listener is located on the leader member and shard replica is removed there. It happened two runs in a row, both for module-based and prefix-based shard tests.&lt;/p&gt;

&lt;p&gt;Example karaf.log &lt;span class=&quot;error&quot;&gt;&amp;#91;4&amp;#93;&lt;/span&gt;: from 09:14:05,199 to 09:14:57,946.&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;4&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/720/archives/odl2_karaf.log.gz&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/720/archives/odl2_karaf.log.gz&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52300" author="vrpolak" created="Tue, 23 May 2017 11:49:35 +0000"  >&lt;p&gt;&amp;gt; when the listener is located on the leader member and shard replica is removed there&lt;/p&gt;

&lt;p&gt;It also happened once &lt;span class=&quot;error&quot;&gt;&amp;#91;5&amp;#93;&lt;/span&gt; when the (prefix-based shard) replica was removed from a leader on a different member than the listener.&lt;br/&gt;
Previously, that scenario was running into &lt;a href=&quot;https://jira.opendaylight.org/browse/CONTROLLER-1694&quot; title=&quot;unsubscribe-dtcl fails with NullPointerException when UnreachableMember happens&quot; class=&quot;issue-link&quot; data-issue-key=&quot;CONTROLLER-1694&quot;&gt;&lt;del&gt;CONTROLLER-1694&lt;/del&gt;&lt;/a&gt; instead.&lt;/p&gt;

&lt;p&gt;&amp;gt; Example karaf.log &lt;span class=&quot;error&quot;&gt;&amp;#91;4&amp;#93;&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;From 09:29:43,066 to 09:31:27,574.&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;5&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/720/archives/log.html.gz#s1-s38-t3-k2-k12-k1-k3-k1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/720/archives/log.html.gz#s1-s38-t3-k2-k12-k1-k3-k1&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52301" author="pgubka@cisco.com" created="Tue, 23 May 2017 18:48:22 +0000"  >&lt;p&gt;New debug logs &lt;a href=&quot;https://jenkins.opendaylight.org/sandbox/job/controller-csit-3node-clustering-only-carbon-2nd/10/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/sandbox/job/controller-csit-3node-clustering-only-carbon-2nd/10/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Odl used from &lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/integration/job/integration-multipatch-test-carbon/37/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/integration/job/integration-multipatch-test-carbon/37/&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52302" author="pgubka@cisco.com" created="Thu, 25 May 2017 08:53:10 +0000"  >&lt;p&gt;New debug logs (including akka)&lt;br/&gt;
&lt;a href=&quot;https://jenkins.opendaylight.org/sandbox/job/controller-csit-3node-clustering-only-carbon-2nd/12/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/sandbox/job/controller-csit-3node-clustering-only-carbon-2nd/12/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Odl was built from controller=63/57763/2:99/57699/3 at&lt;br/&gt;
&lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/integration/job/integration-multipatch-test-carbon/43&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/integration/job/integration-multipatch-test-carbon/43&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52303" author="pgubka@cisco.com" created="Thu, 25 May 2017 09:51:08 +0000"  >&lt;p&gt;New debug logs (including akka)&lt;br/&gt;
&lt;a href=&quot;https://jenkins.opendaylight.org/sandbox/job/controller-csit-3node-clustering-only-carbon-2nd/13/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/sandbox/job/controller-csit-3node-clustering-only-carbon-2nd/13/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Odl built from controller=70/57770/4:99/57699/3 at&lt;br/&gt;
&lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/integration/job/integration-multipatch-test-carbon/45/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/integration/job/integration-multipatch-test-carbon/45/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Shard without the leader&lt;br/&gt;
&lt;a href=&quot;https://logs.opendaylight.org/sandbox/jenkins091/controller-csit-3node-clustering-only-carbon-2nd/13/archives/log.html.gz#s1-s2-t1-k2-k13-k1-k3-k1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/sandbox/jenkins091/controller-csit-3node-clustering-only-carbon-2nd/13/archives/log.html.gz#s1-s2-t1-k2-k13-k1-k3-k1&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52304" author="pgubka@cisco.com" created="Fri, 26 May 2017 10:03:30 +0000"  >&lt;p&gt;Testing without akka logs:&lt;br/&gt;
----------------------------&lt;br/&gt;
Odl built from: controller=22/57822/1&lt;br/&gt;
&lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/integration/job/integration-multipatch-test-carbon/49/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/integration/job/integration-multipatch-test-carbon/49/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Testing job: &lt;a href=&quot;https://jenkins.opendaylight.org/sandbox/job/controller-csit-3node-clustering-only-carbon-2nd/21/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/sandbox/job/controller-csit-3node-clustering-only-carbon-2nd/21/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Link to problem:&lt;a href=&quot;https://logs.opendaylight.org/sandbox/jenkins091/controller-csit-3node-clustering-only-carbon-2nd/21/archives/log.html.gz#s1-s2-t1-k2-k13-k1-k3-k1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/sandbox/jenkins091/controller-csit-3node-clustering-only-carbon-2nd/21/archives/log.html.gz#s1-s2-t1-k2-k13-k1-k3-k1&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Testing with akka logs:&lt;br/&gt;
----------------------------&lt;br/&gt;
Odl built from: controller=22/57822/1:99/57699/5&lt;br/&gt;
&lt;a href=&quot;https://jenkins.opendaylight.org/releng/view/integration/job/integration-multipatch-test-carbon/48/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/view/integration/job/integration-multipatch-test-carbon/48/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Testing job: &lt;a href=&quot;https://jenkins.opendaylight.org/sandbox/job/controller-csit-3node-clustering-only-carbon-2nd/20/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/sandbox/job/controller-csit-3node-clustering-only-carbon-2nd/20/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Link to problem:&lt;a href=&quot;https://logs.opendaylight.org/sandbox/jenkins091/controller-csit-3node-clustering-only-carbon-2nd/20/archives/log.html.gz#s1-s2-t1-k2-k13-k1-k3-k1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/sandbox/jenkins091/controller-csit-3node-clustering-only-carbon-2nd/20/archives/log.html.gz#s1-s2-t1-k2-k13-k1-k3-k1&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52305" author="vrpolak" created="Thu, 8 Jun 2017 15:10:03 +0000"  >&lt;p&gt;This is still present on Releng &lt;span class=&quot;error&quot;&gt;&amp;#91;19&amp;#93;&lt;/span&gt;.&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;19&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/736/log.html.gz#s1-s38-t3-k2-k12-k1-k3-k1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/736/log.html.gz#s1-s38-t3-k2-k12-k1-k3-k1&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52306" author="vrpolak" created="Fri, 9 Jun 2017 14:57:36 +0000"  >&lt;p&gt;Marking &lt;a href=&quot;https://jira.opendaylight.org/browse/CONTROLLER-1706&quot; title=&quot;Large transaction traffic prevents leader to be moved&quot; class=&quot;issue-link&quot; data-issue-key=&quot;CONTROLLER-1706&quot;&gt;&lt;del&gt;CONTROLLER-1706&lt;/del&gt;&lt;/a&gt; as dependency.&lt;/p&gt;

&lt;p&gt;If the new election takes too long, well-timed UnreachableMember could actually make the election finish sooner.&lt;br/&gt;
So if the suite still fails, we should be fixing &lt;a href=&quot;https://jira.opendaylight.org/browse/CONTROLLER-1706&quot; title=&quot;Large transaction traffic prevents leader to be moved&quot; class=&quot;issue-link&quot; data-issue-key=&quot;CONTROLLER-1706&quot;&gt;&lt;del&gt;CONTROLLER-1706&lt;/del&gt;&lt;/a&gt; instead of this.&lt;/p&gt;

&lt;p&gt;Also, it will save me some time, as now I can assign failures to &lt;a href=&quot;https://jira.opendaylight.org/browse/CONTROLLER-1706&quot; title=&quot;Large transaction traffic prevents leader to be moved&quot; class=&quot;issue-link&quot; data-issue-key=&quot;CONTROLLER-1706&quot;&gt;&lt;del&gt;CONTROLLER-1706&lt;/del&gt;&lt;/a&gt; without karaf.log investigation.&lt;/p&gt;

&lt;p&gt;Keeping this open for now, as it is possible that Unreachablemember could mess with elections even after &lt;a href=&quot;https://jira.opendaylight.org/browse/CONTROLLER-1706&quot; title=&quot;Large transaction traffic prevents leader to be moved&quot; class=&quot;issue-link&quot; data-issue-key=&quot;CONTROLLER-1706&quot;&gt;&lt;del&gt;CONTROLLER-1706&lt;/del&gt;&lt;/a&gt; is fixed.&lt;/p&gt;</comment>
                            <comment id="52307" author="vrpolak" created="Mon, 18 Sep 2017 12:12:37 +0000"  >&lt;p&gt;&amp;gt; Keeping this open for now, as it is possible that Unreachablemember&lt;br/&gt;
&amp;gt; could mess with elections even after &lt;a href=&quot;https://jira.opendaylight.org/browse/CONTROLLER-1706&quot; title=&quot;Large transaction traffic prevents leader to be moved&quot; class=&quot;issue-link&quot; data-issue-key=&quot;CONTROLLER-1706&quot;&gt;&lt;del&gt;CONTROLLER-1706&lt;/del&gt;&lt;/a&gt; is fixed.&lt;/p&gt;

&lt;p&gt;No such failures were seen, marking as fixed.&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10000">
                    <name>Blocks</name>
                                                                <inwardlinks description="is blocked by">
                                        <issuelink>
            <issuekey id="26260">CONTROLLER-1706</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                            <customfield id="customfield_11400" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                        <customfield id="customfield_10208" key="com.atlassian.jira.plugin.system.customfieldtypes:textfield">
                        <customfieldname>External issue ID</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>8524</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10201" key="com.atlassian.jira.plugin.system.customfieldtypes:url">
                        <customfieldname>External issue URL</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue><![CDATA[https://bugs.opendaylight.org/show_bug.cgi?id=8524]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10206" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Issue Type</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10300"><![CDATA[Bug]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10000" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>0|i02s5b:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>