<!-- 
RSS generated by JIRA (8.20.10#820010-sha1:ace47f9899e9ee25d7157d59aa17ab06aee30d3d) at Wed Feb 07 19:56:09 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>OpenDaylight JIRA</title>
    <link>https://jira.opendaylight.org</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>8.20.10</version>
        <build-number>820010</build-number>
        <build-date>22-06-2022</build-date>
    </build-info>


<item>
            <title>[CONTROLLER-1674] Timeout waiting for task in producer when moving shard leader</title>
                <link>https://jira.opendaylight.org/browse/CONTROLLER-1674</link>
                <project id="10113" key="CONTROLLER">controller</project>
                    <description>&lt;p&gt;This happened &lt;span class=&quot;error&quot;&gt;&amp;#91;0&amp;#93;&lt;/span&gt; in a suite for module-based shards with tell-based protocol.&lt;/p&gt;

&lt;p&gt;Karaf.log &lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt; shows an unexpected UnreachableMember at 06:17:00,808, followed by many warning as this:&lt;br/&gt;
2017-05-14 06:17:16,519 | WARN  | ult-dispatcher-2 | FrontendClientMetadataBuilder    | 199 - org.opendaylight.controller.sal-distributed-datastore - 1.5.0.Carbon | member-2-shard-default-config: Unknown history for commited transaction member-2-datastore-config-fe-0-chn-2-txn-2-0, ignoring&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;0&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/684/archives/log.html.gz#s1-s29-t1-k2-k9-k1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/684/archives/log.html.gz#s1-s29-t1-k2-k9-k1&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/684/archives/odl2_karaf.log.gz&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/684/archives/odl2_karaf.log.gz&lt;/a&gt;&lt;/p&gt;</description>
                <environment>&lt;p&gt;Operating System: All&lt;br/&gt;
Platform: All&lt;/p&gt;</environment>
        <key id="26228">CONTROLLER-1674</key>
            <summary>Timeout waiting for task in producer when moving shard leader</summary>
                <type id="10104" iconUrl="https://jira.opendaylight.org/secure/viewavatar?size=xsmall&amp;avatarId=10303&amp;avatarType=issuetype">Bug</type>
                                                <status id="5" iconUrl="https://jira.opendaylight.org/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="green"/>
                                    <resolution id="10000">Done</resolution>
                                        <assignee username="rovarga">Robert Varga</assignee>
                                    <reporter username="vrpolak">Vratko Polak</reporter>
                        <labels>
                    </labels>
                <created>Mon, 15 May 2017 10:38:13 +0000</created>
                <updated>Tue, 25 Jul 2023 08:24:37 +0000</updated>
                            <resolved>Mon, 3 Jul 2017 12:09:51 +0000</resolved>
                                                                    <component>clustering</component>
                        <due></due>
                            <votes>0</votes>
                                    <watches>2</watches>
                                                                                                                <comments>
                            <comment id="52198" author="vrpolak" created="Wed, 17 May 2017 10:38:54 +0000"  >&lt;p&gt;Today, the writer has timed out &lt;span class=&quot;error&quot;&gt;&amp;#91;2&amp;#93;&lt;/span&gt; on restconf level, without returning a timeout error on its own. Karaf.log &lt;span class=&quot;error&quot;&gt;&amp;#91;3&amp;#93;&lt;/span&gt; shows no suspicious message within the test case duration (between 03:01:22,254 and  03:02:52,753).&lt;/p&gt;

&lt;p&gt;Note that the run was still without a potential fix &lt;span class=&quot;error&quot;&gt;&amp;#91;4&amp;#93;&lt;/span&gt;.&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;2&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/692/archives/log.html.gz#s1-s24-t1-k2-k8&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/692/archives/log.html.gz#s1-s24-t1-k2-k8&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;3&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/692/archives/odl2_karaf.log.gz&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/692/archives/odl2_karaf.log.gz&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;4&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://git.opendaylight.org/gerrit/56874&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/56874&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52199" author="vrpolak" created="Thu, 18 May 2017 13:20:58 +0000"  >&lt;p&gt;This is still happening. The good news is that now only one writer reports this error &lt;span class=&quot;error&quot;&gt;&amp;#91;5&amp;#93;&lt;/span&gt;, others pass (albeit with lower performance as they wait for submits).&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;5&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/694/archives/log.html.gz#s1-s36-t3-k2-k13&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/694/archives/log.html.gz#s1-s36-t3-k2-k13&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52200" author="vrpolak" created="Thu, 18 May 2017 15:31:34 +0000"  >&lt;p&gt;In another scenario (the previous one has listener and moves leader gracefully, this one has no listener but isolates the leader for a brief time, both scenarios have 3 transaction writers and 2 of them finish correctly now; module-based shards with tell-based protocol) something similar &lt;span class=&quot;error&quot;&gt;&amp;#91;6&amp;#93;&lt;/span&gt; happened.&lt;/p&gt;

&lt;p&gt;But the exceptions are slightly different. The graceful movement response starts with:&lt;br/&gt;
{&quot;errors&quot;:{&quot;error&quot;:[{&quot;error-type&quot;:&quot;application&quot;,&quot;error-tag&quot;:&quot;operation-failed&quot;,&quot;error-message&quot;:&quot;Unexpected-exception&quot;,&quot;error-info&quot;:&quot;java.util.concurrent.TimeoutException: Timeout waiting for task.\n\tat com.google.common.util.concurrent.AbstractFuture$Sync.get(AbstractFuture.java:269)\n\tat com.google.common.util.concurrent.AbstractFuture.get(AbstractFuture.java:96)\n\tat com.google.common.util.concurrent.ForwardingFuture.get(ForwardingFuture.java:69)\n\tat org.opendaylight.yangtools.util.concurrent.MappingCheckedFuture.get(MappingCheckedFuture.java:85)\n\tat &lt;/p&gt;

&lt;p&gt;The brief isolation response starts with:&lt;br/&gt;
{&quot;errors&quot;:{&quot;error&quot;:[{&quot;error-type&quot;:&quot;application&quot;,&quot;error-tag&quot;:&quot;operation-failed&quot;,&quot;error-message&quot;:&quot;Unexpected-exception&quot;,&quot;error-info&quot;:&quot;java.util.concurrent.ExecutionException: TransactionCommitFailedException&lt;/p&gt;
{message=canCommit encountered an unexpected failure, errorList=[RpcError [message=canCommit encountered an unexpected failure, severity=ERROR, errorType=APPLICATION, tag=operation-failed, applicationTag=null, info=null, cause=org.opendaylight.controller.cluster.access.client.RequestTimeoutException: Timed out after 144970935433ns]]}
&lt;p&gt;\n\tat com.google.common.util.concurrent.AbstractFuture$Sync.getValue(AbstractFuture.java:299)\n\tat com.google.common.util.concurrent.AbstractFuture$Sync.get(AbstractFuture.java:272)\n\tat com.google.common.util.concurrent.AbstractFuture.get(AbstractFuture.java:96)\n\tat org.opendaylight.controller.clustering.it.provider.impl.WriteTransactionsHandler.maybeFinish(WriteTransactionsHandler.java:238)\n\tat org.opendaylight.controller.clustering.it.provider.impl.WriteTransactionsHandler.run(WriteTransactionsHandler.java:112)\n\tat &lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;6&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/694/archives/log.html.gz#s1-s28-t1-k2-k25-k1-k2-k3-k1-k1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/694/archives/log.html.gz#s1-s28-t1-k2-k25-k1-k2-k3-k1-k1&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52201" author="vzelcamo@cisco.com" created="Mon, 29 May 2017 12:20:23 +0000"  >&lt;p&gt;Vratko, please add comment when you have new run, thank you&lt;/p&gt;</comment>
                            <comment id="52202" author="vrpolak" created="Mon, 29 May 2017 14:42:34 +0000"  >&lt;p&gt;This is still happening. In a recent Sandbox test &lt;span class=&quot;error&quot;&gt;&amp;#91;7&amp;#93;&lt;/span&gt; (which failed to upload archive with logs) one writer failed with &quot;TimeoutException: Timeout waiting for task&quot; (as opposed to RequestTimeoutException), one finished successfully, and one overstepped the generous (440s) timeout for Restconf response.&lt;/p&gt;

&lt;p&gt;This was the brief isolation scenario (module-based shards, tell-based protocol), with &lt;span class=&quot;error&quot;&gt;&amp;#91;8&amp;#93;&lt;/span&gt; codebase.&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;7&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://jenkins.opendaylight.org/sandbox/job/controller-csit-3node-clustering-only-carbon/6/robot/report/log.html#s1-s10-t1-k2-k25-k1-k1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/sandbox/job/controller-csit-3node-clustering-only-carbon/6/robot/report/log.html#s1-s10-t1-k2-k25-k1-k1&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;8&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://git.opendaylight.org/gerrit/#/c/57698/3&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/#/c/57698/3&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52203" author="vrpolak" created="Fri, 2 Jun 2017 20:15:15 +0000"  >&lt;p&gt;2017-06-01 23:37:21,958 | INFO  | h for user karaf | command                          | 265 - org.apache.karaf.log.command - 3.0.8 | ROBOT MESSAGE: Starting test Local_Leader_Shutdown&lt;br/&gt;
2017-06-01 23:37:27,553 | INFO  | qtp10706451-296  | ClusterAdminRpcService           | 201 - org.opendaylight.controller.sal-cluster-admin-impl - 1.5.1.SNAPSHOT | Removing replica for shard default memberName member-2, datastoreType Config&lt;br/&gt;
2017-06-01 23:37:27,569 | INFO  | lt-dispatcher-19 | ClusterAdminRpcService           | 201 - org.opendaylight.controller.sal-cluster-admin-impl - 1.5.1.SNAPSHOT | Successfully removed replica for shard default&lt;br/&gt;
2017-06-01 23:37:36,439 | INFO  | ult-dispatcher-4 | tActorServerConfigurationSupport | 193 - org.opendaylight.controller.sal-akka-raft - 1.5.1.SNAPSHOT | member-2-shard-default-config: ServerConfigurationPayload [serverConfig=[ServerInfo &lt;span class=&quot;error&quot;&gt;&amp;#91;id=member-1-shard-default-config, isVoting=true&amp;#93;&lt;/span&gt;, ServerInfo &lt;span class=&quot;error&quot;&gt;&amp;#91;id=member-3-shard-default-config, isVoting=true&amp;#93;&lt;/span&gt;]] has been successfully replicated to a majority of followers&lt;br/&gt;
2017-06-01 23:37:36,445 | INFO  | ult-dispatcher-4 | ShardManager                     | 199 - org.opendaylight.controller.sal-distributed-datastore - 1.5.1.SNAPSHOT | shard-manager-config: Received LeaderStateChanged message: LeaderStateChanged &lt;span class=&quot;error&quot;&gt;&amp;#91;memberId=member-2-shard-default-config, leaderId=null, leaderPayloadVersion=5&amp;#93;&lt;/span&gt;&lt;br/&gt;
2017-06-01 23:37:38,659 | INFO  | lt-dispatcher-32 | ClientActorBehavior              | 197 - org.opendaylight.controller.cds-access-client - 1.1.1.SNAPSHOT | member-2-frontend-datastore-config: connection ConnectedClientConnection{client=ClientIdentifier&lt;/p&gt;
{frontend=member-2-frontend-datastore-config, generation=0}, cookie=0, backend=ShardBackendInfo{actor=Actor&lt;a href=&quot;#-579249557&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;akka://opendaylight-cluster-data/user/shardmanager-config/member-2-shard-default-config#-579249557&lt;/a&gt;, sessionId=0, version=BORON, maxMessages=1000, cookie=0, shard=default, dataTree present=true}} indicated no leadership, reconnecting it&lt;br/&gt;
org.opendaylight.controller.cluster.access.commands.NotLeaderException: Actor Actor&lt;a href=&quot;#-579249557&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;akka://opendaylight-cluster-data/user/shardmanager-config/member-2-shard-default-config#-579249557&lt;/a&gt; is not the current leader&lt;br/&gt;
	at org.opendaylight.controller.cluster.datastore.Shard.handleRequest(Shard.java:455)&lt;span class=&quot;error&quot;&gt;&amp;#91;199:org.opendaylight.controller.sal-distributed-datastore:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.datastore.Shard.handleNonRaftCommand(Shard.java:293)&lt;span class=&quot;error&quot;&gt;&amp;#91;199:org.opendaylight.controller.sal-distributed-datastore:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.raft.RaftActor.handleCommand(RaftActor.java:270)&lt;span class=&quot;error&quot;&gt;&amp;#91;193:org.opendaylight.controller.sal-akka-raft:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.common.actor.AbstractUntypedPersistentActor.onReceiveCommand(AbstractUntypedPersistentActor.java:31)&lt;span class=&quot;error&quot;&gt;&amp;#91;192:org.opendaylight.controller.sal-clustering-commons:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.persistence.UntypedPersistentActor.onReceive(PersistentActor.scala:170)&lt;span class=&quot;error&quot;&gt;&amp;#91;180:com.typesafe.akka.persistence:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.common.actor.MeteringBehavior.apply(MeteringBehavior.java:104)&lt;span class=&quot;error&quot;&gt;&amp;#91;192:org.opendaylight.controller.sal-clustering-commons:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.actor.ActorCell$$anonfun$become$1.applyOrElse(ActorCell.scala:544)&lt;span class=&quot;error&quot;&gt;&amp;#91;173:com.typesafe.akka.actor:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.actor.Actor$class.aroundReceive(Actor.scala:502)&lt;span class=&quot;error&quot;&gt;&amp;#91;173:com.typesafe.akka.actor:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.persistence.UntypedPersistentActor.akka$persistence$Eventsourced$$super$aroundReceive(PersistentActor.scala:168)&lt;span class=&quot;error&quot;&gt;&amp;#91;180:com.typesafe.akka.persistence:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.persistence.Eventsourced$$anon$1.stateReceive(Eventsourced.scala:727)&lt;span class=&quot;error&quot;&gt;&amp;#91;180:com.typesafe.akka.persistence:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.persistence.Eventsourced$class.aroundReceive(Eventsourced.scala:183)&lt;span class=&quot;error&quot;&gt;&amp;#91;180:com.typesafe.akka.persistence:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.persistence.UntypedPersistentActor.aroundReceive(PersistentActor.scala:168)&lt;span class=&quot;error&quot;&gt;&amp;#91;180:com.typesafe.akka.persistence:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.actor.ActorCell.receiveMessage(ActorCell.scala:526)&lt;span class=&quot;error&quot;&gt;&amp;#91;173:com.typesafe.akka.actor:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.actor.ActorCell.invoke(ActorCell.scala:495)&lt;span class=&quot;error&quot;&gt;&amp;#91;173:com.typesafe.akka.actor:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:257)&lt;span class=&quot;error&quot;&gt;&amp;#91;173:com.typesafe.akka.actor:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.dispatch.Mailbox.run(Mailbox.scala:224)&lt;span class=&quot;error&quot;&gt;&amp;#91;173:com.typesafe.akka.actor:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.dispatch.Mailbox.exec(Mailbox.scala:234)&lt;span class=&quot;error&quot;&gt;&amp;#91;173:com.typesafe.akka.actor:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)&lt;span class=&quot;error&quot;&gt;&amp;#91;169:org.scala-lang.scala-library:2.11.11.v20170413-090219-8a413ba7cc&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)&lt;span class=&quot;error&quot;&gt;&amp;#91;169:org.scala-lang.scala-library:2.11.11.v20170413-090219-8a413ba7cc&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)&lt;span class=&quot;error&quot;&gt;&amp;#91;169:org.scala-lang.scala-library:2.11.11.v20170413-090219-8a413ba7cc&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)&lt;span class=&quot;error&quot;&gt;&amp;#91;169:org.scala-lang.scala-library:2.11.11.v20170413-090219-8a413ba7cc&amp;#93;&lt;/span&gt;&lt;br/&gt;
2017-06-01 23:37:38,662 | INFO  | lt-dispatcher-32 | ClientActorBehavior              | 197 - org.opendaylight.controller.cds-access-client - 1.1.1.SNAPSHOT | member-2-frontend-datastore-config: connection ConnectedClientConnection{client=ClientIdentifier{frontend=member-2-frontend-datastore-config, generation=0}
&lt;p&gt;, cookie=0, backend=ShardBackendInfo{actor=Actor&lt;a href=&quot;#-579249557&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;akka://opendaylight-cluster-data/user/shardmanager-config/member-2-shard-default-config#-579249557&lt;/a&gt;, sessionId=0, version=BORON, maxMessages=1000, cookie=0, shard=default, dataTree present=true}} reconnecting as ReconnectingClientConnection{client=ClientIdentifier&lt;/p&gt;
{frontend=member-2-frontend-datastore-config, generation=0}, cookie=0, backend=ShardBackendInfo{actor=Actor&lt;a href=&quot;#-579249557&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;akka://opendaylight-cluster-data/user/shardmanager-config/member-2-shard-default-config#-579249557&lt;/a&gt;, sessionId=0, version=BORON, maxMessages=1000, cookie=0, shard=default, dataTree present=true}}&lt;br/&gt;
2017-06-01 23:37:38,662 | INFO  | lt-dispatcher-32 | ClientActorBehavior              | 197 - org.opendaylight.controller.cds-access-client - 1.1.1.SNAPSHOT | member-2-frontend-datastore-config: refreshing backend for shard 0&lt;br/&gt;
2017-06-01 23:37:38,703 | WARN  | monPool-worker-2 | AbstractShardBackendResolver     | 199 - org.opendaylight.controller.sal-distributed-datastore - 1.5.1.SNAPSHOT | Failed to resolve shard&lt;br/&gt;
java.util.concurrent.TimeoutException: Connection attempt failed&lt;br/&gt;
	at org.opendaylight.controller.cluster.databroker.actors.dds.AbstractShardBackendResolver.wrap(AbstractShardBackendResolver.java:129)&lt;span class=&quot;error&quot;&gt;&amp;#91;199:org.opendaylight.controller.sal-distributed-datastore:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.databroker.actors.dds.AbstractShardBackendResolver.lambda$connectShard$2(AbstractShardBackendResolver.java:142)&lt;span class=&quot;error&quot;&gt;&amp;#91;199:org.opendaylight.controller.sal-distributed-datastore:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:760)&lt;span class=&quot;error&quot;&gt;&amp;#91;:1.8.0_121&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:736)&lt;span class=&quot;error&quot;&gt;&amp;#91;:1.8.0_121&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at java.util.concurrent.CompletableFuture$Completion.exec(CompletableFuture.java:443)&lt;span class=&quot;error&quot;&gt;&amp;#91;:1.8.0_121&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289)&lt;span class=&quot;error&quot;&gt;&amp;#91;:1.8.0_121&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056)&lt;span class=&quot;error&quot;&gt;&amp;#91;:1.8.0_121&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692)&lt;span class=&quot;error&quot;&gt;&amp;#91;:1.8.0_121&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:157)&lt;span class=&quot;error&quot;&gt;&amp;#91;:1.8.0_121&amp;#93;&lt;/span&gt;&lt;br/&gt;
Caused by: org.opendaylight.controller.cluster.access.commands.NotLeaderException: Actor Actor&lt;a href=&quot;#-579249557&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;akka://opendaylight-cluster-data/user/shardmanager-config/member-2-shard-default-config#-579249557&lt;/a&gt; is not the current leader&lt;br/&gt;
	at org.opendaylight.controller.cluster.datastore.Shard.handleConnectClient(Shard.java:436)&lt;span class=&quot;error&quot;&gt;&amp;#91;199:org.opendaylight.controller.sal-distributed-datastore:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.datastore.Shard.handleNonRaftCommand(Shard.java:306)&lt;span class=&quot;error&quot;&gt;&amp;#91;199:org.opendaylight.controller.sal-distributed-datastore:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.raft.RaftActor.handleCommand(RaftActor.java:270)&lt;span class=&quot;error&quot;&gt;&amp;#91;193:org.opendaylight.controller.sal-akka-raft:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.common.actor.AbstractUntypedPersistentActor.onReceiveCommand(AbstractUntypedPersistentActor.java:31)&lt;span class=&quot;error&quot;&gt;&amp;#91;192:org.opendaylight.controller.sal-clustering-commons:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.persistence.UntypedPersistentActor.onReceive(PersistentActor.scala:170)&lt;span class=&quot;error&quot;&gt;&amp;#91;180:com.typesafe.akka.persistence:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.common.actor.MeteringBehavior.apply(MeteringBehavior.java:104)&lt;span class=&quot;error&quot;&gt;&amp;#91;192:org.opendaylight.controller.sal-clustering-commons:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.actor.ActorCell$$anonfun$become$1.applyOrElse(ActorCell.scala:544)&lt;span class=&quot;error&quot;&gt;&amp;#91;173:com.typesafe.akka.actor:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.actor.Actor$class.aroundReceive(Actor.scala:502)&lt;span class=&quot;error&quot;&gt;&amp;#91;173:com.typesafe.akka.actor:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.persistence.UntypedPersistentActor.akka$persistence$Eventsourced$$super$aroundReceive(PersistentActor.scala:168)&lt;span class=&quot;error&quot;&gt;&amp;#91;180:com.typesafe.akka.persistence:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.persistence.Eventsourced$$anon$1.stateReceive(Eventsourced.scala:727)&lt;span class=&quot;error&quot;&gt;&amp;#91;180:com.typesafe.akka.persistence:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.persistence.Eventsourced$class.aroundReceive(Eventsourced.scala:183)&lt;span class=&quot;error&quot;&gt;&amp;#91;180:com.typesafe.akka.persistence:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.persistence.UntypedPersistentActor.aroundReceive(PersistentActor.scala:168)&lt;span class=&quot;error&quot;&gt;&amp;#91;180:com.typesafe.akka.persistence:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.actor.ActorCell.receiveMessage(ActorCell.scala:526)&lt;span class=&quot;error&quot;&gt;&amp;#91;173:com.typesafe.akka.actor:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.actor.ActorCell.invoke(ActorCell.scala:495)&lt;span class=&quot;error&quot;&gt;&amp;#91;173:com.typesafe.akka.actor:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:257)&lt;span class=&quot;error&quot;&gt;&amp;#91;173:com.typesafe.akka.actor:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.dispatch.Mailbox.run(Mailbox.scala:224)&lt;span class=&quot;error&quot;&gt;&amp;#91;173:com.typesafe.akka.actor:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at akka.dispatch.Mailbox.exec(Mailbox.scala:234)&lt;span class=&quot;error&quot;&gt;&amp;#91;173:com.typesafe.akka.actor:2.4.18&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)&lt;span class=&quot;error&quot;&gt;&amp;#91;169:org.scala-lang.scala-library:2.11.11.v20170413-090219-8a413ba7cc&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)&lt;span class=&quot;error&quot;&gt;&amp;#91;169:org.scala-lang.scala-library:2.11.11.v20170413-090219-8a413ba7cc&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)&lt;span class=&quot;error&quot;&gt;&amp;#91;169:org.scala-lang.scala-library:2.11.11.v20170413-090219-8a413ba7cc&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)&lt;span class=&quot;error&quot;&gt;&amp;#91;169:org.scala-lang.scala-library:2.11.11.v20170413-090219-8a413ba7cc&amp;#93;&lt;/span&gt;&lt;br/&gt;
2017-06-01 23:37:39,688 | INFO  | lt-dispatcher-23 | Shard                            | 192 - org.opendaylight.controller.sal-clustering-commons - 1.5.1.SNAPSHOT | member-2-shard-default-config (Leader): Term 3 in &quot;AppendEntriesReply &lt;span class=&quot;error&quot;&gt;&amp;#91;term=3, success=false, followerId=member-3-shard-default-config, logLastIndex=1047, logLastTerm=2, forceInstallSnapshot=false, payloadVersion=5, raftVersion=3&amp;#93;&lt;/span&gt;&quot; message is greater than leader&apos;s term 2 - switching to Follower&lt;br/&gt;
2017-06-01 23:37:39,689 | INFO  | lt-dispatcher-23 | Shard                            | 192 - org.opendaylight.controller.sal-clustering-commons - 1.5.1.SNAPSHOT | member-2-shard-default-config (Leader) :- Switching from behavior Leader to Follower, election term: 3&lt;br/&gt;
2017-06-01 23:37:39,689 | INFO  | lt-dispatcher-20 | ShardManager                     | 199 - org.opendaylight.controller.sal-distributed-datastore - 1.5.1.SNAPSHOT | shard-manager-config: Received LeaderStateChanged message: LeaderStateChanged &lt;span class=&quot;error&quot;&gt;&amp;#91;memberId=member-2-shard-default-config, leaderId=null, leaderPayloadVersion=-1&amp;#93;&lt;/span&gt;&lt;br/&gt;
2017-06-01 23:37:39,689 | INFO  | ult-dispatcher-4 | RoleChangeNotifier               | 192 - org.opendaylight.controller.sal-clustering-commons - 1.5.1.SNAPSHOT | RoleChangeNotifier for member-2-shard-default-config , received role change from Leader to Follower&lt;br/&gt;
2017-06-01 23:37:39,689 | INFO  | ult-dispatcher-4 | ShardManager                     | 199 - org.opendaylight.controller.sal-distributed-datastore - 1.5.1.SNAPSHOT | shard-manager-config: Received role changed for member-2-shard-default-config from Leader to Follower&lt;br/&gt;
2017-06-01 23:37:40,765 | INFO  | lt-dispatcher-22 | aftActorLeadershipTransferCohort | 193 - org.opendaylight.controller.sal-akka-raft - 1.5.1.SNAPSHOT | member-2-shard-default-config: Successfully transferred leadership to null in 4.320 s&lt;br/&gt;
2017-06-01 23:37:40,768 | INFO  | ult-dispatcher-3 | Shard                            | 192 - org.opendaylight.controller.sal-clustering-commons - 1.5.1.SNAPSHOT | Stopping Shard member-2-shard-default-config&lt;br/&gt;
2017-06-01 23:37:50,032 | INFO  | lt-dispatcher-32 | AbstractClientConnection         | 197 - org.opendaylight.controller.cds-access-client - 1.1.1.SNAPSHOT | Capping ReconnectingClientConnection{client=ClientIdentifier{frontend=member-2-frontend-datastore-config, generation=0}
&lt;p&gt;, cookie=0, backend=ShardBackendInfo{actor=Actor&lt;a href=&quot;#-579249557&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;akka://opendaylight-cluster-data/user/shardmanager-config/member-2-shard-default-config#-579249557&lt;/a&gt;, sessionId=0, version=BORON, maxMessages=1000, cookie=0, shard=default, dataTree present=true}} throttle delay from 6 to 5 seconds&lt;br/&gt;
2017-06-01 23:40:15,167 | ERROR | pool-30-thread-1 | WriteTransactionsHandler         | 256 - org.opendaylight.controller.samples.clustering-it-provider - 1.5.1.SNAPSHOT | Write transactions failed.&lt;br/&gt;
java.util.concurrent.TimeoutException: Timeout waiting for task.&lt;br/&gt;
	at com.google.common.util.concurrent.AbstractFuture$Sync.get(AbstractFuture.java:269)&lt;span class=&quot;error&quot;&gt;&amp;#91;57:com.google.guava:18.0.0&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at com.google.common.util.concurrent.AbstractFuture.get(AbstractFuture.java:96)&lt;span class=&quot;error&quot;&gt;&amp;#91;57:com.google.guava:18.0.0&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.clustering.it.provider.impl.WriteTransactionsHandler.maybeFinish(WriteTransactionsHandler.java:238)&lt;span class=&quot;error&quot;&gt;&amp;#91;256:org.opendaylight.controller.samples.clustering-it-provider:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.clustering.it.provider.impl.WriteTransactionsHandler.run(WriteTransactionsHandler.java:112)&lt;span class=&quot;error&quot;&gt;&amp;#91;256:org.opendaylight.controller.samples.clustering-it-provider:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)&lt;span class=&quot;error&quot;&gt;&amp;#91;:1.8.0_121&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308)&lt;span class=&quot;error&quot;&gt;&amp;#91;:1.8.0_121&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180)&lt;span class=&quot;error&quot;&gt;&amp;#91;:1.8.0_121&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294)&lt;span class=&quot;error&quot;&gt;&amp;#91;:1.8.0_121&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)&lt;span class=&quot;error&quot;&gt;&amp;#91;:1.8.0_121&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)&lt;span class=&quot;error&quot;&gt;&amp;#91;:1.8.0_121&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at java.lang.Thread.run(Thread.java:745)&lt;span class=&quot;error&quot;&gt;&amp;#91;:1.8.0_121&amp;#93;&lt;/span&gt;&lt;br/&gt;
2017-06-01 23:40:15,367 | INFO  | qtp10706451-308  | ClusterAdminRpcService           | 201 - org.opendaylight.controller.sal-cluster-admin-impl - 1.5.1.SNAPSHOT | Adding replica for shard default&lt;/p&gt;</comment>
                            <comment id="52204" author="vrpolak" created="Mon, 5 Jun 2017 11:38:55 +0000"  >&lt;p&gt;The karaf log segment above is from Sandbox running against 57996/5 with cherry-picked 57699/5.&lt;/p&gt;

&lt;p&gt;The scenario uses a module-based shard (default-config) with tell-based protocol. The transaction writer is started on the leader member, then remove-shard-replica is called. A similar test case (writer on a follower, still calling remove-shard-replica on the leader) passed (the writer finished without reporting any failure).&lt;/p&gt;

&lt;p&gt;Similar failure happened on recent RelEng run &lt;span class=&quot;error&quot;&gt;&amp;#91;9&amp;#93;&lt;/span&gt;, but karaf.log &lt;span class=&quot;error&quot;&gt;&amp;#91;10&amp;#93;&lt;/span&gt; segment between 13:53:54,938 and 13:56:47,140 shows UnreachableMember happened.&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;9&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/733/log.html.gz#s1-s20-t1-k2-k10-k1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/733/log.html.gz#s1-s20-t1-k2-k10-k1&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;10&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/733/odl1_karaf.log.gz&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/733/odl1_karaf.log.gz&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52205" author="vrpolak" created="Fri, 16 Jun 2017 11:40:57 +0000"  >&lt;p&gt;This symptom re-appeared, it now happens &lt;span class=&quot;error&quot;&gt;&amp;#91;11&amp;#93;&lt;/span&gt; consistently but only in one test case. It tests module-based shards (tell based protocol) with single writer started on a follower, but then cluster-admin:make-leader-local attempts to move the leader to the writer member. The test expects writer to finish writing successfully.&lt;/p&gt;

&lt;p&gt;The full writer response:&lt;br/&gt;
{&quot;errors&quot;:{&quot;error&quot;:[&lt;/p&gt;
{&quot;error-type&quot;:&quot;application&quot;,&quot;error-tag&quot;:&quot;operation-failed&quot;,&quot;error-message&quot;:&quot;Unexpected-exception&quot;,&quot;error-info&quot;:&quot;java.util.concurrent.TimeoutException: Timeout waiting for task.\n\tat com.google.common.util.concurrent.AbstractFuture$Sync.get(AbstractFuture.java:269)\n\tat com.google.common.util.concurrent.AbstractFuture.get(AbstractFuture.java:96)\n\tat org.opendaylight.controller.clustering.it.provider.impl.WriteTransactionsHandler.maybeFinish(WriteTransactionsHandler.java:234)\n\tat org.opendaylight.controller.clustering.it.provider.impl.WriteTransactionsHandler.run(WriteTransactionsHandler.java:112)\n\tat java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)\n\tat java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308)\n\tat java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180)\n\tat java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294)\n\tat java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)\n\tat java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)\n\tat java.lang.Thread.run(Thread.java:748)\n&quot;}
&lt;p&gt;]}}&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;11&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/750/log.html.gz#s1-s24-t5-k2-k10&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/750/log.html.gz#s1-s24-t5-k2-k10&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52206" author="vrpolak" created="Wed, 21 Jun 2017 10:58:41 +0000"  >&lt;p&gt;&amp;gt; consistently but only in one test case&lt;/p&gt;

&lt;p&gt;Now I would say 50% of the time, still in only one &lt;span class=&quot;error&quot;&gt;&amp;#91;12&amp;#93;&lt;/span&gt; test case.&lt;br/&gt;
The response message is now more specific: &quot;Final submit was timed out by the test provider or was interrupted&quot;.&lt;/p&gt;

&lt;p&gt;The writer took 160 seconds to respond.&lt;/p&gt;

&lt;p&gt;Few suspicious lines from writer member karaf.log &lt;span class=&quot;error&quot;&gt;&amp;#91;13&amp;#93;&lt;/span&gt; since 07:45:22,131 follow. The first one is especially confusing, since the connection is now reconnected to the new (local, member-2) leader, but it is still reporting NotLeaderException from the previous (remote, member-1) leader.&lt;/p&gt;

&lt;p&gt;2017-06-21 07:45:38,519 | INFO  | lt-dispatcher-20 | ClientActorBehavior              | 197 - org.opendaylight.controller.cds-access-client - 1.1.1.SNAPSHOT | member-2-frontend-datastore-config: connection ConnectedClientConnection{client=ClientIdentifier&lt;/p&gt;
{frontend=member-2-frontend-datastore-config, generation=0}
&lt;p&gt;, cookie=0, backend=ShardBackendInfo{actor=Actor&lt;a href=&quot;#494504499&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;akka://opendaylight-cluster-data/user/shardmanager-config/member-2-shard-default-config#494504499&lt;/a&gt;, sessionId=1, version=BORON, maxMessages=1000, cookie=0, shard=default, dataTree present=true}} indicated no leadership, reconnecting it&lt;br/&gt;
org.opendaylight.controller.cluster.access.commands.NotLeaderException: Actor Actor&lt;a href=&quot;#-1667137708&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;akka://opendaylight-cluster-data/user/shardmanager-config/member-1-shard-default-config#-1667137708&lt;/a&gt; is not the current leader&lt;/p&gt;

&lt;p&gt;2017-06-21 07:45:38,624 | ERROR | lt-dispatcher-22 | OneForOneStrategy                | 174 - com.typesafe.akka.slf4j - 2.4.18 | Transaction member-2-datastore-config-fe-0-chn-4-txn-5365-0 has already been sealed&lt;br/&gt;
java.lang.IllegalStateException: Transaction member-2-datastore-config-fe-0-chn-4-txn-5365-0 has already been sealed&lt;br/&gt;
	at com.google.common.base.Preconditions.checkState(Preconditions.java:197)&lt;span class=&quot;error&quot;&gt;&amp;#91;57:com.google.guava:18.0.0&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.databroker.actors.dds.AbstractProxyTransaction.checkNotSealed(AbstractProxyTransaction.java:363)&lt;span class=&quot;error&quot;&gt;&amp;#91;199:org.opendaylight.controller.sal-distributed-datastore:1.5.1.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;2017-06-21 07:45:38,657 | INFO  | ult-dispatcher-5 | ClientActorBehavior              | 197 - org.opendaylight.controller.cds-access-client - 1.1.1.SNAPSHOT | member-2-frontend-datastore-config: Ignoring unknown response Envelope{sessionId=1, txSequence=1, message=TransactionPurgeResponse{target=member-2-datastore-config-fe-0-chn-4-txn-5346-0, sequence=0}}&lt;/p&gt;

&lt;p&gt;2017-06-21 07:48:18,136 | WARN  | pool-30-thread-1 | WriteTransactionsHandler         | 257 - org.opendaylight.controller.samples.clustering-it-provider - 1.5.1.SNAPSHOT | Future #5362/7971 not completed yet&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;12&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/756/log.html.gz#s1-s24-t5-k2-k10&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/756/log.html.gz#s1-s24-t5-k2-k10&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;13&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/756/odl2_karaf.log.gz&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/756/odl2_karaf.log.gz&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52207" author="vrpolak" created="Mon, 26 Jun 2017 11:31:42 +0000"  >&lt;p&gt;&amp;gt; the connection is now reconnected to the new (local, member-2) leader,&lt;br/&gt;
&amp;gt; but it is still reporting NotLeaderException from the previous (remote, member-1) leader.&lt;/p&gt;

&lt;p&gt;Comparing log messages to code, I think that is a case of multiple FailureEnvelope from the same old backend.&lt;/p&gt;

&lt;p&gt;Specifically, this &lt;span class=&quot;error&quot;&gt;&amp;#91;14&amp;#93;&lt;/span&gt; line confirms the connection to the new backend is established, but then this &lt;span class=&quot;error&quot;&gt;&amp;#91;15&amp;#93;&lt;/span&gt; line says the repeated message from the old backend is causing reconnect on the new connection.&lt;/p&gt;

&lt;p&gt;In karaf.log &lt;span class=&quot;error&quot;&gt;&amp;#91;16&amp;#93;&lt;/span&gt; both log messages occur at 09:14:37,757.&lt;/p&gt;

&lt;p&gt;In theory, spurious reconnects should not cause system test failures, so there is something to fix in other parts of the code as well.&lt;br/&gt;
But limiting reconnects (for example by having sessionId in FailureEnvelope and comparing it to the current value) would be helpful.&lt;/p&gt;

&lt;p&gt;A cosmetic remark: ReconnectingClientConnection probably only needs to know maxMessages value; printing the whole ShardBackendInfo pointing to the old backend is slightly confusing, as that is not the backend ODL is likely to connect to.&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;14&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://git.opendaylight.org/gerrit/gitweb?p=controller.git;a=blob;f=opendaylight/md-sal/cds-access-client/src/main/java/org/opendaylight/controller/cluster/access/client/ClientActorBehavior.java;h=554ffe97c77ae7c3011daca2a28c22f660a20e78;hb=refs/heads/stable/carbon#l307&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/gitweb?p=controller.git;a=blob;f=opendaylight/md-sal/cds-access-client/src/main/java/org/opendaylight/controller/cluster/access/client/ClientActorBehavior.java;h=554ffe97c77ae7c3011daca2a28c22f660a20e78;hb=refs/heads/stable/carbon#l307&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;15&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://git.opendaylight.org/gerrit/gitweb?p=controller.git;a=blob;f=opendaylight/md-sal/cds-access-client/src/main/java/org/opendaylight/controller/cluster/access/client/ClientActorBehavior.java;h=554ffe97c77ae7c3011daca2a28c22f660a20e78;hb=refs/heads/stable/carbon#l177&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/gitweb?p=controller.git;a=blob;f=opendaylight/md-sal/cds-access-client/src/main/java/org/opendaylight/controller/cluster/access/client/ClientActorBehavior.java;h=554ffe97c77ae7c3011daca2a28c22f660a20e78;hb=refs/heads/stable/carbon#l177&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;16&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/761/odl2_karaf.log.gz&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/761/odl2_karaf.log.gz&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52208" author="rovarga" created="Mon, 26 Jun 2017 14:37:41 +0000"  >&lt;p&gt;&lt;a href=&quot;https://git.opendaylight.org/gerrit/59527&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://git.opendaylight.org/gerrit/59527&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="52209" author="vrpolak" created="Thu, 29 Jun 2017 11:29:09 +0000"  >&lt;p&gt;After rework of transaction writer, the error message &lt;span class=&quot;error&quot;&gt;&amp;#91;17&amp;#93;&lt;/span&gt; is different, but I believe it is the same bug. Now it happened on releng when the old leader got remove-shard-replica, and the single writer was on the member which stayed follower.&lt;/p&gt;

&lt;p&gt;The new writer response:&lt;br/&gt;
{&quot;errors&quot;:{&quot;error&quot;:[&lt;/p&gt;
{&quot;error-type&quot;:&quot;application&quot;,&quot;error-tag&quot;:&quot;operation-failed&quot;,&quot;error-message&quot;:&quot;Final submit was timed out by the test provider or was interrupted&quot;,&quot;error-info&quot;:&quot;java.util.concurrent.TimeoutException: Collection did not finish in 300 seconds\n\tat org.opendaylight.controller.clustering.it.provider.impl.AbstractTransactionHandler.checkComplete(AbstractTransactionHandler.java:187)\n\tat java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)\n\tat java.util.concurrent.FutureTask.run(FutureTask.java:266)\n\tat java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)\n\tat java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)\n\tat java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)\n\tat java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)\n\tat java.lang.Thread.run(Thread.java:748)\n&quot;}
&lt;p&gt;]}}&lt;/p&gt;

&lt;p&gt;The karaf.log &lt;span class=&quot;error&quot;&gt;&amp;#91;18&amp;#93;&lt;/span&gt; reports 2011 futures as not completed, nothing else is visible on INFO level.&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;17&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/765/log.html.gz#s1-s20-t3-k2-k8&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/765/log.html.gz#s1-s20-t3-k2-k8&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;18&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/765/odl1_karaf.log.gz&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/jenkins092/controller-csit-3node-clustering-only-carbon/765/odl1_karaf.log.gz&lt;/a&gt;&lt;/p&gt;</comment>
                    </comments>
                <issuelinks>
                            <issuelinktype id="10000">
                    <name>Blocks</name>
                                                                <inwardlinks description="is blocked by">
                                        <issuelink>
            <issuekey id="26222">CONTROLLER-1668</issuekey>
        </issuelink>
                            </inwardlinks>
                                    </issuelinktype>
                    </issuelinks>
                <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                            <customfield id="customfield_11400" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                        <customfield id="customfield_10208" key="com.atlassian.jira.plugin.system.customfieldtypes:textfield">
                        <customfieldname>External issue ID</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>8445</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10201" key="com.atlassian.jira.plugin.system.customfieldtypes:url">
                        <customfieldname>External issue URL</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue><![CDATA[https://bugs.opendaylight.org/show_bug.cgi?id=8445]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                            <customfield id="customfield_10206" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>Issue Type</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10300"><![CDATA[Bug]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10204" key="com.atlassian.jira.plugin.system.customfieldtypes:select">
                        <customfieldname>ODL SR Target Milestone</customfieldname>
                        <customfieldvalues>
                                <customfieldvalue key="10324"><![CDATA[Carbon-SR1]]></customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                    <customfield id="customfield_10000" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>0|i02s13:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>