<!-- 
RSS generated by JIRA (8.20.10#820010-sha1:ace47f9899e9ee25d7157d59aa17ab06aee30d3d) at Wed Feb 07 19:56:37 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>OpenDaylight JIRA</title>
    <link>https://jira.opendaylight.org</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>8.20.10</version>
        <build-number>820010</build-number>
        <build-date>22-06-2022</build-date>
    </build-info>


<item>
            <title>[CONTROLLER-1858] DOMDataBroker Explicit Leader Movement test fails</title>
                <link>https://jira.opendaylight.org/browse/CONTROLLER-1858</link>
                <project id="10113" key="CONTROLLER">controller</project>
                    <description>&lt;p&gt;DOMDataBroker Explicit Leader Movement test, controller-ddb-expl-lead-movement-longevity, has been failing fairly consistently &lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt;. The purpose of test from test documentation:&lt;/p&gt;

&lt;p&gt;The goal is to ensure that applications do not observe disruption when a shard leader is moved as the result of explicit application request. This is performed by having a steady-stream producer execute operations against the shard and then initiate shard leader shutdown, then the producer is shut down cleanly.&lt;/p&gt;

&lt;p&gt;Though it shows 3/4 passing, those 3 are for restarting ODL with tell, actual test case has been failing consistently with only one pass on 28th July. The steps mentioned above are done repeatedly in a loop and it fails 1097 times during the entire ~14hour run with error 500 &lt;span class=&quot;error&quot;&gt;&amp;#91;2&amp;#93;&lt;/span&gt;.&lt;/p&gt;

&lt;p&gt;Update:&lt;br/&gt;
-----------&lt;br/&gt;
Test steps detailed in &lt;span class=&quot;error&quot;&gt;&amp;#91;3&amp;#93;&lt;/span&gt;.&lt;br/&gt;
ODL Logs &lt;span class=&quot;error&quot;&gt;&amp;#91;4&amp;#93;&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;As per &lt;span class=&quot;error&quot;&gt;&amp;#91;3&amp;#93;&lt;/span&gt; searching ODL logs for &apos;starting leader movement from node&apos; gives us:&lt;br/&gt;
018-08-04T23:40:26,930 | INFO  | opendaylight-cluster-data-akka.actor.default-dispatcher-34 | ClientActorBehavior              | 191 - org.opendaylight.controller.cds-access-client - 1.3.3.SNAPSHOT | member-1-frontend-datastore-config: connection ConnectedClientConnection{client=ClientIdentifier&lt;/p&gt;
{frontend=member-1-frontend-datastore-config, generation=0}, cookie=0, backend=ShardBackendInfo{actor=Actor&lt;a href=&quot;#542053028&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;akka://opendaylight-cluster-data/user/shardmanager-config/member-1-shard-default-config#542053028&lt;/a&gt;, sessionId=0, version=BORON, maxMessages=1000, cookie=0, shard=default, dataTree=present}} indicated sequencing mismatch on member-1-datastore-config-fe-0-txn-13-0 sequence 0 (26), reconnecting it&lt;br/&gt;
org.opendaylight.controller.cluster.access.commands.OutOfSequenceEnvelopeException: Expecting envelope 0&lt;br/&gt;
	at org.opendaylight.controller.cluster.datastore.LeaderFrontendState.checkRequestSequence(LeaderFrontendState.java:98) &lt;span class=&quot;error&quot;&gt;&amp;#91;220:org.opendaylight.controller.sal-distributed-datastore:1.7.3.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.datastore.LeaderFrontendState.handleTransactionRequest(LeaderFrontendState.java:191) &lt;span class=&quot;error&quot;&gt;&amp;#91;220:org.opendaylight.controller.sal-distributed-datastore:1.7.3.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.datastore.Shard.handleRequest(Shard.java:573) &lt;span class=&quot;error&quot;&gt;&amp;#91;220:org.opendaylight.controller.sal-distributed-datastore:1.7.3.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.datastore.Shard.handleRequestEnvelope(Shard.java:398) &lt;span class=&quot;error&quot;&gt;&amp;#91;220:org.opendaylight.controller.sal-distributed-datastore:1.7.3.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
&lt;br/&gt;
Also:&lt;br/&gt;
&lt;br/&gt;
2018-08-05T01:17:16,637 | INFO  | opendaylight-cluster-data-akka.actor.default-dispatcher-36 | ClientActorBehavior              | 191 - org.opendaylight.controller.cds-access-client - 1.3.3.SNAPSHOT | member-1-frontend-datastore-config: connection ConnectedClientConnection{client=ClientIdentifier{frontend=member-1-frontend-datastore-config, generation=0}
&lt;p&gt;, cookie=0, backend=ShardBackendInfo{actor=Actor&lt;span class=&quot;error&quot;&gt;&amp;#91;akka.tcp://opendaylight-cluster-data@10.30.170.171:2550/user/shardmanager-config/member-2-shard-default-config#-1686300808&amp;#93;&lt;/span&gt;, sessionId=71, version=BORON, maxMessages=1000, cookie=0, shard=default, dataTree=absent}} indicated no leadership, reconnecting it&lt;br/&gt;
org.opendaylight.controller.cluster.access.commands.NotLeaderException: Actor Actor&lt;a href=&quot;#-1686300808&quot; target=&quot;_blank&quot; rel=&quot;noopener&quot;&gt;akka://opendaylight-cluster-data/user/shardmanager-config/member-2-shard-default-config#-1686300808&lt;/a&gt; is not the current leader&lt;br/&gt;
	at org.opendaylight.controller.cluster.datastore.Shard.handleRequest(Shard.java:566) ~&lt;span class=&quot;error&quot;&gt;&amp;#91;220:org.opendaylight.controller.sal-distributed-datastore:1.7.3.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.datastore.Shard.handleRequestEnvelope(Shard.java:398) ~&lt;span class=&quot;error&quot;&gt;&amp;#91;220:org.opendaylight.controller.sal-distributed-datastore:1.7.3.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;br/&gt;
	at org.opendaylight.controller.cluster.datastore.Shard.handleNonRaftCommand(Shard.java:324) ~&lt;span class=&quot;error&quot;&gt;&amp;#91;220:org.opendaylight.controller.sal-distributed-datastore:1.7.3.SNAPSHOT&amp;#93;&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;&lt;span class=&quot;error&quot;&gt;&amp;#91;1&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://jenkins.opendaylight.org/releng/job/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/job/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;2&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51/robot-plugin/log.html.gz#s1-s2-t1-k2-k1-k1-k1-k1-k1-k1-k1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51/robot-plugin/log.html.gz#s1-s2-t1-k2-k1-k1-k1-k1-k1-k1-k1&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;3&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51/robot-plugin/log.html.gz#s1-s2-t1-k2-k1-k1-k1-k1-k1-k1-k1&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51/robot-plugin/log.html.gz#s1-s2-t1-k2-k1-k1-k1-k1-k1-k1-k1&lt;/a&gt;&lt;br/&gt;
&lt;span class=&quot;error&quot;&gt;&amp;#91;4&amp;#93;&lt;/span&gt; &lt;a href=&quot;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51/&lt;/a&gt;&lt;/p&gt;</description>
                <environment></environment>
        <key id="30567">CONTROLLER-1858</key>
            <summary>DOMDataBroker Explicit Leader Movement test fails</summary>
                <type id="10104" iconUrl="https://jira.opendaylight.org/secure/viewavatar?size=xsmall&amp;avatarId=10303&amp;avatarType=issuetype">Bug</type>
                                            <priority id="2" iconUrl="https://jira.opendaylight.org/images/icons/priorities/critical.svg">High</priority>
                        <status id="1" iconUrl="https://jira.opendaylight.org/images/icons/statuses/open.png" description="The issue is open and ready for the assignee to start work on it.">Open</status>
                    <statusCategory id="2" key="new" colorName="blue-gray"/>
                                    <resolution id="-1">Unresolved</resolution>
                                        <assignee username="thapar">Vishal Thapar</assignee>
                                    <reporter username="thapar">Vishal Thapar</reporter>
                        <labels>
                            <label>csit:3node</label>
                    </labels>
                <created>Tue, 7 Aug 2018 07:53:17 +0000</created>
                <updated>Fri, 17 Aug 2018 18:01:31 +0000</updated>
                                            <version>Oxygen SR3</version>
                                                    <component>clustering</component>
                        <due></due>
                            <votes>0</votes>
                                    <watches>4</watches>
                                                                                                                <comments>
                            <comment id="64575" author="tpantelis" created="Tue, 7 Aug 2018 12:01:58 +0000"  >&lt;p&gt;What are the exact steps this test performs and what failed? I see &quot;FAIL [ 200 | 201 | 204 ] does not contain value &apos;500&apos;.&quot; but I don&apos;t see from the robot log what URL failed and the error message.&lt;/p&gt;</comment>
                            <comment id="64578" author="tpantelis" created="Tue, 7 Aug 2018 13:19:23 +0000"  >&lt;p&gt;I think it would really help to outline the test steps in sentences for those not all that familiar with all the keywords and the author&apos;s intent - I can ascertain/infer some meaning by looking at the log output (perhaps this is already documented somewhere?) but I don&apos;t know what all the keywords do/mean. The first red output I see is:&lt;/p&gt;

&lt;p&gt;&#160;&lt;br/&gt;
 KEYWORD&#160;BuiltIn .&#160;Fail&#160;${output}&lt;br/&gt;
 &#160;&lt;br/&gt;
 &#160;&lt;/p&gt;
&lt;div class=&apos;table-wrap&apos;&gt;
&lt;table class=&apos;confluenceTable&apos;&gt;&lt;tbody&gt;
&lt;tr&gt;
&lt;th class=&apos;confluenceTh&apos;&gt;Documentation:&lt;/th&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;Fails the test with the given message and optionally alters its tags.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;th class=&apos;confluenceTh&apos;&gt;Start / End / Elapsed:&lt;/th&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;20180805 09:36:16.606 / 20180805 09:36:16.607 / 00:00:00.001&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;09:36:16.607&lt;/td&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;FAIL&lt;/td&gt;
&lt;td class=&apos;confluenceTd&apos;&gt;None&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;
&lt;/div&gt;


&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;But I can&apos;t tell what actually failed - the prior URL,&#160; /restconf/operations/cluster-admin:make-leader-local, succeeded.&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;</comment>
                            <comment id="64579" author="tpantelis" created="Tue, 7 Aug 2018 13:28:40 +0000"  >&lt;p&gt;Also is this Oxygen or Fluorine or both? Can you please update the Affected Version? If this is from&#160;Oxygen, does it also fail on&#160;Fluorine?&lt;/p&gt;</comment>
                            <comment id="64581" author="thapar" created="Tue, 7 Aug 2018 13:43:10 +0000"  >&lt;p&gt;Agree, I also been trying to decipher controller tests and thought you might be able to understand them better than me. Have to really dig into robot keywords to get to useful information.&lt;/p&gt;

&lt;p&gt;What my understanding of the test is it is doing following:&lt;/p&gt;

&lt;p&gt;1. Move leader from nodeA to a randomly chosen nodeB.&lt;/p&gt;

&lt;p&gt;2. Write transaction using RPCs - &lt;a href=&quot;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51/robot-plugin/log.html.gz#s1-s2-t1-k2-k1-k1-k1-k1-k1-k1-k1-k1-k1-k2-k5&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51/robot-plugin/log.html.gz#s1-s2-t1-k2-k1-k1-k1-k1-k1-k1-k1-k1-k1-k2-k5&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;3. Wait for 5 seconds.&lt;/p&gt;

&lt;p&gt;4. Check if all transactions went through fine or not. It checks for http code to be &lt;span class=&quot;error&quot;&gt;&amp;#91;200 | 201 | 204&amp;#93;&lt;/span&gt;. But finds a 500.&lt;/p&gt;

&lt;p&gt;5. So, basically while doing a write transaction when leader movement is done, it gets http error 500.&lt;/p&gt;

&lt;p&gt;This step in robot:&lt;br/&gt;
&lt;a href=&quot;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51/robot-plugin/log.html.gz#s1-s2-t1-k2-k1-k1-k1-k1-k1-k1-k1-k1-k1-k2-k2&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51/robot-plugin/log.html.gz#s1-s2-t1-k2-k1-k1-k1-k1-k1-k1-k1-k1-k1-k2-k2&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;logs the log message in karaf.&lt;/p&gt;

&lt;p&gt;This one is oxygen, I believe we don&apos;t have a flourine job for it yet.&lt;/p&gt;</comment>
                            <comment id="64584" author="tpantelis" created="Tue, 7 Aug 2018 14:26:20 +0000"  >&lt;p&gt;So the question is what URL failed with 500 and what was the error info?&#160;&lt;/p&gt;</comment>
                            <comment id="64585" author="thapar" created="Tue, 7 Aug 2018 14:51:42 +0000"  >&lt;p&gt;I thought I pasted this:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51/robot-plugin/log.html.gz#s1-s2-t1-k2-k1-k1-k1-k1-k1-k1-k1-k1-k1-k2-k5&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51/robot-plugin/log.html.gz#s1-s2-t1-k2-k1-k1-k1-k1-k1-k1-k1-k1-k1-k2-k5&lt;/a&gt;&lt;/p&gt;


&lt;p&gt;Documentation:	&lt;br/&gt;
Invoke write-transactions rpc on given nodes.&lt;/p&gt;

&lt;p&gt;Start / End / Elapsed:	20180805 19:05:36.555 / 20180805 19:05:36.558 / 00:00:00.003&lt;br/&gt;
19:05:36.556	INFO	Input parameters: host_list:&lt;span class=&quot;error&quot;&gt;&amp;#91;u&amp;#39;10.30.170.171&amp;#39;&amp;#93;&lt;/span&gt;, index_list:&lt;span class=&quot;error&quot;&gt;&amp;#91;2&amp;#93;&lt;/span&gt;, id_prefix:prefix-, duration:40, rate:1000, chained_flag:True	&lt;br/&gt;
19:05:36.556	INFO	url: &lt;a href=&quot;http://10.30.170.171:8181/restconf/operations/odl-mdsal-lowlevel-control:write-transactions&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;http://10.30.170.171:8181/restconf/operations/odl-mdsal-lowlevel-control:write-transactions&lt;/a&gt;, data: &amp;lt;input xmlns=&quot;tag:opendaylight.org,2017:controller:yang:lowlevel:control&quot;&amp;gt;&lt;br/&gt;
  &amp;lt;id&amp;gt;prefix-2&amp;lt;/id&amp;gt;&lt;br/&gt;
  &amp;lt;seconds&amp;gt;40&amp;lt;/seconds&amp;gt;&lt;br/&gt;
  &amp;lt;transactions-per-second&amp;gt;1000&amp;lt;/transactions-per-second&amp;gt;&lt;br/&gt;
  &amp;lt;chained-transactions&amp;gt;True&amp;lt;/chained-transactions&amp;gt;&lt;br/&gt;
&amp;lt;/input&amp;gt;, timeout: 425&lt;/p&gt;

&lt;p&gt;From the url, it looks like test app is running in ODL and RPC just triggers it. Correct? That is why couldn&apos;t find logs for any python script making rest calls. Suspect some test app within controller code that generates transactions when it gets this RPC call.&lt;/p&gt;</comment>
                            <comment id="64587" author="tpantelis" created="Tue, 7 Aug 2018 15:22:55 +0000"  >&lt;p&gt;yeah - it looks like,&#160;restconf/operations/odl-mdsal-lowlevel-control:write-transactions.&#160; However AFAICT, the response from that was 200:&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;KEYWORD&#160;BuiltIn .&#160;Log&#160;${response.text}
&#160;
||Documentation:|Logs the given message with the given level.|
||Start / End / Elapsed:|20180805 09:36:16.600 / 20180805 09:36:16.600 / 00:00:00.000|

|09:36:16.600|INFO|{&quot;output&quot;:\{&quot;delete-tx&quot;:731,&quot;all-tx&quot;:40000,&quot;insert-tx&quot;:39269}}|&#160;|
00:00:00.001KEYWORD&#160;BuiltIn .

&#160;Log&#160;${response.status_code}
||Documentation:|Logs the given message with the given level.|
||Start / End / Elapsed:|20180805 09:36:16.600 / 20180805 09:36:16.601 / 00:00:00.001|
|09:36:16.601|INFO|200|
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;The encapsulating keyword seems to be OK as well, AFAICT:&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;09:36:16.604	INFO	${status} = PASS	
09:36:16.604	INFO	${output} = None&#160;
&#160;&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;The code for that RPC would be somewhere in the clustering-it-provider in controller. &lt;/p&gt;

&lt;p&gt;So I don&apos;t see where the 500 is coming from or the nature of this failure:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;09:36:16.607	FAIL	None
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="64589" author="thapar" created="Tue, 7 Aug 2018 15:48:35 +0000"  >&lt;p&gt;I think it is this one.&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51/robot-plugin/log.html.gz#s1-s2-t1-k2-k1-k1-k1-k1-k1-k1-k2-k1-k1-k2-k10&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51/robot-plugin/log.html.gz#s1-s2-t1-k2-k1-k1-k1-k1-k1-k1-k2-k1-k1-k2-k10&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="64590" author="tpantelis" created="Tue, 7 Aug 2018 16:07:12 +0000"  >&lt;p&gt;OK - so the underlying error that caused the 500 occurred on odl2:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;2018-08-05T13:36:22,354 | WARN  | opendaylight-cluster-data-shard-dispatcher-42 | AbstractFrontendHistory          | 220 - org.opendaylight.controller.sal-distributed-datastore - 1.7.3.SNAPSHOT | member-2-shard-default-config: Request ModifyTransactionRequest{target=member-1-datastore-config-fe-0-txn-1114-0, sequence=0, replyTo=Actor[akka.tcp://opendaylight-cluster-data@10.30.170.146:2550/user/$a#-1538190485], modifications=1, protocol=SIMPLE} is contained purged transactions [[0..1114]]

TransactionCommitFailedException{message=canCommit encountered an unexpected failure, errorList=[RpcError [message=canCommit encountered an unexpected failure, severity=ERROR, errorType=APPLICATION, tag=operation-failed, applicationTag=null, info=null, 
cause=org.opendaylight.controller.cluster.access.commands.DeadTransactionException: Transactions [[0..1114]] have been purged]]}
org.opendaylight.controller.md.sal.dom.broker.impl.TransactionCommitFailedExceptionMapper.newWithCause(TransactionCommitFailedExceptionMapper.java:36)
org.opendaylight.controller.md.sal.dom.broker.impl.TransactionCommitFailedExceptionMapper.newWithCause(TransactionCommitFailedExceptionMapper.java:18)
org.opendaylight.yangtools.util.concurrent.ExceptionMapper.apply(ExceptionMapper.java:106)\n\tat org.opendaylight.controller.cluster.databroker.ConcurrentDOMDataBroker.handleException(ConcurrentDOMDataBroker.java:227)
org.opendaylight.controller.cluster.databroker.ConcurrentDOMDataBroker.access$000(ConcurrentDOMDataBroker.java:50)
org.opendaylight.controller.cluster.databroker.ConcurrentDOMDataBroker$1.onFailure(ConcurrentDOMDataBroker.java:124)
com.google.common.util.concurrent.Futures$CallbackListener.run(Futures.java:1387)
com.google.common.util.concurrent.MoreExecutors$DirectExecutor.execute(MoreExecutors.java:398)
com.google.common.util.concurrent.AbstractFuture.executeListener(AbstractFuture.java:1015)
com.google.common.util.concurrent.AbstractFuture.complete(AbstractFuture.java:868)
com.google.common.util.concurrent.AbstractFuture.setException(AbstractFuture.java:713)
com.google.common.util.concurrent.SettableFuture.setException(SettableFuture.java:54)
org.opendaylight.controller.cluster.databroker.actors.dds.AbstractProxyTransaction.lambda$directCommit$4(AbstractProxyTransaction.java:507)
org.opendaylight.controller.cluster.access.client.ConnectionEntry.complete(ConnectionEntry.java:50)
org.opendaylight.controller.cluster.access.client.AbstractClientConnection.receiveResponse(AbstractClientConnection.java:469)
org.opendaylight.controller.cluster.access.client.AbstractReceivingClientConnection.receiveResponse(AbstractReceivingClientConnection.java:68)
org.opendaylight.controller.cluster.access.client.ClientActorBehavior.onResponse(ClientActorBehavior.java:174)
org.opendaylight.controller.cluster.access.client.ClientActorBehavior.onRequestFailure(ClientActorBehavior.java:186)
org.opendaylight.controller.cluster.access.client.ClientActorBehavior.internalOnRequestFailure(ClientActorBehavior.java:238)
org.opendaylight.controller.cluster.access.client.ClientActorBehavior.onReceiveCommand(ClientActorBehavior.java:145)
org.opendaylight.controller.cluster.access.client.ClientActorBehavior.onReceiveCommand(ClientActorBehavior.java:49)
org.opendaylight.controller.cluster.access.client.AbstractClientActor.onReceiveCommand(AbstractClientActor.java:69)
akka.persistence.UntypedPersistentActor.onReceive(PersistentActor.scala:275)
akka.actor.UntypedActor$$anonfun$receive$1.applyOrElse(UntypedActor.scala:167)
akka.actor.Actor.aroundReceive(Actor.scala:517)
akka.actor.Actor.aroundReceive$(Actor.scala:515)
akka.persistence.UntypedPersistentActor.akka$persistence$Eventsourced$$super$aroundReceive(PersistentActor.scala:273)
akka.persistence.Eventsourced$$anon$1.stateReceive(Eventsourced.scala:691)
akka.persistence.Eventsourced.aroundReceive(Eventsourced.scala:192)
akka.persistence.Eventsourced.aroundReceive$(Eventsourced.scala:191)
akka.persistence.UntypedPersistentActor.aroundReceive(PersistentActor.scala:273)
akka.actor.ActorCell.receiveMessage(ActorCell.scala:590)
akka.actor.ActorCell.invoke(ActorCell.scala:559)
akka.dispatch.Mailbox.processMailbox(Mailbox.scala:257)
akka.dispatch.Mailbox.run(Mailbox.scala:224)
akka.dispatch.Mailbox.exec(Mailbox.scala:234)
akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
Caused by: org.opendaylight.controller.cluster.access.commands.DeadTransactionException: Transactions [[0..1114]] have been purged
org.opendaylight.controller.cluster.datastore.AbstractFrontendHistory.handleTransactionRequest(AbstractFrontendHistory.java:88)
org.opendaylight.controller.cluster.datastore.LeaderFrontendState.handleTransactionRequest(LeaderFrontendState.java:212)
org.opendaylight.controller.cluster.datastore.Shard.handleRequest(Shard.java:573)
org.opendaylight.controller.cluster.datastore.Shard.handleRequestEnvelope(Shard.java:398)
org.opendaylight.controller.cluster.datastore.Shard.handleNonRaftCommand(Shard.java:324)
org.opendaylight.controller.cluster.raft.RaftActor.handleCommand(RaftActor.java:270)
org.opendaylight.controller.cluster.common.actor.AbstractUntypedPersistentActor.onReceiveCommand(AbstractUntypedPersistentActor.java:44)
akka.persistence.UntypedPersistentActor.onReceive(PersistentActor.scala:275)
org.opendaylight.controller.cluster.common.actor.MeteringBehavior.apply(MeteringBehavior.java:104)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;&lt;a href=&quot;https://jira.opendaylight.org/secure/ViewProfile.jspa?name=rovarga&quot; class=&quot;user-hover&quot; rel=&quot;rovarga&quot;&gt;rovarga&lt;/a&gt; any ideas on this one?&lt;/p&gt;</comment>
                            <comment id="64608" author="tpantelis" created="Thu, 9 Aug 2018 13:36:54 +0000"  >&lt;p&gt;I think we&apos;re going to have to enabled some debug. Set &lt;em&gt;org.opendaylight.controller.cluster.datastore&lt;/em&gt;  and &lt;em&gt;org.opendaylight.controller.cluster.databroker&lt;/em&gt; to DEBUG. This will also include Shard which will emit a lot of output so to minimize it, set &lt;em&gt;org.opendaylight.controller.cluster.datastore.Shard&lt;/em&gt; to INFO. This should do it:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;log4j2.logger.datastore.name=org.opendaylight.controller.cluster.datastore
log4j2.logger.datastore.level=DEBUG
log4j2.logger.databroker.name=org.opendaylight.controller.cluster.databroker
log4j2.logger.databroker.level=DEBUG
log4j2.logger.shard.name=org.opendaylight.controller.cluster.datastore.Shard
log4j2.logger.shard.level=INFO
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Note: the log files may roll over, ie there may also be karaf.log.1, karaf.log.2 etc. &lt;/p&gt;</comment>
                            <comment id="64609" author="jluhrsen" created="Thu, 9 Aug 2018 15:44:07 +0000"  >&lt;p&gt;this &lt;a href=&quot;https://jenkins.opendaylight.org/releng/user/jluhrsen/my-views/view/controller%203node/job/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/51&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;job &lt;/a&gt; will be running with the above debug levels&lt;/p&gt;</comment>
                            <comment id="64625" author="thapar" created="Tue, 14 Aug 2018 16:07:01 +0000"  >&lt;p&gt;&lt;a href=&quot;https://jenkins.opendaylight.org/releng/job/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/52/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/job/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/52/&lt;/a&gt; &lt;/p&gt;

&lt;p&gt;^Passed with deubg&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://jenkins.opendaylight.org/releng/job/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/53/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/job/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/53/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;^Had failures but debugs were not enabled, probably scheduled run.&lt;/p&gt;</comment>
                            <comment id="64650" author="jluhrsen" created="Wed, 15 Aug 2018 23:27:39 +0000"  >&lt;p&gt;trying again:&lt;br/&gt;
&lt;a href=&quot;https://jenkins.opendaylight.org/releng/job/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/54/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/job/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/54/&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="64663" author="thapar" created="Fri, 17 Aug 2018 08:42:13 +0000"  >&lt;p&gt;This one got failure with debug logs:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/54/&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://logs.opendaylight.org/releng/vex-yul-odl-jenkins-1/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/54/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Warning: Log files are ~10/19/19MB compressed. Would recommend doing a wget to get them and then untar them.&lt;/p&gt;</comment>
                            <comment id="64665" author="tpantelis" created="Fri, 17 Aug 2018 11:23:20 +0000"  >&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;. Fail ${output}
Documentation:	
Fails the test with the given message and optionally alters its tags.

Start / End / Elapsed:	20180816 08:36:31.438 / 20180816 08:36:31.439 / 00:00:00.001
08:36:31.439	FAIL	Dictionary does not contain key &apos;value&apos;.
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;This is a different error. A jolokia request failed:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;To JSON using : content={
 &quot;error&quot;: &quot;javax.management.InstanceNotFoundException : org.opendaylight.controller:Category=Shards,name=member-2-shard-default-config,type=DistributedConfigDatastore&quot;,
 &quot;error_type&quot;: &quot;javax.management.InstanceNotFoundException&quot;,
 &quot;request&quot;: {
  &quot;mbean&quot;: &quot;org.opendaylight.controller:Category=Shards,name=member-2-shard-default-config,type=DistributedConfigDatastore&quot;,
  &quot;type&quot;: &quot;read&quot;
 },
 &quot;stacktrace&quot;: &quot;javax.management.InstanceNotFoundException: org.opendaylight.controller:Category=Shards,name=member-2-shard-default-config,type=DistributedConfigDatastore\n\tat com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.getMBean(DefaultMBeanServerInterceptor.java:1095)\n\tat com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.getMBeanInfo(DefaultMBeanServerInterceptor.java:1375)\n\tat com.sun.jmx.mbeanserver.JmxMBeanServer.getMBeanInfo(JmxMBeanServer.java:920)\n\tat org.jolokia.handler.ReadHandler$1.execute(ReadHandler.java:46)\n\tat org.jolokia.handler.ReadHandler$1.execute(ReadHandler.java:41)\n\tat org.jolokia.backend.executor.AbstractMBeanServerExecutor.call(AbstractMBeanServerExecutor.java:90)\n\tat org.jolokia.handler.ReadHandler.getMBeanInfo(ReadHandler.java:229)\n\tat org.jolokia.handler.ReadHandler.getAllAttributesNames(ReadHandler.java:241)\n\tat org.jolokia.handler.ReadHandler.resolveAttributes(ReadHandler.java:217)\n\tat org.jolokia.handler.ReadHandler.fetchAttributes(ReadHandler.java:183)\n\tat org.jolokia.handler.ReadHandler.doHandleRequest(ReadHandler.java:118)\n\tat org.jolokia.handler.ReadHandler.doHandleRequest(ReadHandler.java:37)\n\tat org.jolokia.handler.JsonRequestHandler.handleRequest(JsonRequestHandler.java:161)\n\tat org.jolokia.backend.MBeanServerHandler.dispatchRequest(MBeanServerHandler.java:156)\n\tat org.jolokia.backend.LocalRequestDispatcher.dispatchRequest(LocalRequestDispatcher.java:99)\n\tat org.jolokia.backend.BackendManager.callRequestDispatcher(BackendManager.java:428)\n\tat org.jolokia.backend.BackendManager.handleRequest(BackendManager.java:158)\n\tat org.jolokia.http.HttpRequestHandler.executeRequest(HttpRequestHandler.java:197)\n\tat org.jolokia.http.HttpRequestHandler.handleGetRequest(HttpRequestHandler.java:86)\n\tat org.jolokia.http.AgentServlet$4.handleRequest(AgentServlet.java:470)\n\tat org.jolokia.http.AgentServlet.handleSecurely(AgentServlet.java:350)\n\tat org.jolokia.http.AgentServlet.handle(AgentServlet.java:321)\n\tat org.jolokia.http.AgentServlet.doGet(AgentServlet.java:277)\n\tat javax.servlet.http.HttpServlet.service(HttpServlet.java:687)\n\tat javax.servlet.http.HttpServlet.service(HttpServlet.java:790)\n\tat org.eclipse.jetty.servlet.ServletHolder.handle(ServletHolder.java:848)\n\tat org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1772)\n\tat org.eclipse.jetty.websocket.server.WebSocketUpgradeFilter.doFilter(WebSocketUpgradeFilter.java:205)\n\tat org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1759)\n\tat org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:582)\n\tat org.ops4j.pax.web.service.jetty.internal.HttpServiceServletHandler.doHandle(HttpServiceServletHandler.java:71)\n\tat org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:143)\n\tat org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:548)\n\tat org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:226)\n\tat org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1180)\n\tat org.ops4j.pax.web.service.jetty.internal.HttpServiceContext.doHandle(HttpServiceContext.java:284)\n\tat org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:512)\n\tat org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:185)\n\tat org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1112)\n\tat org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)\n\tat org.ops4j.pax.web.service.jetty.internal.JettyServerHandlerCollection.handle(JettyServerHandlerCollection.java:80)\n\tat org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:134)\n\tat org.eclipse.jetty.server.Server.handle(Server.java:534)\n\tat org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:333)\n\tat org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:251)\n\tat org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:283)\n\tat org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:108)\n\tat org.eclipse.jetty.io.SelectChannelEndPoint$2.run(SelectChannelEndPoint.java:93)\n\tat org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.executeProduceConsume(ExecuteProduceConsume.java:303)\n\tat org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.produceConsume(ExecuteProduceConsume.java:148)\n\tat org.eclipse.jetty.util.thread.strategy.ExecuteProduceConsume.run(ExecuteProduceConsume.java:136)\n\tat org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:671)\n\tat org.eclipse.jetty.util.thread.QueuedThreadPool$2.run(QueuedThreadPool.java:589)\n\tat java.lang.Thread.run(Thread.java:748)\n&quot;,
 &quot;status&quot;: 404
}
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;</comment>
                            <comment id="64666" author="tpantelis" created="Fri, 17 Aug 2018 11:32:46 +0000"  >&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;2018-08-16T12:36:11,723 | ERROR | opendaylight-cluster-data-akka.actor.default-dispatcher-4 | Shard                            | 213 - org.opendaylight.controller.sal-clustering-commons - 1.7.4.SNAPSHOT | Failed to persist event type [org.opendaylight.controller.cluster.raft.persisted.SimpleReplicatedLogEntry] with sequence number [31300240] for persistenceId [member-2-shard-default-config].
akka.pattern.CircuitBreaker$$anon$1: Circuit Breaker Timed out.
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;akka timed out trying to write to disk. This indicates the disk was busy - probably due to all the logging. We&apos;ll have to increase the timeout - I&apos;ll need to look that up and report back later. &lt;/p&gt;</comment>
                            <comment id="64670" author="jluhrsen" created="Fri, 17 Aug 2018 15:41:43 +0000"  >&lt;p&gt;thanks &lt;a href=&quot;https://jira.opendaylight.org/secure/ViewProfile.jspa?name=tpantelis&quot; class=&quot;user-hover&quot; rel=&quot;tpantelis&quot;&gt;tpantelis&lt;/a&gt;, I&apos;m assuming this timeout increase has to come in code, and not some config file?&lt;br/&gt;
in the meantime, I removed the cluster.datastore DEBUG logging and just left these:&lt;/p&gt;

&lt;p&gt;controller.cluster.databroker:DEBUG&lt;br/&gt;
controller.cluster.datastore.Shard:INFO&lt;/p&gt;

&lt;p&gt;running in &lt;a href=&quot;https://jenkins.opendaylight.org/releng/job/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/55&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://jenkins.opendaylight.org/releng/job/controller-csit-3node-ddb-expl-lead-movement-longevity-only-oxygen/55&lt;/a&gt;&lt;/p&gt;</comment>
                            <comment id="64672" author="tpantelis" created="Fri, 17 Aug 2018 18:00:46 +0000"  >&lt;p&gt;It&apos;s an akka setting:&lt;/p&gt;

&lt;p&gt;In controller/configuration/initial/akka.conf, set the timeout to like 5 min:&lt;/p&gt;

&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;akka {
  persistence {
    journal-plugin-fallback {
      circuit-breaker {
        call-timeout = 300s
      }
    }
    snapshot-store-plugin-fallback {
      circuit-breaker {
        max-failures = 10
        call-timeout = 300s
      }
    }
  }
}
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;We need cluster.datastore DEBUG.&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                            <customfield id="customfield_11400" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                <customfield id="customfield_10000" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>0|i03hp3:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>