Uploaded image for project: 'controller'
  1. controller
  2. CONTROLLER-1543

Cluster test regression: No cars can be added after 2 followers are taken down and then up

    XMLWordPrintable

Details

    • Bug
    • Status: Resolved
    • Resolution: Done
    • 0.4.0
    • None
    • clustering
    • None
    • Operating System: All
      Platform: All

    • 6587
    • High

    Description

      As showed in this test for Boron:

      https://jenkins.opendaylight.org/releng/view/CSIT-3node/job/controller-csit-3node-clustering-only-boron/

      There is a failure when trying to add cars after 2 instances are taking down and then they are recovered:

      2016-08-30 18:30:07,383 | WARN | lt-dispatcher-17 | ConcurrentDOMDataBroker | 197 - org.opendaylight.controller.sal-distributed-datastore - 1.4.0.Boron-RC1 | Tx: DOM-18 Error during phase CAN_COMMIT, starting Abort
      akka.pattern.AskTimeoutException: Ask timed out on ActorSelection[Anchor(akka://opendaylight-cluster-data/), Path(/user/shardmanager-config/member-1-shard-car-config#1713972084)] after [30000 ms]. Sender[null] sent message of type "org.opendaylight.controller.cluster.datastore.messages.ReadyLocalTransaction".
      at akka.pattern.PromiseActorRef$$anonfun$1.apply$mcV$sp(AskSupport.scala:604)[180:com.typesafe.akka.actor:2.4.7]
      at akka.actor.Scheduler$$anon$4.run(Scheduler.scala:126)[180:com.typesafe.akka.actor:2.4.7]
      at scala.concurrent.Future$InternalCallbackExecutor$.unbatchedExecute(Future.scala:601)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      at scala.concurrent.BatchingExecutor$class.execute(BatchingExecutor.scala:109)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      at scala.concurrent.Future$InternalCallbackExecutor$.execute(Future.scala:599)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      at akka.actor.LightArrayRevolverScheduler$TaskHolder.executeTask(LightArrayRevolverScheduler.scala:331)[180:com.typesafe.akka.actor:2.4.7]
      at akka.actor.LightArrayRevolverScheduler$$anon$4.executeBucket$1(LightArrayRevolverScheduler.scala:282)[180:com.typesafe.akka.actor:2.4.7]
      at akka.actor.LightArrayRevolverScheduler$$anon$4.nextTick(LightArrayRevolverScheduler.scala:286)[180:com.typesafe.akka.actor:2.4.7]
      at akka.actor.LightArrayRevolverScheduler$$anon$4.run(LightArrayRevolverScheduler.scala:238)[180:com.typesafe.akka.actor:2.4.7]
      at java.lang.Thread.run(Thread.java:745)[:1.8.0_72-internal]
      2016-08-30 18:30:07,382 | ERROR | ult-dispatcher-5 | LocalThreePhaseCommitCohort | 197 - org.opendaylight.controller.sal-distributed-datastore - 1.4.0.Boron-RC1 | Failed to prepare transaction member-1-datastore-config-fe-0-txn-18 on backend
      akka.pattern.AskTimeoutException: Ask timed out on ActorSelection[Anchor(akka://opendaylight-cluster-data/), Path(/user/shardmanager-config/member-1-shard-car-config#1713972084)] after [30000 ms]. Sender[null] sent message of type "org.opendaylight.controller.cluster.datastore.messages.ReadyLocalTransaction".
      at akka.pattern.PromiseActorRef$$anonfun$1.apply$mcV$sp(AskSupport.scala:604)[180:com.typesafe.akka.actor:2.4.7]
      at akka.actor.Scheduler$$anon$4.run(Scheduler.scala:126)[180:com.typesafe.akka.actor:2.4.7]
      at scala.concurrent.Future$InternalCallbackExecutor$.unbatchedExecute(Future.scala:601)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      at scala.concurrent.BatchingExecutor$class.execute(BatchingExecutor.scala:109)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      at scala.concurrent.Future$InternalCallbackExecutor$.execute(Future.scala:599)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      at akka.actor.LightArrayRevolverScheduler$TaskHolder.executeTask(LightArrayRevolverScheduler.scala:331)[180:com.typesafe.akka.actor:2.4.7]
      at akka.actor.LightArrayRevolverScheduler$$anon$4.executeBucket$1(LightArrayRevolverScheduler.scala:282)[180:com.typesafe.akka.actor:2.4.7]
      at akka.actor.LightArrayRevolverScheduler$$anon$4.nextTick(LightArrayRevolverScheduler.scala:286)[180:com.typesafe.akka.actor:2.4.7]
      at akka.actor.LightArrayRevolverScheduler$$anon$4.run(LightArrayRevolverScheduler.scala:238)[180:com.typesafe.akka.actor:2.4.7]
      at java.lang.Thread.run(Thread.java:745)[:1.8.0_72-internal]
      2016-08-30 18:30:07,385 | WARN | CommitFutures-18 | RestconfImpl | 265 - org.opendaylight.netconf.sal-rest-connector - 1.4.0.Boron-RC1 | Error creating data config/car:cars
      TransactionCommitFailedException

      {message=canCommit encountered an unexpected failure, errorList=[RpcError [message=canCommit encountered an unexpected failure, severity=ERROR, errorType=APPLICATION, tag=operation-failed, applicationTag=null, info=null, cause=akka.pattern.AskTimeoutException: Ask timed out on [ActorSelection[Anchor(akka://opendaylight-cluster-data/), Path(/user/shardmanager-config/member-1-shard-car-config#1713972084)]] after [30000 ms]. Sender[null] sent message of type "org.opendaylight.controller.cluster.datastore.messages.ReadyLocalTransaction".]]}

      at org.opendaylight.controller.md.sal.dom.broker.impl.TransactionCommitFailedExceptionMapper.newWithCause(TransactionCommitFailedExceptionMapper.java:37)[166:org.opendaylight.controller.sal-broker-impl:1.4.0.Boron-RC1]
      at org.opendaylight.controller.md.sal.dom.broker.impl.TransactionCommitFailedExceptionMapper.newWithCause(TransactionCommitFailedExceptionMapper.java:18)[166:org.opendaylight.controller.sal-broker-impl:1.4.0.Boron-RC1]
      at org.opendaylight.yangtools.util.concurrent.ExceptionMapper.apply(ExceptionMapper.java:103)[68:org.opendaylight.yangtools.util:1.0.0.Boron-RC1]
      at org.opendaylight.controller.cluster.databroker.ConcurrentDOMDataBroker.handleException(ConcurrentDOMDataBroker.java:220)[197:org.opendaylight.controller.sal-distributed-datastore:1.4.0.Boron-RC1]
      at org.opendaylight.controller.cluster.databroker.ConcurrentDOMDataBroker.access$000(ConcurrentDOMDataBroker.java:49)[197:org.opendaylight.controller.sal-distributed-datastore:1.4.0.Boron-RC1]
      at org.opendaylight.controller.cluster.databroker.ConcurrentDOMDataBroker$1.onFailure(ConcurrentDOMDataBroker.java:128)[197:org.opendaylight.controller.sal-distributed-datastore:1.4.0.Boron-RC1]
      at com.google.common.util.concurrent.Futures$6.run(Futures.java:1310)[65:com.google.guava:18.0.0]
      at com.google.common.util.concurrent.MoreExecutors$DirectExecutor.execute(MoreExecutors.java:457)[65:com.google.guava:18.0.0]
      at com.google.common.util.concurrent.ExecutionList.executeListener(ExecutionList.java:156)[65:com.google.guava:18.0.0]
      at com.google.common.util.concurrent.ExecutionList.execute(ExecutionList.java:145)[65:com.google.guava:18.0.0]
      at com.google.common.util.concurrent.AbstractFuture.setException(AbstractFuture.java:202)[65:com.google.guava:18.0.0]
      at com.google.common.util.concurrent.SettableFuture.setException(SettableFuture.java:68)[65:com.google.guava:18.0.0]
      at org.opendaylight.controller.cluster.datastore.SingleCommitCohortProxy$1.onComplete(SingleCommitCohortProxy.java:58)[197:org.opendaylight.controller.sal-distributed-datastore:1.4.0.Boron-RC1]
      at akka.dispatch.OnComplete.internal(Future.scala:258)[180:com.typesafe.akka.actor:2.4.7]
      at akka.dispatch.OnComplete.internal(Future.scala:256)[180:com.typesafe.akka.actor:2.4.7]
      at akka.dispatch.japi$CallbackBridge.apply(Future.scala:186)[180:com.typesafe.akka.actor:2.4.7]
      at akka.dispatch.japi$CallbackBridge.apply(Future.scala:183)[180:com.typesafe.akka.actor:2.4.7]
      at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:32)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      at akka.dispatch.BatchingExecutor$AbstractBatch.processBatch(BatchingExecutor.scala:55)[180:com.typesafe.akka.actor:2.4.7]
      at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply$mcV$sp(BatchingExecutor.scala:91)[180:com.typesafe.akka.actor:2.4.7]
      at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply(BatchingExecutor.scala:91)[180:com.typesafe.akka.actor:2.4.7]
      at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply(BatchingExecutor.scala:91)[180:com.typesafe.akka.actor:2.4.7]
      at scala.concurrent.BlockContext$.withBlockContext(BlockContext.scala:72)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      at akka.dispatch.BatchingExecutor$BlockableBatch.run(BatchingExecutor.scala:90)[180:com.typesafe.akka.actor:2.4.7]
      at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:39)[180:com.typesafe.akka.actor:2.4.7]
      at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:409)[180:com.typesafe.akka.actor:2.4.7]
      at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      Caused by: akka.pattern.AskTimeoutException: Ask timed out on ActorSelection[Anchor(akka://opendaylight-cluster-data/), Path(/user/shardmanager-config/member-1-shard-car-config#1713972084)] after [30000 ms]. Sender[null] sent message of type "org.opendaylight.controller.cluster.datastore.messages.ReadyLocalTransaction".
      at akka.pattern.PromiseActorRef$$anonfun$1.apply$mcV$sp(AskSupport.scala:604)[180:com.typesafe.akka.actor:2.4.7]
      at akka.actor.Scheduler$$anon$4.run(Scheduler.scala:126)[180:com.typesafe.akka.actor:2.4.7]
      at scala.concurrent.Future$InternalCallbackExecutor$.unbatchedExecute(Future.scala:601)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      at scala.concurrent.BatchingExecutor$class.execute(BatchingExecutor.scala:109)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      at scala.concurrent.Future$InternalCallbackExecutor$.execute(Future.scala:599)[176:org.scala-lang.scala-library:2.11.8.v20160304-115712-1706a37eb8]
      at akka.actor.LightArrayRevolverScheduler$TaskHolder.executeTask(LightArrayRevolverScheduler.scala:331)[180:com.typesafe.akka.actor:2.4.7]
      at akka.actor.LightArrayRevolverScheduler$$anon$4.executeBucket$1(LightArrayRevolverScheduler.scala:282)[180:com.typesafe.akka.actor:2.4.7]
      at akka.actor.LightArrayRevolverScheduler$$anon$4.nextTick(LightArrayRevolverScheduler.scala:286)[180:com.typesafe.akka.actor:2.4.7]
      at akka.actor.LightArrayRevolverScheduler$$anon$4.run(LightArrayRevolverScheduler.scala:238)[180:com.typesafe.akka.actor:2.4.7]
      at java.lang.Thread.run(Thread.java:745)[:1.8.0_72-internal]

      Attachments

        No reviews matched the request. Check your Options in the drop-down menu of this sections header.

        Activity

          People

            Unassigned Unassigned
            ecelgp Luis Gomez
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: