Uploaded image for project: 'netconf'
  1. netconf
  2. NETCONF-880

Netconf does not close device mountpoint properly

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Duplicate
    • Icon: Medium Medium
    • None
    • 2.0.14
    • netconf
    • None

      We have an ODL-based app which manages netconf devices in a DC.
      Recently, we've been experiencing netconf connectivity/sync issues between ODL app and netconf devices in some of our prod environments.
      So here's what has been happening:

      1. We start the ODL app which connects to devices and fetches data from them.
          All is good, devices are connected properly and are in sync (data was fetched correctly).
      2. After a short time, ODL lose connection to some devices.
      3. ODL netconf attempts automatic reconnect and the device gets reconnected.
      4. ODL app tries to load data from device again using a netconf get RPC call.
      5. ODL netconf (not the device) returns "transport error" and ODL app now marks the        device with sync-failed status. See the attachment "netconf-rpc-transport-error.txt"
      6. User then calls ODL app RPC for reconnecting the device.
         This RPC first deletes the node from ODL netconf-topology and then recreates it.
         We added a 5 seconds delay (for investigation purposes only) between the topology  node deletion and recreation to be sure ODL netconf has enough time to dismount and mount the device again.
         We added a check if the ODL netconf mountpoint exists (using DOMMountPointService) after the topology node was deleted and before it's going to be recreated again.
      7. ODL app asks ODL netconf to close the netconf session.
         Then we see the ODL netconf logs as they are in the attachment "netconf-logs-after-device-reconnect-rpc-call.txt".
      8. ODL app asks ODL netconf to start the netconf session again, but it seems that the previous session is stuck.
         Our check if the mountpoint still exists, throws an exception because it still exists, even though it should not at this point.
         Devops guys from customer also reported lots of hanging netconf session between the device and ODL.
      9. Afterwards we see lots of logs "netconf session established" followed by "netconf session promise complete already" errors in short intervals (few secs).
         See the logs in the attachment "netconf-session-promise-complete-already-error.txt"

            rovarga Robert Varga
            ifoltin Igor Foltin
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated:
              Resolved: