[OPNFLWPLUG-860] Flows are not installed into switches when connected to Leader Node in cluster Created: 01/Mar/17  Updated: 27/Sep/21  Resolved: 21/Mar/17

Status: Resolved
Project: OpenFlowPlugin
Component/s: General
Affects Version/s: None
Fix Version/s: None

Type: Bug
Reporter: Naveen kumar Thatipalli Assignee: Luis Gomez
Resolution: Done Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment:

Operating System: All
Platform: All


Attachments: File flows_not_installed_connected_to_Leader_Node_SF142.mht    
External issue ID: 7884

 Description   

Steps to Reproduce :
====================
1. Get Inventory Follower and Leader information

2. Start Mininet Connect To Follower Node1

3. Add Bulk Flow ( 1000 flows ) From Follower Node1

4. Get Bulk Flows and Verify In Cluster config DS and verify all flows are installed

5. Verify Default flows in switches when connected to Follower node. Here it shows zero flows in all 15 switches

6. Verify Flows In Switch when connected to Follower Node .

7. Flows will get installed when connected to Follower node.

8. Verify Flows in switch when connected to Leader Node .

9. Issue observed here . Flows will not get installed when connected to Leader Node.

10. Issue is inconsistent . Flows will get installed into few switches and other switches will have zero flows.

Attached failed log



 Comments   
Comment by Naveen kumar Thatipalli [ 01/Mar/17 ]

Attachment flows_not_installed_connected_to_Leader_Node_SF142.mht has been added with description: Log_file for flows not installed into ovs switches when connected to leader node

Comment by Naveen kumar Thatipalli [ 01/Mar/17 ]

Verified with minimal flows .
500 flows with 30 switches . Total = 15000 flows.
Four switches with zero flows and remaining switches each with 500 flows installed.

Comment by Luis Gomez [ 01/Mar/17 ]

This actually reminds me: https://bugs.opendaylight.org/show_bug.cgi?id=7770 so maybe they are related. Also rising importance to critical as it really is.

Comment by Luis Gomez [ 06/Mar/17 ]

Actually we can track this issue in this test case:

https://jenkins.opendaylight.org/releng/view/CSIT-3node/job/openflowplugin-csit-3node-periodic-bulkomatic-clustering-daily-only-carbon/199/robot/openflowplugin-bulkomatic-clustering-daily.txt/Cluster%20Reconcilliation%20Multi%20DPN/Verify%20Flows%20In%20Switch%20Connected%20To%20Leader/

It looks like the failure is often but not consistently happening.

Comment by Luis Gomez [ 06/Mar/17 ]

This issue can be reproduced by just adding 1 flow in follower and then connect 1 switch to leader of inventory config. It is similar but not the same as OPNFLWPLUG-853. Also this issue is only in Carbon while 7770 is in both Carbon and Boron.

Comment by Luis Gomez [ 09/Mar/17 ]

This issue seems very inconsistent, when i tried today in my local cluster setup I could not really reproduce

Comment by Naveen kumar Thatipalli [ 09/Mar/17 ]

Hi Luis,
Reproduced the issue with less number of flows.

15 switches
Per switch : 1000 flows
Total : 15000 flows.

Given below log file :
https://jenkins.opendaylight.org/sandbox/job/openflowplugin-csit-3node-periodic-bulkomatic-clustering-daily-only-boron/9/robot/report/log.html

Some of the switches were populated with zero flows when connected to Leader Node.
Log snippet:
============

${flows} = [u'0', u'1000', u'1000', u'1000', u'1000', u'1000', u'1000', u'0', u'1000', u'1000', u'1000', u'1000', u'1000', u'1000', u'1000']
Start / End / Elapsed: 20170309 10:07:27.749 / 20170309 10:07:27.750 / 00:00:00.001
10:07:27.750 FAIL Keyword 'MininetKeywords.Check Flows In Mininet' failed after retrying for 1 minute 40 seconds. The last error was: 13000.0 != 15000.0

Comment by Luis Gomez [ 14/Mar/17 ]

Right, I tried with 15 switches and 15K flows and it reproduces consistently. It is important to note that with small amount of switches (3 switches) and flows (1K flows) the issue does not show up which means this is kind of perf/scale related issue.

Comment by Luis Gomez [ 15/Mar/17 ]

OK, after doing more test this issue also happens with single node so raising priority to blocker because this is an important regression. Also consider this bug could be the same as 7915.

Comment by Luis Gomez [ 15/Mar/17 ]

Also you can use the odl_tester.py in int/test repo to generate the flows as alterative of bulkomatic:

python ./odl_tester.py --threads 5 --flows 10000 --fpr 100 --no-delete

Comment by Luis Gomez [ 15/Mar/17 ]

Reproduce is as simple as:

1) Start 1 controller
2) Start mininet: sudo mn --controller 'remote,ip=192.168.0.1,port=6633' --topo linear,15
3) Push 10K flows: python ./odl_tester.py --threads 5 --flows 10000 --fpr 100 --no-delete
4) Stop and start mininet
5) check flows in switches: mininet> dpctl dump-aggregate -O OpenFlow13

2 additional notes:

  • The issue does not show when I enable FRS as you mention.
  • The regression was introduced after Boron SR2 and is present on both Boron and master branches.
Comment by Tomas Slusny [ 21/Mar/17 ]

Anil's fix for FRM was merged, so can this one be closed now?

Generated at Wed Feb 07 20:33:34 UTC 2024 using Jira 8.20.10#820010-sha1:ace47f9899e9ee25d7157d59aa17ab06aee30d3d.