[OPNFLWPLUG-865] Zero flows populated in all switches when connected to Leader Node. Created: 07/Mar/17  Updated: 27/Sep/21  Resolved: 13/Apr/17

Status: Resolved
Project: OpenFlowPlugin
Component/s: General
Affects Version/s: None
Fix Version/s: None

Type: Bug
Reporter: Naveen kumar Thatipalli Assignee: Luis Gomez
Resolution: Done Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment:

Operating System: Linux
Platform: All


Attachments: File 010_small_config_single_switch_10k_flows_SF142_march_13th_2017.mht     File zero_flows_installed_when_connected_to_Leader_Node_boron_latest.mht    
External issue ID: 7915

 Description   

Zero flows populated in switches when connected to Leader Node.

Flows : 10k flows/switch
Switches : 15

Steps to Reproduce :
====================
1. Get Inventory Follower and Leader information

2. Start Mininet Connect To Follower Node1

3. Add Bulk Flow from follower node1

4. Get Bulk Flows and Verify In Cluster config DS and verify all flows are installed

5. Verify Default flows in switches when connected to Follower node. Here it shows zero flows in all 15 switches

6. Verify Flows In Switch when connected to Follower Node .

7. Flows will get installed when connected to Follower node.

8. Stop and start mininet to Leader Node.

9. Verify Flows in switch when connected to Leader Node .

10. Issue is consistent . Zero flows will be installed in all 15 switches when connected to Leader Node.

When connected to Follower node1 , populated total flows in all switches (150000.0)
After stop and start mininet when connected to Follower Node , populated total flows in all switches (150000.0)
Stop and start mininet connect to Leader node

sudo mn --controller 'remote,ip=10.29.12.
213,port=6633' --topo linear,15 --switch ovsk,protocols=OpenFlow13

Zero flows populated when connected to Leader Node.(0.0 != 150000.0)

given log file : https://jenkins.opendaylight.org/sandbox/job/openflowplugin-csit-3node-periodic-bulkomatic-clustering-daily-only-boron/6/robot/report/log.html



 Comments   
Comment by Naveen kumar Thatipalli [ 07/Mar/17 ]

Attachment zero_flows_installed_when_connected_to_Leader_Node_boron_latest.mht has been added with description: Log file for zero flows in 15 switches connected to Leader Node

Comment by Luis Gomez [ 08/Mar/17 ]

Is this not duplicate of https://bugs.opendaylight.org/show_bug.cgi?id=7884?

Comment by Naveen kumar Thatipalli [ 09/Mar/17 ]

Hi Luis,
This is not duplicate of bug <7884>.
7884 : Some of the switches are not getting installed with flows . Zero flows will be present in few switches. This happens when connected to Follower and Leader Node as well.
Example: 4 out of 15 switches will have zero flows.

7915 : All switches will be having zero flows when connected to Leader Node only. This issue is consistently reproducing.

Regards,
Naveen.

Comment by Luis Gomez [ 09/Mar/17 ]

OK, one question: can you reproduce this issue in some local environment or only in ODL jenkins?

Comment by Luis Gomez [ 09/Mar/17 ]

Also, now I realize you are using vey high # flows (150K), does the issue reproduces with less number (e.g. 15K)?

Comment by Naveen kumar Thatipalli [ 13/Mar/17 ]

Hi Luis,
Executed with less number of flows and given below log file. Issue is getting reproduced when connected to Leader Node.

Flows per switch : 10000
switches: 1
Total flows : 10000

log snippet:
============
Keyword 'MininetKeywords.Check Flows In Mininet' failed after retrying for 4 minutes 10 seconds. The last error was: 0.0 != 10000.0

Build : https://nexus.opendaylight.org/content/repositories/opendaylight.snapshot/org/opendaylight/integration/distribution-karaf/0.5.3-SNAPSHOT/distribution-karaf-0.5.3-20170312.234446-802.zip

Attached Log file for the same.

Regards,
Naveen.

Comment by Naveen kumar Thatipalli [ 13/Mar/17 ]

Executed in latest build with single switch with 10k flows. Able to reproduce issue and attached log file.

Comment by Naveen kumar Thatipalli [ 13/Mar/17 ]

Attachment 010_small_config_single_switch_10k_flows_SF142_march_13th_2017.mht has been added with description: Latest log with 10000 flows with single switch

Comment by Luis Gomez [ 14/Mar/17 ]

Right, this also reproduces easily with 1 switch and 10K flows, I believe this bug is connected with https://bugs.opendaylight.org/show_bug.cgi?id=7884 as both share same scenario of switch reconnecting from no-leader to leader.

Comment by Luis Gomez [ 15/Mar/17 ]

I think this could be a duplicate of https://bugs.opendaylight.org/show_bug.cgi?id=7884, therefore blocker too.

Comment by Luis Gomez [ 15/Mar/17 ]

Reproduce is as simple as:

1) Start 1 controller
2) Start mininet: sudo mn --controller 'remote,ip=192.168.0.1,port=6633' --topo linear,1
3) Push 10K flows: python ./odl_tester.py --threads 5 --flows 10000 --fpr 100 --no-delete
4) Stop and start mininet
5) check flows in switch: mininet> dpctl dump-aggregate -O OpenFlow13

2 additional notes:

  • The issue does not show when I enable FRS.
  • The regression was introduced after Boron SR2 and is present on both Boron and master branches.
Comment by Anil Vishnoi [ 16/Mar/17 ]

Hi Luis/Naveen,

I pushed the following patch that fixes this issue

https://git.opendaylight.org/gerrit/53383

Can you please take this patch try to recreate the issue again? I locally tried using the instruction that luis provided and it works for me.

Comment by Anil Vishnoi [ 19/Mar/17 ]

carbon :https://git.opendaylight.org/gerrit/#/c/53521/

Comment by Luis Gomez [ 28/Mar/17 ]

Reopen issue for carbon. Carbon fails the following tests for single controller (4 cpus):

10K flows in 1 switch:

1) Start 1 controller
2) Start mininet: sudo mn --controller 'remote,ip=192.168.0.1,port=6633' --topo linear,1
3) Push 10K flows: python ./odl_tester.py --threads 5 --flows 10000 --fpr 100 --no-delete
4) Stop and start mininet
5) check flows in switch: mininet> dpctl dump-aggregate -O OpenFlow13

20K flows in 8 switch:

1) Start 1 controller
2) Start mininet: sudo mn --controller 'remote,ip=192.168.0.1,port=6633' --topo linear,8
3) Push 20K flows: python ./odl_tester.py --threads 5 --flows 20000 --fpr 100 --no-delete
4) Stop and start mininet
5) check flows in switch: mininet> dpctl dump-aggregate -O OpenFlow13

Comment by Tomas Slusny [ 03/Apr/17 ]

I tested this on this patch: https://git.opendaylight.org/gerrit/#/c/53972/ and it is working. On master it is not, so this issue was probably caused by DeviceFlowRegistry performance regression.

Comment by Tomas Slusny [ 13/Apr/17 ]

Can you recheck this please Luis after my patches that fixed regression in 6917 was merged?

Comment by Luis Gomez [ 13/Apr/17 ]

Yes, this is fixed now.

Generated at Wed Feb 07 20:33:35 UTC 2024 using Jira 8.20.10#820010-sha1:ace47f9899e9ee25d7157d59aa17ab06aee30d3d.