neigh: enable garbage collection #1068

dirkmueller · 2017-02-04T20:40:44Z

VIPs and floating ips that move between differnet interfaces might stay
for very long times cached incorrectly in the neighbor table until the
garbage collection kicks in. by default a STALE (so an entry that used
to have an active connection but now doesn't anymore) gets garbage
collected after gc_stale_timeout, but only if there are more than
gc_thresh1 STALE entries in total. The default of 128 means that one has
to accumulate 128 stale entries (or trigger a forced cache flush) until
this is happening, which for small/low traffic clouds can take an
eternity.

aplanas · 2017-02-06T10:57:40Z

chef/cookbooks/network/templates/default/sysctl_10gbe.conf.erb

@@ -6,6 +6,14 @@
 net.ipv4.ip_local_reserved_ports = 35357
 # Increase system IP port range to allow for more concurrent connections
 net.ipv4.ip_local_port_range = 27018 64999
+# ensure STALE arp neighbor entries expire from the cache, otherwise


aplanas · 2017-02-06T10:58:09Z

chef/cookbooks/network/templates/default/sysctl_10gbe.conf.erb

+# VIPs of an OpenStack service or the floating IP of a VM
+# might not become reachable
+# gc_thresh1 is the lower threshold that needs to be reached before
+# stale entries are getting garbage collected. the default of 128 means


VIPs and floating ips that move between differnet interfaces might stay for very long times cached incorrectly in the neighbor table until the garbage collection kicks in. by default a STALE (so an entry that used to have an active connection but now doesn't anymore) gets garbage collected after gc_stale_timeout, but *only* if there are more than gc_thresh1 STALE entries in total. The default of 128 means that one has to accumulate 128 stale entries (or trigger a forced cache flush) until this is happening, which for small/low traffic clouds can take an eternity.

vuntz

openstack-ansible is using a different approach: https://git.openstack.org/cgit/openstack/openstack-ansible-openstack_hosts/tree/defaults/main.yml#n46

Does that make sense? Or is your approach better?

dirkmueller added this to the Cloud 7 Update1 milestone Feb 4, 2017

Itxaka previously approved these changes Feb 6, 2017

View reviewed changes

aplanas reviewed Feb 6, 2017

View reviewed changes

dirkmueller force-pushed the fix_hanging_floating_ips branch from 3f69a02 to 80901e3 Compare February 6, 2017 13:47

dirkmueller force-pushed the fix_hanging_floating_ips branch from 80901e3 to 748540c Compare February 7, 2017 07:38

vuntz reviewed Mar 9, 2017

View reviewed changes

jsuchome added the needs backport to SOC7 (stable/4.0) label Jun 15, 2017

dirkmueller removed the needs backport to SOC7 (stable/4.0) label Oct 5, 2017

dirkmueller dismissed Itxaka’s stale review via 748540c June 12, 2023 01:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

neigh: enable garbage collection #1068

neigh: enable garbage collection #1068

dirkmueller commented Feb 4, 2017

aplanas Feb 6, 2017

aplanas Feb 6, 2017

vuntz left a comment

neigh: enable garbage collection #1068

Are you sure you want to change the base?

neigh: enable garbage collection #1068

Conversation

dirkmueller commented Feb 4, 2017

aplanas Feb 6, 2017

Choose a reason for hiding this comment

aplanas Feb 6, 2017

Choose a reason for hiding this comment

vuntz left a comment

Choose a reason for hiding this comment