[DCOS_OSS-5116] dcos overlay is not working with CoreOS Stable 2079.3.0 Created: 28/Apr/19  Updated: 18/Jul/19  Resolved: 18/Jul/19

Status: Resolved
Project: DC/OS
Component/s: networking
Affects Version/s: DC/OS 1.12.3
Fix Version/s: None

Type: Bug Priority: Medium
Reporter: Comte Frédéric (Inactive) Assignee: Deepak Goel
Resolution: Cannot Reproduce  
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Team: DELETE Networking Team

 Description   

Hey,

CoreOS publish a new release for stable channel and I think dcos overlay is not working with it.

7: d-dcos: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1420 qdisc noqueue state UP group default
link/ether 02:42:59:ad:d6:66 brd ff:ff:ff:ff:ff:ff
inet 9.0.9.129/25 brd 9.0.9.255 scope global d-dcos
valid_lft forever preferred_lft forever
inet6 fe80::42:59ff:fead:d666/64 scope link
valid_lft forever preferred_lft forever

My d-dcos is working... each node can ping his own container with IP 9.0.X.Y ...
But the overlay network doesn't work and nodes can't ping anoter node container...

9: minuteman: <BROADCAST,NOARP,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN group default qlen 1000
link/ether 82:a4:de:2e:0c:c8 brd ff:ff:ff:ff:ff:ff
inet6 fe80::80a4:deff:fe2e:cc8/64 scope link
valid_lft forever preferred_lft forever
10: spartan: <BROADCAST,NOARP,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN group default qlen 1000
link/ether da:f7:9d:1f:ae:5f brd ff:ff:ff:ff:ff:ff
inet 198.51.100.1/32 scope global spartan
valid_lft forever preferred_lft forever
inet 198.51.100.2/32 scope global spartan
valid_lft forever preferred_lft forever
inet 198.51.100.3/32 scope global spartan
valid_lft forever preferred_lft forever
inet6 fe80::d8f7:9dff:fe1f:ae5f/64 scope link
valid_lft forever preferred_lft forever

Interface Spartan is OK but it seems that minuteman is not.

Route are not populated for the nodes :

0.0.0.0 10.192.253.1 0.0.0.0 UG 0 0 0 enp21s0f0
9.0.9.128 0.0.0.0 255.255.255.128 U 0 0 0 d-dcos
10.192.253.0 0.0.0.0 255.255.255.0 U 0 0 0 enp21s0f0
172.17.0.0 0.0.0.0 255.255.0.0 U 0 0 0 docker0

When node start i get the following journal log

Apr 27 04:05:31 datanode1 systemd-udevd[3123]: Could not generate persistent MAC address for minuteman: No such file or directory
Apr 27 04:05:32 datanode1 systemd-networkd[659]: minuteman: Gained carrier
Apr 27 04:05:32 datanode1 systemd-networkd[659]: minuteman: Gained IPv6LL
Apr 27 04:05:59 datanode1 mesos-agent[3443]: /proc/sys/net/ipv4/conf/minuteman/rp_filter: 2

Frédéric



 Comments   
Comment by Automation Bot [ 28/Apr/19 ]

JIRA automation rule triggered: Team field was updated based on the assignee

Comment by Comte Frédéric (Inactive) [ 29/Apr/19 ]

This morning all my routes are back and the dcos overlay is up. I don't understand what happened but it's ok now.

Comment by Deepak Goel [ 29/Apr/19 ]

Comte Frédéric logs might reveal more information.

Comment by Deepak Goel [ 17/Jul/19 ]

Comte Frédéric are you still facing this issue?

Generated at Tue May 24 03:57:23 CDT 2022 using JIRA 7.8.4#78004-sha1:5704c55c9196a87d91490cbb295eb482fa3e65cf.