New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Docker causes system freeze on Ubuntu 14.10 #10355
Comments
Looks like it also caused FS coruption, some files are lost. Not cool. |
If you can provide Also, are you sure this is a docker issue? If the system completely freezes, that sounds like a kernel panic. If this is a cloud hosted system, you can try getting the console output from the cloud hosting console (or whatever the provider has), as the kernel panic output won't be in filesystem logs. |
+1 I have seen this 2 times in the past month
my sysinfo
|
@anandkumarpatel That kernel is ancient. The latest is @relgames kernel 3.16 might also be affected by that bug, but I'm not sure. Either way, you should make sure you've installed all system updates on your system. |
This is real machine, not a cloud. Not sure if it is related, I will try to reproduce tomorrow, but I also connected my phone to the USB port around that time:
Also last lines from docker log:
|
@relgames If you want something more likely to be stable, you should probably try Ubuntu 14.04. Ubuntu 14.10 is going to be replaced by Ubuntu 15.04 soon, so I wouldn't expect some hidden bug in kernel 3.16 to be fixed soon (edit: I'm not saying it wouldn't get fixed, just that it might have lower priority for a fix). |
Updated to the latest kernel 3.18.3, will see how it goes. |
I'm seeing a similar thing (same kernel, same syslog messages), but the client can't connect to the daemon at all; it fails with |
It looks very similar to my issue, but I'm not sure it's the same thing if you install crashdump (https://help.ubuntu.com/lts/serverguide/kernel-crash-dump.html) we can check if it's the same thing |
I am having a similar issue on Ubuntu 14.04. My dedicated server just hangs and I need to reboot it. What I can do to help debugging this issue? The output from
|
Similar issue on Ubuntu 14.04.03 , kernel 3.19. All networking freeze for 1-2 minutes, and then all become normal. It's repeated by 10-15 min interval. docker -D info:
|
Ping @thaJeztah
|
@rbjorklin thanks for reporting, please provide as much information as possible (e.g. How are containers started, what kind of processes are started in the container, amount of logging, etc.). Note that there has been a kernel issue with aufs, but that should be fixed in 3.13.0-79.123 (see #18180 (comment)). When did you encounter the hang? Were those machines fresh installs or just upgraded from 1.9.x? |
@thaJeztah We are running Marathon on top of Mesos so containers are started by the Mesos slave. All containers are running the official tomcat image with a bash script as ENTRYPOINT that traps sigterm to handle signals nicely. Inside the container we are also running the zabbix-agent to poll JMX values and report back. Pretty much all logging is sent out of the container to logstash with gelf. Tomcat is using this to get it's logs out. We encountered #18180 but this is different, this is way more serious since the entire machines froze. Sometime between 15.00 & 16.00 CET today (2016-03-11). The machines were upgraded to 1.10 the day it was released (2016-02-04) and then upgraded to 1.10.2 about two weeks ago. |
@rbjorklin if you're running Mesos, also be sure to upgrade to 1.10.3; 1.10.3 carries a patch that affected users running Mesos (see #19950). These hangs started after upgrading to 1.10.2? If so, can you open a new issue (Monday would be fine if you have access to those logs), to start "fresh". |
I have the same problem too with kernel root:/var/log/upstart# docker info root:/var/log/upstart# docker --version is it fixed in any release yet? |
@ninchan8328 What problem? When are you seeing problems? What is your daemon configuration? |
is this problem solved? I had the same problem... Server: |
@ZaZaLee this looks to be a kernel issue, so make sure you have your kernel up to date. |
i have the same problem with ubuntu 17.04 and recent kernel: Containers: 45 WARNING: No swap limit support |
@daveoncode can you open a new issue with details, steps to reproduce and relevant daemon, system logs? |
I have the same problem with centos7 and kernel 3.10.0-514.26.2.el7.x86_64: Containers: 2 WARNING: bridge-nf-call-iptables is disabled vmcore-dmesg [2525425.910118] device veth087bf86 entered promiscuous mode And I use crash to analysis vmcore and find out this RIP line crash> bt |
Mee too:
|
I'm closing this. If somebody is still hitting this, please open a new issue and also consider contacting to the distro's kernel maintainers. Note that system hang-up may happen in various different reasons. |
Starting multiple Docker containers hangs the system.
Not sure what exact steps are but I have seen such behaviour several times.
Jan 26 15:57:27 oleg kernel: [257250.221647] device vethf7a6cc6 entered promiscuous mode
Jan 26 15:57:27 oleg kernel: [257250.221822] IPv6: ADDRCONF(NETDEV_UP): vethf7a6cc6: link is not ready
Jan 26 15:57:27 oleg kernel: [257250.271640] IPv6: ADDRCONF(NETDEV_CHANGE): vethf7a6cc6: link becomes ready
Jan 26 15:57:27 oleg kernel: [257250.271692] docker0: port 1(vethf7a6cc6) entered forwarding state
Jan 26 15:57:27 oleg kernel: [257250.271705] docker0: port 1(vethf7a6cc6) entered forwarding state
Jan 26 15:57:28 oleg kernel: [257251.014089] docker0: port 1(vethf7a6cc6) entered disabled state
Jan 26 15:57:28 oleg kernel: [257251.015661] device vethf7a6cc6 left promiscuous mode
Jan 26 15:57:28 oleg kernel: [257251.015677] docker0: port 1(vethf7a6cc6) entered disabled state
Jan 26 15:57:30 oleg kernel: [257252.550674] device veth7707973 entered promiscuous mode
Jan 26 15:57:30 oleg kernel: [257252.551075] IPv6: ADDRCONF(NETDEV_UP): veth7707973: link is not ready
Jan 26 15:57:30 oleg kernel: [257252.598878] IPv6: ADDRCONF(NETDEV_CHANGE): veth7707973: link becomes ready
Jan 26 15:57:30 oleg kernel: [257252.598919] docker0: port 1(veth7707973) entered forwarding state
Jan 26 15:57:30 oleg kernel: [257252.598935] docker0: port 1(veth7707973) entered forwarding state
Jan 26 15:57:45 oleg kernel: [257267.637453] docker0: port 1(veth7707973) entered forwarding state
Here it hangs. Only off/on with a power button helps. Event Kernel Reset keys are not working ( https://en.wikipedia.org/wiki/Magic_SysRq_key )
Jan 26 15:58:43 oleg kernel: [ 0.000000] Initializing cgroup subsys cpuset
Jan 26 15:58:43 oleg kernel: [ 0.000000] Initializing cgroup subsys cpu
Jan 26 15:58:43 oleg kernel: [ 0.000000] Initializing cgroup subsys cpuacct
Jan 26 15:58:43 oleg kernel: [ 0.000000] Linux version 3.16.0-29-generic (buildd@tipua) (gcc version 4.9.1 (Ubuntu 4.9.1-16ubuntu6) ) #39-Ubuntu SMP Mon Dec 15 22:27:29 UTC 2014 (Ubuntu 3.16.0-29.39-gen
Jan 26 15:58:43 oleg kernel: [ 0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-3.16.0-29-generic.efi.signed root=/dev/mapper/ubuntu--vg-root ro
I'm not a Linux guru so let me know where else should I look for logs, dumps, etc
The text was updated successfully, but these errors were encountered: