All jobs should be run with --rm now which makes this not necessary. #80

tfoote · 2016-01-06T06:14:17Z

Fixes ros-infrastructure/ros_buildfarm#120

tfoote · 2016-01-08T00:10:05Z

dirk-thomas · 2016-01-08T00:39:07Z

If after numerous builds no containers accumulate on the host machine it looks good to me.

tfoote · 2016-01-08T21:36:03Z

We still leak the occasional container. Introspecting one of our active slaves there are well over 100 leaked in the last 48 hours. https://gist.github.com/tfoote/ba1e8d608e98df9f1af7

I think it's worth leaving this cleanup. We just have to be sure that the minimum cleanup age is longer than any timeouts. I haven't seen any failures recently since we added the minimum age.

dirk-thomas · 2016-01-08T21:39:23Z

While fixing the symptom with the cleanup script might work currently I think we should investigate why these containers remain i the first place and try to fix the source of the problem.

tfoote · 2016-01-11T20:18:33Z

This is removing the cleanup script which is dealing with a synptom. @esteve is looking into the underlying issue. I'm going to close this removal of the cleanup, as this symptom is important to clean up in case it happens again as it can be catastrophic. (aka slaves run out of disk space)

dirk-thomas · 2016-01-11T20:52:58Z

If we do fix the actual problem I would recommend to merge this. Even if the result would be "catastrophic" otherwise any problem in the future in this area would go unnoticed. I think a hard failure is better then hiding a problem.

tfoote · 2016-01-11T21:39:13Z

I'm not sure I completely agree, but we can do that. reopened

tfoote · 2016-01-13T01:02:34Z

Merging this to let container leakage be apparent and the hoped fix resolve it: ros-infrastructure/ros_buildfarm#124

All jobs should be run with --rm now which makes this not necessary.

All jobs should be run with --rm now which makes this not necessary.

0a264d2

Fixes ros-infrastructure/ros_buildfarm#120

tfoote added the in progress label Jan 6, 2016

tfoote closed this Jan 11, 2016

tfoote removed the in progress label Jan 11, 2016

tfoote deleted the remote_container_cleanup branch January 11, 2016 20:18

tfoote restored the remote_container_cleanup branch January 11, 2016 21:28

tfoote reopened this Jan 11, 2016

tfoote added the in progress label Jan 11, 2016

tfoote self-assigned this Jan 11, 2016

tfoote added a commit that referenced this pull request Jan 13, 2016

Merge pull request #80 from ros-infrastructure/remote_container_cleanup

e3bc486

All jobs should be run with --rm now which makes this not necessary.

tfoote merged commit e3bc486 into master Jan 13, 2016

tfoote deleted the remote_container_cleanup branch January 13, 2016 01:02

tfoote removed the in progress label Jan 13, 2016

This was referenced Jan 13, 2016

doc jobs seem to have an old apt cache ros-infrastructure/ros_buildfarm#134

Closed

docker 1.4 and "Driver aufs failed to remove root filesystem", "device or resource busy" moby/moby#9665

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

All jobs should be run with --rm now which makes this not necessary. #80

All jobs should be run with --rm now which makes this not necessary. #80

tfoote commented Jan 6, 2016

tfoote commented Jan 8, 2016

dirk-thomas commented Jan 8, 2016

tfoote commented Jan 8, 2016

dirk-thomas commented Jan 8, 2016

tfoote commented Jan 11, 2016

dirk-thomas commented Jan 11, 2016

tfoote commented Jan 11, 2016

tfoote commented Jan 13, 2016

All jobs should be run with --rm now which makes this not necessary. #80

All jobs should be run with --rm now which makes this not necessary. #80

Conversation

tfoote commented Jan 6, 2016

tfoote commented Jan 8, 2016

dirk-thomas commented Jan 8, 2016

tfoote commented Jan 8, 2016

dirk-thomas commented Jan 8, 2016

tfoote commented Jan 11, 2016

dirk-thomas commented Jan 11, 2016

tfoote commented Jan 11, 2016

tfoote commented Jan 13, 2016