Support for nested build steps #8021

tonistiigi · 2014-09-13T18:56:49Z

This is a nested build implementation roughly based on #7149 by @proppy

`BUILD /path/to/build/context`

The BUILD instruction starts building a new image as if docker build would
have been run in the specified directory of the current image. Rest of the
instructions in the Dockerfile are now considered to be part of this new build
process. If there are no instructions and the context directory contains a
file called Dockerfile then this file is used instead. Files located in build
context directory are available as source paths to the ADD and COPY commands.
No previously set instruction has any effect to the new build.

Consider a Dockerfile:

FROM crosbymichael/golang
ADD . /go/src/github.com/tonistiigi/dnsdock
ENV CGO_ENABLED 0
RUN cd /go/src/github.com/tonistiigi/dnsdock && \
    go get -d ./... && \
    go install -a ./...

BUILD /go/bin

FROM scratch
ADD dnsdock /
ENTRYPOINT ["/dnsdock"]

Running docker build -t tonistiigi/dnsdock . will result in the creation of following images:

> docker images
tonistiigi/dnsdock          latest  3f7a44c4701a  6 seconds ago    7.844 MB
tonistiigi/dnsdock-builder  latest  dac44acfcdd9  10 seconds ago   474.7 MB

The final image is minimal, containing only a static binary compiled by previous image. Builder layers are kept to keep build cache working.

Parent build step can be also used for dynamic Dockerfile creation. In this case BUILD instruction needs to be the final instruction in the Dockerfile.

FROM ubuntu
ADD . /src
RUN /src/build-new-dockerfile.sh > /src/Dockerfile
BUILD /src

Number of build steps isn't limited. Last image gets the tag.

cat <<EOT > Dockerfile
FROM busybox
RUN /bin/mkdir -p /out && /bin/touch /out/a
BUILD /out

FROM busybox
ADD . /out
RUN /bin/mkdir -p /out && /bin/touch /out/b
BUILD /out

FROM busybox
ADD . /out
RUN /bin/mkdir -p /out && /bin/touch /out/c

EOT

docker build -t abc .
docker run abc /bin/ls /out

# prints:  a   b    c

I realize that the proposals with different solutions for fixing this problem have not yet been agreed on by the maintainers. Just hoping that having one implementation ready for testing/review brings the final solution for minimal containers closer for everybody.

SvenDowideit · 2014-09-15T00:00:08Z

This would need some more documentation in builder.md and in the userguide. (no need to rush, I think this is already complete enough as a tech demo)

I've added your man page excerpt at the top of your description.

proppy · 2014-09-18T17:28:32Z

builder/dispatchers.go

+	copy := *b
+	builder := &copy
+
+	_, err = builder.Run(archive)


Did you consider create a new Job("build") instead?

Not sure it would be more appropriate, but it would make this tracked by the engine.

/cc @vieux

Yes I did.

The problem is that if we just remember the current job and clone it then we would break usage of this package just by its public api. It would have to be changed so that you can only build from a Job object.

If you mean making a new Job from the properties of the current Builder then this would mean maintaining a double logic of how a Job is converted to a Builder and how a Builder is converted to a Job. I think this part changes quite often with new features and if its not perfectly in sync then bad stuff can easily happen.

Up for discussion of course. Maybe I'm missing some important value that having a separate Job adds. Shutdown and logging should work the same AFAIK.

Why not just change the current builder's context ?

@plietar The current image Id is stored as a property of the Builder. If there is no clone then inner build would overwrite this Id for the outer build.

tonistiigi · 2014-10-02T08:53:27Z

I rebased and fixed 2 issues:

-builder suffix didn't work properly if more than 2 images were created
generated Dockerfile for subbuild could invalidate the cache(for example when whole root was copied).

diff without rebase

proppy · 2014-10-07T15:19:16Z

/cc @erikh, @shykes who gave informal endorsement during last contributor meeting

tonistiigi · 2014-10-07T18:26:11Z

Rebased.

IRC meeting log: https://botbot.me/freenode/docker-dev/2014-10-02/?msg=22753567&page=5

I guess the first step would be to clear the design issues. Mainly it seems to be concerning the parent image tagging. I'm not against name/builder, but would like to remind that this means name/builder/builder when there are more build steps. That may not be so easy to understand because it starts to look like a directory structure, but directories don't run from inside to outside.

Current implementation adds the suffix to whatever tag-like string user entered. I left it so specifically so that one could use it to generate suffixes for both tags or version, whichever the user liked more. This is probably even more controversial than hyphens. From IRC it seems that most people expect it to be added only to the image name part.

For me personally I don't see any other value in the builder images than to just keep the cache around. I probably wouldn't ever want to publish a builder image. Also, this part was not in the original proposal. But when I run docker images I'd like to know that this big image with no tag is there because it was used to build something that I use. There could be a different strategy - to store the connection internally and just show something like <builder of user/imagename> instead of <none> when you list images.

SvenDowideit · 2014-10-08T01:56:45Z

I suspect in a 3 phase i might want to use the builder images - but perhaps thats something that could be optional - something like docker build -t test --name-phases might result in test as the final result, and test-1 and test-2 as the 2 intermediates?

colemickens · 2014-10-13T06:41:06Z

I assume, since your examples work, that I must be doing something wrong when trying to use your nested-builds-0 branch: https://github.com/colemickens/polykube/blob/ca13c034a2697ca2a3065e937219fdb2adc02271/Dockerfile-goapi

rm -f Dockerfile
ln -s Dockerfile-goapi Dockerfile
docker build -t polykube/goapi .
Sending build context to Docker daemon   576 kB
Sending build context to Docker daemon 
Step 0 : FROM google/golang
 ---> ef0bc6d3875f
Step 1 : WORKDIR /gopath/src/goapi
 ---> Using cache
 ---> dcba2158c9ac
Step 2 : ADD ./src/goapi /gopath/src/goapi/
 ---> Using cache
 ---> 971f9f977933
Step 3 : RUN go get goapi
 ---> Using cache
 ---> 85b206603766
Step 4 : BUILD /gopath/bin 
FROM scratch

ADD goapi /

ENTRYPOINT ["/goapi"]

2014/10/12 23:37:09 No command specified
Makefile:91: recipe for target 'docker-goapi' failed
make: *** [docker-goapi] Error 1

tonistiigi · 2014-10-13T07:52:50Z

@colemickens That is a bug. I'll try to fix it later today. Quick workaround is to use a CMD before the BUILD keyword. The command itself doesn't matter as its not really executed, something like CMD ["/bin/bash"] should do.

Also note that if you want to base your images from scratch they have to be completely static binaries. Use ldd to check that. Here's some tutorials about static go binaries https://medium.com/@kelseyhightower/optimizing-docker-images-for-static-binaries-b5696e26eb07
http://blog.hashbangbash.com/2014/04/linking-golang-statically/

edit: fixed

colemickens · 2014-10-14T00:05:27Z

I can confirm it's fixed! Thank you for the links as well. I knew to disable CGO (though I'd forgot) but I didn't know the extensiveness of the rest...

stp-ip · 2014-10-18T23:43:37Z

Is there any ETA for this function or similar to be integrated into Docker? Is it "production" usable (with the usual beta hiccups) and could be used as long as it's going to be included later on anyway?

WhisperingChaos · 2014-10-24T19:59:03Z

No quite true: “No previously set instruction has any effect to the new build”

Given:

FROM Ubuntu
ADD Hello /NewContext/Hello
BUILD /NewContext

FROM BusyBox
ADD Hello /Hello
ENTRYPOINT Hello

Assume the above successfully builds image.

Now change the file named “Hello” to “HelloThere” in the NewContext:

FROM Ubuntu
ADD Hello /NewContext/HelloThere
BUILD /NewContext

FROM BusyBox
ADD Hello /Hello  # This add will fail because “Hello” no longer exists in the build context.
ENTRYPOINT Hello

Building this image will fail.

There is no indirect binding element in this design that would improve on its encapsulation and prevent coding changes applied to a prior build step from affecting subsequent ones. Absent indirect binding, the Dockerfile build context references within the resultant image of this example couple directly to the build context “/NewContext”. Therefore, alterations to the build context introduced by “previous instructions” can certainly affect the implementation of the “new build” (subsequent build steps).

The second example involving multiple build steps demonstrates some of the issues documented here such as:

Adding to a Developer’s cognitive load by introducing N contexts where N is number of build steps.
Forced copying of the initial and any subsequent partial build context, as it travels through the build pipeline (steps) until it’s been completely constructed, as the final/aggregated context.

Additionally, this solution diminishes the declarative nature of a nontrivial Dockerfile as a Developer:

Must encode an ADD in each build step to copy the partial build context, created so far, to a writeable location in the current step’s file system.
Must encode a BUILD, after an intermediate step, to identify the build context passed to the next one.
Must encode one or more ADD commands in the body of each step to copy appropriate source (input) artifacts required to generate the resultant output ones from the current build context.
Must encode a RUN command to perform the desired transform.
Must encode the final ADD commands to populate the resultant image.

tonistiigi · 2014-10-24T20:19:41Z

@WhisperingChaos

This is a docs issue. I should find a better wording to avoid confusion. The reason for having a builder image is to prepare a new context for the nested build, so of course everything in this directory by that time is available(as also stated by the docs). This sentence was meant to indicate that this is a completely new build and any ENV, WORKDIR, USER, CMD etc commands do not apply any more.

edit: To clarify more, your sample does not fail because you ran ADD Hello but because there is no file Hello in directory /NewContext.

Regarding everything else, I'd appreciate leaving this discussion to your own proposal thread or to the proposal thread for this feature, and leaving this PR for discussions about the current implementation.

WhisperingChaos · 2014-10-24T21:31:43Z

@tonistiigi

Yes – I agree the build fails because the file “Hello” is missing but the example also demonstrates the tight coupling between build steps that suggest the design is weakly encapsulated. It’s this weak encapsulation that most concerns me.
Sure, I’ll move the section of my post that contrasts this proposal to mine, however, the concerns regarding weak encapsulation, cognitive load, and diminished declarative nature are directly relevant to this PR and have nothing to do with mine.

tonistiigi · 2014-10-24T22:02:52Z

@WhisperingChaos

the example also demonstrates the tight coupling between build steps that suggest the design is weakly encapsulated

I do not follow.

Sure, I’ll move the section of my post that contrasts this proposal to mine, however, the concerns regarding weak encapsulation, cognitive load, and diminished declarative nature are directly relevant to this PR and have nothing to do with mine.

Absolutely. I'm not trying to silence your opinion in any way.

Regarding my own examples, the first one is the use case for this PR. Second and third are just there to explain things that may not be so obvious. My third example is complete nonsense, without any practical value, just to show what would happen in this case. You can make the same image today, with a 2 line Dockerfile.

stp-ip · 2014-10-30T11:09:10Z

Any progress so far? I'm eagerly looking forward to this.

tonistiigi · 2014-10-30T12:10:45Z

@stp-ip I wouldn't be too optimistic. While this is waiting for design decisions from @shykes, @erikh and @tiborvass are working on Dockerfile2 that has some overlapping https://gist.github.com/erikh/2447342612fbbd938b8c. Probably weeks until there are decisions if we're moving on with this PR or abandoning in favor of something else.

edit: btw, if anyone needs a rebase, just let me know

stp-ip · 2014-10-30T12:22:54Z

@tonistiigi That's unfortunate as this would effect decissions concerning secret distribution as discussed in kubernetes/kubernetes#2030, which could be interesting in docker itself. Any ETA for Dockerfile2 decissions itself? Or which version it could hopefully land?

tonistiigi · 2014-10-30T13:22:24Z

@stp-ip From my discussion yesterday with @erikh at #docker-dev , it seems to be in very early stages. Make sure to share your thoughts when it gets closer to official proposal.

thaJeztah · 2014-11-01T11:49:47Z

@tonistiigi I noticed a "secret" badge when visiting that gist; Is it something to be shared publicly, or did you "spill the beans"?

tonistiigi · 2014-11-01T11:56:28Z

@thaJeztah no, its ok. It was shared publicly in irc.

thaJeztah · 2014-11-01T12:23:19Z

@tonistiigi okay then, just confirming, thnx 😄

erikh · 2014-11-01T18:35:15Z

Hang on. :) Nobody has made any decisions yet.

On Oct 30, 2014, at 5:10 AM, Tõnis Tiigi notifications@github.com wrote:

@stp-ip https://github.com/stp-ip I wouldn't be too optimistic. While this is waiting for design decisions from @shykes https://github.com/shykes, @erikh https://github.com/erikh and @tiborvass https://github.com/tiborvass are working on Dockerfile2 that has some overlapping https://gist.github.com/erikh/2447342612fbbd938b8c https://gist.github.com/erikh/2447342612fbbd938b8c. Probably weeks until there are decisions if we're moving on with this PR or abandoning in favor of something else.

—
Reply to this email directly or view it on GitHub #8021 (comment).

prologic · 2014-11-14T01:24:48Z

+1 just wanted to say I like this proposal :) It seems it could be very composable :)

BUILD instruction starts a new empty builder process. Context for the new build is set to one of directories in current image. Signed-off-by: Tõnis Tiigi <tonistiigi@gmail.com> (github: tonistiigi)

tonistiigi · 2014-12-15T20:44:30Z

Not giving up on this yet. Here's another rebase.

If anyone wants to play with this you can get a x64 binary from https://www.dropbox.com/s/smj2b9tvl63f1qw/docker-1.4.0-nested-builds?dl=0 that has this patch on top of v1.4.0. Client side binaries don't require modification.

icecrime · 2015-01-06T18:45:55Z

Review session with @unclejack @tiborvass @crosbymichael

Thank you for your contribution, however we decided to close this because no proposal has been decided on. We need an agreement on a design from @shykes (probably with @proppy's participation), and that debate can continue on the already opened proposals.

tonistiigi · 2015-01-06T20:10:30Z

Have to say I'm a bit disappointed about this. The reason for this implementation was to force the maintainers to make a decision instead of letting the proposals die out.

Although many people need something like this, there hasn't been any meaningful response to @proppy's proposal since July. Same goes to the @shykes original proposal or the proposal by @WhisperingChaos on the same topic.

Its fine for me to close it because the proposal sucks or the implementation is crappy. But I can only read the current reasoning as "we don't have time to deal with this".

stp-ip · 2015-01-06T22:57:04Z

I agree. There are a few proposals, but none seem to be worked with. This one included.

colemickens · 2015-01-06T23:03:34Z

I'm also quite disappointed in this. Outside of this patch, and cheating with tar and pipes, it is rather difficult to achieve minimal containers in some instances.

Closing functional proposals with working code seems more than a tad bit rude. What "already opened proposals" are allowed to have discussion, and why isn't this one of them? Especially since tonisttiigi's maintaining this patch, it's easily testable...

crosbymichael · 2015-01-06T23:57:46Z

@colemickens I would not call closing functional PRs rude as there are only two possible outcomes with a PR. It's either merged or it's closed. This is part of open source. Decisions have to be made and if we merged every single PR into docker it would certainly become unusable for most of us.

@tonistiigi Sorry you feel this way, it was not our intention. The reason why this was closed is because it is based on a proposal that has not yet been finalized, nothing more.

As far as the responsiveness on the proposals for this, it is true that there have not been any meaningful comments for a while now. We have a huge amount of volume on PRs, issues, and requests and it is true that we don't have time to review and finalize everything. All of the maintainers review almost every single comment on the repository and if you have ever watched this repo you know the amount of volume that we are working with. Even if the maintainers don't immediately respond, we do read them, all of them. We just have to make some decisions on priorities and make sure that we are able to address more pressing issues over others. Bugs and security issues usually take priority over new feature requests.

Part of the work being done by the maintainers right now is to be more responsive. We are always looking for more people to help in the maintainer roles as there are more ways to contribute to an open source project than writing code ( even though this is the fun part ).

Anyways, sorry again if you felt that we closed this for the wrong reasons. Feel free to ping me personally on issues, PRs, and proposals. I try really hard to respond quickly when pinged and reaching out to the other maintainers on IRC or pings on issues helps us see where people are having issues with the project and what we should focus on.

proppy · 2015-01-07T00:01:00Z

@tonistiigi I could update #7149 to match your implementation, would you be ok with this? @crosbymichael @icecrime do you see this as a possible path forward?

tonistiigi · 2015-01-07T09:40:17Z

@proppy Yes, fine by me if you think that helps to move this process forward.

WhisperingChaos · 2015-03-11T17:21:12Z

Sorry @tonistiigi for the excessive delay in my reply. However, I needed time to reconstruct and extend the comparison between Nested/Chained Build and Function Idiom proposals that was originally presented in this thread after you correctly requested that it be removed and associated to the Function Idiom one.

the example also demonstrates the tight coupling between build steps that suggest the design is weakly encapsulated

I do not follow.

Please review the comparison between Nested Build and Function Idiom that now exists as part of the Function Idiom thread #8660. You'll find a table at the end of the fourth reply that should better present the 'tight coupling' argument.

My third example is complete nonsense, without any practical value, just to show what would happen in this case. You can make the same image today, with a 2 line Dockerfile.

Simple examples eliminate the implication that an approach's weakness is somehow related to the complexity of the example. Instead, encountering an issue in simple examples should cause reassessment, as the problem is likely innate to the approach. The 'nonsense' example presents the bare minimum operations of a given Nested Build: a single ADD, a single core statement, in this case RUN, and a potentially subsequent BUILD. It also includes a small number of adjacent build steps. If weak encapsulation/tight coupling can be demonstrated by this example then it's probably innate to Nested Build.

jamtur01 added the /project/doc label Sep 14, 2014

SvenDowideit mentioned this pull request Sep 16, 2014

Let Dockerfile ADD pick file from another docker image #7992

Closed

proppy reviewed Sep 18, 2014
View reviewed changes

proppy mentioned this pull request Sep 25, 2014

cmd: build minimal docker images for all cmds kubernetes/kubernetes#1444

Closed

tonistiigi force-pushed the nested-builds-0 branch from c4923d6 to ee54c51 Compare October 2, 2014 08:48

tonistiigi force-pushed the nested-builds-0 branch from ee54c51 to 871f145 Compare October 7, 2014 18:19

tonistiigi mentioned this pull request Oct 7, 2014

Proposal: Dockerfile BUILD instruction #7149

Closed

tonistiigi force-pushed the nested-builds-0 branch from 871f145 to 5fa8c08 Compare October 13, 2014 11:01

stp-ip mentioned this pull request Oct 22, 2014

Support dynamic configuration distribution kubernetes/kubernetes#1553

Closed

stp-ip mentioned this pull request Oct 30, 2014

Command order at runtime with secret distribution #2030 and dynamic configuration #2068 kubernetes/kubernetes#2069

Closed

vbatts added the UX label Nov 13, 2014

prologic mentioned this pull request Nov 24, 2014

Provide a deploy-able Dockerfile (the existing one is more for tests) sandstorm-io/sandstorm#163

Open

Add BUILD instruction support to Dockerfiles

aa74bbd

BUILD instruction starts a new empty builder process. Context for the new build is set to one of directories in current image. Signed-off-by: Tõnis Tiigi <tonistiigi@gmail.com> (github: tonistiigi)

tonistiigi force-pushed the nested-builds-0 branch from 5fa8c08 to aa74bbd Compare December 15, 2014 20:24

icecrime closed this Jan 6, 2015

tonistiigi mentioned this pull request Jan 19, 2015

Build in a busybox image? aacebedo/dnsdock#6

Closed

WhisperingChaos mentioned this pull request Mar 8, 2015

Proposal: Incorporate docker run IMAGE into BUILD #8660

Closed

tiborvass mentioned this pull request Mar 23, 2015

Proposal: Nested builds #7115

Closed

WhisperingChaos mentioned this pull request May 24, 2015

Proposal: remove support for multiple FROMs in Dockerfile #13026

Closed

Support for nested build steps #8021

Support for nested build steps #8021

Conversation

tonistiigi commented Sep 13, 2014

BUILD /path/to/build/context

SvenDowideit commented Sep 15, 2014

proppy Sep 18, 2014

Choose a reason for hiding this comment

tonistiigi Sep 18, 2014

Choose a reason for hiding this comment

plietar Sep 18, 2014

Choose a reason for hiding this comment

tonistiigi Sep 19, 2014

Choose a reason for hiding this comment

tonistiigi commented Oct 2, 2014

proppy commented Oct 7, 2014

tonistiigi commented Oct 7, 2014

SvenDowideit commented Oct 8, 2014

colemickens commented Oct 13, 2014

tonistiigi commented Oct 13, 2014

colemickens commented Oct 14, 2014

stp-ip commented Oct 18, 2014

WhisperingChaos commented Oct 24, 2014

tonistiigi commented Oct 24, 2014

WhisperingChaos commented Oct 24, 2014

tonistiigi commented Oct 24, 2014

stp-ip commented Oct 30, 2014

tonistiigi commented Oct 30, 2014

stp-ip commented Oct 30, 2014

tonistiigi commented Oct 30, 2014

thaJeztah commented Nov 1, 2014

tonistiigi commented Nov 1, 2014

thaJeztah commented Nov 1, 2014

erikh commented Nov 1, 2014

prologic commented Nov 14, 2014

tonistiigi commented Dec 15, 2014

icecrime commented Jan 6, 2015

tonistiigi commented Jan 6, 2015

stp-ip commented Jan 6, 2015

colemickens commented Jan 6, 2015

crosbymichael commented Jan 6, 2015

proppy commented Jan 7, 2015

tonistiigi commented Jan 7, 2015

WhisperingChaos commented Mar 11, 2015

`BUILD /path/to/build/context`