[v2,1/3] classes: Add initramfs class

Message ID 20200923162046.206888-1-hws@denx.de
State Superseded, archived
Headers show
Series [v2,1/3] classes: Add initramfs class | expand

Commit Message

Harald Seiler Sept. 23, 2020, 8:20 a.m. UTC
Add a new "image" class for generating a custom initramfs.  It works
like this: A new minimal debian rootfs is bootstrapped and all
dependency packages for the new initramfs are installed.  Then, an
initramfs is generated from this rootfs and deployed like usual.

This new initramfs.bbclass "image" class should be pulled in by an
"initramfs image" recipe.  Said recipe then specifies all dependencies
of the initramfs via INITRAMFS_INSTALL and INITRAMFS_PREINSTALL (which
are analogous to the respective IMAGE_* variables).

initramfs.bbclass intentionally does _not_ expose a mechanism to change
/etc/initramfs-tools/initramfs.conf and /etc/initramfs-tools/modules.
Changes to their settings are better done via packages that deploy
conf-hooks to /usr/share/initramfs-tools/conf-hooks.d/ and module
fragment files to /usr/share/initramfs-tools/modules.d/.

Signed-off-by: Harald Seiler <hws@denx.de>
---

Notes:
    I had this idea while searching for a way to build an initramfs that
    uses dm-verity to assert integrity of the rootfs.  To me, this feels
    like a much cleaner solution than anything else I tried and I'm happy to
    report that, using this approach, I got everything working nicely in the
    original project.
    
    In my opinion, this design has a number of advantages over the previous
    solutions we have seen so far:
    
     - It does not suffer any kind of initramfs pollution, caused by
       packages installed into a rootfs.  This is a big problem when trying
       to generated an initramfs from e.g. `buildchroot-target` as many
       unrelated packaged could be installed there which would all get
       pulled into the initrd (if they install hooks/scripts).
    
       This also means, with this new approach, the integrator has maximum
       control over the contents of the initramfs.
    
     - There are no needs to change the initramfs generation process in any
       way, the debian tooling can be used exactly like its meant to.
    
     - As most isar-generated images will never regenerate the initramfs
       from the running system, all initramfs related packages are dead-weight
       to the image.  This is a problem when trying to generate the initramfs
       from the actual image rootfs.
    
       When it is necessary to rebuild the initramfs in a running system,
       the packages designed for this new class could just be installed into
       the rootfs, without any changes necessary.  This means, any generic
       initramfs module packages can be used both with the in-rootfs mechanism
       and initramfs.bbclass.
    
     - Because of this complete isolation and independence, implementation
       of complex logic is much easier:  For example dm-verity needs
       a root-hash that is only available after the rootfs has been cast into
       a filesystem image.  With this new approach, this can be modelled with
       a simple task dependency.
    
    Changes in v2:
    - None (just added examples in new patches)

 meta/classes/initramfs.bbclass | 41 ++++++++++++++++++++++++++++++++++
 1 file changed, 41 insertions(+)
 create mode 100644 meta/classes/initramfs.bbclass

Comments

Henning Schild Sept. 25, 2020, 5:16 a.m. UTC | #1
To me this sounds like a "buildchroot" specifically for the initramfs.
And the target would be to keep the stuff we need only in the initramfs
out of the "final" rootfs.

More inline.

On Wed, 23 Sep 2020 18:20:44 +0200
Harald Seiler <hws@denx.de> wrote:

> Add a new "image" class for generating a custom initramfs.  It works
> like this: A new minimal debian rootfs is bootstrapped and all
> dependency packages for the new initramfs are installed.  Then, an
> initramfs is generated from this rootfs and deployed like usual.
> 
> This new initramfs.bbclass "image" class should be pulled in by an
> "initramfs image" recipe.  Said recipe then specifies all dependencies
> of the initramfs via INITRAMFS_INSTALL and INITRAMFS_PREINSTALL (which
> are analogous to the respective IMAGE_* variables).
> 
> initramfs.bbclass intentionally does _not_ expose a mechanism to
> change /etc/initramfs-tools/initramfs.conf and
> /etc/initramfs-tools/modules. Changes to their settings are better
> done via packages that deploy conf-hooks to
> /usr/share/initramfs-tools/conf-hooks.d/ and module fragment files to
> /usr/share/initramfs-tools/modules.d/.
> 
> Signed-off-by: Harald Seiler <hws@denx.de>
> ---
> 
> Notes:
>     I had this idea while searching for a way to build an initramfs
> that uses dm-verity to assert integrity of the rootfs.  To me, this
> feels like a much cleaner solution than anything else I tried and I'm
> happy to report that, using this approach, I got everything working
> nicely in the original project.
>     
>     In my opinion, this design has a number of advantages over the
> previous solutions we have seen so far:
>     
>      - It does not suffer any kind of initramfs pollution, caused by
>        packages installed into a rootfs.  This is a big problem when
> trying to generated an initramfs from e.g. `buildchroot-target` as
> many unrelated packaged could be installed there which would all get
>        pulled into the initrd (if they install hooks/scripts).

I was about to ask why not just use that chroot ... got it.

>        This also means, with this new approach, the integrator has
> maximum control over the contents of the initramfs.

And maximum responsibilty. Using an initrd that was not generated from
the "final" rootfs probably violates some assumptions or can at least
be seen as highly unusual. Not sure that is or could become a problem.

>      - There are no needs to change the initramfs generation process
> in any way, the debian tooling can be used exactly like its meant to.
>     
>      - As most isar-generated images will never regenerate the
> initramfs from the running system, all initramfs related packages are
> dead-weight to the image.  This is a problem when trying to generate
> the initramfs from the actual image rootfs.

Yes dead-weight, but is it heavy? In terms of disk-space we are
probably talking about really not much, if you compare to what
applications will pull in. But i do not know. I guess lvm, mdadm,
cryptsetup and friends could really add up ... but wic does not have
that anyways ;)

>        When it is necessary to rebuild the initramfs in a running
> system, the packages designed for this new class could just be
> installed into the rootfs, without any changes necessary.  This
> means, any generic initramfs module packages can be used both with
> the in-rootfs mechanism and initramfs.bbclass.
>     
>      - Because of this complete isolation and independence,
> implementation of complex logic is much easier:  For example
> dm-verity needs a root-hash that is only available after the rootfs
> has been cast into a filesystem image.  With this new approach, this
> can be modelled with a simple task dependency.

I guess one can pass that hash to the initrd with an argument from the
kernel command line. So you probably pass it into your wks, or whatever
imager controls the cmdline.

How much does the generation of the initrd impose on the "real" rootfs,
in terms of "dead" packages? And how polluted can an initrd become when
generated from a rootfs that "contains too much"? Maybe you have
numbers for your layer giving a perspective on the gain.

Generating the initramfs not from the rootfs means that the manifest
will not mention all packages shipped in the image, which could cause
legal issues when overlooked. Adding the manifests might result in
too much clearing effort, because not all that is in the
initramfs-buildchroot will be in the initramfs.

Henning

>     Changes in v2:
>     - None (just added examples in new patches)
> 
>  meta/classes/initramfs.bbclass | 41
> ++++++++++++++++++++++++++++++++++ 1 file changed, 41 insertions(+)
>  create mode 100644 meta/classes/initramfs.bbclass
> 
> diff --git a/meta/classes/initramfs.bbclass
> b/meta/classes/initramfs.bbclass new file mode 100644
> index 000000000000..8af9b4b379a5
> --- /dev/null
> +++ b/meta/classes/initramfs.bbclass
> @@ -0,0 +1,41 @@
> +# This software is a part of ISAR.
> +
> +# Make workdir and stamps machine-specific without changing common
> PN target +WORKDIR =
> "${TMPDIR}/work/${DISTRO}-${DISTRO_ARCH}/${PN}-${MACHINE}/${PV}-${PR}"
> +STAMP =
> "${STAMPS_DIR}/${DISTRO}-${DISTRO_ARCH}/${PN}-${MACHINE}/${PV}-${PR}"
> +STAMPCLEAN =
> "${STAMPS_DIR}/${DISTRO}-${DISTRO_ARCH}/${PN}-${MACHINE}/*-*" +
> +INITRAMFS_INSTALL ?= "" +INITRAMFS_PREINSTALL ?= ""
> +INITRAMFS_ROOTFS ?= "${WORKDIR}/rootfs" +INITRAMFS_IMAGE_FILE =
> "${DEPLOY_DIR_IMAGE}/${INITRAMFS_FULLNAME}.initrd.img" +
> +# Install proper kernel
> +INITRAMFS_INSTALL += "${@ ("linux-image-" + d.getVar("KERNEL_NAME",
> True)) if d.getVar("KERNEL_NAME", True) else ""}" +
> +# Name of the initramfs including distro&machine names
> +INITRAMFS_FULLNAME = "${PN}-${DISTRO}-${MACHINE}"
> +
> +DEPENDS += "${INITRAMFS_INSTALL}"
> +
> +ROOTFSDIR = "${INITRAMFS_ROOTFS}"
> +ROOTFS_FEATURES = ""
> +ROOTFS_PACKAGES = "initramfs-tools ${INITRAMFS_PREINSTALL}
> ${INITRAMFS_INSTALL}" +
> +inherit rootfs
> +
> +do_generate_initramfs() {
> +    rootfs_do_mounts
> +    rootfs_do_qemu
> +
> +    sudo -E chroot "${INITRAMFS_ROOTFS}" \
> +        update-initramfs -u -v
> +
> +    if [ ! -e "${INITRAMFS_ROOTFS}/initrd.img" ]; then
> +        die "No initramfs was found after generation!"
> +    fi
> +
> +    rm -rf "${INITRAMFS_IMAGE_FILE}"
> +    cp "${INITRAMFS_ROOTFS}/initrd.img" "${INITRAMFS_IMAGE_FILE}"
> +}
> +addtask generate_initramfs after do_rootfs before do_build
Harald Seiler Oct. 6, 2020, 3:24 a.m. UTC | #2
Hi,

On Fri, 2020-09-25 at 15:16 +0200, Henning Schild wrote:
> To me this sounds like a "buildchroot" specifically for the initramfs.

That's pretty much what it is.  With the difference to the other
buildchroots that it is not shared when building multiple initramfs images
so each one has its own rootfs.

> And the target would be to keep the stuff we need only in the initramfs
> out of the "final" rootfs.

This is a nice side-effect but not the main reason I propose this.  Just
to make it clear: I do _not_ want this to be a replacement for the current
way of building the initramfs from the image rootfs.  This new bbclass is
meant for situations where in-image generation falls short or is otherwise
inappropriate.  A few examples:

 - As already mentioned, it makes dm-verity setups _much_ easier and a lot
   cleaner because it avoids the dependency cycle that other approaches
   have.

 - This could be used to generate a completely custom initramfs-based
   rescue system.

> More inline.
> 
> On Wed, 23 Sep 2020 18:20:44 +0200
> Harald Seiler <hws@denx.de> wrote:
> 
> > Add a new "image" class for generating a custom initramfs.  It works
> > like this: A new minimal debian rootfs is bootstrapped and all
> > dependency packages for the new initramfs are installed.  Then, an
> > initramfs is generated from this rootfs and deployed like usual.
> > 
> > This new initramfs.bbclass "image" class should be pulled in by an
> > "initramfs image" recipe.  Said recipe then specifies all dependencies
> > of the initramfs via INITRAMFS_INSTALL and INITRAMFS_PREINSTALL (which
> > are analogous to the respective IMAGE_* variables).
> > 
> > initramfs.bbclass intentionally does _not_ expose a mechanism to
> > change /etc/initramfs-tools/initramfs.conf and
> > /etc/initramfs-tools/modules. Changes to their settings are better
> > done via packages that deploy conf-hooks to
> > /usr/share/initramfs-tools/conf-hooks.d/ and module fragment files to
> > /usr/share/initramfs-tools/modules.d/.
> > 
> > Signed-off-by: Harald Seiler <hws@denx.de>
> > ---
> > 
> > Notes:
> >     I had this idea while searching for a way to build an initramfs
> > that uses dm-verity to assert integrity of the rootfs.  To me, this
> > feels like a much cleaner solution than anything else I tried and I'm
> > happy to report that, using this approach, I got everything working
> > nicely in the original project.
> >     
> >     In my opinion, this design has a number of advantages over the
> > previous solutions we have seen so far:
> >     
> >      - It does not suffer any kind of initramfs pollution, caused by
> >        packages installed into a rootfs.  This is a big problem when
> > trying to generated an initramfs from e.g. `buildchroot-target` as
> > many unrelated packaged could be installed there which would all get
> >        pulled into the initrd (if they install hooks/scripts).
> 
> I was about to ask why not just use that chroot ... got it.
> 
> >        This also means, with this new approach, the integrator has
> > maximum control over the contents of the initramfs.
> 
> And maximum responsibilty. Using an initrd that was not generated from
> the "final" rootfs probably violates some assumptions or can at least
> be seen as highly unusual. Not sure that is or could become a problem.

The vanilla initramfs from debian does not suffer any problems, as long as
you make sure the correct kernel modules are included.  This should
usually just work(tm) because initramfs.bbclass does the exact same thing
as image.bbclass:

> > +# Install proper kernel
> > +INITRAMFS_INSTALL += "${@ ("linux-image-" + d.getVar("KERNEL_NAME",

With upstream initramfs modules, care has to be taken, of course.  Let's
look at cryptroot from cryptsetup as an example:  This module needs
a correct /etc/crypttab in the initramfs-chroot to properly unlock the
root filesystem.  For any other partitions, crypttab needs to be in the
actual image rootfs.

You could now either have two separate crypttabs, one with just the rootfs
in initramfs-chroot and one for the rest in the actual image.  Or, what
I'd find the cleaner solution: A config-package containing /etc/crypttab
that is deployed to both the image and the initramfs-chroot.

That said, I do not see why one would want the custom initramfs for this
scenario ... The in-image version will work just fine.

> >      - There are no needs to change the initramfs generation process
> > in any way, the debian tooling can be used exactly like its meant to.
> >     
> >      - As most isar-generated images will never regenerate the
> > initramfs from the running system, all initramfs related packages are
> > dead-weight to the image.  This is a problem when trying to generate
> > the initramfs from the actual image rootfs.
> 
> Yes dead-weight, but is it heavy? In terms of disk-space we are
> probably talking about really not much, if you compare to what
> applications will pull in. But i do not know. I guess lvm, mdadm,
> cryptsetup and friends could really add up ... but wic does not have
> that anyways ;)

I don't think it would amount to much.  If you're really short on space,
it could matter (as you said, one could pull any dm tooling out of the
rootfs), but apart from that I don't think it is that relevant.

> >        When it is necessary to rebuild the initramfs in a running
> > system, the packages designed for this new class could just be
> > installed into the rootfs, without any changes necessary.  This
> > means, any generic initramfs module packages can be used both with
> > the in-rootfs mechanism and initramfs.bbclass.
> >     
> >      - Because of this complete isolation and independence,
> > implementation of complex logic is much easier:  For example
> > dm-verity needs a root-hash that is only available after the rootfs
> > has been cast into a filesystem image.  With this new approach, this
> > can be modelled with a simple task dependency.
> 
> I guess one can pass that hash to the initrd with an argument from the
> kernel command line. So you probably pass it into your wks, or whatever
> imager controls the cmdline.

I don't see this working without creating a dependency cycle somewhere ...
I did try to explore alternative solutions but it all ended up requiring
ugly hacks at some point or another.

> How much does the generation of the initrd impose on the "real" rootfs,
> in terms of "dead" packages? And how polluted can an initrd become when
> generated from a rootfs that "contains too much"? Maybe you have
> numbers for your layer giving a perspective on the gain.

The main motivation for me was a clean dm-verity setup so I do not have
numbers on this aspect.  I don't expect it to be much though.

> Generating the initramfs not from the rootfs means that the manifest
> will not mention all packages shipped in the image, which could cause
> legal issues when overlooked. Adding the manifests might result in
> too much clearing effort, because not all that is in the
> initramfs-buildchroot will be in the initramfs.

Well you won't install anything into the initramfs-chroot that won't in
some form or another leave a trace in the initramfs so I'd say merging the
manifests does make sense here.

Patch

diff --git a/meta/classes/initramfs.bbclass b/meta/classes/initramfs.bbclass
new file mode 100644
index 000000000000..8af9b4b379a5
--- /dev/null
+++ b/meta/classes/initramfs.bbclass
@@ -0,0 +1,41 @@ 
+# This software is a part of ISAR.
+
+# Make workdir and stamps machine-specific without changing common PN target
+WORKDIR = "${TMPDIR}/work/${DISTRO}-${DISTRO_ARCH}/${PN}-${MACHINE}/${PV}-${PR}"
+STAMP = "${STAMPS_DIR}/${DISTRO}-${DISTRO_ARCH}/${PN}-${MACHINE}/${PV}-${PR}"
+STAMPCLEAN = "${STAMPS_DIR}/${DISTRO}-${DISTRO_ARCH}/${PN}-${MACHINE}/*-*"
+
+INITRAMFS_INSTALL ?= ""
+INITRAMFS_PREINSTALL ?= ""
+INITRAMFS_ROOTFS ?= "${WORKDIR}/rootfs"
+INITRAMFS_IMAGE_FILE = "${DEPLOY_DIR_IMAGE}/${INITRAMFS_FULLNAME}.initrd.img"
+
+# Install proper kernel
+INITRAMFS_INSTALL += "${@ ("linux-image-" + d.getVar("KERNEL_NAME", True)) if d.getVar("KERNEL_NAME", True) else ""}"
+
+# Name of the initramfs including distro&machine names
+INITRAMFS_FULLNAME = "${PN}-${DISTRO}-${MACHINE}"
+
+DEPENDS += "${INITRAMFS_INSTALL}"
+
+ROOTFSDIR = "${INITRAMFS_ROOTFS}"
+ROOTFS_FEATURES = ""
+ROOTFS_PACKAGES = "initramfs-tools ${INITRAMFS_PREINSTALL} ${INITRAMFS_INSTALL}"
+
+inherit rootfs
+
+do_generate_initramfs() {
+    rootfs_do_mounts
+    rootfs_do_qemu
+
+    sudo -E chroot "${INITRAMFS_ROOTFS}" \
+        update-initramfs -u -v
+
+    if [ ! -e "${INITRAMFS_ROOTFS}/initrd.img" ]; then
+        die "No initramfs was found after generation!"
+    fi
+
+    rm -rf "${INITRAMFS_IMAGE_FILE}"
+    cp "${INITRAMFS_ROOTFS}/initrd.img" "${INITRAMFS_IMAGE_FILE}"
+}
+addtask generate_initramfs after do_rootfs before do_build