[Bug 1430074] Re: d-i multipath disk-partition separator overhaul

Mathieu Trudel-Lapierre mathieu.tl at gmail.com
Wed Mar 11 15:10:41 UTC 2015


I'm looking into this now.

** Changed in: debian-installer (Ubuntu)
       Status: New => In Progress

** Changed in: debian-installer (Ubuntu)
   Importance: Undecided => Critical

-- 
You received this bug notification because you are a member of Ubuntu
Sponsors Team, which is subscribed to the bug report.
https://bugs.launchpad.net/bugs/1430074

Title:
  d-i multipath disk-partition separator overhaul

Status in debian-installer package in Ubuntu:
  In Progress
Status in grub-installer package in Ubuntu:
  In Progress
Status in partman-base package in Ubuntu:
  In Progress
Status in partman-multipath package in Ubuntu:
  In Progress

Bug description:
  This bug is an umbrella for a number of bugs in the debian-installer with common 
  root cause: the different disk-partition separator used for multipath partition
  block devices by parted_server (libparted) and some components of the installer.

  The specific problems will be presented shortly. They are summarized in this bug
  rather than reported individually because the solution is common, and involves 3
  source packages; so it's simple to glance over the bug (.. big) picture, problem
  and solution at once, and likewise for reviewing the patches.

  The demonstration and steps to reproduce are contained in an attached text file,
  for being an install session with descriptive steps and comments.  It's somewhat
  long, but very easy to follow.  The solution testing is also attached in a text
  file (and is shorter).


  Overall problem:
  ---------------

  The installer (i.e., its components; e.g., partman-*, grub-installer) makes use
  of the 'parted_server' program (which relies on 'libparted') to handle disk and
  partition devices.  It's used to list disks and their partitions, and to modify
  partitions in the disks (e.g., create, resize, flag, delete).

  Whenever parted_server (libparted) produces a list of partitions in a multipath
  disk (for consumption by installer components), or creates such  (i.e., a block
  device in /dev/mapper for a partition in a multipath disk), the following disk-
  parition separator is used: 'p'. This is hard-coded in libparted (arch/linux.c).

  For example: /dev/mapper/mpath0p1

  The problems arise on some installer components which have code for handling an
  installation on multipath disks/partitions (e.g., partition the multipath disks,
  format the partitions in multipath disks, install the bootloader in one of them).

  Those components assume the multipath disk-partition separator is '-part'. This
  is hard-coded in them.

  For example: /dev/mapper/mpath0-part1


  Overall solution:
  ----------------

  After the evaluation of a few solution alternatives, this one looked
  like right.

  It's short, simple, and non-invasive -- all changes are delivered in multipath-
  specific functions or conditional blocks, except for 2 ppc64el-specific changes,
  that actually are required to improve grub-installer for multipath on ppc64el).

  Fundamentally, stick to 'p' as multipath disk-partition separator.

  Reasons:
  - First of all: no impact nor change to non-multipath stuff.
  - Second: multipath partitions listed by parted_server exist in /dev/mapper,
            with the listed block device name; consistently:
            - including those created/modified by parted_server, 
            - and those existing/non-modified -- untouched by parted_server.


  Specific problems:
  -----------------

  In order of appearance (in the installer). See the attached file with the steps
  to reproduce/demonstration.

  P.S.: some problems reported are not related to the disk-partition separator,
        but are minor errors related to the multipath support in the installer
        (still contained in multipath-specific functions or conditional blocks)
        and have fixes provided in the best interest of the debian-installer.

  
  Problem #1:  The underlying devices of multipath devices are listed in the
  ----------   partitioning dialogs.
              
  The underlying devices cannot be accessed, because the device-mapper locks them
  with exclusive access.

  There's already code to do this, but it's broken due to a wrong grep expression,
  that checks whether a device is an underlying device of a multipath devices.

  Fixed in partman-base:
  	- init.d/parted: part_of_multipath():
  	  Update grep expression for more recent output of 'multipath -l'.

  
  Problem #2:  Human-description of a multipath device is not a single device string;
  ----------   it actually contains multiple strings from the same device name/WWID.

  Some multipath devices' WWID have more than only hexadecimal characters; this
  breaks a sed substitution expression, thus leaving spaces in its output, which
  are not correctly handled.

  Fixed in partman-base:
  	- lib/base.sh: humandev():
  	  Accept spaces/non-hex-only characters in multipath WWID.

  
  Problem #3:  Non-existing LVM VGs/LVs shown for existing multipath partitions
  ----------

  If multipath partitions are not correctly identified (because of the multipath
  disk-partition separator), they end up considered LVM VGs/LVs in the dialog for
  listing/choosing partitions.

  Fixed in partman-base:
  	- lib/base.sh: is_multipath_part():
  	  Use 'p' (not '-part') as multipath disk-partition separator.

  
  Problem #4:  Failure when formatting existing multipath partitions (i.e.,
  ----------   not created/modified by the partitioner via parted_server)

  If multipath partitions are not created/modified with the partitioner, they are
  not created in /dev/mapper by parted_server/libparted  (note: this would create
  them with the 'p' disk-partition separator).

  In that case, the multipath partitions end up created in /dev/mapper by kpartx
  with the '-part' disk-partition separator in a commit.d/ hook.

  Afterwards, in the format.d/ hooks, parted_server lists the partitions in a
  multipath disk with the 'p' disk-partition separator, which leads to an error,
  because (as described) /dev/mapper/mpathX*p*Y doesn't exist, but mpathX*-part*Y.

  Fixed in partman-multipath:
  	- commit.d/partition_multipath:
  	  Use 'p' (not '-part') as multipath disk-partition separator.
  	  (during d-i.)

  
  Problem #5:  Incorrect multipath partition device names in target's /etc/fstab
  ----------

  In the finish.d/ hooks, the device names that are inserted in /target/etc/fstab
  come from partition listings by parted_server -- therefore, with the 'p' disk-
  partition separator.

  This is wrong for the target system, because it has kpartx udev rules shipped
  with the multipath-tools package (and carried to initramfs by multipath-tools-boot),
  and the kpartx udev rules specify the '-part' disk-partition separator.

  So, this is one place where we actually have to change from 'p' to
  '-part'.

  It is done immediately after the hook/script that inserts /dev/mapper devices
  in the /target/etc/fstab file, using a sed expression.

  Fixed in partman-multipath:
  	- (new file) finish.d/fstab_hd_entries_multipath:
  	  Use '-part' (not 'p') as multipath disk-partition separator.
  	  (for the target.)

  
  Problem #6:  PReP partition-flag is lost on multipath disk every 2 changes to
  ----------   its partition table

  (this should be automatically fixed with problem #1.)

  When a multipath disk is first partitioned, and a multipath partition is defined
  as PReP boot partition, the partition has the flag 'prep' set.

  The flag propagates to the underlying device, obviously (multipath is just multi-
  ple paths to the a device).

  The next time that disk is partitioned, that occurs normally for the multipath
  disk; however, the partitioner notices that the underlying device now has a 
  partition with the 'prep' flag set, and because it was not requested on the
  underlying device (only on the multipath disk), the partitioner resets the flag.

  The flag reset in the underlying device also propagates to the
  multipath device.

  Now there's no partition w/ the PReP flag anymore.

  This is a problem for grub-installer on ppc64el, because it currently uses the
  'prep-bootdev' program, which finds a PReP partition using that flag/libparted.

  Fixed in problem #1.

  
  Problem #7:  GRUB is installed on an underlying device, not the multipath disk.
  ----------

  The grub-installer script fails to identify the partition with boot filesystem
  as a multipath partition because it expects the '-part' disk-partition separator.

  Therefore, an underlying disk/partition is selected.  Although it is working on
  the test scenario, it is certainly not the right approach with multipath, and
  should be fixed.

  This is fixed differently for non-ppc64el and ppc64el architectures, because
  ppc64el currently uses the 'prep-bootdev' program to find a PReP partition.

  Fixed in grub-installer for non-ppc64el:
  	- grub-installer:
  	  Use 'p' (not '-part') as multipath disk-partition separator.

  Fixed in grub-installer for ppc64el:
  	- grub-installer:
  	  [ppc64el] bootdev/wipe_bootdev: prefer PReP partition in the
  	  same disk as the boot file system partition ($disc_offered).
  	- prep-bootdev:
  	  add '-l' option to list all PReP partitions.


  
  Steps to Reproduce
  ------------------

  See attachment (reproducing_mpathsep.txt)

  
  Solution
  --------

  See attachments:
  - partman-base_mpathsep.debdiff
  - partman-multipath_mpathsep.debdiff
  - grub-installer_mpathsep.debdiff

  
  Solution Testing (ppc64el & amd64)
  ----------------------------------

  See attachment (testing_mpathsep.txt)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/debian-installer/+bug/1430074/+subscriptions



More information about the Ubuntu-sponsors mailing list