Initial MDRAID support #277

Harvie · 2024-12-20T08:38:33Z

This allows to create level 1 MDRAID with 1 device. Can be empty or prepopulated with data image.
Includes unit test using mdadm to check generated image.

Eg.:

image mdraid-md.img {
  mdraid {
    level = 1
  }
  partition data {
    image = "mdraid-ext4.img"
  }
}

It might sound stupid to create single device raid, but it actualy fits my reallife usecase where i pre-generate such raid when making image and user then can very easily add more devices during runtime when needed later. For example like this:

mdadm --grow /dev/md23 --raid-disks=2 --force
maddm /dev/md23 --add /dev/sdb1

Although it should be simple to pre-generate images for all raid members in genimage. It shouldn't be much harder than generating multiple images with the same RAID UUID and few metadata changes. But i haven't researched that so far.

Also see #191

Harvie · 2024-12-20T09:00:18Z

I have no idea why the test fails here. It says file mdraid.config is missing, but it's obviously there.
Also mdraid tests are passing at my PC when i do make check-TESTS:

michaelolbrich · 2024-12-20T11:08:38Z

https://github.com/pengutronix/genimage/actions/runs/12428468239/job/34700133730?pr=277#step:6:1000

The test run make distcheck which runs the test from the generated tarball. You forgot to add the test config to EXTRA_DIST so it's missing.

Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie · 2024-12-20T11:45:14Z

Seems to be fixed now.

image-mdraid.c

test/misc.test

Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie · 2024-12-21T12:05:46Z

Happy holidays.
@michaelolbrich I think i've resolved most of your objections, but i think i could really use your guidance about the multi-image thing. While i am not gonna implement generating multi device raid now in the first mdraid PR, i think it's bound to happen in the future and i would like to prepare the codebase for it, so we don't have to change config syntax in the future. What do you think would be the correct way of doing that?

It would kinda make sense for the image-mdraid.c to actualy generate two (or more) images, all being part of that RAID1 mirror. But on the other hand i don't think this would allow genimage to resolve dependencies correctly. Maybe i should manualy create other image structs and add them to list in mdraid_setup() ? would that be enough for image-hd to find them and trigger the build of all of them when needed? what would you suggest?

Harvie · 2024-12-21T12:24:04Z

One approach might be specifing all the output images of that array in single mdraid image node.

But maybe i should just do something like this instead:

image raid1-a.img {
  mdraid {
    level = 1
    devices = 2
  }
  partition data {
    image = "mdraid-ext4.img"
  }
}

image raid1-b.img {
  mdraid {
    master = raid1-a.img    #most of the config is gonna be inherited from master
    position = 2      #this is 2nd device in the array described by master image
  }
}

and then B image can lookup all the configuration from A image config node... but still this does not guarantee that B will be only generated after A. (UUIDs and other details of the A image need to be decided before generating B).

Is there way to enforce dependency?

Harvie · 2024-12-22T08:42:15Z

OK, i did binary diff of superblocks of two disks belonging to same RAID1 and highlighted the important parts:

Some things i am confused about:

Dev numbers 0 and 2 (why skipped 1? maybe the first disk should be 1, not 0?)
Why is there 3rd role with 0x0100 ???? Maybe the system is really confused about device ids starting at 0

Update: oh, the active role is simply number of the disk in the array (not sure why, because we already have DEV_NUMBER)

[harvie@anemophobia mdadm]$ grep DISK_ROLE md_p.h 
#define MD_DISK_ROLE_SPARE	0xffff
#define MD_DISK_ROLE_FAULTY	0xfffe
#define MD_DISK_ROLE_JOURNAL	0xfffd
#define MD_DISK_ROLE_MAX	0xff00 /* max value of regular disk role */

UPDATE: i did some research on this and it seems there are no rules on how the system should number the devices and roles down the road and that 0xFFFF roles are just OK to be ignored when there are no such disks. Therefore i no longer worry about this as long as the image we generate makes sense.

…single array (UUID needs to be specified manualy for now) Signed-off-by: Tomas Mudrunka <[email protected]>

Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie · 2024-12-26T18:02:24Z

I am now able to create two partitions belonging to single raid like this:

image mdraid-hd.img {

  hdimage {
    partition-table-type = "gpt"
  }

  partition mdraid-a {
    image = "mdraid-a.img"
    partition-type-uuid = R
  }

  partition mdraid-b {
    image = "mdraid-b.img"
    partition-type-uuid = R
  }

}

image mdraid-a.img {
	mdraid {
		devices = 2
		role = 0
		timestamp = 638022222
		raid-uuid = "de9980f1-0449-4e83-84bd-98e4b1ca3fe3"
		image = "mdraid-ext4.img"
	}
}

image mdraid-b.img {
	mdraid {
		devices = 2
		role = 1
		timestamp = 638022222
		raid-uuid = "de9980f1-0449-4e83-84bd-98e4b1ca3fe3"
		image = "mdraid-ext4.img"
	}
}

image mdraid-ext4.img {
  ext4 {
    label = "TEST_FS"
  }
  size = 5M
}

When i do losetup -fP mdraid-hd.img the kernel automaticaly recognizes both partitions as members of the array and assembles it automaticaly, while reporting the filesystem in the raid to be clean:

# cat /proc/mdstat 
Personalities : [raid1] 
md127 : active raid1 loop2p2[1] loop2p1[0]
      5120 blocks super 1.2 [2/2] [UU]
      bitmap: 0/1 pages [0KB], 65536KB chunk
      
unused devices: <none>

# LANG=C fsck.ext4 /dev/md127 
e2fsck 1.47.1 (20-May-2024)
TEST_FS: clean, 12/1280 files, 1434/5120 blocks

…d is to decide how to size bitmaps Signed-off-by: Tomas Mudrunka <[email protected]>

Signed-off-by: Tomas Mudrunka <[email protected]>

…e sure all member disks refer to the same array Signed-off-by: Tomas Mudrunka <[email protected]>

Signed-off-by: Tomas Mudrunka <[email protected]>

…nent to others Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie · 2024-12-30T15:03:05Z

Great news. With last version the required metadata fields are automaticaly exchanged and synced between components. Therefore there is no longer need to manualy set indentical array UUID, timestamp and data to all images.

image mdraid-hd.img {

  hdimage {
    partition-table-type = "gpt"
  }

  partition mdraid-a {
    image = "mdraid-a.img"
    partition-type-uuid = R
  }

  partition mdraid-b {
    image = "mdraid-b.img"
    partition-type-uuid = R
  }

}

image mdraid-a.img {
	mdraid {
		level = 1
		devices = 2
		image = "mdraid-ext4.img"
	}
}

image mdraid-b.img {
	mdraid {
		parent = "mdraid-a.img"
	}
}

image mdraid-ext4.img {
  ext4 {
    label = "TEST_FS"
  }
  size = 5M
}

You only need to specify array format for one of the images and other images can refer to it by setting parent = "file.img" config option. Roles numbers can be assigned automaticaly for all inheritant images, when omitted.

Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie · 2024-12-30T20:29:13Z

I think i am ready here. I've implemented all the features i can wish for. Added tests, documented everything. And also added documentation for f2fs, which is something i kinda owed since my f2fs PR was merged 👼

Signed-off-by: Tomas Mudrunka <[email protected]>

…"parent" Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie force-pushed the master branch 3 times, most recently from e3f0cc8 to 3d315c9 Compare December 20, 2024 08:56

Initial MDRAID support pengutronix#191

3f7ec9f

Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie force-pushed the master branch from 3d315c9 to 3f7ec9f Compare December 20, 2024 11:22

michaelolbrich requested changes Dec 20, 2024

View reviewed changes

Harvie force-pushed the master branch 3 times, most recently from 0f935d6 to 02d0983 Compare December 20, 2024 13:29

MDRAID: Reflected first part of suggestions made by @michaelolbrich

2209801

Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie force-pushed the master branch from 02d0983 to 2209801 Compare December 20, 2024 13:33

Harvie added 2 commits December 21, 2024 01:12

Configurable timestamp and uuids for repeatable builds and testing

e807fa6

Signed-off-by: Tomas Mudrunka <[email protected]>

MDRAID: Cleanup of size and alignment handling

237d917

Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie force-pushed the master branch from 6f8e73a to 237d917 Compare December 21, 2024 00:28

Harvie force-pushed the master branch from 056717b to 0197ea3 Compare December 26, 2024 16:56

Harvie added 2 commits December 26, 2024 18:47

MDRAID initial support for creating multiple images that are part of …

38ff037

…single array (UUID needs to be specified manualy for now) Signed-off-by: Tomas Mudrunka <[email protected]>

MDRAID Use intermediate file for mdadm testing

23b009a

Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie force-pushed the master branch from 6991d55 to 23b009a Compare December 26, 2024 17:48

Harvie requested a review from michaelolbrich December 26, 2024 17:50

MDRAID: Prepared most of the code to create bitmaps, only thing neede…

a423b56

…d is to decide how to size bitmaps Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie force-pushed the master branch 2 times, most recently from 8573a5f to e4c5464 Compare December 29, 2024 13:18

MDRAID: Finished proper bitmap generation

7857b56

Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie force-pushed the master branch from e4c5464 to 7857b56 Compare December 29, 2024 14:21

Keep the timestamp consistent across whole genimage invocation to mak…

1eef6ed

…e sure all member disks refer to the same array Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie force-pushed the master branch from 79abd4a to 1eef6ed Compare December 29, 2024 14:37

Harvie added 2 commits December 29, 2024 17:15

MDRAID: Removed need to specify data as partition in config

09806bf

Signed-off-by: Tomas Mudrunka <[email protected]>

MDRAID: Minor fixes

f6bdf28

Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie force-pushed the master branch from 7c9af1c to 6b07e1e Compare December 30, 2024 13:41

MDRAID Infrastructure for inheriting metadata from master array compo…

5d4a6d1

…nent to others Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie force-pushed the master branch from 6b07e1e to 5d4a6d1 Compare December 30, 2024 15:00

srandom is already called in main

fd6215f

Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie force-pushed the master branch from ff8f6a5 to fd6215f Compare December 30, 2024 15:50

MDRAID: test case for inheritance

43a3588

Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie force-pushed the master branch 3 times, most recently from f0488ac to cbf50b0 Compare December 30, 2024 20:16

Harvie force-pushed the master branch from cbf50b0 to 6f40ffd Compare December 30, 2024 20:42

Added documentation for mdraid and f2fs images

1bca566

Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie force-pushed the master branch 5 times, most recently from 2d377c0 to e2662a1 Compare December 31, 2024 02:20

MDRAID: refactored to use parse() and setup(), rendamed "inherit" to …

d3202a2

…"parent" Signed-off-by: Tomas Mudrunka <[email protected]>

Harvie force-pushed the master branch from e2662a1 to d3202a2 Compare December 31, 2024 02:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial MDRAID support #277

Initial MDRAID support #277

Harvie commented Dec 20, 2024 •

edited

Loading

Harvie commented Dec 20, 2024 •

edited

Loading

michaelolbrich commented Dec 20, 2024

Harvie commented Dec 20, 2024

Harvie commented Dec 21, 2024

Harvie commented Dec 21, 2024 •

edited

Loading

Harvie commented Dec 22, 2024 •

edited

Loading

Harvie commented Dec 26, 2024 •

edited

Loading

Harvie commented Dec 30, 2024 •

edited

Loading

Harvie commented Dec 30, 2024 •

edited

Loading

Initial MDRAID support #277

Are you sure you want to change the base?

Initial MDRAID support #277

Conversation

Harvie commented Dec 20, 2024 • edited Loading

Harvie commented Dec 20, 2024 • edited Loading

michaelolbrich commented Dec 20, 2024

Harvie commented Dec 20, 2024

Harvie commented Dec 21, 2024

Harvie commented Dec 21, 2024 • edited Loading

Harvie commented Dec 22, 2024 • edited Loading

Harvie commented Dec 26, 2024 • edited Loading

Harvie commented Dec 30, 2024 • edited Loading

Harvie commented Dec 30, 2024 • edited Loading

Harvie commented Dec 20, 2024 •

edited

Loading

Harvie commented Dec 20, 2024 •

edited

Loading

Harvie commented Dec 21, 2024 •

edited

Loading

Harvie commented Dec 22, 2024 •

edited

Loading

Harvie commented Dec 26, 2024 •

edited

Loading

Harvie commented Dec 30, 2024 •

edited

Loading

Harvie commented Dec 30, 2024 •

edited

Loading