Can't get ceph-osd to work properly

rainwadj · 26 September 2022 18:16

I’m deploying Openstack Yoga using Charms to 4 identical servers. I have completed all of the steps to get the pieces installed and running, but ceph-osd is still blocked with ‘Non-pristine devices detected, consult list-disks, zap-disk and blacklist-* actions.’.

I can see that the disks exist, using ‘juju ssh ceph-osd/0’ and ‘fdisk -l’, and I’ve tried ’ juju run-action --wait ceph-osd/0 zap-disk devices=/dev/sdb i-really-mean-it=true’ on sdb and sdc on all 4 nodes.

Output of list-disks from one of the nodes is below.

Any ideas?

$ juju run-action --wait ceph-osd/0 list-disks
unit-ceph-osd-0:
  UnitId: ceph-osd/0
  id: "66"
  results:
    Stderr: |2
        Failed to find physical volume "/dev/sdc".
        Cannot use /dev/sda: device is partitioned
        Cannot use /dev/sdd: device is too small (pv_min_size)
        Failed to find physical volume "/dev/sdb".
    blacklist: '[''/dev/sdd'', ''/dev/sda'']'
    disks: '[''/dev/sdc'', ''/dev/sda'', ''/dev/sdd'', ''/dev/sdb'']'
    non-pristine: '[''/dev/sda'', ''/dev/sdd'']'
  status: completed
  timing:
    completed: 2022-09-26 18:11:17 +0000 UTC
    enqueued: 2022-09-26 18:11:15 +0000 UTC
    started: 2022-09-26 18:11:15 +0000 UTC

lmlogiudice · 26 September 2022 20:22

Indeed, it appears that you don’t have any usable disks for ceph-osd to run with. You mention zapping /dev/sdb, but it’s listed as non-existent. Are you certain that such device exists on the machine where that OSD resides ?

rainwadj · 26 September 2022 20:47

Yes, I can see them. Here’s fdisk output from ceph-osd/0:

$ sudo fdisk -l
Disk /dev/loop0: 63.22 MiB, 66293760 bytes, 129480 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes


Disk /dev/loop1: 102.98 MiB, 107986944 bytes, 210912 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes


Disk /dev/loop2: 47.99 MiB, 50323456 bytes, 98288 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes


Disk /dev/loop3: 108.64 MiB, 113917952 bytes, 222496 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes


Disk /dev/sdb: 1.75 TiB, 1920383410176 bytes, 3750748848 sectors
Disk model: VK001920GWSRU   
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 4096 bytes
I/O size (minimum/optimal): 4096 bytes / 4096 bytes


Disk /dev/sdc: 1.75 TiB, 1920383410176 bytes, 3750748848 sectors
Disk model: VK001920GWSRU   
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 4096 bytes
I/O size (minimum/optimal): 4096 bytes / 4096 bytes


Disk /dev/sda: 894.25 GiB, 960197124096 bytes, 1875385008 sectors
Disk model: VK000960GWSRT   
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 4096 bytes
I/O size (minimum/optimal): 4096 bytes / 4096 bytes
Disklabel type: gpt
Disk identifier: 04827D77-D6AB-41ED-8F35-02005C4A7A4A

Device       Start        End    Sectors   Size Type
/dev/sda1     2048    1050623    1048576   512M EFI System
/dev/sda2  1050624 1875384974 1874334351 893.8G Linux filesystem

lmlogiudice · 26 September 2022 21:03

Alright. Can you show us the output of the following commands ?

pvdisplay
lvdisplay
vgdisplay

I have the suspicion that this is an LVM-related issue.

rainwadj · 26 September 2022 21:05

All 3 of those commands show nothing.

christophermera · 26 September 2022 21:34

what’s your ‘juju status ceph-osd’ output look like?

lathiat · 27 September 2022 06:25

Hi Don,

For some reason the charm code thinks the disks that are configured are non-pristine.

One of the main checks that the charm does, is that the first 2048 bytes of the disk are zero, e.g. the disk header is blank.

The most common cause of this issue is that the charm tried to setup the disk, failed part-way for some reason but after it wrote some data to the disk header such as an LVM header, LUKS header or similar. It’s also possible the disk header wasn’t blank to start with. MAAS zeros out each disks’ header at deploy time but depending on if you are using MAAS or exactly what else was done during the deploy it’s possible the header was never blank.

I’d just the following command to check each disk for the first 2048 bytes being 0:

xxd -l 2048 /dev/sdb

If they are not zero, first ensure 100% that the disk is not in-use and has no data and you are looking at the correct disk, then you can wipe it’s header with dd. Please be careful not to accidentally do this to the wrong disk/machine

Once you’ve cleaned up the disks you’ll need to trigger the config-changed hook of the charm, an easy way is to change the osd-disks config value to add a space on the end or similar or this may also work

juju run --application ceph-osd ./hooks/config-changed`

If this happened with a clean MAAS deployment, we may be able to figure out why it failed to setup the disk the first time if you upload a full copy of /var/log/juju/unit-ceph-osd-*.log from one of the affected units. I’d recommend checking this file for any sensitive data or secrets before uploading it publicly otherwise if you prefer to keep the file private you could e-mail to me direct (first.last@canonical.com).

It would also be good to check the output of sudo lsblk and not just fdisk as sometimes the disk is in-use by other parts of the storage subsystem that sometimes shows up in lsblk but not fdisk. So it’s a good command in that regard for checking.

Regards, Trent

rainwadj · 27 September 2022 14:03

Thanks for the info. I should have mentioned at the top that these are all managed by MAAS. All 4 servers are new, and this is a fresh installation, so there’s no risk of losing any data.

I only tried the xxd command on one of the nodes, but it shows all zeroes for both /dev/sdb and /dev/sdc.

So then I ran the ‘config-changed’ hook as you suggested. And… it worked. Ceph-osd found both drives on all 4 servers, and all services are active/idle now. Whatever was hanging it up may have been cleared by the zap-disk commands that I ran, but I didn’t know about the ‘config-changed’ hook. I had tried ‘juju run-action ceph-osd/0 add-disk osd-devices=/dev/sdb,/dev/sdc’, but that didn’t seem to help.

On to the next step. Based on a previous test a few months ago, it’ll probably all go well until I get to trying to integrate Keystone with our Active Directory.

Thanks again for the help.

lathiat · 28 September 2022 08:49

I think you are right the zap-disk commands probably removed whatever was on the disks from the first failure.

If you’re happy to share /var/log/juju/unit-ceph-osd-.log and /var/log/syslog from one of the ceph-osd units via e-mail I’m happy to see if we can figure out why it failed the first time. There may be a configuration change that will prevent it doing that if you try to re-deploy again.

Otherwise, glad it’s working now!

Cheers, Trent

rainwadj · 28 September 2022 19:24

Thanks, Trent. I sent the logs to your email.

ubumadmin · 7 January 2023 15:45

Hi I have 3 physical servers, 1 for running MAAS, 2 other larger servers deployed as KVM hosts Juju controller in installed on VM on one of the KVMhosts Now trying to deploy juju charmed ceph, want to use 1 vm on 1 physical host with a virtual disk, and the 2 physical servers with raw disks. Plan to add a 3rd physical server later. Created a vm on one of the KVM hosts and did a juju add-machine :~$ juju status Model Controller Cloud/Region Version SLA Timestamp ceph maas2-default maas2/default 2.9.37 unsupported 10:36:42-05:00

Machine State Address Inst id Series AZ Message 0 started 172.18.20.203 vm-45-ceph-1 focal default Deployed

~$ more ceph.yaml ceph-osd: osd-devices: /dev/sdc /dev/vdb source: cloud:focal-fossa I was getting some errors like “‘xenial’ is not a valid distro_series. It should be one of: ‘’, ‘ubuntu/focal’.”]})

so I added the source: cloud:focal-fossa but not sure if that is correct

then juju deploy -n 3 --config ./ceph.yaml --constraints tags=ceph --debug ceph-osd

ceph-osd does get deployed to the VM but not the physical servers, I get constraints errors. I could create addition VM but would prefer to use the baremetal physical servers with raw disks dedicated to ceph. Is this not possible because the physical server are in a deployed state, not a ready state or is it because I am not specifying the correct contraints? I have a disk on each physical server /dev/sdc ready to use. I don’t see a /var/log/juju/unit-ceph-osd-*.log

Did you do a add-machine for your physical servers?

pmatulis · 9 January 2023 16:17

Hi. Please see the documentation on how to specify software sources.

apollo64 · 30 January 2023 18:30

juju run-action --wait ceph-osd/0 list-disks
unit-ceph-osd-0:
UnitId: ceph-osd/0
id: “2”
results:
Stderr: |2
Cannot use /dev/sdb: device is partitioned
Failed to find physical volume “/dev/sda”.
blacklist: ‘[]’
disks: ‘[’’/dev/sdb’’, ‘’/dev/sda’’]’
non-pristine: ‘[’’/dev/sdb’’]’
status: completed
timing:
completed: 2023-01-30 18:19:15 +0000 UTC
enqueued: 2023-01-30 18:19:11 +0000 UTC

juju run-action --wait ceph-osd/0 add-disk osd-devices=/dev/sda
unit-ceph-osd-0:
UnitId: ceph-osd/0
id: “4”
message: exit status 1
results:
ReturnCode: 1
Stderr: |
partx: /dev/sda: failed to read partition table
Failed to find physical volume “/dev/sda”.
Failed to find physical volume “/dev/sda”.
Running command: /usr/bin/ceph-authtool --gen-print-key
–> RuntimeError: No valid ceph configuration file was loaded.
Stdout: |2
Physical volume “/dev/sda” successfully created. Volume group “ceph-969925ba-2ae0-4072-ad16-fab2a447e053” successfully created Logical volume “osd-block-969925ba-2ae0-4072-ad16-fab2a447e053” created. status: failed
timing:
completed: 2023-01-30 18:19:32 +0000 UTC enqueued: 2023-01-30 18:19:25 +0000 UTC started: 2023-01-30 18:19:28 +0000 UTC

juju run-action --wait ceph-osd/0 list-disks unit-ceph-osd-0: UnitId: ceph-osd/0 id: “10” results: Stderr: |2 Cannot use /dev/sdb: device is partitioned blacklist: ‘[]’ disks: ‘[’’/dev/sdb’’, ‘’/dev/sda’’]’ non-pristine: ‘[’’/dev/sdb’’, ‘’/dev/sda’’]’ status: completed timing:

Been running into errors with a maas deployed openstack. currently my ceph-osd wont deploy properly. Would like some help. I have tried the stuff above and no luck.