Looking at the pastebin (thanks!), I can see that the pacemaker is attempting to perform a stonith via the MAAS api, but failing in its invocations. The “logd is not running” bit is not the concerning part in the logs, rather the concerning part is the tracebacks following, which indicate that the stonith commands are failing:
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ logd is not runninglogd is not runninglogd is not runninglogd is not runninglogd is not runningTraceback (most recent call last): ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ File “/usr/lib/stonith/plugins/external/maas”, line 380, in ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ sys.exit(map_commands(sys.argv)) ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ File “/usr/lib/stonith/plugins/external/maas”, line 373, in map_commands ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ rc = commandscmd ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ File “/usr/lib/stonith/plugins/external/maas”, line 244, in power_reset ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ machine.power_on() ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ File “/usr/lib/python3/dist-packages/maas/client/utils/maas_async.py”, line 49, in wrapper ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ result = eventloop.run_until_complete(result) ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ File “/usr/lib/python3.8/asyncio/base_events.py”, line 616, in run_until_complete ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ return future.result() ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ File “/usr/lib/python3/dist-packages/maas/client/viscera/machines.py”, line 713, in power_on ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ self._reset(await self._handler.power_on(**params)) ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ File “/usr/lib/python3/dist-packages/maas/client/bones/init.py”, line 316, in call ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ response = await self.bind(**params).call(**data) ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ File “/usr/lib/python3/dist-packages/maas/client/bones/init.py”, line 461, in dispatch ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ response = await session.request( ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ File “/usr/lib/python3/dist-packages/aiohttp/client.py”, line 504, in _request ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ await resp.start(conn) ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ File “/usr/lib/python3/dist-packages/aiohttp/client_reqrep.py”, line 847, in start ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ message, payload = await self._protocol.read() # type: ignore # noqa ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ File “/usr/lib/python3/dist-packages/aiohttp/streams.py”, line 591, in read ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ await self._waiter ]
Feb 25 12:09:16 juju-de1a68-3-lxd-7 pacemaker-fenced [31093] (log_op_output) notice: fence_legacy_reboot_1[572338] error output [ aiohttp.client_exceptions.ServerDisconnectedError ]
I suspect what is happening is that the commands are going through on the MAAS server side, but the client is unable to interpret the results due to the error. Pacemaker is continually trying to stonith the node in this scenario, which is likely why you see the node going down.
May I ask what version of MAAS you are using in this deployment?
I see a similar issue reported in https://github.com/maas/python-libmaas/issues/251