VM libvirtError - internal error: client socket is closed

Description

BRIEF:

Cannot boot VM after updating TrueNAS 12.0-RELEASE to TrueNAS 12.0-U2.

STEPS AND RESULTS:

TrueNAS 12.0-RELEASE with RancherOS VM (with RAW AHCI disk installed originally with FreeNAS 11.3)
Update to TrueNAS 12.0-UC2.

ACTUAL RESULT:

VM is not auto-booted
Cannot manually start VM using UI: libvirtError
Can manually create new VM with same disk, but results in the same error: libvirtError
Can not remove copy of VM in UI: libvirtError

SCREENSHOT / FAULT DETAILS:

libvirtError

internal error: client socket is closed

ADDITIONAL INFO:

Error: Traceback (most recent call last):
File "/usr/local/lib/python3.8/site-packages/middlewared/main.py", line 137, in call_method
result = await self.middleware._call(message['method'], serviceobj, methodobj, params, app=self,
File "/usr/local/lib/python3.8/site-packages/middlewared/main.py", line 1206, in _call
return await self.run_in_executor(prepared_call.executor, methodobj, *prepared_call.args)
File "/usr/local/lib/python3.8/site-packages/middlewared/main.py", line 1110, in run_in_executor
return await loop.run_in_executor(pool, functools.partial(method, *args, **kwargs))
File "/usr/local/lib/python3.8/concurrent/futures/thread.py", line 57, in run
result = self.fn(*self.args, **self.kwargs)
File "/usr/local/lib/python3.8/site-packages/middlewared/schema.py", line 977, in nf
return f(*args, **kwargs)
File "/usr/local/lib/python3.8/site-packages/middlewared/plugins/vm.py", line 1595, in start
self.vms[vm['name']].start(vm_data=vm)
File "/usr/local/lib/python3.8/site-packages/middlewared/plugins/vm.py", line 166, in start
if self.domain.isActive():
File "/usr/local/lib/python3.8/site-packages/libvirt.py", line 1566, in isActive
if ret == -1: raise libvirtError ('virDomainIsActive() failed', dom=self)
libvirt.libvirtError: internal error: client socket is closed

Problem/Justification

None

Impact

None

SmartDraw Connector

Katalon Manual Tests (BETA)

Activity

Show:

Jacob McDonald December 22, 2021 at 8:30 PM

be careful. After cloning that zvol I later ran into a livelist bug that caused BSD to kernel panic on boot, and later with the Linux kernel caused boots to take 15-20 minutes (but at least didn't kernel panic).

I ended up deleting all zvols and recreating them from scratch. Even still, I have livelist debug output occasionally. It doesn't seem to be causing me any harm at the moment, but it still makes me uneasy. I would not enjoy rebuilding these 10 disks from backup.

Francesco Virgillito December 20, 2021 at 10:15 PM

I also had the same problem with my vm, did you manage to eliminate the zpool of the VM?

Jacob McDonald May 24, 2021 at 6:34 PM

All of a sudden both of my VMs are in this same state. One of them had been running ever since the system booted (uptime of 21 days). At some point in the last few days, that VM has entered a stopped state and would not start, showing this same error. The other VM shows the same error.

There have been no changes to my system whatsoever, including pools, datasets, snapshots, etc.

I was able to workaround this bug by simply cloning the VM.

RL March 25, 2021 at 8:11 AM
Edited

One additional question: The option to change the RancherOS password in the raw-device seems to be removed, and my 'rancher'-user password seems to be reset to the default password. It RancherOS completely depricated now?

 

EDIT: Oh nevermind, the password is in the GRUB config file. However, it's probably a good idea to upgrade this VM anyway..

RL March 25, 2021 at 8:05 AM

Yep, that solved it! Many thanks for your help . Seemed to be an user error after all . By the way, my '.bhyve_containers'-folder is actually a dataset now, just as my other machine, ans this works without any trouble.

Complete

Details

Assignee

Reporter

Labels

Components

Affects versions

Priority

More fields

Katalon Platform

Created February 10, 2021 at 9:11 AM
Updated July 1, 2022 at 5:13 PM
Resolved March 9, 2021 at 2:54 PM