python3.8.core (libthr)
Description
Problem/Justification
Impact
is cloned by
SmartDraw Connector
Katalon Manual Tests (BETA)
Activity

Caleb March 11, 2021 at 1:20 AM
Thanks for opening a ticket and providing the information. I've created an umbrella ticket here: https://jira.ixsystems.com/browse/NAS-109709 for which I'm tracking this problem. I'll be closing this ticket as a duplicate. I have found a threading regression with python3.8 which is what 12 is shipped with currently. 12.0-U3 will have a fix for this regression, however, I'm not confident it will fix the core dump. My hope is that it will but it's hard to tell since I was unable to get a reproduction of the core dump.

Caleb March 2, 2021 at 1:07 PM
I've responded to your ticket explaining what I've found in your core dumps.
, okay thanks. If it happens again, let me know and I'll send instructions on how to install the debug version of python.

George March 1, 2021 at 10:31 PM
I've only seen the one python dump so far. If it happens again, I'd be happy to turn on debugging as you suggest.
Previously, when I've had unexpected reboots, I've looked at logs afterwards and not been able to find any reason for the reboot: for example, dmesg just suddenly has the boot info with nothing strange preceding it.

Matthew Hoare March 1, 2021 at 9:32 PM
could you find anything useful in the files I posted in https://jira.ixsystems.com/browse/NAS-109426?
I've only had this python dump once; after upgrading and letting the system run for a few days. Since then it has not occurred again. Could it be related to the upgrade process?

Caleb March 1, 2021 at 4:12 PM
Hi , thank you for the core dump files. I've taken a look and your core dump is particularly interesting. It seems that pythons stack is being corrupted. I have not seen this before as yours is the first time I've ever seen it. Unfortunately, because python3.8 is built with no debug symbols and is stripped the thread that crashed and caused the core dump is worthless because I cannot glean any useful information. I have a couple questions for you if you don't mind answering.
1. are you seeing python core dump routinely?
2. if it is dumping routinely, do you by chance know if something is triggering it? (i.e. you started a replication, or a VM gets rebooted every night, or a cloud task kicks off etc etc)
If python is routinely core dumping for you, it would be invaluable to get the non-stripped, with-debug version of python installed on your system. We could do this in a new boot environment relatively easily and it would allow us to "rollback" to the previous boot environment getting you back to a default state.
Thanks!
After being unable to login to the server (the login screen would not appear, only the page indicating the server was running), it had an unexpected reboot followed by the message above, so I am creating a ticket. As an anecdotal aside, with FreeNAS 11 I had an unexpected reboot at full load maybe once every few months. Since updating to the various versions of TrueNAS 12, I get unexpected reboots weekly. Very concerned about this apparent change in reliability.