Help with diagnosing an OS freeze

If none of the specific sub-forums seem right for your thread, ask here.
Post Reply
Message
Author
scarm
Posts: 31
Joined: 2023-06-14 20:05
Has thanked: 1 time
Been thanked: 2 times

Help with diagnosing an OS freeze

#1 Post by scarm »

Hi all,

I am recently experiencing an issue with Debian 12. The system occasionally freezes completely (nothing is responsive, audio stuck in a loop of the last 2 seconds and SysRq does not work) so I am forced to do a hard reset. This happens quite infrequently (maybe every 2-3 weeks) at seemingly random times.
Long story short...the most difficult scenario one could possibly imagine to diagnose.
Now, I do not expect anyone to tell me what is wrong with my system, considering it could be anything from software to harware to configuration to BIOS (I use coreboot) to power issues and so on.

What I hope is that somebody could help me to set up Debian so that when this happens again I can gather as many logs as possible to at least get some clues where to start troubleshooting. Of couse, I already went through the usual /var/log options and journalctl but could not find anything useful. No logs for the time just before the crash event.

Are there any tools that can collect more logs or memory dumps before a crash?
I have read a bit about kdump and kexec, but I am having a hard time understanding how to properly set them up.
Does the capture of kernel memory even work if I have to hard reset the system and cannot reboot?

My system specs are: AMD A8-6500 APU, A88XM-E motherboard, coreboot and 8GB of DDR3 RAM.
I checked the RAM for errors and PSU for voltage outputs, everything seems fine.

Aki
Global Moderator
Global Moderator
Posts: 4036
Joined: 2014-07-20 18:12
Location: Europe
Has thanked: 112 times
Been thanked: 532 times

Re: Help with diagnosing an OS freeze

#2 Post by Aki »

⢀⣴⠾⠻⢶⣦⠀
⣾⠁⢠⠒⠀⣿⡁ Debian - The universal operating system
⢿⡄⠘⠷⠚⠋⠀ https://www.debian.org
⠈⠳⣄⠀

scarm
Posts: 31
Joined: 2023-06-14 20:05
Has thanked: 1 time
Been thanked: 2 times

Re: Help with diagnosing an OS freeze

#3 Post by scarm »

OK, I installed kdump-tools and used this command

Code: Select all

sudo grep USE_KDUMP /etc/default/kdump-tools
to check and I have USE_KDUMP=1.

With

Code: Select all

sudo grep LOAD_KEXEC /etc/default/kexec
I get LOAD_KEXEC=true

I tried to reboot as recommeded to enable the crash kernel parameter, however the system hanged on reboot (is this normal?).
After doing a hard reset, I restarted the system and checked with

Code: Select all

sudo kdump-config show
I get "current state: ready to dump".

So I decided to test with

Code: Select all

echo c > /proc/sysrq-trigger
This command triggers exactly the freeze I am trying to diagnose (which I now assume is caused by a kernel panic). Nothing responds, so I have to hard reset again.
After restarting the system I check my /var/crash and there is no kernel dump there.

Am I doing something wrong or did I miss something?

lindi
Debian Developer
Debian Developer
Posts: 594
Joined: 2022-07-12 14:10
Has thanked: 2 times
Been thanked: 117 times

Re: Help with diagnosing an OS freeze

#4 Post by lindi »

You could try netconsole but I would seriously just consider buying new hardware.

Aki
Global Moderator
Global Moderator
Posts: 4036
Joined: 2014-07-20 18:12
Location: Europe
Has thanked: 112 times
Been thanked: 532 times

Re: Help with diagnosing an OS freeze

#5 Post by Aki »

Hello,
scarm wrote: 2024-09-20 21:32 Am I doing something wrong or did I miss something?
See here: Is the kdump.service enabled ? See:
⢀⣴⠾⠻⢶⣦⠀
⣾⠁⢠⠒⠀⣿⡁ Debian - The universal operating system
⢿⡄⠘⠷⠚⠋⠀ https://www.debian.org
⠈⠳⣄⠀

scarm
Posts: 31
Joined: 2023-06-14 20:05
Has thanked: 1 time
Been thanked: 2 times

Re: Help with diagnosing an OS freeze

#6 Post by scarm »

Yes, that was it! I managed to make it work by enabling kdump-tools.service in Debian (in Red Hat is kdump.service).
After that I tested again with

Code: Select all

echo c > /proc/sysrq-trigger

The system rebooted to the dump kernel and I found the dump files in /var/crash.

Hope I can learn something more when the crash happens again.

Aki
Global Moderator
Global Moderator
Posts: 4036
Joined: 2014-07-20 18:12
Location: Europe
Has thanked: 112 times
Been thanked: 532 times

Re: Help with diagnosing an OS freeze

#7 Post by Aki »

Hello @scarm,

Have you made any progress ?
⢀⣴⠾⠻⢶⣦⠀
⣾⠁⢠⠒⠀⣿⡁ Debian - The universal operating system
⢿⡄⠘⠷⠚⠋⠀ https://www.debian.org
⠈⠳⣄⠀

scarm
Posts: 31
Joined: 2023-06-14 20:05
Has thanked: 1 time
Been thanked: 2 times

Re: Help with diagnosing an OS freeze

#8 Post by scarm »

A couple of days ago the freeze event happened again.
However, the system did not reboot. I suppose it gets completely frozen and the only thing I could do was a hard reset with the power button.
Unfortunately, no useful logs to help going forward.
I think it might not be a software issue, but probably hardware related.

arzgi
Posts: 1587
Joined: 2008-02-21 17:03
Location: Finland
Been thanked: 81 times

Re: Help with diagnosing an OS freeze

#9 Post by arzgi »

You could try some live-CD, like Debian or Knoppix to make sure. Might be good to track the temperatures also any link those (don't know if already mentioned, red just the thread, not the links)

Post Reply