Scheduled Maintenance: We are aware of an issue with Google, AOL, and Yahoo services as email providers which are blocking new registrations. We are trying to fix the issue and we have several internal and external support tickets in process to resolve the issue. Please see: viewtopic.php?t=158230

 

 

 

[Solved] 3G of almost exactly the same message.

If none of the specific sub-forums seem right for your thread, ask here.
Post Reply
Message
Author
rayandrews
Posts: 111
Joined: 2014-01-31 21:32
Has thanked: 4 times
Been thanked: 1 time

[Solved] 3G of almost exactly the same message.

#1 Post by rayandrews »

Occasionally my system hangs and when it does, after rebooting I see my /var/log/syslog has expanded to the point where it uses up the entire free space on my disk and, helpfully, contains countless millions of these nearly identical lines:

Code: Select all

Mar 20 20:26:52 d5-12-Debian1 kernel: [17176.391603] nouveau 0000:01:00.0: fifo: PBDMA0: 80000000 [SIGNATURE] ch 3 [00ff947000 Xorg[585]] subc 0 mthd 0000 data 00000000
Mar 20 20:26:52 d5-12-Debian1 kernel: [17176.391615] nouveau 0000:01:00.0: fifo: PBDMA0: 80000000 [SIGNATURE] ch 3 [00ff947000 Xorg[585]] subc 0 mthd 0000 data 00000000
Mar 20 20:26:53 d5-12-Debian1 kernel: [17176.443010] nouveau 0000:01:00.0: fifo: PBDMA0: 80000000 [SIGNATURE] ch 3 [00ff947000 Xorg[585]] subc 0 mthd 0000 data 00000000
Mar 20 20:26:53 d5-12-Debian1 kernel: [17176.443031] nouveau 0000:01:00.0: fifo: PBDMA0: 80000000 [SIGNATURE] ch 3 [00ff947000 Xorg[585]] subc 0 mthd 0000 data 00000000
Mar 20 20:26:53 d5-12-Debian1 kernel: [17176.443054] nouveau 0000:01:00.0: fifo: PBDMA0: 80000000 [SIGNATURE] ch 3 [00ff947000 Xorg[585]] subc 0 mthd 0000 data 00000000

Code: Select all

... which only vary here: [17176.391615]
                                 ^^^^^^
I presume the freeze happens because my system has devoted itself to writing these lines until the disk is full. Is there anything I can do about this?

Speaking of syslog, is there a primer somewhere on all the various files in /var/log? It would be nice to know what they all do.
Last edited by rayandrews on 2023-03-25 17:20, edited 1 time in total.

User avatar
sunrat
Administrator
Administrator
Posts: 6412
Joined: 2006-08-29 09:12
Location: Melbourne, Australia
Has thanked: 116 times
Been thanked: 462 times

Re: [Software] 3G of almost exactly the same message.

#2 Post by sunrat »

You should work out why all those messages occur. Firmware maybe? May need firmware-misc-nonfree which contains firmware for Nvidia cards.
“ computer users can be divided into 2 categories:
Those who have lost data
...and those who have not lost data YET ”
Remember to BACKUP!

User avatar
donald
Debian Developer, Site Admin
Debian Developer, Site Admin
Posts: 1046
Joined: 2021-03-30 20:08
Has thanked: 186 times
Been thanked: 240 times

Re: [Software] 3G of almost exactly the same message.

#3 Post by donald »

$20 bet you have Nvidia card installed and UEFI on.
Typo perfectionish.


"The advice given above is all good, and just because a new message has appeared it does not mean that a problem has arisen, just that a new gremlin hiding in the hardware has been exposed." - FreewheelinFrank

reinob
Posts: 1189
Joined: 2014-06-30 11:42
Has thanked: 97 times
Been thanked: 47 times

Re: [Software] 3G of almost exactly the same message.

#4 Post by reinob »

In case you don't know.. you don't have to use syslog (rsyslogd) if you don't want it and/or can't manage it.

systemd-journald has the option of forwarding to syslog, which defaults to "yes". Just change it to "no" in /etc/systemd/journald.conf.

As for the journal, you can also limit its size using various options. You can even make it volatile (i.e. RAM-only, so it doesn't survive a reboot). For desktop systems I think it's not a bad idea (and you can always turn it back on if you need to debug something).

If you nevertheless want to keep the syslog, you can always filter specific messages.

(for the sake of clarity: if your problem is the logging, then fix the logging. If your problem is the freezing, then fix the freezing..)

Aki
Global Moderator
Global Moderator
Posts: 2823
Joined: 2014-07-20 18:12
Location: Europe
Has thanked: 69 times
Been thanked: 385 times

Re: [Software] 3G of almost exactly the same message.

#5 Post by Aki »

Hello,
rayandrews wrote: 2023-03-21 22:50 Occasionally my system hangs [..] I see my /var/log/syslog has expanded to the point where it uses up the entire free space on my disk and, helpfully, contains countless millions of these nearly identical lines:

Code: Select all

Mar 20 20:26:52 d5-12-Debian1 kernel: [17176.391603] nouveau 0000:01:00.0: fifo: PBDMA0: 80000000 [SIGNATURE] ch 3 [00ff947000 Xorg[585]] subc 0 mthd 0000 data 00000000
[..]
I presume the freeze happens because my system has devoted itself to writing these lines until the disk is full. Is there anything I can do about this?
The message reported above is from the "nouveau" kernel module, the kernel module that drives your nvidia graphic card. It reports that there is something unusual in the commands sent to the GPU pipeline (commands executed by the GPU to draw the screen).

If you are interested in solving it, it could be useful to understand what triggers it. For example, you can analyze system logs to identify last logged activities before these messages start appearing.

It could be useful reporting Debian installed version, used kernel and other information about your hardware (computer manufacturer, model and GPU model) and software. The inxi software can be useful to detail some of this information:

Code: Select all

inxi -F
⢀⣴⠾⠻⢶⣦⠀
⣾⠁⢠⠒⠀⣿⡁ Debian - The universal operating system
⢿⡄⠘⠷⠚⠋⠀ https://www.debian.org
⠈⠳⣄⠀

User avatar
sunrat
Administrator
Administrator
Posts: 6412
Joined: 2006-08-29 09:12
Location: Melbourne, Australia
Has thanked: 116 times
Been thanked: 462 times

Re: [Software] 3G of almost exactly the same message.

#6 Post by sunrat »

Indeed system information may be helpful. I would suggest to install inxi and show the output of

Code: Select all

inxi -Fxxz
the x are for extra details and z is to sanitise it of personally identifiable information.

Whether firmware is installed as in my above post is important.
“ computer users can be divided into 2 categories:
Those who have lost data
...and those who have not lost data YET ”
Remember to BACKUP!

rayandrews
Posts: 111
Joined: 2014-01-31 21:32
Has thanked: 4 times
Been thanked: 1 time

Re: [Software] 3G of almost exactly the same message.

#7 Post by rayandrews »

donald wrote: 2023-03-22 02:25 $20 bet you have Nvidia card installed and UEFI on.
It's a Nvidia ZOTAC GEFORCE GT-730. Just replaced it, previous of same model was starting to fail. Need a card that has all three styles of output. How do I check for this UEFI?
Last edited by rayandrews on 2023-03-22 15:40, edited 1 time in total.

rayandrews
Posts: 111
Joined: 2014-01-31 21:32
Has thanked: 4 times
Been thanked: 1 time

Re: [Software] 3G of almost exactly the same message.

#8 Post by rayandrews »

"Firmware maybe? May need firmware-misc-nonfree which contains firmware for Nvidia cards."

Installed.


"(for the sake of clarity: if your problem is the logging, then fix the logging. If your problem is the freezing, then fix the freezing..)"

Both! But thanks, I've figured out how to limit the size of the logs. Really tho, you'd think there'd be some default protection against syslog filling up your entire disk.

"If you are interested in solving it, it could be useful to understand what triggers it. For example, you can analyze system logs to identify last logged activities before these messages start appearing."

Where do I find that info? As I mentioned, there are so many things in /var/log, I'd like to understand what all those files do. I haven't noticed anything in particular that causes it so far.

"inxi -Fxxz"

Nice program, haven't used that before:

Code: Select all


System:    Kernel: 5.10.0-19-amd64 x86_64 bits: 64 compiler: gcc v: 10.2.1 
           Desktop: Xfce 4.16.0 tk: Gtk 3.24.24 wm: xfwm4 dm: LightDM 
           Distro: Debian GNU/Linux 11 (bullseye) 
Machine:   Type: Desktop System: MapleTronics product: DQ45CB v: vPro serial: <filter> 
           Mobo: Intel model: DQ45CB v: AAE30148-205 serial: <filter> BIOS: Intel 
           v: CBQ4510H.86A.0073.2009.0122.1353 date: 01/22/2009 
Battery:   Device-1: hidpp_battery_0 model: Logitech Wireless Mouse serial: <filter> 
           charge: 55% (should be ignored) status: Discharging 
CPU:       Info: Dual Core model: Intel Core2 Duo E8500 bits: 64 type: MCP arch: Penryn 
           rev: A L1 cache: 32 KiB L2 cache: 6 MiB 
           flags: lm nx pae sse sse2 sse3 sse4_1 ssse3 vmx bogomips: 12664 
           Speed: 2000 MHz min/max: 1998/3163 MHz Core speeds (MHz): 1: 2000 2: 2000 
Graphics:  Device-1: NVIDIA GK208B [GeForce GT 730] vendor: ZOTAC driver: nouveau 
           v: kernel bus ID: 01:00.0 chip ID: 10de:1287 
           Display: x11 server: X.Org 1.20.11 driver: loaded: modesetting 
           unloaded: fbdev,vesa resolution: 1: 1920x1080~60Hz 2: 1920x1080~60Hz 
           3: 1366x768~60Hz s-dpi: 96 
           OpenGL: renderer: NV106 v: 4.3 Mesa 20.3.5 direct render: Yes 
Audio:     Device-1: Intel 82801JD/DO HD Audio driver: snd_hda_intel v: kernel 
           bus ID: 00:1b.0 chip ID: 8086:3a6e 
           Device-2: NVIDIA GK208 HDMI/DP Audio vendor: ZOTAC driver: snd_hda_intel 
           v: kernel bus ID: 01:00.1 chip ID: 10de:0e0f 
           Sound Server: ALSA v: k5.10.0-19-amd64 
Network:   Device-1: Intel 82567LM-3 Gigabit Network driver: e1000e v: kernel port: e100 
           bus ID: 00:19.0 chip ID: 8086:10de 
           IF: enp0s25 state: up speed: 1000 Mbps duplex: full mac: <filter> 
RAID:      Hardware-1: Intel SATA Controller [RAID mode] driver: ahci v: 3.0 
           bus ID: 00:1f.2 chip ID: 8086.2822 
Drives:    Local Storage: total: 149.04 GiB used: 15.78 GiB (10.6%) 
           ID-1: /dev/sda vendor: Western Digital model: WD800JD-75MSA3 size: 74.51 GiB 
           speed: 3.0 Gb/s serial: <filter> temp: 24 C 
           ID-2: /dev/sdb vendor: Western Digital model: WD800AAJS-60WAA0 
           size: 74.53 GiB speed: 3.0 Gb/s serial: <filter> 
Partition: ID-1: / size: 11.71 GiB used: 5.5 GiB (47.0%) fs: ext4 dev: /dev/sda5 
           ID-2: /boot size: 188.2 MiB used: 135.5 MiB (72.0%) fs: ext4 dev: /dev/sda2 
Swap:      ID-1: swap-1 type: partition size: 5 GiB used: 0 KiB (0.0%) priority: -2 
           dev: /dev/sda1 
Sensors:   System Temperatures: cpu: 38.0 C mobo: N/A gpu: nouveau temp: 29.0 C 
           Fan Speeds (RPM): N/A 
Info:      Processes: 176 Uptime: 1h 33m Memory: 7.7 GiB used: 1.71 GiB (22.1%) 
           Init: systemd v: 247 runlevel: 5 Compilers: gcc: N/A Packages: apt: 1398 
           Shell: Zsh v: 5.8 running in: xfce4-terminal inxi: 3.3.01

rayandrews
Posts: 111
Joined: 2014-01-31 21:32
Has thanked: 4 times
Been thanked: 1 time

Re: [Software] 3G of almost exactly the same message.

#9 Post by rayandrews »

BTW the video card comes with a CD full of drivers, should I attempt an install? That presuming there is a Linux offering. I had thought that it would be safer to stick with Debian's drivers, but ...

Once tried the Nvidia driver, didn't notice any improvement and the thing is half a GB ferkrisakes. I've always used nouveau and back with Debian 9, my system was rock solid.

rayandrews
Posts: 111
Joined: 2014-01-31 21:32
Has thanked: 4 times
Been thanked: 1 time

Re: [Software] 3G of almost exactly the same message.

#10 Post by rayandrews »

Ok, just had a hang, reboot and found this in syslog:

Code: Select all

Mar 22 18:16:44 d5-12-Debian1 kernel: [24081.734680] nouveau 0000:01:00.0: fifo: fault 00 [READ] at 0000000a08738000 engine 00 [GR] client 02 [GPC0/PE_0] reason 00 [PDE] on channel 6 [00ff717000 Xorg[597]]
Mar 22 18:16:44 d5-12-Debian1 kernel: [24081.734690] nouveau 0000:01:00.0: fifo: channel 6: killed
Mar 22 18:16:44 d5-12-Debian1 kernel: [24081.734694] nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Mar 22 18:16:44 d5-12-Debian1 kernel: [24081.734700] nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery

User avatar
kent_dorfman766
Posts: 535
Joined: 2022-12-16 06:34
Location: socialist states of america
Has thanked: 57 times
Been thanked: 70 times

Re: [Software] 3G of almost exactly the same message.

#11 Post by kent_dorfman766 »

rayandrews wrote: 2023-03-22 15:24
donald wrote: 2023-03-22 02:25 $20 bet you have Nvidia card installed and UEFI on.
How do I check for this UEFI?
UEFI is a bootloader technology that you would see in the BIOS setup if you go there and it's present, but I digres lest I "soapbox".

I am very curious about @donald comment about nvidia and UEFI. Common sense question is why would GPU have anything to do with it?

Small soapbox: gaining only about three inches...I never could use nouveau, but always had better performance and more features with closed source nvidia drivers, but then I use CUDA a lot.

User avatar
donald
Debian Developer, Site Admin
Debian Developer, Site Admin
Posts: 1046
Joined: 2021-03-30 20:08
Has thanked: 186 times
Been thanked: 240 times

Re: [Software] 3G of almost exactly the same message.

#12 Post by donald »

kent_dorfman766 wrote: 2023-03-23 01:43
rayandrews wrote: 2023-03-22 15:24
donald wrote: 2023-03-22 02:25 $20 bet you have Nvidia card installed and UEFI on.
How do I check for this UEFI?
UEFI is a bootloader technology that you would see in the BIOS setup if you go there and it's present, but I digres lest I "soapbox".

I am very curious about @donald comment about nvidia and UEFI. Common sense question is why would GPU have anything to do with it?

Small soapbox: gaining only about three inches...I never could use nouveau, but always had better performance and more features with closed source nvidia drivers, but then I use CUDA a lot.
Some older Nvidia cards needed UEFI and some do not and can run under Legacy support, and some are really finicky with signed drivers. I have a 390xx that with each update breaks and I have to drop down to 2 prior kernels to reinstall the drivers, disable secure boot to load the drivers or other such stuff. Its a problem with older cards.

@rayandrews Follow this page: https://wiki.debian.org/NvidiaGraphicsDrivers. My card is older but I can try to walk you through my process.
Typo perfectionish.


"The advice given above is all good, and just because a new message has appeared it does not mean that a problem has arisen, just that a new gremlin hiding in the hardware has been exposed." - FreewheelinFrank

rayandrews
Posts: 111
Joined: 2014-01-31 21:32
Has thanked: 4 times
Been thanked: 1 time

Re: [Software] 3G of almost exactly the same message.

#13 Post by rayandrews »

... but this does look like a software issue then? Thing is that as I said, my system was rock solid using nouveau under Deb 9. Still, I suppose I should probably backup and try the proprietary driver again.

Aki
Global Moderator
Global Moderator
Posts: 2823
Joined: 2014-07-20 18:12
Location: Europe
Has thanked: 69 times
Been thanked: 385 times

Re: [Software] 3G of almost exactly the same message.

#14 Post by Aki »

rayandrews wrote: 2023-03-23 12:52 ... but this does look like a software issue then? Thing is that as I said, my system was rock solid using nouveau under Deb 9. Still, I suppose I should probably backup and try the proprietary driver again.
In rare cases, programs can trigger incorrect behavior in the nouveau kernel module.
⢀⣴⠾⠻⢶⣦⠀
⣾⠁⢠⠒⠀⣿⡁ Debian - The universal operating system
⢿⡄⠘⠷⠚⠋⠀ https://www.debian.org
⠈⠳⣄⠀

rayandrews
Posts: 111
Joined: 2014-01-31 21:32
Has thanked: 4 times
Been thanked: 1 time

Re: [Software] 3G of almost exactly the same message.

#15 Post by rayandrews »

Weell, I've reinstalled all .5 GB of the Nvidia driver and let's see how she runs. And I've tweaked things to keep my logs from taking over the entire disk -- which I still think should be protected against by default, esp. since logrotate makes it so easy once you know how. Thanks gentlemen.

Post Reply