kernel error message - Etch

News and discussion about development of the Debian OS itself

kernel error message - Etch

Postby migster » 2008-06-29 01:23

Hello,

I'm getting weird kernel error messages (reproducible). I installed Debian Etch with RAID1 software RAID and when both discs are under heavy load I get this error message:
Code: Select all
Message from syslogd@localhost at Sat Jun 28 20:41:02 2008 ...
localhost kernel: Oops: 0000 [#1]

Message from syslogd@localhost at Sat Jun 28 20:41:03 2008 ...
localhost kernel: SMP

Message from syslogd@localhost at Sat Jun 28 20:41:03 2008 ...
localhost kernel: CPU:    0

Message from syslogd@localhost at Sat Jun 28 20:41:03 2008 ...
localhost kernel: EIP is at generic_file_buffered_write+0x23e/0x5ea

Message from syslogd@localhost at Sat Jun 28 20:41:03 2008 ...
localhost kernel: eax: f1b94160   ebx: f820b220   ecx: 000003ad   edx: c13a4b80

Message from syslogd@localhost at Sat Jun 28 20:41:03 2008 ...
localhost kernel: esi: 000000a9   edi: 000005b4   ebp: c13a4b80   esp: e42bbdb0

Message from syslogd@localhost at Sat Jun 28 20:41:03 2008 ...
localhost kernel: ds: 007b   es: 007b   ss: 0068

Message from syslogd@localhost at Sat Jun 28 20:41:03 2008 ...
localhost kernel: Process wget (pid: 11586, ti=e42ba000 task=f1d80aa0 task.ti=e42ba000)

Message from syslogd@localhost at Sat Jun 28 20:41:03 2008 ...
localhost kernel: Stack: 00000961 00000001 e42bbefc 00000001 00000961 000005b4 f1b94160 e1bbef48

Message from syslogd@localhost at Sat Jun 28 20:41:03 2008 ...
localhost kernel:        f820b220 e1bbee9c 00000000 b7f97000 000003ad 00000000 e42bbebc 00000000

Message from syslogd@localhost at Sat Jun 28 20:41:03 2008 ...
localhost kernel:        000005b4 00000000 00000000 c0250a8a e42bbeb4 00000000 000005b4 dc045a50

Message from syslogd@localhost at Sat Jun 28 20:41:03 2008 ...
localhost kernel: Call Trace:

Message from syslogd@localhost at Sat Jun 28 20:41:03 2008 ...
localhost kernel: Code: 3c 00 c7 44 24 30 00 00 00 00 0f 84 d6 01 00 00 8b 4c 24 2c 03 4c 24 3c 89 ea 89 4c 24 0c 51 8b 5c 24 20 8b 4c 24 30 8b 44 24 18 <ff> 53 18 89 c7 85 ff 58 74 79 8b 54 24 20 8b 82 5c 01 00 00 f0

Message from syslogd@localhost at Sat Jun 28 20:41:03 2008 ...
localhost kernel: EIP: [<c0143120>] generic_file_buffered_write+0x23e/0x5ea SS:ESP 0068:e42bbdb0

max@max-amd:~/Sources/xine$
Message from syslogd@localhost at Sat Jun 28 20:41:33 2008 ...
localhost kernel: Oops: 0000 [#2]

Message from syslogd@localhost at Sat Jun 28 20:41:33 2008 ...
localhost kernel: SMP

Message from syslogd@localhost at Sat Jun 28 20:41:33 2008 ...
localhost kernel: CPU:    0

Message from syslogd@localhost at Sat Jun 28 20:41:33 2008 ...
localhost kernel: EIP is at do_writepages+0x15/0x32

Message from syslogd@localhost at Sat Jun 28 20:41:33 2008 ...
localhost kernel: eax: f820b220   ebx: dfb11f78   ecx: 00000000   edx: dfb11f78

Message from syslogd@localhost at Sat Jun 28 20:41:33 2008 ...
localhost kernel: esi: e1bbef48   edi: c2185a00   ebp: e1bbef48   esp: dfb11f04

Message from syslogd@localhost at Sat Jun 28 20:41:33 2008 ...
localhost kernel: ds: 007b   es: 007b   ss: 0068

Message from syslogd@localhost at Sat Jun 28 20:41:33 2008 ...
localhost kernel: Process pdflush (pid: 174, ti=dfb10000 task=dff8faa0 task.ti=dfb10000)

Message from syslogd@localhost at Sat Jun 28 20:41:33 2008 ...
localhost kernel: Stack: 00000007 e1bbee9c c0177017 dfb11f78 00000000 00000000 f6452ea4 dff8faa0

Message from syslogd@localhost at Sat Jun 28 20:41:33 2008 ...
localhost kernel:        f35011c0 c2009340 c200c260 f3eb6e40 f3501000 e1bbee9c c22f74b4 c2185a00

Message from syslogd@localhost at Sat Jun 28 20:41:33 2008 ...
localhost kernel:        dfb11f78 c0177467 00056081 00000005 c2185a00 c2185a3c dfb11f78 00000000

Message from syslogd@localhost at Sat Jun 28 20:41:33 2008 ...
localhost kernel: Call Trace:

Message from syslogd@localhost at Sat Jun 28 20:41:33 2008 ...
localhost kernel: Code: 03 00 ba 01 00 00 00 5b 5e 89 d0 c3 ff 40 10 89 d0 e9 45 ff ff ff 56 89 c6 31 c0 53 83 7a 0c 00 89 d3 7e 21 80 4a 24 10 8b 46 38 <8b> 48 0c 85 c9 74 06 89 f0 ff d1 eb 09 31 c9 89 f0 e8 f9 1f 03

Message from syslogd@localhost at Sat Jun 28 20:41:33 2008 ...
localhost kernel: EIP: [<c0146656>] do_writepages+0x15/0x32 SS:ESP 0068:dfb11f04



Her'e output from `uname -ra`:
Linux max-amd 2.6.18-6-k7 #1 SMP Fri Jun 6 22:56:53 UTC 2008 i686 GNU/Linux

The system:
ASUS a7n8x-deluxe motherboard, Athlon XP 2600+, Nvidia GeForce 6800 (module compiled from Nvidia utility). 2 SATA 1 drives, partitioned into a bunch of Linux partitions /root, /var, etc and raided using RAID1.

(So, why does the above kernel show SMP? not sure...)

The problem:
During heavy disc load the above kernel message appears and the system becomes unstable (sometimes crashes, sometimes x-server restarts, sometimes just get error messages). By heavy load I mean: both drives are resynching, and I'm compiling 6-7 applications at the same time. Gkrellm shows about 50MB/s drive speed.

When I leave the system alone with both drives resyncing, the error does not happen (at least it hasn't happened yet).

Temporary fix:
I will consider changing the kernel version and see how that goes... but I have no idea what the error message means.

Any ideas?

Thanks in advance!
migster
 
Posts: 3
Joined: 2008-06-29 00:56
Location: Canada

Postby BioTube » 2008-06-29 02:39

All x86(except maybe the 486; I'm also including AMD64) kernels are built with SMP enabled. As for the message, something's screwing up in the kernel; it may be a buggy driver that can't keep up with the load.
Image
Ludwig von Mises wrote:The elite should be supreme by virtue of persuasion, not by the assistance of firing squads.
User avatar
BioTube
 
Posts: 7551
Joined: 2007-06-01 04:34

Postby migster » 2008-06-29 14:36

Okay, so I installed 2.6.18-4-k7 and recompiled the Nvidia video drivers - again, same weird kernel error... this is really annoying:

Code: Select all
Jun 29 10:23:34 localhost kernel: BUG: unable to handle kernel paging request at virtual address f8205248
Jun 29 10:23:35 localhost kernel:  printing eip:
Jun 29 10:23:35 localhost kernel: c015bbee
Jun 29 10:23:36 localhost kernel: *pde = 00000000
Jun 29 10:23:36 localhost kernel: Oops: 0000 [#1]
Jun 29 10:23:36 localhost kernel: SMP
Jun 29 10:23:36 localhost kernel: Modules linked in: nvidia xt_limit xt_tcpudp iptable_mangle ipt_LOG ipt_MASQUERADE ip_nat ipt_TOS ipt_REJECT ip_conntrack_irc ip_conntrack_ftp xt_state ip_conntrack nfnetlink iptable_filter ip_tables x_tables ppdev lp button ac battery ipv6 dm_snapshot dm_mirror dm_mod w83l785ts asb100 hwmon_vid eeprom ds1621 sbp2 loop snd_mpu401 snd_mpu401_uart snd_seq_dummy snd_seq_oss snd_ca0106 snd_seq_midi snd_seq_midi_event analog snd_intel8x0 snd_ac97_codec snd_ac97_bus snd_pcm_oss snd_mixer_oss snd_seq gameport snd_rawmidi snd_seq_device parport_pc parport psmouse floppy rtc snd_pcm snd_timer serio_raw pcspkr shpchp pci_hotplug snd soundcore i2c_nforce2 snd_page_alloc i2c_core nvidia_agp agpgart eth1394 tsdev evdev
ext3 jbd mbcache raid1 md_mod ide_generic usb_storage sd_mod ide_cd cdrom sata_nv generic usbhid ohci1394 ieee1394 3c59x mii ehci_hcd sata_sil libata
scsi_mod amd74xx ide_core ohci_hcd forcedeth usbcore thermal processor fan
Jun 29 10:23:36 localhost kernel: CPU:    0
Jun 29 10:23:36 localhost kernel: EIP:    0060:[<c015bbee>]    Tainted: P      VLI
Jun 29 10:23:36 localhost kernel: EFLAGS: 00010286   (2.6.18-4-k7 #1)
Jun 29 10:23:36 localhost kernel: EIP is at try_to_release_page+0x27/0x46
Jun 29 10:23:36 localhost kernel: eax: f8205220   ebx: f0f6ff48   ecx: c1622200   edx: 00000000
Jun 29 10:23:36 localhost kernel: esi: 000000d0   edi: 00000001   ebp: c02d1800   esp: dfb1fe40
Jun 29 10:23:36 localhost kernel: ds: 007b   es: 007b   ss: 0068
Jun 29 10:23:36 localhost kernel: Process kswapd0 (pid: 175, ti=dfb1e000 task=dff90aa0 task.ti=dfb1e000)
Jun 29 10:23:36 localhost kernel: Stack: c1622200 f0f6ff48 c0148df3 dfb1ff80 00000020 00000000 00000000 00000020
Jun 29 10:23:36 localhost kernel:        00000003 00000001 00000000 00000020 00000003 00000001 c147b9a0 c12ada80
Jun 29 10:23:36 localhost kernel:        c10a8160 c1d8dcc0 c18e1800 c1e54b40 c1ae3a00 c1b519a0 c1c373c0 c1d51e60
Jun 29 10:23:36 localhost kernel: Call Trace:
Jun 29 10:23:36 localhost kernel:  [<c0148df3>] shrink_inactive_list+0x44b/0x71c
Jun 29 10:23:36 localhost kernel:  [<f89ac2ea>] mb_cache_shrink_fn+0x1d/0xb5 [mbcache]
Jun 29 10:23:36 localhost kernel:  [<c0149173>] shrink_zone+0xaf/0xd0
Jun 29 10:23:36 localhost kernel:  [<c01495fc>] kswapd+0x295/0x399
Jun 29 10:23:36 localhost kernel:  [<c012db81>] autoremove_wake_function+0x0/0x2d
Jun 29 10:23:36 localhost kernel:  [<c0149367>] kswapd+0x0/0x399
Jun 29 10:23:36 localhost kernel:  [<c012dab3>] kthread+0xc2/0xef
Jun 29 10:23:36 localhost kernel:  [<c012d9f1>] kthread+0x0/0xef
Jun 29 10:23:36 localhost kernel:  [<c0101005>] kernel_thread_helper+0x5/0xb
Jun 29 10:23:36 localhost kernel: Code: 89 f8 5f c3 56 89 c1 89 d6 53 8b 58 10 8b 00 a8 01 75 08 0f 0b 38 06 46 f5 29 c0 8b 01 31 d2 f6 c4 10 75 21 85 db 74 14 8b 43 38 <8b> 58 28 85 db 74 0a 89 f2 89 c8 ff d3 89 c2 eb 09 5b 5e 89 c8
Jun 29 10:23:36 localhost kernel: EIP: [<c015bbee>] try_to_release_page+0x27/0x46 SS:ESP 0068:dfb1fe40
Jun 29 10:23:36 localhost kernel:  <1>BUG: unable to handle kernel paging request at virtual address f8205248
Jun 29 10:23:36 localhost kernel:  printing eip:
Jun 29 10:23:36 localhost kernel: c015bbee
Jun 29 10:23:36 localhost kernel: *pde = 00000000
Jun 29 10:23:36 localhost kernel: Oops: 0000 [#2]
Jun 29 10:23:36 localhost kernel: SMP
Jun 29 10:23:36 localhost kernel: Modules linked in: nvidia xt_limit xt_tcpudp iptable_mangle ipt_LOG ipt_MASQUERADE ip_nat ipt_TOS ipt_REJECT ip_conntrack_irc ip_conntrack_ftp xt_state ip_conntrack nfnetlink iptable_filter ip_tables x_tables ppdev lp button ac battery ipv6 dm_snapshot dm_mirror dm_mod w83l785ts asb100 hwmon_vid eeprom ds1621 sbp2 loop snd_mpu401 snd_mpu401_uart snd_seq_dummy snd_seq_oss snd_ca0106 snd_seq_midi snd_seq_midi_event analog snd_intel8x0 snd_ac97_codec snd_ac97_bus snd_pcm_oss snd_mixer_oss snd_seq gameport snd_rawmidi snd_seq_device parport_pc parport psmouse floppy rtc snd_pcm snd_timer serio_raw pcspkr shpchp pci_hotplug snd soundcore i2c_nforce2 snd_page_alloc i2c_core nvidia_agp agpgart eth1394 tsdev evdev
ext3 jbd mbcache raid1 md_mod ide_generic usb_storage sd_mod ide_cd cdrom sata_nv generic usbhid ohci1394 ieee1394 3c59x mii ehci_hcd sata_sil libata
scsi_mod amd74xx ide_core ohci_hcd forcedeth usbcore thermal processor fan
Jun 29 10:23:36 localhost kernel: CPU:    0
Jun 29 10:23:36 localhost kernel: EIP:    0060:[<c015bbee>]    Tainted: P      VLI
Jun 29 10:23:36 localhost kernel: EFLAGS: 00213286   (2.6.18-4-k7 #1)
Jun 29 10:23:36 localhost kernel: EIP is at try_to_release_page+0x27/0x46
Jun 29 10:23:36 localhost kernel: eax: f8205220   ebx: f0f6ff48   ecx: c1550480   edx: 00000000
Jun 29 10:23:36 localhost kernel: esi: 000280d2   edi: 00000001   ebp: c02d1800   esp: f3c4fdbc
Jun 29 10:23:36 localhost kernel: ds: 007b   es: 007b   ss: 0068
Jun 29 10:23:36 localhost kernel: Process Xorg (pid: 5385, ti=f3c4e000 task=dfa81550 task.ti=f3c4e000)
Jun 29 10:23:36 localhost kernel: Stack: c1550480 f0f6ff48 c0148df3 f3c4fef0 00000020 00000000 00000000 00000020
Jun 29 10:23:36 localhost kernel:        00000016 00000001 00000000 00000020 00000008 00000001 c1570080 c11f8180
Jun 29 10:23:36 localhost kernel:        c13943c0 c140aea0 c1213100 c155d520 c16a5dc0 c127fc00 c13157a0 c10bb680
Jun 29 10:23:36 localhost kernel: Call Trace:
Jun 29 10:23:36 localhost kernel:  [<c0148df3>] shrink_inactive_list+0x44b/0x71c
Jun 29 10:23:36 localhost kernel:  [<c0147c81>] __pagevec_release+0x15/0x1d
Jun 29 10:23:36 localhost kernel:  [<c01488a4>] shrink_active_list+0x384/0x38c
Jun 29 10:23:36 localhost kernel:  [<c0149173>] shrink_zone+0xaf/0xd0
Jun 29 10:23:36 localhost kernel:  [<c0149af4>] try_to_free_pages+0x138/0x224
Jun 29 10:23:36 localhost kernel:  [<c0145f52>] __alloc_pages+0x184/0x275
Jun 29 10:23:36 localhost kernel:  [<c014c5f0>] __handle_mm_fault+0xf1/0x705
Jun 29 10:23:36 localhost kernel:  [<c011554e>] do_page_fault+0x18a/0x46c
Jun 29 10:23:36 localhost kernel:  [<c01153c4>] do_page_fault+0x0/0x46c
Jun 29 10:23:36 localhost kernel:  [<c01037d5>] error_code+0x39/0x40
Jun 29 10:23:36 localhost kernel: Code: 89 f8 5f c3 56 89 c1 89 d6 53 8b 58 10 8b 00 a8 01 75 08 0f 0b 38 06 46 f5 29 c0 8b 01 31 d2 f6 c4 10 75 21 85 db 74 14 8b 43 38 <8b> 58 28 85 db 74 0a 89 f2 89 c8 ff d3 89 c2 eb 09 5b 5e 89 c8
Jun 29 10:23:36 localhost kernel: EIP: [<c015bbee>] try_to_release_page+0x27/0x46 SS:ESP 0068:f3c4fdbc


Any ideas? Maybe try some other kernel? There are only 3 K7 kernels available for Debian Etch I think, and the earliest and latest versions failed, so... I have no idea what to do now :(
migster
 
Posts: 3
Joined: 2008-06-29 00:56
Location: Canada

Postby migster » 2008-06-29 15:32

Another update: I started to receive the aforementioned error message rather frequently without any load on the system - the failing process is Xorg - could this be related to the hardware in any way?

Is it possible that maybe some chunk of RAM in the video card or the motherboard cannot be allocated due to physical damage?

Before I installed Debian I had Ubuntu 8.04 and the video card would freeze randomly... I blamed it on the new compiz packages, but maybe it's the hardware?

Thanks in advance!
migster
 
Posts: 3
Joined: 2008-06-29 00:56
Location: Canada


Return to Debian Development

Who is online

Users browsing this forum: No registered users and 4 guests

fashionable