|
|
硬件 4超线程2.8,DDR 256M,四个IDE口硬盘,
系统:Mandrake Linux release 10.0 (Official)
内核:2.6.3-7
以前用另外一台机器用此版本非常稳定,现在的机器配置要高得多,却经常死机,除机器配置外,就是很多硬盘分区我由原来的reiserfs改为了XFS,
以下为死机后log的最后记录,似乎是某个地方出了BUG ,
Dec 11 04:03:49 ftp kernel: ------------[ cut here ]------------
Dec 11 04:03:49 ftp kernel: kernel BUG at include/linux/list.h:148!
Dec 11 04:03:49 ftp kernel: invalid operand: 0000 [#1]
Dec 11 04:03:49 ftp kernel: CPU: 0
Dec 11 04:03:49 ftp kernel: EIP: 0060:[prune_dcache+329/352] Not tainted VLI
Dec 11 04:03:49 ftp kernel: EIP: 0060:[<c01680a9>] Not tainted VLI
Dec 11 04:03:49 ftp kernel: EFLAGS: 00010293
Dec 11 04:03:49 ftp kernel: EIP is at prune_dcache+0x149/0x160
Dec 11 04:03:49 ftp kernel: eax: ce07dd70 ebx: c3bed8e0 ecx: c3bed8f4 edx: c3bec854
Dec 11 04:03:49 ftp kernel: esi: 00000080 edi: c3bed950 ebp: c13dfe90 esp: c13dfe84
Dec 11 04:03:49 ftp kernel: ds: 007b es: 007b ss: 0068
Dec 11 04:03:49 ftp kernel: Process kswapd0 (pid: 8, threadinfo=c13de000 task=c13e4ca0)
Dec 11 04:03:49 ftp kernel: Stack: c13de000 00000080 00000456 c13dfe9c c016847d 00000080 c13dfed0 c0143c3b
Dec 11 04:03:49 ftp kernel: 00000080 000000d0 000055e1 02cc2938 00000000 00000856 00000000 cffeeb80
Dec 11 04:03:49 ftp kernel: 00000102 c03275d4 00000081 c13dff04 c0144f2f 00000102 000000d0 00000000
Dec 11 04:03:49 ftp kernel: Call Trace:
Dec 11 04:03:49 ftp kernel: [shrink_dcache_memory+29/48] shrink_dcache_memory+0x1d/0x30
Dec 11 04:03:49 ftp kernel: [<c016847d>] shrink_dcache_memory+0x1d/0x30
Dec 11 04:03:49 ftp kernel: [shrink_slab+267/368] shrink_slab+0x10b/0x170
Dec 11 04:03:49 ftp kernel: [<c0143c3b>] shrink_slab+0x10b/0x170
Dec 11 04:03:49 ftp kernel: [balance_pgdat+575/592] balance_pgdat+0x23f/0x250
Dec 11 04:03:49 ftp kernel: [<c0144f2f>] balance_pgdat+0x23f/0x250
Dec 11 04:03:49 ftp kernel: [kswapd+199/224] kswapd+0xc7/0xe0
Dec 11 04:03:49 ftp kernel: [<c0145007>] kswapd+0xc7/0xe0
Dec 11 04:03:49 ftp kernel: [autoremove_wake_function+0/64] autoremove_wake_function+0x0/0x40
Dec 11 04:03:49 ftp kernel: [<c0121170>] autoremove_wake_function+0x0/0x40
Dec 11 04:03:49 ftp kernel: [autoremove_wake_function+0/64] autoremove_wake_function+0x0/0x40
Dec 11 04:03:49 ftp kernel: [<c0121170>] autoremove_wake_function+0x0/0x40
Dec 11 04:03:49 ftp kernel: [autoremove_wake_function+0/64] autoremove_wake_function+0x0/0x40
Dec 11 04:03:49 ftp kernel: [<c0121170>] autoremove_wake_function+0x0/0x40
Dec 11 04:03:49 ftp kernel: [kswapd+0/224] kswapd+0x0/0xe0
Dec 11 04:03:49 ftp kernel: [<c0144f40>] kswapd+0x0/0xe0
Dec 11 04:03:49 ftp kernel: [kernel_thread_helper+5/16] kernel_thread_helper+0x5/0x10
Dec 11 04:03:49 ftp kernel: [<c0109275>] kernel_thread_helper+0x5/0x10
Dec 11 04:03:49 ftp kernel:
Dec 11 04:03:49 ftp kernel: Code: ff 39 df 5a 0f 84 e8 fe ff ff 57 e8 22 fb ff ff 58 e9 dc fe ff ff 57 53 ff d0 5f 58 eb da 0f 0b 95 00 dd 60 2e c0 e9 78 ff ff ff <0f> 0b 94 00 dd 60 2e c0 e9 63 ff ff ff 8d 76 00 8d bc 27 00 00
Dec 11 04:05:03 ftp CROND[19302]: (mail) CMD (/usr/bin/python -S /usr/lib/mailman/cron/gate_news)
Dec 11 04:10:08 ftp CROND[19305]: (mail) CMD (/usr/bin/python -S /usr/lib/mailman/cron/gate_news)
不知道到底是XFS的某种BUG导致了死机,还是因为CROND对硬盘的频繁读取选成了死机,还请赐教
有一点比较奇怪,几乎每次死机都是凌晨四点后, 而四点左右,似乎crond只有mail的一个任务在运行,
27 3 * * * /usr/bin/python -S /usr/lib/mailman/cron/nightly_gzip |
|