LinuxLists.cc - smp_call_function/flush_tlb_all hang on large memory system

2003-03-20 20:10:17

Subject: smp_call_function/flush_tlb_all hang on large memory system

On my 2.4.18 system, I am seeing a hang in
smp_call_function() when my machine is configured with
>2GB of RAM. Examining the call_data, it appears that
only 2 out of the other 3 cpus have responded to the
IPI (started == finished == 2). Examining eflags on
each of the cpus, shows that one of the cpus has
interrupts disabled (bit 9 was clear). However, that
cpu's stack does not show anything that would have
disabled interrupts, i.e. appears as if somebody
forgot to sti. Interestingly, manually setting that
bit (from kdb) will actually unwedge the system.

I always see smp_call_function() called from
flush_tbl_all(), which is usually called from
kmap_high; there has been one occassion when it was
called from vmfree_area_pages.

3]kdb> bt
EBP EIP Function (args)
4d4afec4 40111398 smp_call_function+0x88 (40111180 0 1
1 755)
4d4afee0 401111dc flush_tlb_all+0x14 (0 4d4ac000
433ca6b0)
4d4afef4 40138779 flush_all_zero_pkmaps+0x89 (433ca6b0
4d4aff6c 403 40129fd0 433ca6b0)
4d4aff28 4013888a kmap_high+0x106 (4d112474 2000
499392e4 4d1124dc 0)
4d4aff74 4012c997 generic_file_write+0x40b (499392c4
3fffd4a4 1000 499392e4 0)
4d4aff98 40178df5 nfs_file_write+0xa9 (499392c4
3fffc4a4 2000 499392e4 4d4ac000)
4d4affbc 4013ad18 sys_write+0x98 (4 3fffc4a4 2000 2000
3fffc4a4)
401071ab system_call+0x33

This hang is fairly reproducible when exercising NFS
(as shown above), e.g. repeatedly copying a large
file.

Browsing the web, I found a similar occurrence, but
didn't find a reply to the post:
http://oss.sgi.com/projects/xfs/mail_archive/200010/msg00122.html

Has anyone ever seen this or possibly have any ideas?
Please CC me directly on the response. Thanks in
advance.

- William Chow

__________________________________________________
Do you Yahoo!?
Yahoo! Platinum - Watch CBS' NCAA March Madness, live on your desktop!
http://platinum.yahoo.com

2004-11-22 09:48:13

by Deepak Kumar Gupta, Noida

[permalink] [raw]

Subject: Re: smp_call_function/flush_tlb_all hang on large memory system

Hi William/Rajagopal

I saw your posting related to problem on internet. Just curious to ask you
have you got any solution for that or not.. as i am facing same problem on
SGI Propack 3 (based on kernel 2.4.18)on 2 CPU IA64 machine..

If you got any solution for this then pls let me know..

Any help in this regard is appreciated.

posting: http://www.cs.helsinki.fi/linux/linux-kernel/2003-11/1153.html

Thanks and Best Regards
Deepak Kumar Gupta
Operating System Group
HCL Technologies Limited
India.

2004-11-22 12:15:38

by Robin Holt

[permalink] [raw]

Subject: Re: smp_call_function/flush_tlb_all hang on large memory system

On Mon, Nov 22, 2004 at 03:15:00PM +0530, Deepak Kumar Gupta, Noida wrote:
> Hi William/Rajagopal
>
> I saw your posting related to problem on internet. Just curious to ask you
> have you got any solution for that or not.. as i am facing same problem on
> SGI Propack 3 (based on kernel 2.4.18)on 2 CPU IA64 machine..
>
> If you got any solution for this then pls let me know..
>
> Any help in this regard is appreciated.
>
> posting: http://www.cs.helsinki.fi/linux/linux-kernel/2003-11/1153.html
>

Can you provide the output from an L2 "leds" command? This will tell us
what the cpus are doing and whether they have interrupts enabled. Have you
contacted your support people yet? I did not see an open case for this,
but have no idea how your support person exactly filed it.

Thanks,
Robin Holt

2004-11-23 02:19:07

by Deepak Kumar Gupta, Noida

[permalink] [raw]

Subject: RE: smp_call_function/flush_tlb_all hang on large memory system

Hi Robin

The output of CPU is

CPU A: 0x02: Kernel: CPU busy
0x03: Kernel: CPU busy
CPU C: 0x03: Kernel: CPU busy

well regarding filing the issue.. i haven't yet contactated support
persons.. send the mail to just know whether there is already a solution
available or not..

If you are interested in stack trace.. then it is as follows:-

[0]kdb> bt
Stack traceback for pid 7
0xe00000307b818000 7 1 1 0 R 0xe00000307b8185a0 *kswapd
0xe00000000444b120 smp_call_function+0x5e0
args (0xe000000005033698, 0xe000000005033698, 0x1,
0xa000000000008000, 0x1)
kernel .text 0xe000000004400000 0xe00000000444ab40
0xe00000000444b160
0xe00000000444a330 smp_flush_tlb_all+0x30
args (0xe0000000044545a0, 0x288)
kernel .text 0xe000000004400000 0xe00000000444a300
0xe00000000444a360
0xe0000000044545a0 flush_tlb_range+0x40
args (0xe00000307a5b64c8, 0x2000000002128000, 0x200000000212c000,
0xe000000004559880, 0x58e)
kernel .text 0xe000000004400000 0xe000000004454560
0xe000000004454700
0xe000000004559880 try_to_swap_out+0x320
args (0xe00000307a5b64c8, 0xe00000303b910468, 0x27be00,
0xe000003045638250, 0xa0007fffffe20300)
kernel .text 0xe000000004400000 0xe000000004559560
0xe000000004559c60
0xe0000000045564d0 swap_out+0x810
args (0xa0007fffffe20300, 0x1d0, 0xe000003005400000,
0xe000003045638250, 0xe00000307a5b64c8)
kernel .text 0xe000000004400000 0xe000000004555cc0
0xe000000004556680
0xe000000004556b10 shrink_cache+0x490
args (0xe0000030054187f0, 0xc, 0xe000003005400000, 0x1d0,
0xa0007fffff6b9110)
kernel .text 0xe000000004400000 0xe000000004556680
0xe0000000045574a0
0xe000000004557aa0 shrink_caches+0xe0
args (0xe000003005400000, 0x6, 0x1d0, 0x20, 0xe000003005400000)
kernel .text 0xe000000004400000 0xe0000000045579c0
0xe000000004557b60
0xe000000004557bf0 try_to_free_pages_zone+0x90
args (0xe000003005400000, 0x1d0, 0x5, 0xe00000000541c998,
0xe000000005033848)
kernel .text 0xe000000004400000 0xe000000004557b60
0xe000000004557cc0
0xe0000000045590d0 kswapd_balance_pgdat+0x110
args (0xe000003005400000, 0xe000003005400030, 0x0,
0xe000003005400000, 0x0)
kernel .text 0xe000000004400000 0xe000000004558fc0
0xe000000004559140
0xe0000000045591b0 kswapd_balance+0x70
args (0x0, 0xe00000000586eb10, 0xe000000004559510, 0x287)
kernel .text 0xe000000004400000 0xe000000004559140
0xe000000004559220
0xe000000004559510 kswapd+0x170
args (0x0, 0xe000000004f4e250, 0x1, 0xe000000004416b00, 0x30c)
kernel .text 0xe000000004400000 0xe0000000045593a0
0xe000000004559560
0xe000000004416b00 arch_kernel_thread+0x160
args (0xe000000004d0f2b8, 0xe00000000521b660, 0x0, 0x0,
0xe0000000044e4a30)
kernel .text 0xe000000004400000 0xe0000000044169a0
0xe000000004416c20
0xe0000000044e4a30 kernel_thread+0xf0
args (0xe000000004d0f2b0, 0x0, 0xe00, 0x0, 0xe000000004d34990)
kernel .text 0xe000000004400000 0xe0000000044e4940
0xe0000000044e4a60
0xe000000004d34990 kswapd_init+0x70
args (0xe000000004d1d030, 0x285)
kernel .text.init 0xe000000004d1c000 0xe000000004d34920
0xe000000004d34a00
0xe000000004d1d030 do_initcalls+0x50
args (0xe000000004e5b2d8, 0xe00000000521b660, 0xe000000004e5b578,
0xe000000004408e20, 0x20a)
kernel .text.init 0xe000000004d1c000 0xe000000004d1cfe0
0xe000000004d1d080
0xe000000004408e20 init+0xc0
args (0x0, 0xe000003007014830, 0xe000000004416b00, 0x30c)
kernel .text 0xe000000004400000 0xe000000004408d60
0xe0000000044090a0
0xe000000004416b00 arch_kernel_thread+0x160
args (0xe000000004d10f88, 0xe00000000521b660, 0x0,
0xaeeeeeee8badbeef, 0xe0000000044e4a30)
kernel .text 0xe000000004400000 0xe0000000044169a0
0xe000000004416c20
0xe0000000044e4a30 kernel_thread+0xf0
args (0xe000000004d10f80, 0x0, 0xe00, 0x0, 0xe000000004408cd0)
kernel .text 0xe000000004400000 0xe0000000044e4940
0xe0000000044e4a60
0xe000000004408cd0 rest_init+0x50
args (0xe000000004d1cf60, 0x58e)
kernel .text 0xe000000004400000 0xe000000004408c80
0xe000000004408d60
0xe000000004d1cf60 start_kernel+0x480
args (0x307bda9c08, 0xb1f, 0x300467e378, 0x3004875a00, 0x307bd4b7b0)
kernel .text.init 0xe000000004d1c000 0xe000000004d1cae0
0xe000000004d1cfe0
0xe0000000044081c0 start_ap+0x2a0
args (0x307bf77000, 0x3004a3eb50, 0x0, 0x1, 0x307bda9c08)
kernel .text 0xe000000004400000 0xe000000004407f20
0xe0000000044081e0

Best Regards
Deepak Kumar Gupta.

-----Original Message-----
From: Robin Holt
To: Deepak Kumar Gupta, Noida
Cc: '[email protected]'; '[email protected]';
'[email protected]'; '[email protected]'
Sent: 11/22/04 5:44 PM
Subject: Re: smp_call_function/flush_tlb_all hang on large memory system

On Mon, Nov 22, 2004 at 03:15:00PM +0530, Deepak Kumar Gupta, Noida
wrote:
> Hi William/Rajagopal
>
> I saw your posting related to problem on internet. Just curious to ask
you
> have you got any solution for that or not.. as i am facing same
problem on
> SGI Propack 3 (based on kernel 2.4.18)on 2 CPU IA64 machine..
>
> If you got any solution for this then pls let me know..
>
> Any help in this regard is appreciated.
>
> posting:
http://www.cs.helsinki.fi/linux/linux-kernel/2003-11/1153.html
>

Can you provide the output from an L2 "leds" command? This will tell us
what the cpus are doing and whether they have interrupts enabled. Have
you
contacted your support people yet? I did not see an open case for this,
but have no idea how your support person exactly filed it.

Thanks,
Robin Holt

2004-11-23 02:31:35

by tip-bot for Jack Steiner

[permalink] [raw]

Subject: Re: smp_call_function/flush_tlb_all hang on large memory system

On Tue, Nov 23, 2004 at 07:41:07AM +0530, Deepak Kumar Gupta, Noida wrote:
> Hi Robin
>
> The output of CPU is
>
> CPU A: 0x02: Kernel: CPU busy
> 0x03: Kernel: CPU busy
> CPU C: 0x03: Kernel: CPU busy

Looks like cpu 1 is stuck. Can you nmi cpu #1 & send the
NMI record + the System.map for the OS:

at kdb
ps
nmi <hung cpu>
(wait 20 sec)
pod
error
error a

<hung cpu> is the cpu number of the cpu that does not show LED values that
are changing. In the output above, cpu #1 is hung.

>
> well regarding filing the issue.. i haven't yet contactated support
> persons.. send the mail to just know whether there is already a solution
> available or not..
>
> If you are interested in stack trace.. then it is as follows:-
>
> [0]kdb> bt
> Stack traceback for pid 7
> 0xe00000307b818000 7 1 1 0 R 0xe00000307b8185a0 *kswapd
> 0xe00000000444b120 smp_call_function+0x5e0
> args (0xe000000005033698, 0xe000000005033698, 0x1,
> 0xa000000000008000, 0x1)
> kernel .text 0xe000000004400000 0xe00000000444ab40
> 0xe00000000444b160
> 0xe00000000444a330 smp_flush_tlb_all+0x30
> args (0xe0000000044545a0, 0x288)
> kernel .text 0xe000000004400000 0xe00000000444a300
> 0xe00000000444a360
> 0xe0000000044545a0 flush_tlb_range+0x40
> args (0xe00000307a5b64c8, 0x2000000002128000, 0x200000000212c000,
> 0xe000000004559880, 0x58e)
> kernel .text 0xe000000004400000 0xe000000004454560
> 0xe000000004454700
> 0xe000000004559880 try_to_swap_out+0x320
> args (0xe00000307a5b64c8, 0xe00000303b910468, 0x27be00,
> 0xe000003045638250, 0xa0007fffffe20300)
> kernel .text 0xe000000004400000 0xe000000004559560
> 0xe000000004559c60
> 0xe0000000045564d0 swap_out+0x810
> args (0xa0007fffffe20300, 0x1d0, 0xe000003005400000,
> 0xe000003045638250, 0xe00000307a5b64c8)
> kernel .text 0xe000000004400000 0xe000000004555cc0
> 0xe000000004556680
> 0xe000000004556b10 shrink_cache+0x490
> args (0xe0000030054187f0, 0xc, 0xe000003005400000, 0x1d0,
> 0xa0007fffff6b9110)
> kernel .text 0xe000000004400000 0xe000000004556680
> 0xe0000000045574a0
> 0xe000000004557aa0 shrink_caches+0xe0
> args (0xe000003005400000, 0x6, 0x1d0, 0x20, 0xe000003005400000)
> kernel .text 0xe000000004400000 0xe0000000045579c0
> 0xe000000004557b60
> 0xe000000004557bf0 try_to_free_pages_zone+0x90
> args (0xe000003005400000, 0x1d0, 0x5, 0xe00000000541c998,
> 0xe000000005033848)
> kernel .text 0xe000000004400000 0xe000000004557b60
> 0xe000000004557cc0
> 0xe0000000045590d0 kswapd_balance_pgdat+0x110
> args (0xe000003005400000, 0xe000003005400030, 0x0,
> 0xe000003005400000, 0x0)
> kernel .text 0xe000000004400000 0xe000000004558fc0
> 0xe000000004559140
> 0xe0000000045591b0 kswapd_balance+0x70
> args (0x0, 0xe00000000586eb10, 0xe000000004559510, 0x287)
> kernel .text 0xe000000004400000 0xe000000004559140
> 0xe000000004559220
> 0xe000000004559510 kswapd+0x170
> args (0x0, 0xe000000004f4e250, 0x1, 0xe000000004416b00, 0x30c)
> kernel .text 0xe000000004400000 0xe0000000045593a0
> 0xe000000004559560
> 0xe000000004416b00 arch_kernel_thread+0x160
> args (0xe000000004d0f2b8, 0xe00000000521b660, 0x0, 0x0,
> 0xe0000000044e4a30)
> kernel .text 0xe000000004400000 0xe0000000044169a0
> 0xe000000004416c20
> 0xe0000000044e4a30 kernel_thread+0xf0
> args (0xe000000004d0f2b0, 0x0, 0xe00, 0x0, 0xe000000004d34990)
> kernel .text 0xe000000004400000 0xe0000000044e4940
> 0xe0000000044e4a60
> 0xe000000004d34990 kswapd_init+0x70
> args (0xe000000004d1d030, 0x285)
> kernel .text.init 0xe000000004d1c000 0xe000000004d34920
> 0xe000000004d34a00
> 0xe000000004d1d030 do_initcalls+0x50
> args (0xe000000004e5b2d8, 0xe00000000521b660, 0xe000000004e5b578,
> 0xe000000004408e20, 0x20a)
> kernel .text.init 0xe000000004d1c000 0xe000000004d1cfe0
> 0xe000000004d1d080
> 0xe000000004408e20 init+0xc0
> args (0x0, 0xe000003007014830, 0xe000000004416b00, 0x30c)
> kernel .text 0xe000000004400000 0xe000000004408d60
> 0xe0000000044090a0
> 0xe000000004416b00 arch_kernel_thread+0x160
> args (0xe000000004d10f88, 0xe00000000521b660, 0x0,
> 0xaeeeeeee8badbeef, 0xe0000000044e4a30)
> kernel .text 0xe000000004400000 0xe0000000044169a0
> 0xe000000004416c20
> 0xe0000000044e4a30 kernel_thread+0xf0
> args (0xe000000004d10f80, 0x0, 0xe00, 0x0, 0xe000000004408cd0)
> kernel .text 0xe000000004400000 0xe0000000044e4940
> 0xe0000000044e4a60
> 0xe000000004408cd0 rest_init+0x50
> args (0xe000000004d1cf60, 0x58e)
> kernel .text 0xe000000004400000 0xe000000004408c80
> 0xe000000004408d60
> 0xe000000004d1cf60 start_kernel+0x480
> args (0x307bda9c08, 0xb1f, 0x300467e378, 0x3004875a00, 0x307bd4b7b0)
> kernel .text.init 0xe000000004d1c000 0xe000000004d1cae0
> 0xe000000004d1cfe0
> 0xe0000000044081c0 start_ap+0x2a0
> args (0x307bf77000, 0x3004a3eb50, 0x0, 0x1, 0x307bda9c08)
> kernel .text 0xe000000004400000 0xe000000004407f20
> 0xe0000000044081e0
>
> Best Regards
> Deepak Kumar Gupta.
>
>
> -----Original Message-----
> From: Robin Holt
> To: Deepak Kumar Gupta, Noida
> Cc: '[email protected]'; '[email protected]';
> '[email protected]'; '[email protected]'
> Sent: 11/22/04 5:44 PM
> Subject: Re: smp_call_function/flush_tlb_all hang on large memory system
>
> On Mon, Nov 22, 2004 at 03:15:00PM +0530, Deepak Kumar Gupta, Noida
> wrote:
> > Hi William/Rajagopal
> >
> > I saw your posting related to problem on internet. Just curious to ask
> you
> > have you got any solution for that or not.. as i am facing same
> problem on
> > SGI Propack 3 (based on kernel 2.4.18)on 2 CPU IA64 machine..
> >
> > If you got any solution for this then pls let me know..
> >
> > Any help in this regard is appreciated.
> >
> > posting:
> http://www.cs.helsinki.fi/linux/linux-kernel/2003-11/1153.html
> >
>
> Can you provide the output from an L2 "leds" command? This will tell us
> what the cpus are doing and whether they have interrupts enabled. Have
> you
> contacted your support people yet? I did not see an open case for this,
> but have no idea how your support person exactly filed it.
>
> Thanks,
> Robin Holt
> -
> To unsubscribe from this list: send the line "unsubscribe linux-ia64" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html

--
Thanks

Jack Steiner ([email protected]) 651-683-5302
Principal Engineer SGI - Silicon Graphics, Inc.

2004-11-23 02:43:37

by Keith Owens

[permalink] [raw]

Subject: Re: smp_call_function/flush_tlb_all hang on large memory system

It has always been an error to call smp_call_function() with interrupts
disabled. Recent 2.6 kernels check for this and issue a warning. The
problem is not smp_call_function() or flush_tlb_all(), it is the code
that called them with interrupts disabled. Find the calling code and
fix it to not disable interrupts.

2004-11-23 21:14:49

by Zwane Mwaikambo

[permalink] [raw]

Subject: RE: smp_call_function/flush_tlb_all hang on large memory system

On Tue, 23 Nov 2004, Deepak Kumar Gupta, Noida wrote:

> Hi Robin
>
> The output of CPU is
>
> CPU A: 0x02: Kernel: CPU busy
> 0x03: Kernel: CPU busy
> CPU C: 0x03: Kernel: CPU busy
>
> well regarding filing the issue.. i haven't yet contactated support
> persons.. send the mail to just know whether there is already a solution
> available or not..
>
> If you are interested in stack trace.. then it is as follows:-
>
> [0]kdb> bt
> Stack traceback for pid 7
> 0xe00000307b818000 7 1 1 0 R 0xe00000307b8185a0 *kswapd
> 0xe00000000444b120 smp_call_function+0x5e0
> args (0xe000000005033698, 0xe000000005033698, 0x1,
> 0xa000000000008000, 0x1)
> kernel .text 0xe000000004400000 0xe00000000444ab40
> 0xe00000000444b160
> 0xe00000000444a330 smp_flush_tlb_all+0x30
> args (0xe0000000044545a0, 0x288)
> kernel .text 0xe000000004400000 0xe00000000444a300
> 0xe00000000444a360
> 0xe0000000044545a0 flush_tlb_range+0x40
> args (0xe00000307a5b64c8, 0x2000000002128000, 0x200000000212c000,
> 0xe000000004559880, 0x58e)
> kernel .text 0xe000000004400000 0xe000000004454560
> 0xe000000004454700
> 0xe000000004559880 try_to_swap_out+0x320
> args (0xe00000307a5b64c8, 0xe00000303b910468, 0x27be00,
> 0xe000003045638250, 0xa0007fffffe20300)
> kernel .text 0xe000000004400000 0xe000000004559560

This function holds mm->page_table_lock which is acquired with interrupts
disabled. As a result there is a window for deadlock when you descend into
smp_call_function. I suggest you run fast from crusty kernels ;)