DevHeads.net

C 7: smpboot: CPU 16 is now offline

Current kernel, and I just booted, and dmesg shows, of the 32 cores, 0, 2,
4 and 6 ok, and *all* other show "is now offline.

What's happening here?

mark

Comments

Re: C 7: smpboot: CPU 16 is now offline, and slabs...

By m.roth at 06/13/2018 - 10:00

<a href="mailto:m.roth@5-cent.us">m.roth@5-cent.us</a> wrote:
In googling, I see threads about incorrect calculation of slabs. Following
one thread, I find
cat /sys/kernel/slab/:t-0000048/cpu_slabs

gives me

4 N0=4

Meanwhile, slabtop shows
Active / Total Slabs (% used) : 25927 / 25927 (100.0%)

Which changes, but just varying around that number, and st 100%.

So: should I increase the number of slabs, using the kernel parm of
swiotlb, and if so, for what I show above, should I set it to, say, 32000?

mark

Re: C 7: smpboot: CPU 16 is now offline, and slabs...

By m.roth at 06/13/2018 - 11:10

<a href="mailto:m.roth@5-cent.us">m.roth@5-cent.us</a> wrote:
Perhaps I should have started with 1,3, etc, but I was doing the 20's,
instead. Got to CPU27... and the system rebooted.

Now I'm wondering if the offline'd CPUs have something to do with the fact
that this (and an identical one, in the datacenter, are rebooting around
04:00 every day. Btw, they're Dell PE R530's from 2016....

mark

Re: C 7: smpboot: CPU 16 is now offline, and slabs...

By m.roth at 06/13/2018 - 13:36

<a href="mailto:m.roth@5-cent.us">m.roth@5-cent.us</a> wrote:
Anyone think I might be going down the wrong path? Any thoughts at all? If
not, any cmts on my downgrading to the previous microcode? This happened
once a week ago, and then, starting last Friday, began happening at least
around 04:00 every day.

mark