Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp4910509imu; Mon, 12 Nov 2018 20:34:40 -0800 (PST) X-Google-Smtp-Source: AJdET5ckmOOS4iTg4uzgQi71610IWBoiRBmRqGL+dJ21Z+FaL+fdLOHw3xVH+AOg+8W98fyDGYuB X-Received: by 2002:a63:a84a:: with SMTP id i10mr3390411pgp.263.1542083680884; Mon, 12 Nov 2018 20:34:40 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1542083680; cv=none; d=google.com; s=arc-20160816; b=VxBnIvyd9j8F0EFBqSol1dG6U1NQvidcJqfo2VCU0ZCRtC0LsWrHcew3+OwhSqtLij Iy+az+OI4gcTrLL054JyZ0FOQp8NFIfwoobWhqXzuOZtkQTKmNtcDEDP/OCjlPLJaa7C hMaEiHHYeB2Q3jw8q081VB4J501GFj1XIm1kCy65k3D2D/vROn9ZD0VAExWx9FZUr4yT 3ZRp7cJnWDQ5hsI45evnwVD1soZAR3njC0yIlhnGrzBkhVqo7Jx89Tsk7hceQ7eQAwgu 99E33IgUkvlIAI85tHrSKjE7S9flobVK5X2wQn2RQhYVZDDaJSLjZy9LZtaR6oY0G89d 2i5g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:to:references:message-id :content-transfer-encoding:cc:date:in-reply-to:from:subject :mime-version; bh=NNeK12wZHP/hs0hix0ff8694vFI89ZFYaVhROaErAOI=; b=GyDUgEmsuXnhO5Ti6NYtwKgOxta9gYWEHmzyXngzrOZRtaUxHy5EYkCIiY92ffKQWr EBwShntM7Rh8KNfYsp7d7mdKa0OdtP+dcRxsFoVFu3Kq1uMtEjIIZV3hx/HLY3MEin7C P3PVNvuTbINgxMW7YOq0yDO4eP7ARGXWcWBrP0GCpHdlSKfT4I2hPMj9jgeyg3xxCsNQ QXu12uZH98I4ZwT/KuDxUb779MvtDrlBN116j27zqnsPhxN6a3BK7yM7bMqtxTaN1eWq VNeV68hJ9m+LOlsNwjU3HkW9443mUa5Yboy2RzjKhataSHg67IH4e5cjQrUL2mwIPQFQ BzFQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r36-v6si18047609pgl.257.2018.11.12.20.34.25; Mon, 12 Nov 2018 20:34:40 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730728AbeKMOaI convert rfc822-to-8bit (ORCPT + 99 others); Tue, 13 Nov 2018 09:30:08 -0500 Received: from mout.gmx.net ([212.227.15.19]:43801 "EHLO mout.gmx.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726103AbeKMOaI (ORCPT ); Tue, 13 Nov 2018 09:30:08 -0500 Received: from [192.168.1.153] ([74.104.183.64]) by mail.gmx.com (mrgmx002 [212.227.17.184]) with ESMTPSA (Nemesis) id 0McmS9-1g527l3nlp-00Hzz7; Tue, 13 Nov 2018 05:33:05 +0100 Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (Mac OS X Mail 12.0 \(3445.100.39\)) Subject: Re: ODEBUG: Out of memory. ODEBUG disabled From: Qian Cai In-Reply-To: Date: Mon, 12 Nov 2018 23:33:01 -0500 Cc: Arnd Bergmann , Yang Shi , Zhong Jiang , linux kernel , Thomas Gleixner , "Joel Fernandes (Google)" Content-Transfer-Encoding: 8BIT Message-Id: References: To: Waiman Long X-Mailer: Apple Mail (2.3445.100.39) X-Provags-ID: V03:K1:4t38uTvSR6GfGvn1/YS5lk7H5V9I84X4GWYk95anX1iX9HVpT82 NZ6W8Kff/K+yF1vsLELRvFG0xLiEubX5tPvkrc7uFD1Smuc7Lz3n7T//GEHrki8QtgC07B2 u3toOeIPyMkJiClgU/pOJUXbDGB3TLewF0tQndhSQjWRdxRvQnaZmzeUnYSoo5NYaTk1U9B n9zit7+McHjXMziWv3O1w== X-Spam-Flag: NO X-UI-Out-Filterresults: notjunk:1;V01:K0:KPiVI0gVOBM=:OGkMtYn+/wZzXKNqZ179V+ tyMJPSKdwD/z+k1TTnF4lM1DFlR8yzATIuiHdbJ9aLgjfksJcNE8fcg0AQVobQqzeAzNxefHL k3mjYWwJyJIq/HnKHGpRTqvaMI8sYAqCwAujXOyEXwr5dDb2VlM1mIKaNEC7AG9Q/FjBEMGMm BQlyQ+TSXXyNCQ2W0BeD3oPwt95A+KVWlbRtyOooFnPKnRCQIcrpFrze99akjMLHMQ6SWeX93 b3mPYh0AXH+Es73rfY9P+AT9M4gjOFZBt/gSDfDX4uL52I9Ab8kplIkOaYbdUfLEWCSXQVojn Xk7K6y5BWm50OV+py5Xf4H9QYNHeigTrjAnMJcrL/75k90qtOtqR33/wYdplqq3nhOJ9kY05n SgNjfqjl+sVFEXsxUpEedj7cbxZCrDU+ta9JTGhjvrQM6ljRSXr7ERfYx7xmZcn3YeCbT/dGR thlfciTQmsfQm+BT3z1v6Ker36WUkylXH/vjYh3wa+s6mz9rrJ8fL/Q6G85C9qDRa/Opy9dsR A2rM6ew00K65Q416eB8as2FYfSd2qENJwDCQVdjHVqdJ9bVzuJD346tP3qh96hXIldc0R1fVL jl56swa+Bu58tOaqLkPmQCVncc4EmpzL79JD5G/qb7bRwDEDbT5XFKg2jaB6sZXB1x8OHj1An 5XO9+ZL7nB5UbB/9SyYLL7cvHQqer83WfnQH3TNtBVxDHns9i0wyAXeeE8EMEMpdxYyKU3pBt pR4Xn8X+eTKHjHLdyey8gF5DiPgkFrfHn1n8745h/Jbw8/VyZEiKLd14AEspNjXDoZGEZ8r6R /fVGmSGhH8ra/5R2ywhGyjVIV8yXSyoPVt0vz9oQ3DiGW9cTpaOCW9kYxQQU9EjQQ86CJ/QFR Vu363JJiCho887QkJ1toYj0GSWE5eTEUXjVQtXPRAOjClbw5Sjdf26IBEmMcfF Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org > On Nov 10, 2018, at 9:11 AM, Qian Cai wrote: > > On 11/10/18 at 8:59 AM, Waiman Long wrote: > >> On 11/09/2018 08:45 PM, Qian Cai wrote: >>>> Sent: Friday, November 09, 2018 at 5:08 PM >>>> From: "Waiman Long" >>>> To: "Qian Cai" , "Yang Shi" >>>> Cc: "open list" , "Thomas Gleixner" , "Arnd Bergmann" , "Joel Fernandes (Google)" , "Zhong Jiang" >>>> Subject: Re: ODEBUG: Out of memory. ODEBUG disabled >>>> >>>> On 11/09/2018 04:51 PM, Qian Cai wrote: >>>>>> On Nov 9, 2018, at 4:42 PM, Yang Shi wrote: >>>>>> >>>>>> >>>>>> >>>>>> On 11/9/18 1:36 PM, Qian Cai wrote: >>>>>>> It is a bit annoying on this aarch64 server with 64 CPUs that is >>>>>>> booting the latest mainline (3541833fd1f2) causes object debugging >>>>>>> always running out of memory. >>>>>> May you please paste the detail failure log? >>>>> I assume you mean dmesg. >>>>> >>>>> Here is the dmesg for 64 CPUs, >>>>> https://paste.ubuntu.com/p/BnhvXXhn7k/ >>>>>>> I have to boot the kernel with only 16 CPUs instead (nr_cpus=16) >>>>>>> to make it work. Is it expected that object debugging is not going >>>>>>> to work with large machines? >>>>>> I don't think so. I'm supposed it works well with large CPU number on x86. >>>>> Here is the one with nr_cpus workaround, >>>>> https://paste.ubuntu.com/p/qMpd2CCPSV/ >>>> The debugobjects code have a set of 1024 statically allocated debug >>>> objects that can be used in early boot before the slab memory allocator >>>> is initialized. Apparently, the system may have used up all the >>>> statically allocated objects. Try double ODEBUG_POOL_SIZE to see if it >>>> helps. >>> Great, you are right. Doubling the size makes it work. Does it make sense >>> to have a kconfig option instead? >> >> First, I think you need to figure out what your system needed to use up >> so many debug objects in early boot. If there is a legitimate reason for >> this behavior, we can talk about having a kconfig option to increase that. > Anybody else not getting ODEBUG OOM with more than 64-CPU? As > mentioned, restricting to 16-CPU works fine. How can I figure out why the > system uses so much debug objects? On another aarch64 server with 256-CPU, even double the size of ODEBUG_POOL_SIZE, i.e., 2048 will get "ODEBUG: Out of memory. ODEBUG disabled”.