Received: by 2002:a05:7412:518d:b0:e2:908c:2ebd with SMTP id fn13csp393328rdb; Thu, 5 Oct 2023 08:53:31 -0700 (PDT) X-Google-Smtp-Source: AGHT+IEG9H44j5fauhVJwP4riUN0QvWYAAlMglbfnHZgkZv4zrN0B0+Ldb7Z8jsIwJSM2spcE0jJ X-Received: by 2002:a05:6a21:a599:b0:153:39d9:56f8 with SMTP id gd25-20020a056a21a59900b0015339d956f8mr2579185pzc.15.1696521211427; Thu, 05 Oct 2023 08:53:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1696521211; cv=none; d=google.com; s=arc-20160816; b=W1Elk/R6qgL/7b0cu9ww3WPcby4HcYRC8qWPm1/xvSvyC8aOWaXnjA3Rhhc3m1bQaI Cx1x3QHVfKXKJWlTxsH7vfYRGZNPS9CmvQgBIQIn6c0HV5L0mqTZgD1Mh90fSHLXlH9N 0OmUOkVjYSLI1kZkcg7g30+jF4H9iN9YkxJBLqXCo9FFwqvqY74bCduLBRnZWbwh1zCH mwOESjH6JG0zI/oXeuRx+PLmzihxht24brYBGaKFdRBABZQXvZSU7mROL4MCdMD6ZC8D tFrZ1U36IzGy8IQKtccU/IkmGWT1GgejaeJhI8PwrTPts9AwEfKjpjZHvQPh+lVRK1M7 vv4Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=hwkpKaN6skDzUqPPWxzuywPx8x1rjjExaxeexi0cUAM=; fh=lQmlxQesYx27L+Ts6LV368ZO09XA9ThroYyk67zO4TA=; b=F4FvI4r+PsHKEzGFL9ZwCvI9Ii+AxghVKCtQtf4F6tmLP03tyCtbfWuudjjhkrvOSO P/OA7bOrrMFQdn/CTo97wulQqnaILHqGxf5kB0N0YmN8TFERfKDiWTAUSNHK+RblQQ6z qX7FOyo/83oyBaywTXjK83cYT2nXV11v/0MNb0J6lKd6dsCtB7dTk741Vixe7K4XQGZS LvUVjMW+4EOE6yqQmOjhNINzmf+hkmE6i9U6JGYrxEp3SPjKkChLfgyGunZOFZp25NOe WFdSKlzydoy+jjmA5OA3masM3pkM1I55vg/tGntE8LeZYLYrcLYW0H/tc4gPXDAbOon8 aqNg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b="0V/zks3q"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:4 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Return-Path: Received: from howler.vger.email (howler.vger.email. [2620:137:e000::3:4]) by mx.google.com with ESMTPS id m16-20020a63fd50000000b00579ab302fd2si1728769pgj.814.2023.10.05.08.53.31 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 05 Oct 2023 08:53:31 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:4 as permitted sender) client-ip=2620:137:e000::3:4; Authentication-Results: mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b="0V/zks3q"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:4 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by howler.vger.email (Postfix) with ESMTP id 2E84587A32D2; Thu, 5 Oct 2023 08:53:30 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at howler.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234422AbjJEPxD (ORCPT + 99 others); Thu, 5 Oct 2023 11:53:03 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:58654 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S235410AbjJEPvI (ORCPT ); Thu, 5 Oct 2023 11:51:08 -0400 Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id D6B716079E; Thu, 5 Oct 2023 07:07:44 -0700 (PDT) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 1CA90C116B8; Thu, 5 Oct 2023 09:18:16 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linuxfoundation.org; s=korg; t=1696497497; bh=er2gV/B0WGFGwTMx3YbhEbm41Uarn24n6lRwoScj7y4=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=0V/zks3qAJ5nzMioieCIDud+bGbfkFe+9f0mnDo772FRR8YsbFjMw0sctDIrO3PFZ nZV0Gm93NEvl6n/mwYKwei2JXj330Jd4CZOYF9zaTScF7OBP5YMYV5ou3bTpQwWS3Z Q355DK9M+2o0elgzOH08Weh4qq4s6NbYIdTGqF5s= Date: Thu, 5 Oct 2023 11:18:14 +0200 From: Greg Kroah-Hartman To: Lee Jones Cc: "Starke, Daniel" , "linux-kernel@vger.kernel.org" , Fedor Pchelkin , Jiri Slaby , "linux-serial@vger.kernel.org" , "syzbot+5f47a8cea6a12b77a876@syzkaller.appspotmail.com" Subject: Re: [PATCH 1/1] tty: n_gsm: Avoid sleeping during .write() whilst atomic Message-ID: <2023100528-directory-arrogant-2ca9@gregkh> References: <20231003170020.830242-1-lee@kernel.org> <2023100320-immorally-outboard-573a@gregkh> <2023100421-negotiate-stammer-1b35@gregkh> <20231004085720.GA9374@google.com> <2023100448-cotton-safehouse-aca2@gregkh> <20231004125704.GA83257@google.com> <2023100435-xerox-idiocy-5cf0@gregkh> <20231005090311.GD83257@google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20231005090311.GD83257@google.com> X-Spam-Status: No, score=-4.4 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_MED, SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (howler.vger.email [0.0.0.0]); Thu, 05 Oct 2023 08:53:30 -0700 (PDT) On Thu, Oct 05, 2023 at 10:03:11AM +0100, Lee Jones wrote: > On Wed, 04 Oct 2023, Greg Kroah-Hartman wrote: > > > On Wed, Oct 04, 2023 at 01:57:04PM +0100, Lee Jones wrote: > > > On Wed, 04 Oct 2023, Greg Kroah-Hartman wrote: > > > > > > > On Wed, Oct 04, 2023 at 09:57:20AM +0100, Lee Jones wrote: > > > > > On Wed, 04 Oct 2023, Greg Kroah-Hartman wrote: > > > > > > > > > > > On Wed, Oct 04, 2023 at 05:59:09AM +0000, Starke, Daniel wrote: > > > > > > > > Daniel, any thoughts? > > > > > > > > > > > > > > Our application of this protocol is only with specific modems to enable > > > > > > > circuit switched operation (handling calls, selecting/querying networks, > > > > > > > etc.) while doing packet switched communication (i.e. IP traffic over PPP). > > > > > > > The protocol was developed for such use cases. > > > > > > > > > > > > > > Regarding the issue itself: > > > > > > > There was already an attempt to fix all this by switching from spinlocks to > > > > > > > mutexes resulting in ~20% performance loss. However, the patch was reverted > > > > > > > as it did not handle the T1 timer leading into sleep during atomic within > > > > > > > gsm_dlci_t1() on every mutex lock there. > > > > > > > > > > That's correct. When I initially saw this report, my initial thought > > > > > was to replace the spinlocks with mutexts, but having read the previous > > > > > accepted attempt and it's subsequent reversion I started to think of > > > > > other ways to solve this issue. This solution, unlike the last, does > > > > > not involve adding sleep inducing locks into atomic contexts, nor > > > > > should it negatively affect performance. > > > > > > > > > > > > There was also a suggestion to fix this in do_con_write() as > > > > > > > tty_operations::write() appears to be documented as "not allowed to sleep". > > > > > > > The patch for this was rejected. It did not fix the issue within n_gsm. > > > > > > > > > > > > > > Link: https://lore.kernel.org/all/20221203215518.8150-1-pchelkin@ispras.ru/ > > > > > > > Link: https://lore.kernel.org/all/20221212023530.2498025-1-zengheng4@huawei.com/ > > > > > > > Link: https://lore.kernel.org/all/5a994a13-d1f2-87a8-09e4-a877e65ed166@kernel.org/ > > > > > > > > > > > > Ok, I thought I remembered this, I'll just drop this patch from my > > > > > > review queue and wait for a better solution if it ever comes up as this > > > > > > isn't a real issue that people are seeing on actual systems, but just a > > > > > > syzbot report. > > > > > > > > > > What does the "better solution" look like? > > > > > > > > One that actually fixes the root problem here (i.e. does not break the > > > > recursion loop, or cause a performance decrease for normal users, or > > > > prevent this from being bound to the console). > > > > > > Does this solution break the recursion loop or affect performance? > > > > This solution broke the recursion by returning an error, right? > > This is the part I was least sure about. > > If this was considered valid and we were to go forward with a solution > like this, what would a quality improvement look like? Should we have > stayed in this function and waited for the previous occupant to leave > before continuing through ->write()? This isn't valid, as it obviously never shows up in real use. The real solution should be to prevent binding a console to this line discipline as it can not handle the recursion that consoles require for the write path. Then, if consoles are really needed, the code can be fixed up to handle such recursion. That's not a trivial thing to do, as can be seen by the crazy gyrations that the n_tty line discipline does in its write path... thanks, greg k-h