Received: by 2002:a05:6a10:5bc5:0:0:0:0 with SMTP id os5csp2477235pxb; Fri, 29 Oct 2021 02:07:54 -0700 (PDT) X-Google-Smtp-Source: ABdhPJx7VZixHgPA6I5FlXIN0qZaqkXKbtLMy4nLPpqXN1u7e8/faZa2Tu0x2IHrdC5s1547Bgsn X-Received: by 2002:aa7:8b56:0:b0:44c:10a:4ee9 with SMTP id i22-20020aa78b56000000b0044c010a4ee9mr9800900pfd.46.1635498474609; Fri, 29 Oct 2021 02:07:54 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1635498474; cv=none; d=google.com; s=arc-20160816; b=AqOpWSWwmwO3r4msAH318NIiJjrSsDYRCJD/ZYFOBeBVVhMhctVNr84Cbn6FZUmiBn 2YeluDDBJMl+E3OCivjAn3dHoqwaRTtDs2VSX997eVMP3cBoBcXtWFioThFMAY9iSXWN NqSZiNQt9OPOthSQw83CzBJ6vFy/Tt+AiGz2dPAcsfG5JHm6vqrV/jd15YU0gSBfLISX Tzowzn9GPsRjKNhIW6IHNl3DDtgC04cmWEybbKKOrCbSdJAgdye38Iydbf+150uRJdt8 VeoUrERxa5qSFUXaN7Bj8XysRLqBc/qW3MNr52jcriGFtCWonDLdZYLxkelsKGNwhMQc rYKg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:subject :from:references:cc:to:content-language:user-agent:mime-version:date :message-id; bh=KX6wb56H7oFx7z8pZuCQGxSazLVm2MZ27DMCPTx906g=; b=CFKYx7dDdynn8Vezv6iVp7xfr2irlsXRbJAtHlB87BJSiaxoiHdKMQE2Htf0NtgS7a cpt/LfJcIwN2K9R/iwnSozi82U3NQDcPh41taO6iMrJd2uhrD9Zc2K51MVHn8Nn+qJop auc3gfN+lfYfebHa5kWNf2iA1AriLnCdT9Za+o7jn0hLKpfg+y1Fb7Hv3hZNU+Mw4tv9 NdiOjh6CmjItsOCgsIcA/xzECx8+J2+lzTe7rgfMyXm1jd4M3u2lmZD9ntNXDsQkeUpG M/vigOIhl8ZWDEyH0oyk648ScKLWqMhmX3Iu+0fkyDzbBlQB9lfZ72L2WN+zyMBW/yGs Wh0A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-wireless-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-wireless-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id hk6si16004196pjb.0.2021.10.29.02.07.41; Fri, 29 Oct 2021 02:07:54 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-wireless-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-wireless-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-wireless-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231303AbhJ2JJ7 (ORCPT + 67 others); Fri, 29 Oct 2021 05:09:59 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:56584 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229844AbhJ2JJz (ORCPT ); Fri, 29 Oct 2021 05:09:55 -0400 Received: from wp530.webpack.hosteurope.de (wp530.webpack.hosteurope.de [IPv6:2a01:488:42:1000:50ed:8234::]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 0CDD5C061570 for ; Fri, 29 Oct 2021 02:07:25 -0700 (PDT) Received: from ip4d173d4a.dynamic.kabel-deutschland.de ([77.23.61.74] helo=[192.168.66.200]); authenticated by wp530.webpack.hosteurope.de running ExIM with esmtpsa (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) id 1mgNr9-00029N-Ei; Fri, 29 Oct 2021 11:07:23 +0200 Message-ID: Date: Fri, 29 Oct 2021 11:07:22 +0200 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.2.0 Content-Language: en-BS To: Kalle Valo , Francesco Magliocca Cc: ath10k@lists.infradead.org, "linux-wireless@vger.kernel.org" , "regressions@lists.linux.dev" References: <87o88muvvz.fsf@codeaurora.org> From: Thorsten Leemhuis Subject: Re: Bug in Memory Layout of rx_desc for QCA6174 In-Reply-To: <87o88muvvz.fsf@codeaurora.org> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-bounce-key: webpack.hosteurope.de;regressions@leemhuis.info;1635498446;e9740259; X-HE-SMSGID: 1mgNr9-00029N-Ei Precedence: bulk List-ID: X-Mailing-List: linux-wireless@vger.kernel.org On 21.09.21 11:21, Kalle Valo wrote: > (adding linux-wireless and regression lists) thx for that, this made be add the regression to regzbot (https://linux-regtracking.leemhuis.info/regzbot/mainline/ ). That's why I noticed there hasn't been any recent activity wrt to this regression -- at least in this thread. Was progress made somewhere else? If not: what can be done to get things moving again? Sure, it's an old regression, but nevertheless it would be nice to get it fixed. Ciao, Thorsten #regzbot ignore-activity > Francesco Magliocca writes: > >> Hello everyone, >> I have a QCA6174 PCIe board, I am using linux kernel 5.12.10. >> The firmware loaded is: >>> [ 4.483131] ath10k_pci 0000:02:00.0: qca6174 hw3.2 target 0x05030000 >>> chip_id 0x00340aff sub 1a56:143a >>> [ 4.483136] ath10k_pci 0000:02:00.0: kconfig debug 0 debugfs 1 >>> tracing 0 dfs 0 testmode 0 >>> [ 4.483567] ath10k_pci 0000:02:00.0: firmware ver >>> WLAN.RM.4.4.1-00157-QCARMSWPZ-1 api 6 features wowlan,ignore-otp,mfp >>> crc32 90eebefb >>> [ 4.572730] ath10k_pci 0000:02:00.0: board_file api 2 bmi_id N/A crc32 318825bf >>> [ 4.665592] ath10k_pci 0000:02:00.0: htt-ver 3.60 wmi-op 4 htt-op 3 >>> cal otp max-sta 32 raw 0 hwcrypto 1 >> >> around six months ago I reported a bug which is still haunting me: >> When I am connected to my home's Wi-Fi network and my father's Huawei >> smartphone is connected too >> my Wi-Fi card hangs and gets stuck, I have to force restart of the device. >> >> Note that this problem does not happen if my pc and the smartphone are >> connected to different networks (for example >> I tried connecting my pc to the 2.4GHz network and the smartphone to >> the 5GHz network, and the bug does not appear). >> >> Now, I tried bisecting driver changes, and I found the faulty one, >> it is the commit: e3def6f7ddf88636febb12e1e3e86387a4ce5452 > > Ok, so this is the commit: > > commit e3def6f7ddf88636febb12e1e3e86387a4ce5452 > Author: Govind Singh > AuthorDate: Thu Dec 21 14:30:51 2017 +0530 > Commit: Kalle Valo > CommitDate: Wed Dec 27 12:05:35 2017 +0200 > > ath10k: Update rx descriptor for WCN3990 target > > WCN3990 rx descriptor uses different offset of msdu start, msdu end, > ppdu end, rx pkt end and rx frag info. > To accommodate different offsets, define respective fields in > rx descriptor of WCN3990 target. > > Signed-off-by: Govind Singh > Signed-off-by: Kalle Valo > >> It adds some fields to structures like rx_msdu_start, rx_frag_info, etc.. >> The changes modify the size of these structures! >> >> If I revert this commit changes, the bug does not happen >> (I tested it for two weeks, while the bug happens at least once in 2-3 hours >> from when the smartphone is connected to the wifi network). > > Good, I was just about to ask about that. > >> Also, if I selectively remove some of the changes introduced by the >> faulty commit, the bug does not go away, so it looks like the problem >> is in the change of size of the data structures. > > Heh, I was also about to ask about that as well :) The firmware is > supposed to handle length differences but clearly it's not. > >> Now, I'd like to ask you what we can do to fix this problem... Is >> there something I am doing wrong? Or is there a bug in the firmware? >> >> If the firmware can't be easily fixed, I was thinking that we can >> abstract the htt_rx_desc (in the same way we do with ops in other >> parts of the driver) to have two versions: one for 32-bit descriptors >> (like my QCA6174) and one for 64-bit descriptors (i.e. WCN3990, which >> was the cause of this change). >> >> I'd be really happy to help, but I am not sure I fully understand what >> is going on, so what do you think is happening and what should we do? > > Getting the firmware fixed is difficult. I would first try abstracting > the htt_rx_desc, can you send a patch? >