Received: by 2002:a25:ab43:0:0:0:0:0 with SMTP id u61csp4273565ybi; Mon, 3 Jun 2019 08:16:00 -0700 (PDT) X-Google-Smtp-Source: APXvYqzo37S5l4X0IVtlaJvM3WeXT5jZ0DJePK0oIRNRDkDuwcHxHlMFDuFygxw8Uovf4rHUzcv2 X-Received: by 2002:a62:b40f:: with SMTP id h15mr26314029pfn.57.1559574960284; Mon, 03 Jun 2019 08:16:00 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1559574960; cv=none; d=google.com; s=arc-20160816; b=v7kka1AZRoGHY+NsR7aFw5hXE2dd1eoTGEAhudFbs+q8+HhFjPTQnrao0IkbupvkiM TaAeyH55xNwJZV60mmG8uPS2OOZFUFWr7/vlIxutvHJDFGnGWlUBvTcOF9rcURijajkR 0TvrKb8Nt1YMjkZmEIn3iU3IPp3XZkOArT5f9rwbPHB+SfkFNcDckpubDWpP2y/jw/XN 2bg5ibKFaETxEON+jylVvai9NiYh9S5srjOavc7EYHAP3Lo/guk2ldU9cuvPMmZhLqlA Av65cBHS3u9fM8Twl6h4uZbZlmiWpDx0ggqNgxAmGIRZNhwR8kHLwYmBjfOgFEqAQp9i W00A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:thread-topic:thread-index :content-transfer-encoding:mime-version:subject:message-id:to:from :date:dkim-signature:dkim-filter; bh=/OnVyqXrC1O/FPVWtMOQhejKB9OczroYQvFDsdCPNkg=; b=WYt6fFA2rTlUcq/Vi6CS5RaM7Yv04fyeyw78vVMza+gtYvKoypJ++mH5cKF5IOyiCN SNuiOeOGIBydo/6MOYbgwY6bjIkilSnm0irkQZFUUlJGi7KZkspDxAdDKR7qmy7lF+3K PWaN2fqtTNXo2bu3WhZOEwIzWww/6xPZPWgvw7PxxFUMz0WmZPebQdJFca2yU3gQQxz7 NCg4RMRrMclBaxToFgvcplqSgIs2FGPRUUppGM0zqM9LLt4hgB81d9yV3AjMzHF8O6+G UesAP3fYYlMShWXGOcGeOHk5TFmTrU2dnBJI4zhpxVxe3eDnpPFOqtMCM/wSSpbHpfNd qGGw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@desy.de header.s=default header.b="aX0/t5GL"; spf=pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id b9si20445636pgq.253.2019.06.03.08.15.34; Mon, 03 Jun 2019 08:16:00 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@desy.de header.s=default header.b="aX0/t5GL"; spf=pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729038AbfFCPNw (ORCPT + 99 others); Mon, 3 Jun 2019 11:13:52 -0400 Received: from smtp-o-3.desy.de ([131.169.56.156]:39491 "EHLO smtp-o-3.desy.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729011AbfFCPNv (ORCPT ); Mon, 3 Jun 2019 11:13:51 -0400 X-Greylist: delayed 382 seconds by postgrey-1.27 at vger.kernel.org; Mon, 03 Jun 2019 11:13:51 EDT Received: from smtp-buf-3.desy.de (smtp-buf-3.desy.de [IPv6:2001:638:700:1038::1:a6]) by smtp-o-3.desy.de (Postfix) with ESMTP id 834C8601FE for ; Mon, 3 Jun 2019 17:07:28 +0200 (CEST) DKIM-Filter: OpenDKIM Filter v2.11.0 smtp-o-3.desy.de 834C8601FE DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=desy.de; s=default; t=1559574448; bh=/OnVyqXrC1O/FPVWtMOQhejKB9OczroYQvFDsdCPNkg=; h=Date:From:To:Subject:From; b=aX0/t5GLpy8opi89ChDyO5eulOTN9KCDIOH1rCEqmGu6jXiycXFbU+ZvlHHXC63+K fJZ3InOFqoj4MJ7XjZSH8a+1l8cVuvsU2wpoU70r1ra9RpfSEUcUYgENx/8/84GJQM jKKbBVkyPf8RscDhk1FJ+VowXsL8DixY75DwE5jk= Received: from smtp-m-3.desy.de (smtp-m-3.desy.de [131.169.56.131]) by smtp-buf-3.desy.de (Postfix) with ESMTP id 7E965A0077 for ; Mon, 3 Jun 2019 17:07:28 +0200 (CEST) X-Virus-Scanned: amavisd-new at desy.de Received: from z-mbx-2.desy.de (z-mbx-2.desy.de [131.169.55.140]) by smtp-intra-1.desy.de (Postfix) with ESMTP id 5CC65C003B for ; Mon, 3 Jun 2019 17:07:28 +0200 (CEST) Date: Mon, 3 Jun 2019 17:07:28 +0200 (CEST) From: "Mkrtchyan, Tigran" To: linux-nfs Message-ID: <1811809323.9701664.1559574448351.JavaMail.zimbra@desy.de> Subject: NFS (pNFS) and VM dirty bytes MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Mailer: Zimbra 8.8.10_GA_3781 (ZimbraWebClient - FF67 (Linux)/8.8.10_GA_3786) Thread-Index: cmnjngump0CQm9y9bsnEoMSaFt7C8Q== Thread-Topic: NFS (pNFS) and VM dirty bytes Sender: linux-nfs-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org Dear NFS fellows, though this is not directly NFS issue, I post this question here as we mostly affected by NFS clients (and you have enough kernel connection to route it to the right people). We have 25 new data processing nodes with 32 cores, 256 GB RAM and 25 Gb/s NIC. They run CentOS 7 (but this is irrelevant, I think). When each node runs 24 parallel write incentive (75% write, 25% read) workloads, we see a spike of IO errors on close. Client runs into timeout due to slow network or IO starvation on the NFS servers. It stumbles, disconnects, establishes a new connection and stumbled again... As default values for dirty pages is vm.dirty_background_bytes = 0 vm.dirty_background_ratio = 10 vm.dirty_bytes = 0 vm.dirty_ratio = 30 the first data get sent when at least 25GB of data is accumulated. To get the full deployment more responsive, we have reduced default numbers to something more reasonable: vm.dirty_background_ratio = 0 vm.dirty_ratio = 0 vm.dirty_background_bytes = 67108864 vm.dirty_bytes = 536870912 IOW, we force client to start to send data as soon as 64MB is written. The question is how get this values optimal and how make them file system/mount point specific. Thanks in advance, Tigran.