Received: by 2002:ad5:4acb:0:0:0:0:0 with SMTP id n11csp3720239imw; Mon, 18 Jul 2022 13:15:53 -0700 (PDT) X-Google-Smtp-Source: AGRyM1tUkYxVxo2WEIbpNPavyzvqvynCN/XTC7g3XMCzcofSKAPsPCz2ZkasdinYpANAl/QRBjev X-Received: by 2002:aa7:c2d7:0:b0:43a:78af:6e57 with SMTP id m23-20020aa7c2d7000000b0043a78af6e57mr38904641edp.163.1658175352817; Mon, 18 Jul 2022 13:15:52 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1658175352; cv=none; d=google.com; s=arc-20160816; b=EM9bq8JIrdiuOtdtVXWvp99warTj6UkFkLAB2+wCtGDLTsA8nQGtm7GAyOXfQNeEe9 vkd8wc/x0ElffSERUFPTCyrnOEcH3QvCkCXI9a4eLcWDff3FwBw8TzKE2ci7kMH8QWAR r8UZ/BLLuxUL2SYnVXdKUShDPVLDvs4RcSSDWgfSwq8wPOGH4MO2SObjXqP4RZEq3OX1 /FptOtPx1p+O9Ge29431aFZrKd3xOZoaVgmzrQzo3txX3VD8uqZ8hS18ApNCiAhQLGaz OcrclgwTFH2R2AeirXTJCjJNk7yN6wn3qygv2X7K52OgTn+ExT9uJmR6hOU+6qIs4AfO b5fw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:user-agent :content-transfer-encoding:references:in-reply-to:date:cc:to:from :subject:message-id:dkim-signature; bh=naC571Utkk4UJUqzz7rtUXJkx0UeiIelJK9AHy0k/Zs=; b=mH/qoh61Xl4ZNay4SpDfuQMYuvtGXZXN6Y81xyarfryVjxqHjlRKgv3u8mZiYXtXBG Y1MyKmbu9vnBs5bmGKs66SBnYH4KMQzO5hKUq5npUWDtvETxv5ELt6ut46APXQdVeCFz 2cReU6+dpYkyt5Cy6eLWaKDZauLTJ1BQF1Dp8W0FjdpeJdDjBdEhEvkXKe3K+r1uUdub SsNAnlIQjuPSPz/emux1WVzvAn9s+zp+d5b5RtlP76MVoXvi9qeR0ex44BsXHfKNiyU4 ojEGie06T4oab+WcTtjemfnZzX+g9uyYicMukOZRxuIIUNcond4NJS3sxf8TqJxYHqOK dg0w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=vHGwF98L; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id j8-20020a170906278800b0072b0deda68fsi12825446ejc.354.2022.07.18.13.15.22; Mon, 18 Jul 2022 13:15:52 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=vHGwF98L; spf=pass (google.com: domain of linux-nfs-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234432AbiGRUK1 (ORCPT + 99 others); Mon, 18 Jul 2022 16:10:27 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55124 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234344AbiGRUK0 (ORCPT ); Mon, 18 Jul 2022 16:10:26 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 218B5A462 for ; Mon, 18 Jul 2022 13:10:25 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 92A4060B4A for ; Mon, 18 Jul 2022 20:10:24 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 69009C341C0; Mon, 18 Jul 2022 20:10:23 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1658175023; bh=MnQOSanrNshgyP4jaYz3PkWSnqdbXHXZQtY8Uov4TWE=; h=Subject:From:To:Cc:Date:In-Reply-To:References:From; b=vHGwF98LwtZdTmpVJuroy9Ym08qqYnbbGb9eLr1AlO4UJL4rtjLhwHcaFFD7ha6z7 PmbrijEKdKuUPzZxEK0nzcEtYrzUMwsH+27j4XEAfv/J0JIFzoIHXIISO2gNB8DFD1 PVeB+ir5Q/+ZtQ1fQpxHJ10FOloxMUTFL2wDhiPYR5LTviYBvCeBX9NHEA/OUnOEma ZBRcOZY4VBVD6cnB0WKJvH5wJBFWyVXun2PWrH9RJkpEMlQSFAP7UH8JaMzKu0fdbi xkOQzGm6yILo49Mm0n51+wTG01ktxeFpP0SaciOT3XsgCrcfDrBm43SxIy66DZucub jNw90r5uDhCVQ== Message-ID: <83cba112994dce59b6d7c7f7cea28e37b3a22d2c.camel@kernel.org> Subject: Re: [PATCH v2 12/15] SUNRPC: Add RPC-with-TLS support to xprtsock.c From: Jeff Layton To: Chuck Lever , linux-nfs@vger.kernel.org Cc: trondmy@hammerspace.com Date: Mon, 18 Jul 2022 16:10:22 -0400 In-Reply-To: <165452710606.1496.14773661487729121787.stgit@oracle-102.nfsv4.dev> References: <165452664596.1496.16204212908726904739.stgit@oracle-102.nfsv4.dev> <165452710606.1496.14773661487729121787.stgit@oracle-102.nfsv4.dev> Content-Type: text/plain; charset="ISO-8859-15" Content-Transfer-Encoding: quoted-printable User-Agent: Evolution 3.44.3 (3.44.3-1.fc36) MIME-Version: 1.0 X-Spam-Status: No, score=-7.8 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org On Mon, 2022-06-06 at 10:51 -0400, Chuck Lever wrote: > Signed-off-by: Chuck Lever > --- > include/linux/sunrpc/xprtsock.h | 1=20 > net/sunrpc/xprtsock.c | 243 ++++++++++++++++++++++++++++++++-= ------ > 2 files changed, 201 insertions(+), 43 deletions(-) >=20 > diff --git a/include/linux/sunrpc/xprtsock.h b/include/linux/sunrpc/xprts= ock.h > index e0b6009f1f69..eaf3d705f758 100644 > --- a/include/linux/sunrpc/xprtsock.h > +++ b/include/linux/sunrpc/xprtsock.h > @@ -57,6 +57,7 @@ struct sock_xprt { > struct work_struct error_worker; > struct work_struct recv_worker; > struct mutex recv_mutex; > + struct completion handshake_done; > struct sockaddr_storage srcaddr; > unsigned short srcport; > int xprt_err; > diff --git a/net/sunrpc/xprtsock.c b/net/sunrpc/xprtsock.c > index a4fee00412d4..63fe97ede573 100644 > --- a/net/sunrpc/xprtsock.c > +++ b/net/sunrpc/xprtsock.c > @@ -48,6 +48,7 @@ > #include > #include > #include > +#include > =20 > #include > #include > @@ -197,6 +198,11 @@ static struct ctl_table sunrpc_table[] =3D { > */ > #define XS_IDLE_DISC_TO (5U * 60 * HZ) > =20 > +/* > + * TLS handshake timeout. > + */ > +#define XS_TLS_HANDSHAKE_TO (20U * HZ) > + > #if IS_ENABLED(CONFIG_SUNRPC_DEBUG) > # undef RPC_DEBUG_DATA > # define RPCDBG_FACILITY RPCDBG_TRANS > @@ -1254,6 +1260,8 @@ static void xs_reset_transport(struct sock_xprt *tr= ansport) > if (atomic_read(&transport->xprt.swapper)) > sk_clear_memalloc(sk); > =20 > + /* TODO: Maybe send a TLS Closure alert */ > + > kernel_sock_shutdown(sock, SHUT_RDWR); > =20 > mutex_lock(&transport->recv_mutex); > @@ -2424,6 +2432,147 @@ static void xs_tcp_setup_socket(struct work_struc= t *work) > =20 > #if IS_ENABLED(CONFIG_TLS) > =20 > +/** > + * xs_tls_handshake_done - TLS handshake completion handler > + * @data: address of xprt to wake > + * @status: status of handshake > + * > + */ > +static void xs_tls_handshake_done(void *data, int status) > +{ > + struct rpc_xprt *xprt =3D data; > + struct sock_xprt *transport =3D > + container_of(xprt, struct sock_xprt, xprt); > + > + transport->xprt_err =3D status ? -EACCES : 0; > + complete(&transport->handshake_done); > + xprt_put(xprt); > +} > + > +static int xs_tls_handshake_sync(struct rpc_xprt *xprt, unsigned int xpr= tsec) > +{ > + struct sock_xprt *transport =3D > + container_of(xprt, struct sock_xprt, xprt); > + int rc; > + > + init_completion(&transport->handshake_done); > + set_bit(XPRT_SOCK_IGNORE_RECV, &transport->sock_state); > + > + transport->xprt_err =3D -ETIMEDOUT; > + switch (xprtsec) { > + case RPC_XPRTSEC_TLS_X509: > + rc =3D tls_client_hello_x509(transport->sock, > + xs_tls_handshake_done, xprt_get(xprt), > + TLSH_DEFAULT_PRIORITIES, TLSH_NO_PEERID, > + TLSH_NO_CERT); > + break; > + case RPC_XPRTSEC_TLS_PSK: > + rc =3D tls_client_hello_psk(transport->sock, xs_tls_handshake_done, > + xprt_get(xprt), TLSH_DEFAULT_PRIORITIES, > + TLSH_NO_PEERID); > + break; > + default: > + rc =3D -EACCES; > + } > + if (rc) > + goto out; > + > + rc =3D wait_for_completion_interruptible_timeout(&transport->handshake_= done, > + XS_TLS_HANDSHAKE_TO); Should this be interruptible or killable? I'm not sure we want to give up on a non-fatal signal, do we? (e.g. SIGCHLD). Actually...it looks like this function always runs in workqueue context, so this should probably just be wait_for_completion...or better yet, consider doing this asynchronously so we don't block a workqueue thread. > + if (rc < 0) > + goto out; > + > + rc =3D transport->xprt_err; > + > +out: > + xs_stream_reset_connect(transport); > + clear_bit(XPRT_SOCK_IGNORE_RECV, &transport->sock_state); > + return rc; > +} > + > +/* > + * Transfer the connected socket to @dst_transport, then mark that > + * xprt CONNECTED. > + */ > +static int xs_tls_finish_connecting(struct rpc_xprt *src_xprt, > + struct sock_xprt *dst_transport) > +{ > + struct sock_xprt *src_transport =3D > + container_of(src_xprt, struct sock_xprt, xprt); > + struct rpc_xprt *dst_xprt =3D &dst_transport->xprt; > + > + if (!dst_transport->inet) { > + struct socket *sock =3D src_transport->sock; > + struct sock *sk =3D sock->sk; > + > + /* Avoid temporary address, they are bad for long-lived > + * connections such as NFS mounts. > + * RFC4941, section 3.6 suggests that: > + * Individual applications, which have specific > + * knowledge about the normal duration of connections, > + * MAY override this as appropriate. > + */ > + if (xs_addr(dst_xprt)->sa_family =3D=3D PF_INET6) { > + ip6_sock_set_addr_preferences(sk, > + IPV6_PREFER_SRC_PUBLIC); > + } > + > + xs_tcp_set_socket_timeouts(dst_xprt, sock); > + tcp_sock_set_nodelay(sk); > + > + lock_sock(sk); > + > + /* > + * @sk is already connected, so it now has the RPC callbacks. > + * Reach into @src_transport to save the original ones. > + */ > + dst_transport->old_data_ready =3D src_transport->old_data_ready; > + dst_transport->old_state_change =3D src_transport->old_state_change; > + dst_transport->old_write_space =3D src_transport->old_write_space; > + dst_transport->old_error_report =3D src_transport->old_error_report; > + sk->sk_user_data =3D dst_xprt; > + > + /* socket options */ > + sock_reset_flag(sk, SOCK_LINGER); > + > + xprt_clear_connected(dst_xprt); > + > + dst_transport->sock =3D sock; > + dst_transport->inet =3D sk; > + dst_transport->file =3D src_transport->file; > + > + release_sock(sk); > + > + /* Reset src_transport before shutting down its clnt */ > + mutex_lock(&src_transport->recv_mutex); > + src_transport->inet =3D NULL; > + src_transport->sock =3D NULL; > + src_transport->file =3D NULL; > + > + xprt_clear_connected(src_xprt); > + xs_sock_reset_connection_flags(src_xprt); > + xs_stream_reset_connect(src_transport); > + mutex_unlock(&src_transport->recv_mutex); > + } > + > + if (!xprt_bound(dst_xprt)) > + return -ENOTCONN; > + > + xs_set_memalloc(dst_xprt); > + > + if (!xprt_test_and_set_connected(dst_xprt)) { > + dst_xprt->connect_cookie++; > + clear_bit(XPRT_SOCK_CONNECTING, &dst_transport->sock_state); > + xprt_clear_connecting(dst_xprt); > + > + dst_xprt->stat.connect_count++; > + dst_xprt->stat.connect_time +=3D (long)jiffies - > + dst_xprt->stat.connect_start; > + xs_run_error_worker(dst_transport, XPRT_SOCK_WAKE_PENDING); > + } > + return 0; > +} > + > /** > * xs_tls_connect - establish a TLS session on a socket > * @work: queued work item > @@ -2433,61 +2582,70 @@ static void xs_tls_connect(struct work_struct *wo= rk) > { > struct sock_xprt *transport =3D > container_of(work, struct sock_xprt, connect_worker.work); > + struct rpc_create_args args =3D { > + .net =3D transport->xprt.xprt_net, > + .protocol =3D transport->xprt.prot, > + .address =3D (struct sockaddr *)&transport->xprt.addr, > + .addrsize =3D transport->xprt.addrlen, > + .timeout =3D transport->xprtsec_clnt->cl_timeout, > + .servername =3D transport->xprt.servername, > + .nodename =3D transport->xprtsec_clnt->cl_nodename, > + .program =3D transport->xprtsec_clnt->cl_program, > + .prognumber =3D transport->xprtsec_clnt->cl_prog, > + .version =3D transport->xprtsec_clnt->cl_vers, > + .authflavor =3D RPC_AUTH_TLS, > + .cred =3D transport->xprtsec_clnt->cl_cred, > + .xprtsec =3D RPC_XPRTSEC_NONE, > + }; > + unsigned int pflags =3D current->flags; > struct rpc_clnt *clnt; > + struct rpc_xprt *xprt; > + int status; > =20 > - clnt =3D transport->xprtsec_clnt; > - transport->xprtsec_clnt =3D NULL; > + if (atomic_read(&transport->xprt.swapper)) > + current->flags |=3D PF_MEMALLOC; > + > + xs_stream_start_connect(transport); > + > + clnt =3D rpc_create(&args); > if (IS_ERR(clnt)) > goto out_unlock; > + rcu_read_lock(); > + xprt =3D xprt_get(rcu_dereference(clnt->cl_xprt)); > + rcu_read_unlock(); > =20 > - xs_tcp_setup_socket(work); > + status =3D xs_tls_handshake_sync(xprt, transport->xprt.xprtsec); > + if (status) > + goto out_close; > =20 > + status =3D xs_tls_finish_connecting(xprt, transport); > + if (status) > + goto out_close; > + trace_rpc_socket_connect(&transport->xprt, transport->sock, 0); > + > + xprt_put(xprt); > rpc_shutdown_client(clnt); > =20 > out_unlock: > + xprt_unlock_connect(&transport->xprt, transport); > + current_restore_flags(pflags, PF_MEMALLOC); > + transport->xprtsec_clnt =3D NULL; > return; > -} > =20 > -static void xs_set_xprtsec_clnt(struct rpc_clnt *clnt, struct rpc_xprt *= xprt) > -{ > - struct sock_xprt *transport =3D container_of(xprt, struct sock_xprt, xp= rt); > - struct rpc_create_args args =3D { > - .net =3D xprt->xprt_net, > - .protocol =3D xprt->prot, > - .address =3D (struct sockaddr *)&xprt->addr, > - .addrsize =3D xprt->addrlen, > - .timeout =3D clnt->cl_timeout, > - .servername =3D xprt->servername, > - .nodename =3D clnt->cl_nodename, > - .program =3D clnt->cl_program, > - .prognumber =3D clnt->cl_prog, > - .version =3D clnt->cl_vers, > - .authflavor =3D RPC_AUTH_TLS, > - .cred =3D clnt->cl_cred, > - .xprtsec =3D RPC_XPRTSEC_NONE, > - .flags =3D RPC_CLNT_CREATE_NOPING, > - }; > - > - switch (xprt->xprtsec) { > - case RPC_XPRTSEC_TLS_X509: > - case RPC_XPRTSEC_TLS_PSK: > - transport->xprtsec_clnt =3D rpc_create(&args); > - break; > - default: > - transport->xprtsec_clnt =3D ERR_PTR(-ENOTCONN); > - } > -} > - > -#else > - > -static void xs_set_xprtsec_clnt(struct rpc_clnt *clnt, struct rpc_xprt *= xprt) > -{ > - struct sock_xprt *transport =3D container_of(xprt, struct sock_xprt, xp= rt); > +out_close: > + xprt_put(xprt); > + rpc_shutdown_client(clnt); > =20 > - transport->xprtsec_clnt =3D ERR_PTR(-ENOTCONN); > + /* xprt_force_disconnect() wakes tasks with a fixed tk_status code. > + * Wake them first here to ensure they get our tk_status code. > + */ > + xprt_wake_pending_tasks(&transport->xprt, status); > + xs_tcp_force_close(&transport->xprt); > + xprt_clear_connecting(&transport->xprt); > + goto out_unlock; > } > =20 > -#endif /*CONFIG_TLS */ > +#endif /* CONFIG_TLS */ > =20 > /** > * xs_connect - connect a socket to a remote endpoint > @@ -2520,8 +2678,7 @@ static void xs_connect(struct rpc_xprt *xprt, struc= t rpc_task *task) > } else > dprintk("RPC: xs_connect scheduled xprt %p\n", xprt); > =20 > - xs_set_xprtsec_clnt(task->tk_client, xprt); > - > + transport->xprtsec_clnt =3D task->tk_client; > queue_delayed_work(xprtiod_workqueue, > &transport->connect_worker, > delay); >=20 >=20 --=20 Jeff Layton