Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932920AbcCKRU2 (ORCPT ); Fri, 11 Mar 2016 12:20:28 -0500 Received: from mx0a-00082601.pphosted.com ([67.231.145.42]:43040 "EHLO mx0a-00082601.pphosted.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932643AbcCKRU1 (ORCPT ); Fri, 11 Mar 2016 12:20:27 -0500 Subject: Re: [PATCH net-next] bpf: avoid copying junk bytes in bpf_get_current_comm() To: Daniel Borkmann , "David S . Miller" References: <1457582553-395600-1-git-send-email-ast@fb.com> <56E29CDA.5010004@iogearbox.net> CC: Tobias Waldekranz , Brendan Gregg , , , From: Alexei Starovoitov Message-ID: <56E2FE44.7040904@fb.com> Date: Fri, 11 Mar 2016 09:20:04 -0800 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.11; rv:38.0) Gecko/20100101 Thunderbird/38.6.0 MIME-Version: 1.0 In-Reply-To: <56E29CDA.5010004@iogearbox.net> Content-Type: text/plain; charset="windows-1252"; format=flowed Content-Transfer-Encoding: 7bit X-Originating-IP: [192.168.52.123] X-Proofpoint-Spam-Reason: safe X-FB-Internal: Safe X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2016-03-11_09:,, signatures=0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2482 Lines: 56 On 3/11/16 2:24 AM, Daniel Borkmann wrote: > On 03/10/2016 05:02 AM, Alexei Starovoitov wrote: >> Lots of places in the kernel use memcpy(buf, comm, TASK_COMM_LEN); but >> the result is typically passed to print("%s", buf) and extra bytes >> after zero don't cause any harm. >> In bpf the result of bpf_get_current_comm() is used as the part of >> map key and was causing spurious hash map mismatches. >> Use strlcpy() to guarantee zero-terminated string. >> bpf verifier checks that output buffer is zero-initialized, > > Sorry for late reply, more below: > >> so even for short task names the output buffer don't have junk bytes. >> Note it's not a security concern, since kprobe+bpf is root only. >> >> Fixes: ffeedafbf023 ("bpf: introduce current->pid, tgid, uid, gid, >> comm accessors") >> Reported-by: Tobias Waldekranz >> Signed-off-by: Alexei Starovoitov > [...] >> diff --git a/kernel/bpf/helpers.c b/kernel/bpf/helpers.c >> index 4504ca66118d..50da680c479f 100644 >> --- a/kernel/bpf/helpers.c >> +++ b/kernel/bpf/helpers.c >> @@ -166,7 +166,7 @@ static u64 bpf_get_current_comm(u64 r1, u64 size, >> u64 r3, u64 r4, u64 r5) >> if (!task) >> return -EINVAL; >> >> - memcpy(buf, task->comm, min_t(size_t, size, sizeof(task->comm))); >> + strlcpy(buf, task->comm, min_t(size_t, size, sizeof(task->comm))); > > If I see this correctly, __set_task_comm() makes sure comm is always zero > terminated, so that seems good, but isn't it already sufficient when > switching > to strlcpy() to simply use: > > strlcpy(buf, task->comm, size); > > The min_t() seems unnecessary work to me, why do we still need it? size > is guaranteed to be > 0 through the eBPF verifier, so strlcpy() should take > care of the rest. that's one clever optimization. yep. we can drop min_t. btw I wanted to add memset to __set_task_comm, keep memcpy in bpf_get_current_comm and optimize perf_event_comm_event (which doing: memset+strlcpy and can be replaced with memcpy), but figured that such 'fix' is not suitable for stable. I guess we can do in the next cycle? strlen is not cheap. Especially since it turned out that bpf_get_current_comm() is used very often in the hot path in bcc/tools. Also for the next cycle I'm planning to extend verifier to allow uninitialized stack to be passed to functions like bpf_get_current_comm() and they would have to zero it in error cases. Then we can save few more cycles from the programs.