Received: by 2002:ac0:a581:0:0:0:0:0 with SMTP id m1-v6csp129199imm; Tue, 3 Jul 2018 15:24:32 -0700 (PDT) X-Google-Smtp-Source: ADUXVKKTUL+8xVOM2tHsB++dxo3VNKW8DnDOdmUq0Kcs/0lfwkIYOTOVKuMUdpeXVVcsba/QilBE X-Received: by 2002:a65:6292:: with SMTP id f18-v6mr27083286pgv.85.1530656672649; Tue, 03 Jul 2018 15:24:32 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1530656672; cv=none; d=google.com; s=arc-20160816; b=XhZos1ha04MdsLZWyCnovYPEggrNYqNj53ibKudHz/1d/L/TDVL2x5ZIkArk2a8haZ LbqblcqymdaUOJn+L36mf63/YRZ4j1x6chUTzjtCbnGdc1UZL+QifYAHapae6jk79LSU nLGqopx3WE8m59tYi++9a33JvY3mXllHa7aJCefN8Owe2/rlrYhyt4MYI+EXcWeR/tq7 0dXiDQnTdmdc079EZ3hURaNss6IzEiH/joCuZUPV6gr4ikzTMj8o52iwVkvhuRMDBg6n 66n4IObNWojI1wJUVQrk6a17WvyFDLtt/3CsIKFiDyS5n/VjQV2I7Lo3mOqyi1m4S4U6 zAMw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :organization:references:in-reply-to:message-id:subject:cc:to:from :date:dkim-signature:arc-authentication-results; bh=rtGgww4yfkrfzuYv0MZhW8sh+wc0pqLVAJGeggss1vg=; b=dYy/zV/YFitVW0uTqrehY8i8d1Szq+vPDomV4OfYgi9fELBzCLrfuYH1uLFPN2SsBr b+XheZWmG4p9Df7xxW5WVIHyjbUNE0qehPCYPS+GNleHKVdx0VOIUwtKF3vVXVSmpiet GqHjHpc9fTlzPlgyk4nNxONXxr53d86n+VkmzOQWO7zI+06QF1/PaZnI69b87j5XQzrz qHuMbz4u2Y05Rs1euarL6uwVLl73fk0PIVSBX36hAUxfwwJ9HDDGPLZVA6IVvNACYcqc O2F7rTi4FIs4I50ldIhti0Tvk3/uTydyMvodiFJKz+r+64LFGxTx8o28MyI9Xc2/jdJO B6FQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@netronome-com.20150623.gappssmtp.com header.s=20150623 header.b=dyGZlD33; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e7-v6si1904382pgf.317.2018.07.03.15.24.17; Tue, 03 Jul 2018 15:24:32 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@netronome-com.20150623.gappssmtp.com header.s=20150623 header.b=dyGZlD33; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753381AbeGCWXj (ORCPT + 99 others); Tue, 3 Jul 2018 18:23:39 -0400 Received: from mail-qt0-f195.google.com ([209.85.216.195]:46391 "EHLO mail-qt0-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753111AbeGCWXh (ORCPT ); Tue, 3 Jul 2018 18:23:37 -0400 Received: by mail-qt0-f195.google.com with SMTP id h5-v6so3017499qtm.13 for ; Tue, 03 Jul 2018 15:23:36 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=netronome-com.20150623.gappssmtp.com; s=20150623; h=date:from:to:cc:subject:message-id:in-reply-to:references :organization:mime-version:content-transfer-encoding; bh=rtGgww4yfkrfzuYv0MZhW8sh+wc0pqLVAJGeggss1vg=; b=dyGZlD33eV/F/u94rmlp/u9UgsTXoe7s/g8rkxEGt7BQH61FO5ISxwpmTVtZ1MHGlf DnKPCD5vpitLHKXQcoZlUiUYvfXUd+fGMwKC7lqUT5K/RliL7y6KFHIqdgm1naQMzkCJ rc3tsbfjv0+Oj7jPE1hEdt+KCiJTbSOVqQbs2aLptDjDwy4V8GafGDz3nbhGDaH0Rn3n HL9++H/oYZKx2KtM7WbbCqsVM6rJQKLGUWrKiWCIURPesjfGjnYRhipYaalcQ2IW6yCZ abPPuch55EebEJznZflrdEbRc7RK9OKbo+/B06drAPYm3cBThBWHC044EIKxVre89k9h uHOw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:in-reply-to :references:organization:mime-version:content-transfer-encoding; bh=rtGgww4yfkrfzuYv0MZhW8sh+wc0pqLVAJGeggss1vg=; b=PyHxZKqMQWRnJB/sUYaeO8XKP9uH51BkP8UwcCyeUbYfArRUBrj3un+XQpodxhl62j 9sMrEQZVIczlolArj9/mNw10WTpDIkO89AhdrtzjEvNwutL1v+GZI61IGXeVZWgbTm+F 0WpDS+2bRuItRzIR+c0W+L3OlM75T49zqcabk6aq7TZ5uRPXPF6FjgP7XGLykNCJ5lgv iWD4KI/vYfZIh2iSqtNsAn+4HubX9DBEVa6uLGVtd6VaLGEQypz6PPq3fmSTzrJJeKPa Y4YUM9tLFvgv+Gg/N3w9jLAIUPXZrR3CFLP/pSQfBk4lppMoHWLE478VaxW64DgZHPPO hH2w== X-Gm-Message-State: APt69E1wJlc5FO0KLdP0ZKfa1Q8qkdeiGE6aYxATk0xRvQoHwcnLJzWG LNQYlR4my/MCBaug7SyahDKrNw== X-Received: by 2002:aed:2798:: with SMTP id a24-v6mr29937082qtd.40.1530656616258; Tue, 03 Jul 2018 15:23:36 -0700 (PDT) Received: from cakuba.netronome.com ([75.53.12.129]) by smtp.gmail.com with ESMTPSA id u10-v6sm1815271qki.6.2018.07.03.15.23.34 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Tue, 03 Jul 2018 15:23:36 -0700 (PDT) Date: Tue, 3 Jul 2018 15:23:31 -0700 From: Jakub Kicinski To: Okash Khawaja Cc: Daniel Borkmann , Martin KaFai Lau , Alexei Starovoitov , Yonghong Song , "Quentin Monnet" , "David S. Miller" , , , Subject: Re: [PATCH bpf-next v2 2/3] bpf: btf: add btf print functionality Message-ID: <20180703152331.151d1c4b@cakuba.netronome.com> In-Reply-To: <20180703214559.GA4448@w1t1fb> References: <20180702183913.669030439@fb.com> <20180702191324.570616684@fb.com> <20180702220659.6baa77ba@cakuba.netronome.com> <20180703214559.GA4448@w1t1fb> Organization: Netronome Systems, Ltd. MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 3 Jul 2018 22:46:00 +0100, Okash Khawaja wrote: > On Mon, Jul 02, 2018 at 10:06:59PM -0700, Jakub Kicinski wrote: > > On Mon, 2 Jul 2018 11:39:15 -0700, Okash Khawaja wrote: > > > +#define BITS_PER_BYTE_MASK (BITS_PER_BYTE - 1) > > > +#define BITS_PER_BYTE_MASKED(bits) ((bits) & BITS_PER_BYTE_MASK) > > > > Perhaps it's just me but BIT_OFFSET or BIT_COUNT as a name of this macro > > would make it more obvious to parse in the code below. > I don't mind either. However these macro names are also used inside > kernel for same purpose. For sake of consistency, I'd recommend we keep > them :) Ugh, okay :) > > > + } print_num; > > > + > > > + total_bits_offset = bit_offset + BTF_INT_OFFSET(int_type); > > > + data += BITS_ROUNDDOWN_BYTES(total_bits_offset); > > > + bit_offset = BITS_PER_BYTE_MASKED(total_bits_offset); > > > + bits_to_copy = bits + bit_offset; > > > + bytes_to_copy = BITS_ROUNDUP_BYTES(bits_to_copy); > > > + > > > + print_num.u64_num = 0; > > > + memcpy(&print_num.u64_num, data, bytes_to_copy); > > > > This scheme is unlikely to work on big endian machines... > Can you give an example how? On BE: Input: [0x01, 0x82] Bit length: 15 Bytes to copy: 2 bit_offset: 0 upper_bits: 7 print_num.u64_num = 0; # [0, 0, 0, 0, 0, 0, 0, 0] memcpy(&print_num.u64_num, data, bytes_to_copy); # [0x01, 0x82, 0, 0, 0, 0, 0, 0] mask = (1 << upper_bits) - 1; # mask = 0x7f print_num.u8_nums[bytes_to_copy - 1] &= mask; # [0x01, 0x02, 0, 0, 0, 0, 0, 0] printf("0x%llx", print_num.u64_num); # 0x0102000000000000 AKA 72620543991349248 # expected: # 0x0102 AKA 258 Am I missing something? > > > + upper_bits = BITS_PER_BYTE_MASKED(bits_to_copy); > > > + if (upper_bits) { > > > + uint8_t mask = (1 << upper_bits) - 1; > > > + > > > + print_num.u8_nums[bytes_to_copy - 1] &= mask; > > > + } > > > + > > > + print_num.u64_num >>= bit_offset; > > > + > > > + if (is_plain_text) > > > + jsonw_printf(jw, "0x%llx", print_num.u64_num); > > > + else > > > + jsonw_printf(jw, "%llu", print_num.u64_num); > > > +} > > > + > > > +static int btf_dumper_int(const struct btf_type *t, uint8_t bit_offset, > > > + const void *data, json_writer_t *jw, > > > + bool is_plain_text) > > > +{ > > > + uint32_t *int_type = (uint32_t *)(t + 1); > > > + uint32_t bits = BTF_INT_BITS(*int_type); > > > + int ret = 0; > > > + > > > + /* if this is bit field */ > > > + if (bit_offset || BTF_INT_OFFSET(*int_type) || > > > + BITS_PER_BYTE_MASKED(bits)) { > > > + btf_dumper_int_bits(*int_type, bit_offset, data, jw, > > > + is_plain_text); > > > + return ret; > > > + } > > > + > > > + switch (BTF_INT_ENCODING(*int_type)) { > > > + case 0: > > > + if (BTF_INT_BITS(*int_type) == 64) > > > + jsonw_printf(jw, "%lu", *((uint64_t *)data)); > > > + else if (BTF_INT_BITS(*int_type) == 32) > > > + jsonw_printf(jw, "%u", *((uint32_t *)data)); > > > + else if (BTF_INT_BITS(*int_type) == 16) > > > + jsonw_printf(jw, "%hu", *((uint16_t *)data)); > > > + else if (BTF_INT_BITS(*int_type) == 8) > > > + jsonw_printf(jw, "%hhu", *((uint8_t *)data)); > > > + else > > > + btf_dumper_int_bits(*int_type, bit_offset, data, jw, > > > + is_plain_text); > > > + break; > > > + case BTF_INT_SIGNED: > > > + if (BTF_INT_BITS(*int_type) == 64) > > > + jsonw_printf(jw, "%ld", *((int64_t *)data)); > > > + else if (BTF_INT_BITS(*int_type) == 32) > > > + jsonw_printf(jw, "%d", *((int32_t *)data)); > > > + else if (BTF_INT_BITS(*int_type) == 16) > > > > Please drop the double space. Both for 16 where it makes no sense and > > for 8 where it's marginally useful but not really. > > > > > + jsonw_printf(jw, "%hd", *((int16_t *)data)); > > > + else if (BTF_INT_BITS(*int_type) == 8) > > > + jsonw_printf(jw, "%hhd", *((int8_t *)data)); > > > + else > > > + btf_dumper_int_bits(*int_type, bit_offset, data, jw, > > > + is_plain_text); > > > + break; > > > + case BTF_INT_CHAR: > > > + if (*((char *)data) == '\0') > > > + jsonw_null(jw); > > > > Mm.. I don't think 0 char is equivalent to null. > Yes, thanks. Will fix. > > > > > > + else if (isprint(*((char *)data))) > > > + jsonw_printf(jw, "\"%c\"", *((char *)data)); > > > > This looks very suspicious. So if I see a "6" for a char field it's > > either a 6 ('\u0006') or a 54 ('6')... > It will always be 54. May be I missed your point. Could you explain why > it would be other than 54? Ah, I think I missed that %c is in quotes... > > > + else > > > + if (is_plain_text) > > > + jsonw_printf(jw, "%hhx", *((char *)data)); This seems to be missing a "0x" prefix? > > > + else > > > + jsonw_printf(jw, "%hhd", *((char *)data)); > > > > ... I think you need to always print a string, and express it as > > \u00%02hhx for non-printable. > Okay that makes sense Yeah, IDK, char can be used as a byte as well as a string. In eBPF it may actually be more likely to just be used as a raw byte buffer... Either way I think it may be nice to keep it consistent, at least for the JSON output could we do either always ints or always characters?