LinuxLists.cc - [RFC] Per file OOM badness

2018-01-18 16:49:12

Subject: [RFC] Per file OOM badness

Hi, this series is a revised version of an RFC sent by Christian König
a few years ago. The original RFC can be found at
https://lists.freedesktop.org/archives/dri-devel/2015-September/089778.html

This is the same idea and I've just adressed his concern from the original RFC
and switched to a callback into file_ops instead of a new member in struct file.

Thanks,
Andrey

2018-01-18 16:50:34

by Andrey Grodzovsky

[permalink] [raw]

Subject: [PATCH 3/4] drm/gem: adjust per file OOM badness on handling buffers

Large amounts of VRAM are usually not CPU accessible, so they are not mapped
into the processes address space. But since the device drivers usually support
swapping buffers from VRAM to system memory we can still run into an out of
memory situation when userspace starts to allocate to much.

This patch gives the OOM another hint which process is
holding how many resources.

Signed-off-by: Andrey Grodzovsky <[email protected]>
---
drivers/gpu/drm/drm_file.c | 12 ++++++++++++
drivers/gpu/drm/drm_gem.c | 8 ++++++++
include/drm/drm_file.h | 4 ++++
3 files changed, 24 insertions(+)

diff --git a/drivers/gpu/drm/drm_file.c b/drivers/gpu/drm/drm_file.c
index b3c6e99..626cc76 100644
--- a/drivers/gpu/drm/drm_file.c
+++ b/drivers/gpu/drm/drm_file.c
@@ -747,3 +747,15 @@ void drm_send_event(struct drm_device *dev, struct drm_pending_event *e)
spin_unlock_irqrestore(&dev->event_lock, irqflags);
}
EXPORT_SYMBOL(drm_send_event);
+
+long drm_oom_badness(struct file *f)
+{
+
+ struct drm_file *file_priv = f->private_data;
+
+ if (file_priv)
+ return atomic_long_read(&file_priv->f_oom_badness);
+
+ return 0;
+}
+EXPORT_SYMBOL(drm_oom_badness);
diff --git a/drivers/gpu/drm/drm_gem.c b/drivers/gpu/drm/drm_gem.c
index 01f8d94..ffbadc8 100644
--- a/drivers/gpu/drm/drm_gem.c
+++ b/drivers/gpu/drm/drm_gem.c
@@ -264,6 +264,9 @@ drm_gem_object_release_handle(int id, void *ptr, void *data)
drm_gem_remove_prime_handles(obj, file_priv);
drm_vma_node_revoke(&obj->vma_node, file_priv);

+ atomic_long_sub(obj->size >> PAGE_SHIFT,
+ &file_priv->f_oom_badness);
+
drm_gem_object_handle_put_unlocked(obj);

return 0;
@@ -299,6 +302,8 @@ drm_gem_handle_delete(struct drm_file *filp, u32 handle)
idr_remove(&filp->object_idr, handle);
spin_unlock(&filp->table_lock);

+ atomic_long_sub(obj->size >> PAGE_SHIFT, &filp->f_oom_badness);
+
return 0;
}
EXPORT_SYMBOL(drm_gem_handle_delete);
@@ -417,6 +422,9 @@ drm_gem_handle_create_tail(struct drm_file *file_priv,
}

*handlep = handle;
+
+ atomic_long_add(obj->size >> PAGE_SHIFT,
+ &file_priv->f_oom_badness);
return 0;

err_revoke:
diff --git a/include/drm/drm_file.h b/include/drm/drm_file.h
index 0e0c868..ac3aa75 100644
--- a/include/drm/drm_file.h
+++ b/include/drm/drm_file.h
@@ -317,6 +317,8 @@ struct drm_file {

/* private: */
unsigned long lock_count; /* DRI1 legacy lock count */
+
+ atomic_long_t f_oom_badness;
};

/**
@@ -378,4 +380,6 @@ void drm_event_cancel_free(struct drm_device *dev,
void drm_send_event_locked(struct drm_device *dev, struct drm_pending_event *e);
void drm_send_event(struct drm_device *dev, struct drm_pending_event *e);

+long drm_oom_badness(struct file *f);
+
#endif /* _DRM_FILE_H_ */
--
2.7.4

2018-01-18 16:50:34

by Andrey Grodzovsky

[permalink] [raw]

Subject: [PATCH 4/4] drm/amdgpu: Use drm_oom_badness for amdgpu.

Signed-off-by: Andrey Grodzovsky <[email protected]>
---
drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c | 1 +
1 file changed, 1 insertion(+)

diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c
index 46a0c93..6a733cdc8 100644
--- a/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c
+++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c
@@ -828,6 +828,7 @@ static const struct file_operations amdgpu_driver_kms_fops = {
#ifdef CONFIG_COMPAT
.compat_ioctl = amdgpu_kms_compat_ioctl,
#endif
+ .oom_file_badness = drm_oom_badness,
};

static bool
--
2.7.4

2018-01-18 16:50:38

by Andrey Grodzovsky

[permalink] [raw]

Subject: [PATCH 2/4] oom: take per file badness into account

Try to make better decisions which process to kill based on
per file OOM badness

Signed-off-by: Andrey Grodzovsky <[email protected]>
---
mm/oom_kill.c | 23 +++++++++++++++++++++++
1 file changed, 23 insertions(+)

diff --git a/mm/oom_kill.c b/mm/oom_kill.c
index 29f8555..825ed52 100644
--- a/mm/oom_kill.c
+++ b/mm/oom_kill.c
@@ -49,6 +49,8 @@
#define CREATE_TRACE_POINTS
#include <trace/events/oom.h>

+#include <linux/fdtable.h>
+
int sysctl_panic_on_oom;
int sysctl_oom_kill_allocating_task;
int sysctl_oom_dump_tasks = 1;
@@ -182,6 +184,21 @@ static bool is_dump_unreclaim_slabs(void)
}

/**
+ * oom_file_badness - add per file badness
+ * @points: pointer to summed up badness points
+ * @file: tasks open file
+ * @n: file descriptor id (unused)
+ */
+static int oom_file_badness(const void *points, struct file *file, unsigned n)
+{
+ if (file->f_op->oom_file_badness)
+ *((long *)points) += file->f_op->oom_file_badness(file);
+
+ return 0;
+}
+
+
+/**
* oom_badness - heuristic function to determine which candidate task to kill
* @p: task struct of which task we should calculate
* @totalpages: total present RAM allowed for page allocation
@@ -222,6 +239,12 @@ unsigned long oom_badness(struct task_struct *p, struct mem_cgroup *memcg,
*/
points = get_mm_rss(p->mm) + get_mm_counter(p->mm, MM_SWAPENTS) +
mm_pgtables_bytes(p->mm) / PAGE_SIZE;
+
+ /*
+ * Add how much memory a task uses in opened files, e.g. device drivers.
+ */
+ iterate_fd(p->files, 0, oom_file_badness, &points);
+
task_unlock(p);

/*
--
2.7.4

2018-01-18 16:52:51

by Andrey Grodzovsky

[permalink] [raw]

Subject: [PATCH 1/4] fs: add OOM badness callback in file_operatrations struct.

This allows device drivers to specify an additional badness for the OOM
when they allocate memory on behalf of userspace.

Signed-off-by: Andrey Grodzovsky <[email protected]>
---
include/linux/fs.h | 1 +
1 file changed, 1 insertion(+)

diff --git a/include/linux/fs.h b/include/linux/fs.h
index 511fbaa..938394a 100644
--- a/include/linux/fs.h
+++ b/include/linux/fs.h
@@ -1728,6 +1728,7 @@ struct file_operations {
u64);
ssize_t (*dedupe_file_range)(struct file *, u64, u64, struct file *,
u64);
+ long (*oom_file_badness)(struct file *);
} __randomize_layout;

struct inode_operations {
--
2.7.4

2018-01-18 17:03:50

by Michal Hocko

[permalink] [raw]

Subject: [RFC] Per file OOM badness

Subject: [PATCH 3/4] drm/gem: adjust per file OOM badness on handling buffers

Subject: [PATCH 4/4] drm/amdgpu: Use drm_oom_badness for amdgpu.

Subject: [PATCH 2/4] oom: take per file badness into account

Subject: [PATCH 1/4] fs: add OOM badness callback in file_operatrations struct.

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Attachments:

Subject: RE: [RFC] Per file OOM badness

Subject: RE: [RFC] Per file OOM badness

Subject: Re: [PATCH 3/4] drm/gem: adjust per file OOM badness on handling buffers

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Attachments:

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [PATCH 4/4] drm/amdgpu: Use drm_oom_badness for amdgpu.

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [PATCH 4/4] drm/amdgpu: Use drm_oom_badness for amdgpu.

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness

Subject: Re: [RFC] Per file OOM badness