nvidia-docker2.0 GPU 隔离

通过环境变量来隔离

NVIDIA_VISIBLE_DEVICES

容器可以使用哪些 GPU

0,1,2 GPU 编号，多个逗号隔开
all: 所有 GPU，nvidia 官方的镜像默认是这个选项
none: 没有 GPU，但是容器内部会映射 GPU 驱动
empty: nvidia-container-runtime will have the same behavior as runc.

NVIDIA_DRIVER_CAPABILITIES

This option controls which driver libraries/binaries will be mounted inside the container.

compute: required for CUDA and OpenCL applications,
compat32: required for running 32-bit applications,
graphics: required for running OpenGL and Vulkan applications,
utility: required for using nvidia-smi and NVML,
video: required for using the Video Codec SDK.

还有几个环境变量未做测试，可以点击进入 GitHub 查看。

几个感兴趣的问题可以参考下：

Is OpenGL supported?

No, OpenGL is not supported at the moment and there is no plan to support OpenGL+GLX in the near future.
OpenGL+EGL however will be supported and this issue will be updated accordingly.
If you are a NGC subscriber and require GLX for your workflow, please fill out a feature request for support consideration.

Do you support CUDA Multi Process Service (a.k.a. MPS)?

No, MPS is not supported at the moment. However we plan on supporting this feature in the future, and this issue will be updated accordingly.

Do you support running a GPU-accelerated X server inside the container?

No, running a X server inside the container is not supported at the moment and there is no plan to support it in the near future (see also OpenGL support).

Why is `nvidia-smi` inside the container not listing the running processes?

nvidia-smi and NVML are not compatible with PID namespaces.
We recommend monitoring your processes on the host or inside a container using --pid=host.

Can I limit the GPU resources (e.g. bandwidth, memory, CUDA cores) taken by a container?

No. Your only option is to set the GPU clocks at a lower frequency before starting the container.

What do I have to install in my container images?

Library dependencies vary from one application to another. In order to make things easier for developers, we provide a set of official images to base your images on.

Can I use the GPU during a container build (i.e. `docker build`)?

Yes, as long as you configure your Docker daemon to use the nvidia runtime as the default, you will be able to have build-time GPU support. However, be aware that this can render your images non-portable (see also invalid device function).

The official CUDA images are too big, what do I do?

The devel image tags are large since the CUDA toolkit ships with many libraries, a compiler and various command-line tools.
As a general rule of thumb, you shouldn’t ship your application with its build-time dependencies. We recommend to use multi-stage builds for this purpose. Your final container image should use our runtime or base images.
As of CUDA 9.0 we now ship a base image tag which bundles the strict minimum of dependencies.

Do you support Kubernetes?

Since Kubernetes 1.8, the recommended way is to use our official device plugin. Note that this is still alpha support.

Zabbix For Nvidia Gpu Discovery

自动发现规则创建模板老生常谈，创建模版，模板名为“” ， [图片] 然后创建应用集，自动发现规则；名称：自定义，我们设置 discover gpu 类型：zabbix 客户端（主动式），PS：“became not supported: Timeout while executing a shell script ..

测评 | 矩池云上架 RTX 2080 Ti 八卡机开箱

[图片] 大家好，福利君今天给给大家带来的是一则消息。矩池云将上架了超微八卡 GPU 服务器，全新的机器组合，可靠的服务品质。产品性能在这里引用 Lambda Labs 基于 FP32 对多 GPU 扩展训练性能评测的数据。两张 RTX 2080 Ti 是一张 RTX 2080 Ti 的 1.8 倍；四张 RT ..

矩池云 | 新冠肺炎防控：肺炎 CT 检测

连日来，新型冠状病毒感染的肺炎疫情，牵动的不仅仅是全武汉、全湖北，更是全国人民的心，大家纷纷以自己独特的方式为武汉加油！我们相信坚持下去，终会春暖花开。今天让我们以简单实用的神经网络模型，来检测肺炎的 CT 影像。第一步：导入我们需要的库 from keras.preprocessing.image import ..

矩池云 | 搭建浅层神经网络 "Hello world"

作为图像识别与机器视觉界的 'hello world!' ， MNIST ('Modified National Institute of Standards and Technology') 数据集有着举足轻重的地位。基本上每本人工智能、机器学习相关的书上都以它作为开始。下面我们会用 TensorFlow 搭建一个 ..

Google Colab 免费的 GPU 教程

现在，你可以开发深度学习与应用谷歌 Colaboratory 在的免费特斯拉 K80 GPU -使用 Keras，Tensorflow 和 PyTorch。 Google Colab 是 Google 为 AI 开发人员提供的免费云服务,借助 Colab 可以免费在 GPU 上开发深度学习应用程序。意思就是 Goog ..

2080 Ti 莫名起火，英伟达承认 GPU 有缺陷

RTX 2080 Ti，英伟达新一代图灵架构 GPU，因为独特而鲜明的外观，一直以来被大家戏称为“燃气灶”。现在这个昵称总算名副其实了。 [图片] 昨天，2080 Ti 用户 shansoft 正在上网，只是简单地浏览网页，没有做其他任何事情。突然，电脑突然黑屏自动关机了。不明所以的他往机箱里一看，不得了：2080 ..

欢迎来到这里！

我们正在构建一个小众社区，大家在这里相互信任，以平等 • 自由 • 奔放的价值观进行分享交流。最终，希望大家能够找到与自己志同道合的伙伴，共同成长。

关于

nvidia-docker2.0 GPU 隔离

NVIDIA_VISIBLE_DEVICES

NVIDIA_DRIVER_CAPABILITIES

Is OpenGL supported?

Do you support CUDA Multi Process Service (a.k.a. MPS)?

Do you support running a GPU-accelerated X server inside the container?

Why is `nvidia-smi` inside the container not listing the running processes?

Can I limit the GPU resources (e.g. bandwidth, memory, CUDA cores) taken by a container?

What do I have to install in my container images?

Can I use the GPU during a container build (i.e. `docker build`)?

The official CUDA images are too big, what do I do?

Do you support Kubernetes?

相关帖子

Zabbix For Nvidia Gpu Discovery

测评 | 矩池云上架 RTX 2080 Ti 八卡机开箱

矩池云 | 新冠肺炎防控：肺炎 CT 检测

矩池云 | 搭建浅层神经网络 "Hello world"

Google Colab 免费的 GPU 教程

大家平时用到 GPU 计算吗？

2080 Ti 莫名起火，英伟达承认 GPU 有缺陷

欢迎来到这里！

nvidia-docker2.0 GPU 隔离

NVIDIA_VISIBLE_DEVICES

NVIDIA_DRIVER_CAPABILITIES

Is OpenGL supported?

Do you support CUDA Multi Process Service (a.k.a. MPS)?

Do you support running a GPU-accelerated X server inside the container?

Why is nvidia-smi inside the container not listing the running processes?

Can I limit the GPU resources (e.g. bandwidth, memory, CUDA cores) taken by a container?

What do I have to install in my container images?

Can I use the GPU during a container build (i.e. docker build)?

The official CUDA images are too big, what do I do?

Do you support Kubernetes?

相关帖子

Zabbix For Nvidia Gpu Discovery

测评 | 矩池云上架 RTX 2080 Ti 八卡机开箱

矩池云 | 新冠肺炎防控：肺炎 CT 检测

矩池云 | 搭建浅层神经网络 "Hello world"

Google Colab 免费的 GPU 教程

大家平时用到 GPU 计算吗？

2080 Ti 莫名起火，英伟达承认 GPU 有缺陷

欢迎来到这里！

Why is `nvidia-smi` inside the container not listing the running processes?

Can I use the GPU during a container build (i.e. `docker build`)?