Commit Graph

10 Commits (1c012f1c4966dfe6124192abafeba892ae9510a0)

Author SHA1 Message Date
Rohit Agarwal 5139bb580a Update nvidia-gpu-device-plugin to apps/v1 and use RollingUpdate updateStrategy.
Even though RollingUpdate is the default updateStrategy, we need to
specify it explicitly here because otherwise updating from
extensions/v1beta1 to apps/v1 doesn't change the updateStrategy.
2018-05-24 17:53:13 -07:00
Rohit Agarwal 81443e5d92 Remove hostNetwork and hostPID from nvidia-gpu-device-plugin manifest.
They are not needed.
2018-04-11 12:22:09 -07:00
Rohit Agarwal a38e98f907 Add wildcard toleration to nvidia-gpu-device-plugin.
This addon runs only on nodes with GPUs and it's essential for their
proper function.
2018-04-06 17:36:48 -07:00
Jiaying Zhang 9a05af5502 Update gke nvidia-gpu-device-plugin to the latest version that supports
both v1alpha and v1beta1 device plugin versions.
Re-enables nvidia-gpus e2e test after verifying the test passes now.
2018-02-26 14:08:58 -08:00
Tim Hockin 3586986416 Switch to k8s.gcr.io vanity domain
This is the 2nd attempt.  The previous was reverted while we figured out
the regional mirrors (oops).

New plan: k8s.gcr.io is a read-only facade that auto-detects your source
region (us, eu, or asia for now) and pulls from the closest.  To publish
an image, push k8s-staging.gcr.io and it will be synced to the regionals
automatically (similar to today).  For now the staging is an alias to
gcr.io/google_containers (the legacy URL).

When we move off of google-owned projects (working on it), then we just
do a one-time sync, and change the google-internal config, and nobody
outside should notice.

We can, in parallel, change the auto-sync into a manual sync - send a PR
to "promote" something from staging, and a bot activates it.  Nice and
visible, easy to keep track of.
2018-02-07 21:14:19 -08:00
Rohit Agarwal a73382566b Update nvidia-gpu-device-plugin addon.
This includes changes from GoogleCloudPlatform/container-engine-accelerators#38 and GoogleCloudPlatform/container-engine-accelerators#37
2017-12-12 20:53:27 -08:00
Rohit Agarwal cb220a17e3 Update nvidia-gpu-device-plugin addon.
This includes changes from GoogleCloudPlatform/container-engine-accelerators#33
2017-11-30 17:40:30 -08:00
Rohit Agarwal d7341749ff nvidia-gpu-device-plugin daemonset should tolerate nvidia.com/gpu taint.
It is expected that nodes with extended resources attached will be
tainted with the resouce name, so that we can create dedicated nodes.
If ExtendedResourceToleration admission controller is enabled, pods
requesting such resources will automatically tolerate such taints.
nvidia-gpu-device-plugin daemonset doesn't request such resources but
still needs to run on such nodes, so it needs this toleration.
2017-11-29 11:31:42 -08:00
Jiaying Zhang 4a1a205109 Changes nvidia-gpu device plugin addon config settings:
- Runs as system critical pod
- Makes resource limits to match its resource requets
- Modifies test/e2e/scheduling/nvidia-gpus.go to cope with the recent
change of running the device plugin as a system addon.
- The resource settings of the addon is based on the test results
from 8 nvidia-tesla-k80 gpus.
2017-11-20 17:32:53 -08:00
Rohit Agarwal cf292754ba Run nvidia-gpu device-plugin daemonset as an addon on GCE nodes that have nvidia GPUs attached. 2017-11-02 12:58:29 -07:00