We should allow default memory for VM to be < 256 MB #2987

egernst · 2020-09-24T20:44:24Z

Description of problem

In toml, set the default memory size to 128 MB, and start a VM

Expected result

VM should be default memory size + memory requested by the container workload(s).

Actual result

It'll be 2048 + memory requested

Further information

In general case with QEMU, this isn't a major problem, since we'll just eat the extra page tables overhead, and only consume pages when needed. The memcgroup in the guest should make sure it only utilizes requested amount, not what is actually available.

However, this becomes very problematic when you use preallocated memory (which is only supported by QEMU today?). In this case, the VMM will get OOM killed very quickly since the host's memory cgroup (created by kubelet) will limit the entire sandbox to the requests + pod overhead, which is on the order of 160 MB.

I think it would be safest to let the user decide the defaultMemory, and not enforce a minimum. I expect many deployments will be made with a default memory request for unspecified container, which should make this all feasible.

Currently, we enforce a lower limit of 256MB for the defaultMemorySize. In general case with QEMU, this isn't a major problem, since we'll just eat the extra page table overhead, and only consume pages when needed. The memcgroup in the guest should make sure it only utilizes requested amount, not what is actually available. However, this becomes very problematic when you use preallocated memory. In the k8s case, the VMM will get OOM killed very quickly since the host's memory cgroup (created by kubelet) will limit the entire sandbox to the requests + pod overhead (on the order of 140 MB). We should allow the administrator of kata to set a better default value, which should be much closer aligned with what's used for PodOverhead (in the kube case). Let's lower from 256 to 8. Fixes: kata-containers#2987 Signed-off-by: Eric Ernst <[email protected]>

Currently, we enforce a lower limit of 256MB for the defaultMemorySize. In general case with QEMU, this isn't a major problem, since we'll just eat the extra page table overhead, and only consume pages when needed. The memcgroup in the guest should make sure it only utilizes requested amount, not what is actually available. However, this becomes very problematic when you use preallocated memory. In the k8s case, the VMM will get OOM killed very quickly since the host's memory cgroup (created by kubelet) will limit the entire sandbox to the requests + pod overhead (on the order of 140 MB). We should allow the administrator of kata to set a better default value, which should be much closer aligned with what's used for PodOverhead (in the kube case). Let's remove the artifical limit in kata, and leave it up to the end user to pick an appropriate non-default value, if desired. Fixes: kata-containers#2987 Signed-off-by: Eric Ernst <[email protected]> test Signed-off-by: Eric Ernst <[email protected]>

Currently, we enforce a lower limit of 256MB for the defaultMemorySize. In general case with QEMU, this isn't a major problem, since we'll just eat the extra page table overhead, and only consume pages when needed. The memcgroup in the guest should make sure it only utilizes requested amount, not what is actually available. However, this becomes very problematic when you use preallocated memory. In the k8s case, the VMM will get OOM killed very quickly since the host's memory cgroup (created by kubelet) will limit the entire sandbox to the requests + pod overhead (on the order of 140 MB). We should allow the administrator of kata to set a better default value, which should be much closer aligned with what's used for PodOverhead (in the kube case). Let's remove the artifical limit in kata, and leave it up to the end user to pick an appropriate non-default value, if desired. Fixes: kata-containers#2987 Signed-off-by: Eric Ernst <[email protected]>

Currently, we enforce a lower limit of 256MB for the defaultMemorySize. In general case with QEMU, this isn't a major problem, since we'll just eat the extra page table overhead, and only consume pages when needed. The memcgroup in the guest should make sure it only utilizes requested amount, not what is actually available. However, this becomes very problematic when you use preallocated memory. In the k8s case, the VMM will get OOM killed very quickly since the host's memory cgroup (created by kubelet) will limit the entire sandbox to the requests + pod overhead (on the order of 140 MB). We should allow the administrator of kata to set a better default value, which should be much closer aligned with what's used for PodOverhead (in the kube case). Let's remove the artifical limit in kata, and leave it up to the end user to pick an appropriate non-default value, if desired. Fixes: kata-containers#2987 Signed-off-by: Eric Ernst <[email protected]> (cherry picked from commit ab7f18d)

Currently, we enforce a lower limit of 256MB for the defaultMemorySize. In general case with QEMU, this isn't a major problem, since we'll just eat the extra page table overhead, and only consume pages when needed. The memcgroup in the guest should make sure it only utilizes requested amount, not what is actually available. However, this becomes very problematic when you use preallocated memory. In the k8s case, the VMM will get OOM killed very quickly since the host's memory cgroup (created by kubelet) will limit the entire sandbox to the requests + pod overhead (on the order of 140 MB). We should allow the administrator of kata to set a better default value, which should be much closer aligned with what's used for PodOverhead (in the kube case). Let's remove the artifical limit in kata, and leave it up to the end user to pick an appropriate non-default value, if desired. Fixes: kata-containers#2987 Signed-off-by: Eric Ernst <[email protected]>

egernst added bug Incorrect behaviour needs-review Needs to be assessed by the team. labels Sep 24, 2020

egernst self-assigned this Sep 24, 2020

egernst mentioned this issue Sep 24, 2020

hypervisor: don't enforce a minimum memory setting #2988

Merged

egernst removed the needs-review Needs to be assessed by the team. label Sep 24, 2020

egernst closed this as completed in ab7f18d Oct 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

We should allow default memory for VM to be < 256 MB #2987

We should allow default memory for VM to be < 256 MB #2987

egernst commented Sep 24, 2020

We should allow default memory for VM to be < 256 MB #2987

We should allow default memory for VM to be < 256 MB #2987

Comments

egernst commented Sep 24, 2020

Description of problem

Expected result

Actual result

Further information