Changes

Jump to: navigation, search

Resource shortage

5,689 bytes added, 19:34, 8 October 2013
m
cpuunits: intentation
Sometimes you see a strange fails failures from some programs inside your [[VE|Virtual Environmentcontainer]]. In some cases it means one of the resources controlled by OpenVZ has hit the limit.
The first thing to do is to check the contents of the /proc/user_beancounters file in your [[VEcontainer]]. The last column of output is the fail counter. Each time a resource hit hits the limit, the fail counter is increasingincremented. So, if you see non-zero values in the failcnt column that means something is wrong.
There are two ways to fix the situation: reconfugure reconfigure (in some cases recompile) the application, or change the resource management settings.
== [[UBC ]] parameters ==
UBC stands for '''u'''ser '''b'''ean'''c'''ounters &mdash; this Here is a set an example of limits and guarantees contolled per VE. You can see the current usage [[UBC]] values and limits by examining obtained from <code>/proc/user_beancounters </code> file. Here is the example of /proc/user_beancounters inside VE with ID of in container 123:
<pre>
# cat /proc/user_beancounters
</pre>
Note You can see if you hit the limit for some [[UBC]] parameters by analyzing the last column (named <code>failcnt</code>) &mdash; it . It shows a number of fails failures for this counter, i.e. a number of times the a parameter hit the limit. Usually what you need to do is to increase the parameter in question. But you need to do it carefully, and here is how.
<ol>
<li>Get the current values for the parameter's barrier and limit. For example, we want to increase kmemsize values from the above example. From <code>/proc/user_beancounters </code> we see that <code>kmemsize </code> barrier is 2752512, and its limit is 2936012.
</li>
<li>Increase the values. Say, we want to increase double <code>kmemsize by 1.5 times</code>. This is how it can be done using built-in bash arithmetics:
<pre>
# vzctl set 123 --kmemsize $((2752512*1.52)):$((2936012*1.52)) --save
</pre>
By using the <ttcode>--save</ttcode> flag, we denote indicate we want to both apply the new settings setting to the running VE, container and save them it in the configuration file (from which they the settings will be taken during next VE container start).
</li>
<li>Check the new configuration. Issue the following command:
<pre>
# vzcfgvalidate /etc/vz/conf/123.conf
</pre>
If something is wrong, you need to fix it as suggested by the utility.
</li>
</ol>
 
For more in-depth explanation of different parameters, their meaning and how to set them properly, see [[setting UBC parameters]].
 
== Disk quota ==
To check if your [[container]] exceeded its disk quota, use the following commands (inside a container):
<pre>
# df
Filesystem 1K-blocks Used Available Use% Mounted on
simfs 1048576 327664 720912 32% /
# df -i
Filesystem Inodes IUsed IFree IUse% Mounted on
simfs 200000 18857 181143 10% /
</pre>
 
The first command shows disk space usage and the second command shows the inodes usage (you can roughly use the inodes count as a number of files/directories on your system).
 
If one of the commands shows a usage of 100% you have exceeded one of the disk quota limits.
 
You can increase the limit from the host system ([[CT0]] [[aka]] [[VE0]]) only. This is how:
<ol>
<li>Get the current values for disk quota:
<pre># vzquota stat 123
resource usage softlimit hardlimit grace
1k-blocks 327664 1048576 1153434
inodes 18857 200000 220000
</pre>
</li>
 
<li>To increase the disk space quota, use vzctl set --diskspace. For example, we want to increase it by a factor of 2:
<pre>
vzctl set 123 --diskspace $(( 1048576*2 )):$(( 1153434*2 )) --save
</pre>
or you can use G notation for gigabyte
<pre>
vzctl set 123 --diskspace 20G:25G --save
</pre>
Here is 20 gigabyte barrier and 25 gigabyte limit diskspace saved. It can be checked by
<pre>
# vzctl exec 123 df -h
Filesystem Size Used Avail Use% Mounted on
simfs 20G 2.1G 18G 11% /
tmpfs 443M 0 443M 0% /lib/init/rw
tmpfs 443M 0 443M 0% /dev/shm
</pre>
</li>
 
<li>To increase the disk inodes quota, use <tt>vzctl set --diskinodes</tt>. For example, we want to increase it by a factor of 3:
<pre>
vzctl set 123 --diskinodes $(( 200000*3 )):$(( 220000*3 )) --save
</pre>
</li>
</ol>
 
Applying a different diskinodes configuration is applied instantly and does not require a restart of the container.
 
You can also find the number of free inodes with
 
<pre>
# vzctl exec 123 df -i
</pre>
{{Note|shell does not support floating-point arithmetic, i.e. you can not use expressions like <code>$((&nbsp;220000*1.5&nbsp;))</code>. To use floating point, try <code>bc</code> instead, something like this: <code><nowiki>$(echo&nbsp;220000*1.5&nbsp;|&nbsp;bc)</nowiki></code>.}}
 
== CPU ==
 
There are two parameters controlling fair CPU scheduler in OpenVZ: cpuunits and cpulimit.
 
=== cpuunits ===
Cpuunits are set via
<pre>
vzctl set 101 --cpuunits 1000 --save
</pre>
For example. If you set a cpuunit for one container to a value and set a cpuunit on another container to a different value, the time allotted to each of the containers will be the ratio of the two units. Let's use a real example.
 
We did the following:
<pre>
vzctl set 101 --cpuunits 1000 --save
vzctl set 102 --cpuunits 2000 --save
vzctl set 103 --cpuunits 3000 --save
</pre>
 
If we started a CPU intensive application on each CT, then 103 would be given 3 times as much cpu time as 101 and 102 would get twice as much as 101, but some fraction of what 103 got. Here's how to determine what the real ratios are.
 
Add the three units, 1000+2000+3000 = 6000
 
* 101 gets 1000/6000 or 1/6th of the time. (16%)
* 102 gets 2000/6000 or 1/3rd of the time. (34%)
* 103 gets 3000/6000 or 1/2 of the time. (50%)
 
To summarize: those units are proportional to each other. To say it more strict, to the sum of all CTs units, plus the host system, please don't forget that one. So indeed, units of 1 1 1 1 are the same as 200 200 200 200 or 8888 8888 8888 8888.
 
You may wonder why there's the tool vzcpucheck, which returns an absolute number called the "power of the node". The thing is, when you move a CT from one box to another, it could be problematic if you use different scales and different CPUs.
 
So vzcpucheck tries to work around that by inventing something called 'power of the node' which it gets from /proc/cpuinfo I guess (haven't checked it). If it shows a power of the node 10000 and you distribute that among all the CTs on the node, and then move one CT to another node which had cpuunits set in the same manner, that CT will have about the same CPU units it had on the old node.
 
=== cpulimit ===
 
The cpulimit parameter sets the absolute maximum limit for a container to a percent value. For instance:
<pre>
vzctl set 101 --cpulimit 25 --save
</pre>
says that container 101 cannot ever have more than 25 percent of a CPU even if the CPU is idle for the other 75% of the time. The limit is calculated as a percentage of a single CPU, not as a percentage of the server's CPU resources as a whole. In other words, if you have more than one CPU, you can set a cpulimit > 100. In a quad-core server, setting cpulimit to 100 permits a container to consume one entire core (and not 100% of the server).
 
CPU limits are only available in rhel5-based and rhel6-based kernels and they behave a bit differently in them.
 
In the rhel5 kernel the limit has a container-wide meaning. That said if you have e.g. a container of 2 CPUS with the 100% cpulimit set, this container's usage of CPUs can be 100%/0% or 50%/50% or any other values, whose sum is 100%.
 
In the rhel6 kernel the applied limit is divided between onlince CPUs proportionally and a busy CPU cannot borrow time from an idle one. I.e. with a 2 CPUs container and 100% limit set the usage of each CPU cannot exceed 50% in any case.
 
[[Category: Troubleshooting]]
[[Category: HOWTO]]
[[Category: UBC]]
[[Category: Resource management]]
[[Category: Disk quota]]

Navigation menu