[ale] shared research server help

Jim Kinney jim.kinney at gmail.com
Wed Oct 4 18:48:59 EDT 2017


ulimit is a way to set soft and hard limits on resource usage including
RAM consumed.

On Wed, 2017-10-04 at 17:32 -0500, Todor Fassl wrote:
> I manage a group of research servers for grad students at a university. 
> The grad students use these machines to do the research for their Ph.D 
> theses. The problem is that they pretty regularly kill off each other's 
> programs by using up all the ram. Most of the machines have 256G of ram. 
> One kid uses 200Gb and another 100Gb and one or the other, often both, 
> die. Sometimes they bringthe machines down by hogging the cpu or using 
> up all the ram. Well, the machines never crash but they might as well be 
> down.
> 
> We really, really don't want to force them to use a scheduling system 
> like slurm. They are just learnng and they might run the same piece of 
> code 20 times in an hour.
> 
> Is there a way to set a limit on the amount of ram all of a user's 
> processes can use? If so, we were thinking of setting it at 50% of the 
> on-board ram. Then it would take 3 students together to trash a machine. 
> It might still happen but it would be a lot more infrequent.
> 
> Any other suggestions? Anything at all? Just keep in mind that we really 
> want to keep it easy for the students to play around.
> 
> 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.ale.org/pipermail/ale/attachments/20171004/fc2f6d5e/attachment.html>


More information about the Ale mailing list