NERSC logo National Energy Research Scientific Computing Center
  A DOE Office of Science User Facility
  at Lawrence Berkeley National Laboratory
 

NERSC Announcements Message Archive

Select: [all-announcements] [users] [franklin] [bassi] [jacquard] [davinci] [nug] [managers]

[ Back ]

Subject: Seaborg Status
Author: Jim Craw <craw_at_nersc.gov>
Date: 2001-11-02 15:45:04
Hello SP users: I wish to apologize to our Seaborg users for the recent instability problems. Especially over the last few weeks. As of last Monday NERSC elevated the GPFS problems to a "CRITS IT" level (IBM talk for customer site goes Critical). As of Wednesday, we have installed fixes for several known problems (e.g. random nodes crashing due to GPFS error, GPFS terminating on interactive nodes, GPFS limits, etc...). The only remaining related problem is regarding executables that get left in an unusable state when GPFS restarts after getting "terminated". We have not experienced this problem since (fixes were put on system) Wednesday and IBM has now identified the problem and is in the process of generating and testing a fix. We hope to get the fix early next week and test it out on our development system before planning to install it on Seaborg. So in the meantime, the system has definitely stabilized, both H/W and S/W wise. The load has picked up but turnaround still looks good. Please go ahead and submit more jobs if you like/can. Sorry for any inconvenience caused. Regards, Jim Craw Computational Systems Group Lead

LBNL Home
Page last modified: Fri, 05 Dec 2008 19:17:25 GMT
Page URL: http://www.nersc.gov/nusers/announcements/message_text.php
Web contact: webmaster@nersc.gov
Computing questions: consult@nersc.gov

Privacy and Security Notice
DOE Office of Science