CUMULVS: Collaborative User Migration, User Library for Visualization and Steering

CUMULVS is a software infrastructure for the development of collaborative environments. It supports interactive visualization and remote computational steering of distributed applications by multiple collaborators, and provides a mechanism for constructing fault-tolerant, migrating applications in heterogeneous distributed computing environments.

CUMULVS provides:

CUMULVS is part of the ACTS Toolkit as originated in the DOE 2000 Initiative.

Check out the FY 99 Highlights, the FY 98 Highlights, and the FY 97 Highlights.


The CUMULVS 1.1.1 Software is available for FREE DOWNLOAD.

Latest ``Unofficial'' Pre-Release of CUMULVS 1.3.0.alpha2 Distribution WITH CHECKPOINTING, and NEW AUTOCONF INSTALL SCRIPTS! (in gzipped tar format).
(Also available in compressed tar format, uuencoded format, and a highly toxic zip format for Windows.)

Latest ``Unofficial'' Pre-Release of CUMULVS 1.2.0.beta3 Distribution WITH CHECKPOINTING (in gzipped tar format).
(Also available in compressed tar format, uuencoded format, and a highly toxic zip format for Windows.)

CUMULVS 1.1.1 Distribution WITH CHECKPOINTING (in gzipped tar format).
(Also available in compressed tar format, uuencoded format, and a highly toxic zip format for Windows.)

Here's a PDF version of the most recent CUMULVS Tutorial Slides from the ACTS Toolkit Workshop, August 27, 2004. (*** This is the most up-to-date documentation on the CUMULVS user library interface! ***)

And here's the original version of the user's guide (sorry, still needs work...).


CUMULVS is currently based on the Parallel Virtual Machine (PVM) message-passing libraries and run-time system, but CUMULVS works with applications that use PVM, MPI, Global Arrays, or any other concurrent run-time environment. (You will need to install PVM for the CUMULVS internals...)

You can download the PVM software for FREE.


Publications & Presentations:

Univ. Louisville
J. A. Kohl, ``Interacting with High-Performance Scientific Simulations Using CUMULVS: Visualization, Computational Steering, and Fault Tolerance,'' Graduate Colloquium, Department of Electrical and Computer Engineering / CSE, University of Louisville, Louisville, KY, October 7, 1999.
SPDT 98
J. A. Kohl, P. M. Papadopoulos, ``Efficient and Flexible Fault Tolerance and Migration of Scientific Simulations Using CUMULVS,'' 2nd SIGMETRICS Symposium on Parallel and Distributed Tools, Welches, OR, August 1998.
(Presentation Slides in Postscript or HTML.)
HICSS-31
P. M. Papadopoulos, J. A. Kohl, B. D. Semeraro, ``CUMULVS: Extending a Generic Steering and Visualization Middleware for Application Fault-Tolerance,'' Proceedings of the 31st Hawaii International Conference on System Sciences (HICSS-31), Kona, Hawaii, January 1998.
IJHPCA
G. A. Geist, J. A. Kohl, P. M. Papadopoulos, ``CUMULVS: Providing Fault-Tolerance, Visualization and Steering of Parallel Applications,'' International Journal of High Performance Computing Applications, Volume 11, Number 3, August 1997, pp. 224-236.
CCC '97
J. A. Kohl, P. M. Papadopoulos, ``Fault-Tolerance and Reconfigurability Using CUMULVS,'' Oral Presentation, Cluster Computing Conference, Emory University, Atlanta, GA, March 9-11, 1997.
Lyon, France
G. A. Geist, J. A. Kohl, P. M. Papadopoulos, ``CUMULVS: Providing Fault-Tolerance, Visualization and Steering of Parallel Applications,'' Environment and Tools for Parallel Scientific Computing Workshop at Domaine de Faverges-de-la-Tour, Lyon, France, August 21-23, 1996.
ORNL Review Article (Alternate Link)
J. A. Kohl, ``High-Performance Computers: Innovative Assistants to Science,'' ORNL Review, Special Issue on Advanced Computing, Vol. 30, Nos. 3 & 4, 1997.


External Publications & Projects:

CUMULVS for "MxN" Parallel Data Redistribution
CUMULVS with Global Arrays
Remote Visualization over Standard Network Connections
Check out the cool CUMULVS work going on at NCSA!

Project Status:

ORNL is developing a state-of-the-art system called CUMULVS for assisting in the development of parallel and distributed applications. CUMULVS allows scientists to easily incorporate fault tolerance, interactive visualization and computational steering into their applications. The system is a valuable new tool for use in many large scientific simulations because it allows the scientist to visually monitor large data fields of an ongoing computation and to remotely control algorithmic and model parameters while the application is running. In addition, CUMULVS provides a simple way to incorporate checkpointing and distributed task migration inside large applications. This facility supports automatic recovery / restart of application tasks, even across heterogeneous architecture and topology boundaries.

CUMULVS provides several important features for the computational scientist. It handles the details of collecting and sending distributed data fields to, and receiving steering parameters from, multiple dynamically attached viewers. The viewers provide a uniform global view of data, even if the data is decomposed across many distributed tasks. CUMULVS manages all aspects of the dynamic attachment and detachment of multiple viewers to a running simulation. Viewers can be commercial packages such as AVS, public domain software such as Tcl/Tk, or customized viewers for specific application domains.

CUMULVS produces time-coherent views of application data that could potentially be changing asynchronously on parallel computers all across the nation. CUMULVS ensures the coherency of steering parameter updates when multiple collaborators are viewing and steering the application at the same time. And changes to steering parameters are coordinated across the application task so that updates are applied at a consistent time step in each task.

The CUMULVS system has been used in an acoustic wave propagation application and a First Principles materials science application, allowing the application developers to see for the first time the progress of their solutions during the computation.

In the acoustic wave propagation application (shown above), the user can adjust input parameters, such as shot location and intensity, on the fly. In both applications, CUMULVS supplied the distributed checkpointing and process migration capabilities required to run over the wide-area GII testbed.

Using these two applications, CUMULVS was demonstrated at Supercomputing '95 where it won the award for best interface and fault tolerance in the High Performance Computing Challenge. A demonstration of on-the-fly reconfiguration via CUMULVS checkpointing, using a simulation of air flow around an airfoil, won the Silver Medal for Innovation at Supercomputing '96. The development of CUMULVS was funded by DOE's Mathematical, Information, and Computational Sciences Division (MICS) and by DOE Basic Energy Sciences.


Impact:

In light of the growing emphasis towards computer simulation and remote control of Virtual Laboratory experiments, CUMULVS could impact the entire scientific community by supplying a collaborative infrastructure for computational steering, interactive visualization and fault tolerance in distributed environments.

Computational steering has the potential to revolutionize computational experiments by allowing scientists to interactively explore (steer) a simulation in time and/or space. Computational steering helps a scientist or engineer to concentrate more on the science than on the computer. Through the use of interaction, the computer will become a more useful tool to the engineer, allowing experimentation and real time exploration of a design space. Instead of the typical simulation mode - manually setting input parameters, computing results, storing data off to disk, visualizing the results via a separate visualization package, then starting again from the beginning - computational steering allows the scientist or engineer to ``close the loop'' and allow interactive manipulation of algorithmic and model parameters beyond their initial values.


For more details on CUMULVS or questions,
email: cumulvs at msr dot csm dot ornl dot gov

Distributed Computing Group
Computer Science and Mathematics Division
Oak Ridge National Laboratory
P.O. Box 2008, Bldg 6012, MS 6367
Oak Ridge, TN 37831-6367

Research supported by the Mathematics, Information and Computational Sciences Office, Office of Advanced Scientific Computing Research, Office of Science, U. S. Department of Energy, under contract No. DE-AC05-00OR22725 with UT-Battelle, LLC.

http://www.csm.ornl.gov/cs/cumulvs.html
Last modified: October 25, 2004 by Kohl.