Links:
xCAT Extreme Cluster Administration Toolkit: a tool kit that can be used for the deployment and administration of Linux clusters. Its features are based on user requirements, and many of its features take advantage of IBM xSeries hardware. Read more hot Condor a software system that runs on a cluster of workstations to harness wasted CPU cycles DIPC Distributed Inter-Process Communication. It enables you to build and program multi-computers genders Genders is a static cluster configuration database used for cluster configuration management. It is used by a variety of tools and scripts for management of large clusters. IBM Distributed Lock Manager provides an implementation of the classic VAX Cluster locking semantics for a Linux cluster LAMP Application Server The LAMPAS project is a combination of common open source tools that provides a unified system from which administrators, developers, and other parties can manage a large application cluster. The underlying platform is LAMP based. Linux Cluster Manager Linux Cluster Manager is a designed as a beowulf cluster setup and management tool. LCM can run bulk commands, give real time performance and status monitoring, search running processes, and provide system imaging at a file or block level over the network. MAT an easy to use network enabled UNIX configuration and monitoring tool. It provides an integrated tool for many common system administration tasks, including Backups, and Replication It includes a warning system for potential system problems, and graphing of many common system parameters Maui Cluster Scheduler Maui is an advanced job scheduler for use on clusters and supercomputers. It is an optimized and configurable tool capable of supporting a large array of scheduling policies, dynamic priorities, extensive reservations, and fairshare. It is currently in use at hundreds of leading government, academic, and commercial sites throughout the world. It improves the manageability and efficiency of machines ranging from clusters of a few processors to multi-teraflop supercomputers. Moab Cluster Suite (commercial) Moab Cluster Suite is a cluster workload management solution that integrates scheduling, management and reporting of cluster workloads. MOSIX Grid and Cluster Management MOSIX is a management system for Linux clusters and organizational grids that provides a Single-System Image. In a MOSIX based system, there is no need to modify or link applications with any library, copy files, login to remote nodes, or even assign processes to different nodes; it is all done automatically. Just "fork and forget", like in an SMP. openMosixview a cluster-management GUI for openMosix-cluster which contains useful applications for monitoring and administration. It is a complete rewrite of Mosixview openSSI webView a simple and easy-to-use openSSI cluster monitoring system. Its goal is to provide a quick overview of the cluster state, by graphing vital functions and graphically representing key figures OSCAR Cluster OSCAR (Open Source Cluster Application Resources) is a snapshot of the best known methods for building, programming, and using clusters. It consists of a fully integrated and easy to install software bundle designed for high performance cluster computing. Everything needed to install, build, maintain, and use a Linux cluster is included in the suite, making it unnecessary to download or even install any individual software packages on your cluster. Paje The Pajé generic tool provides interactive and scalable behavioral visualizations of parallel and distributed applications, helping to capture the dynamics of their executions; because of its genericity, it can be used unchanged in a large variety of contexts. pconsole an administrative tool for working with clusters of machines. pconsole allows you to connect to each node of your cluster simultaneously, and you can type your administrative commands in a specialized window that 'multiplies' the input to each to the connections you have opened PCP a system for replicating files on multiple nodes of a PC cluster. Replication is done by building an n-ary tree of TCP sockets and using parallelized, pipelined data transfers which use RSA authentication quattor quattor is a system administration toolkit providing a powerful, portable and modular toolsuite for the automated installation, configuration and management of clusters and farms running UNIX derivates like Linux and Solaris. radmind radmind is a suite of Unix command-line tools and a server designed to remotely administer the file systems of multiple Unix machines. synctool synctool is a cluster administration tool that keeps configuration files synchronized across all nodes in a cluster. Nodes may be part of a logical group or class, in which case they need a particular subset of configuration files. synctool can restart daemons when needed, if their relevant configuration files have been changed. synctool can also be used to do patch management or other system administrative tasks. TORQUE TORQUE is a resource manager providing control over batch jobs and distributed compute nodes. Wackamole Wackamole is an application that helps with making a cluster highly available. It manages a bunch of virtual IPs, that should be available to the outside world at all times. Zillion The Zillion Project is a distributed computing project based on GNUstep. Jobs can be created from simple template projects and can be submitted through a simple submission tool.