Links:
Cerebro Cerebro is a collection of cluster monitoring tools and libraries. cexec cexec is a toolkit for building ad-hoc clusters out of standard Unix hosts and regular Unix applications. chpox chpox provides transparent checkpointing and restarting of processes on Linux clusters. Enhanced Cluster Tools (ECT) for Linux Enhanced Cluster Tools (ECT) for Linux is a set of additional tools for the enhancement of Cluster Systems Management (CSM) Technology. FreeDCE FreeDCE is the Distributed Computing Environment. It is a set of Application Programming Interface (API) libraries and servers that allow application writers to create distributed, secure applications for the enterprise. It was developed by the Open Software Foundation (now, The Open Group) out of several distributed computing technologies from several sources. FreeHA FreeHA is a free, portable, easy to use (compared to alternatives) program to run nodes in a 'High Availability' Cluster. Its initial target is to run on any and all UNIXen, rather than any particular one. Growler Growler is a component-oriented framework aimed at distributed and collaborative visualization and computational steering. It is released under the NASA Open Source Agreement. Growler's distributed object and event architecture is suitable for application to high-performance LAN environments as well the Internet. Its features include strong integration with C++, selective distributed reference counting, and efficient well-typed event channels for local and remote event broadcast, with support for low-latency/low-perturbation events via a loss-tolerant "state-based" event model. There is also support for event transmission over Infiniband, and event sizes greater that 4 gigabytes. HA-Tools The HA-Tools provide some programs to improve shell scripting, especially in a High Availability environment. The halockrun program provides a simple and reliable way to implement locking in shell scripts (a typical usage for halockrun is to prevent cronjobs to run simultanously), and the hatimerun program provides a time-out mechanism. ifmetric ifmetric is for setting the metrics of all IPv4 routes attached to a given network interface at once. This may be used to change the priority of routing IPv4 traffic over the interface. Lower metrics correlate with higher priorities. JumboMem JumboMem gives unmodified binaries transparent access to memory spread across multiple computers. The goal is to improve the performance of memory-hungry applications by replacing accesses to a slow paging device (a disk) with accesses to fast RAM located across a high-speed network. Unlike other memory servers, JumboMem does not need administrator privileges either to install or run. MP-MPICH MP-MPICH is a Multi-platform uniform MPI implementation, based on MPICH and SCI-MPICH, resulting in a high performance, consistent MPI across both ethernet and SCI networks in a hybrid environment. Oscar Open Source Cluster Application Resource: a snapshot of the best known methods for building, programming, and using clusters. It consists of a fully integrated and easy to install software bundle designed for high performance cluster computing paexec paexec distributes performing the given tasks (stdin) across several CPUs or machines in a network. PeaceMaker PeaceMaker is a failover/clustering engine system based on a message passing protocol system over TCP/IP between two or more nodes. It can monitor node faults based on interface polling status, health sanity node status, and messages passed through an external plugin attached to a plugin module. Node faults or state change events can be handled through external programs or TCP/IP notification. Piranha Piranha is a component of Red Hat's HA Server product, is a complete Open Source solution for Linux providing High Availability (HA), IP service monitoring and failover, and load balancing. popsneakerconfig popsneakerconfig allows you to comfortably manage the configuration file popsneakerrc. It allows you to create rules with a wizard, and can optimize your rules and highlight invalid rules. Riverdrums Load Balancer The Riverdrums Load Balancer is a no-frills, event-driven load balancer aimed at clarity and efficiency. slurm SLURM is an open-source resource manager designed for Linux clusters of all sizes. It provides three key functions. First it allocates exclusive and/or non-exclusive access to resources (computer nodes) to users for some duration of time so they can perform work. Second, it provides a framework for starting, executing, and monitoring work (typically a parallel job) on a set of allocated nodes. Finally, it arbitrates conflicting requests for resources by managing a queue of pending work. Strike HA Strike HA is a high availability system inspired by the Linux HA Heartbeat project and Freeha. It is designed to be easy but highly configurable. It features auto fail-back, packet signing, integrated IP and MAC take-over ability, and monitoring tools. Syzygy Syzygy is designed to run a wide range of VR applications on a PC cluster with equal or greater performance than an SGI Onyx, the traditional platform for high-end VR. Warewulf Warewulf is a distribution of tools that are designed to aid in the implementation of Beowulf style clusters. Presently the largest component is the slave node builder.