Ganglia
Ganglia is a scalable, real-time cluster monitoring
environment that collects cluster statistics in an open and
well-defined XML
format.
It is based on a
hierarchical design targeted at federations of clusters. It relies on a
multicast-based listen/announce protocol to monitor state within
clusters and uses a tree of point-to-point connections amongst
representative cluster nodes to federate clusters and aggregate their
state.
Ganglia leverages widely used technologies such as XML for data
representation, XDR for compact, portable data transport, and RRDtool
for data storage and visualization. It uses carefully engineered data
structures and algorithms to achieve very low per-node overheads and
high concurrency. The implementation is robust, has been ported to an
extensive set of operating systems and processor architectures, and is
currently in use on over 500 clusters around the world. It has been
used to link clusters across university campuses and around the world
and can scale to handle clusters with 2000 nodes.
Features include:
- Ganglia Monitoring Daemon - a multi-threaded daemon which
runs on each cluster node you want to monitor
- Ganglia Meta Daemon - consolidates data on the
head node
- Ganglia PHP Web Front-end
- rrdtool - consolidates data on the head node

Return
to Monitoring Home Page
Last Updated Tuesday, November 23 2010 @ 02:00 PM EST |