6 Top Data Analysis Tools for Big Data
Big Data is an all-inclusive term that refers to data sets so
large and complex that they need to be processed by specially designed
hardware and software tools. The data sets are typically of the order
of tera or exabytes in size. These data sets are created from a diverse
range of sources: sensors that gather climate information, publicly
available information such as magazines, newspapers, articles. Other
examples where big data is generated include purchase transaction
records, web logs, medical records, military surveillance, video and
image archives, and large-scale e-commerce.
There is a heightened interest in Big Data and Big Data
analysis and the implications they have for businesses. Big Data
analysis is the process of examining huge quantities of data to find
patterns, correlations, and other useful information that can help
firms become more responsive to change, and to make better informed
decisions.
Big Data analysis can be performed with data mining software.
However, the unstructured data sources used for big data analysis are
not necessarily suitable for investigation by traditional data mining
software.
This is the second in our series identifying the finest open
source software for Big Data. This feature highlights the finest data
analysis tools. Hopefully, there will be something of interest for
anyone who needs to analyse huge volumes of unstructured data.
Now, let's explore the 6 data analysis tools at hand. For
each title we have compiled its own portal page, a full description
with an in-depth analysis of its features, together with links to
relevant resources and reviews.
Data Analysis Tools
|
| Hadoop |
Distributed processing of large data sets across
clusters of computers |
| HPCC
Systems |
Designed
for the enterprise to resolve Big Data
challenges |
| Storm |
Distributed and fault-tolerant realtime computation |
| Apache
Drill |
Distributed
system for interactive analysis of
large-scale datasets |
| Rapid
Miner |
Knowledge discovery in databases, machine learning, and
data mining |
| Pentaho |
Enterprise
reporting, analysis, dashboard, data mining,
workflow and more |
Return to our complete collection of Group
Tests, identifying the finest Linux software.
Last Updated Thursday, April 11 2013 @ 03:36 PM EDT |