Polars is a DataFrame interface on top of an OLAP Query Engine implemented in Rust using Apache Arrow Columnar Format
Read more
The Linux Portal Site
Polars is a DataFrame interface on top of an OLAP Query Engine implemented in Rust using Apache Arrow Columnar Format
Read more
datatable is a Python package for manipulating 2-dimensional tabular data structures (aka data frames).
Read more
Modin is a drop-in replacement for pandas. While pandas is single-threaded, Modin lets you instantly speed up your workflows.
Read more
pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools.
Read more
NumPy is the fundamental package for scientific computing with Python.
Read more
SciPy (pronounced “Sigh Pie”) is a Python-based ecosystem of open-source software for mathematics, science, and engineering.
Read more
Vaex is a program and Python library to visualize and explore large tabular datasets. It can calculate statistics.
Read more
HoloViews is an open-source Python library designed to make data analysis and visualization seamless and simple.
Read more
Dask is a flexible parallel computing library for analytic computing. It takes a Python job and shares it across multiple systems.
Read more
Optimus is the missing framework to profile, clean, process and do ML in a distributed fashion using Apache Spark (PySpark).
Read more
yt is an open-source Python package for analyzing and visualizing volumetric data. yt focuses on driving physically-meaningful inquiry.
Read more
AWS Data Wrangler extends the power of Pandas library to AWS connecting DataFrames and AWS data related services.
Read more
The R Project for Statistical Computing (R) is a free software environment for statistical computing and graphics.
Read more
MOA is a software environment for implementing algorithms and running experiments for online learning from evolving data streams.
Read more
Orange is a component-based framework for machine learning and data mining. It includes a range of data visualization, and exploration.
Read more
Environment for DeveLoping KDD-Applications Supported by Index-Structures (ELKI) is a data mining software framework.
Read more
DataMelt is an environment for scientific computation, data analysis and data visualization.
Read more
Weka (Waikato Environment for Knowledge Analysis) is a comprehensive popular suite of machine learning software written in Java.
Read more
Rattle provides a Gnome based open source interface to R functionality for binary classification tasks and data mining.
Read more
SU2 is a suite of tools for the numerical solution of partial differential equations (PDE) and performing PDE constrained optimization.
Read more