Apache Zeppelin is an open source web-based notebook that enables interactive data analytics. It’s a data analytics and visualization platform enabling data engineers, data analysts and data scientists increase their productivity. Develop, organize, execute, and share data code and visualizing results without reverting to the command line or needing the cluster details.
This software brings data exploration, visualization, sharing and collaboration features to Spark.
You can make beautiful data-driven, interactive and collaborative documents with SQL, Scala and more.
Key Features
- Browser based notebook style editor.
- Interactive snippet-at-time experience.
- Supports many interpreters including Apache Spark, Python, JDBC, Markdown, Shell and R, as well as Alluxio, Beam, BigQuery, Cassandra, Elasticsearch, Flink, Geode, HBase, HDFS, Hive, Ignite, Kylin, Lens, Livy, Pig, Postgresql, Scalding, and Scio. Apache Zeppelin’s interpreter concept allows any language/data-processing-backend to be plugged into Zeppelin.
- Built-in Apache Spark support offering five interpreters:
- 1. SparkInterpreter – creates a SparksContext and provides a Scala environment.
- 2. PySparkInterpreter – provides a Python environment.
- 3. SparkRInterpreter – provides an R environment with SparkR support.
- 4. SparkSQLInterpreter – provides a SQL environment.
- 5. DepInterpreter – dependency loader.
- Automatic SparkContext and SQLContext injection.
- Runtime jar dependency loading from local filesystem or maven repository.
- Cancelling job and displaying its progress.
- Data visualization including pivot charts, dynamic forms, and more. Output from any language backend can be recognized and visualized.
- Dynamically create input forms in your notebook. You can use form templates with a simple template language, or the forms can be created programmatically.
- Collaborate by sharing your notebook URL. Apache Zeppelin broadcasts any changes in realtime.
- Integrated Matplotlib with Python and Pyspark interpreter.
- Multi-user support with LDAP with a separate interpreter running scope per user or per note.
- Provides several REST APIs for interaction and remote activation of Zeppelin functionality.
- Official docker images are available.
Website: zeppelin.apache.org
Support: Documentation, Wiki, GitHub Code Repository, Mailing Lists
Developer: Apache Software Foundation
License: Apache License 2.0

Apache Zeppelin is written in Java. Learn Java with our recommended free books and free tutorials.
Related Software
| Notebook software | |
|---|---|
| JupyterLab | The next generation user interface for Project Jupyter |
| RStudio | Integrated development environment (IDE) for R |
| Jupyter Notebook | Web-based notebook environment for interactive computing |
| Positron | Next-generation data science IDE |
| marimo | Reactive Python notebook |
| Apache Zeppelin | Multi-purpose notebook |
| IPython | Rich architecture for interactive computing |
| Polynote | Experimental polyglot notebook environment |
| nteract | Notebooks on your Desktop |
| Pluto | Simple reactive notebooks for Juli |
| Pretzel | Billed as a modern replacement for Jupyter Notebooks |
| Spark Notebook | Interactive and reactive data science using Scala and Spark |
| BeakerX | Kernels and extensions to the Jupyter interactive computing environment |
Read our verdict in the software roundup.
Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk. You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more. Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form. |

