Data Analysis

pyjanitor – extend pandas with readable data-cleaning functions

pyjanitor is a Python library that extends pandas with a collection of convenient, readable data-cleaning functions.

It provides a cleaner API for common preprocessing tasks, with support for method chaining so that dataframe transformations can be written as clear, sequential pipelines.

This is free and open source software.

Key Features

  • Extends pandas with additional data-cleaning and preprocessing functions.
  • Supports method chaining for readable dataframe transformation workflows.
  • Cleans column names, including support for multi-index columns.
  • Removes empty rows and columns from dataframes.
  • Identifies duplicate entries in datasets.
  • Adds, removes, and renames dataframe columns.
  • Coalesces multiple columns into a single column.
  • Converts dates from Excel, Matlab, and Unix formats to Python datetime format.
  • Includes helper functions for splitting data into features and targets for machine learning.
  • Provides experimental submodules for finance, biology, chemistry, engineering, and PySpark.

Website: github.com/pyjanitor-devs/pyjanitor
Support:
Developer: pyjanitor-devs
License: MIT License

pyjanitor is written in Python. Learn Python with our recommended free books and free tutorials.


Best Free and Open Source Software Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted