Skip to content

rapidstats:

PyPI version PyPI - Downloads License: MIT Tests pre-commit Code style: black Imports: isort

Documentation

What is it?

rapidstats is a minimal library that implements fast statistical routines in Rust and Polars. Currently, its core purpose is to provide the Bootstrap class. Unlike scipy.stats.bootstrap, the Bootstrap class contains specialized functions (for e.g. confusion matrix, ROC-AUC, and so on) implemented in Rust as well as a generic interface for any Python callable. This makes it significantly faster than passing in a callable on the Python side. For example, bootstrapping confusion matrix statistics can be up to 40x faster.

This library is in an alpha state. Although all functions are tested against existing libraries, use at your own risk. The API is subject to change very frequently.

Usage:

Dependencies

rapidstats has a minimal set of dependencies. It only depends on Polars and tqdm (for progress bars). You may install pyarrow (pip install rapidstats[pyarrow]) to allow functions to take NumPy arrays, Pandas objects, and other objects that may be converted through Arrow.

Installing

The easiest way is to install rapidstats is from PyPI using pip:

pip install rapidstats

Notes

The purpose of rapidstats is not to replace scipy or scikit-learn, which would be a massive undertaking. At the moment, only functions that are not easily achievable using existing libraries or functions that are significantly more performant (e.g. the core loop is implemented in Rust) will be added.