logml.eda.artifacts.missingness

Classes

MissingnessSummary()

Wrapper for dataset missingness statistics:

class logml.eda.artifacts.missingness.MissingnessSummary

Bases: object

Wrapper for dataset missingness statistics:

  • missing values stats per row

  • missing values stats per column

  • complete datasets per columns subsets

  • pairwise distances on top of nan features

  • columns order by nan features similarity

LABEL = 'missingness'
missing_values_per_row: Dict[str, pandas.core.frame.DataFrame]
missing_values_per_column: Dict[str, pandas.core.frame.DataFrame]
complete_dataset: Dict[str, pandas.core.frame.DataFrame]
pairwise_nan_distances: numpy.ndarray
similarity_order: List[str]