logml.eda.manager

Functions

topological_sort_items(artifact_ids)

Sorts artifact producers in topological order with respect to dependencies.

Classes

EDAArtifactsManager(objective_cfg, global_params)

High-level utility for working with EDA artifacts producers.

class logml.eda.manager.EDAArtifactsManager(objective_cfg: logml.configuration.modeling.ModelingTaskSpec, global_params: dict, eda_cfg: Optional[logml.configuration.eda.EDAArtifactsGenerationSection] = None, logger: Optional[logging.Logger] = None, dataset_metadata: Optional[logml.data.metadata.DatasetMetadata] = None)

Bases: object

High-level utility for working with EDA artifacts producers.

get_required_producers(artifact_ids: Optional[List[str]] = None) List[str]

Goes through a given list of required artifacts and pulls up all dependencies.

Example: input: [‘correlation’] output: [‘metadata’, ‘correlation’]

produce(dataframe: pandas.core.frame.DataFrame, required_artifacts: Optional[List[str]] = None)

For a given dataframe runs a list of required EDA artifacts producers.

logml.eda.manager.topological_sort_items(artifact_ids)

Sorts artifact producers in topological order with respect to dependencies.