logml.configuration.report
- class logml.configuration.report.BaselineKitWorkflowSection
Bases:
pydantic.main.BaseModel
Defines workflow for BaselineKit.
Show JSON schema
{ "title": "BaselineKitWorkflowSection", "description": "Defines workflow for BaselineKit.", "type": "object", "properties": { "produce_and_execute_strata_notebooks": { "title": "Produce And Execute Strata Notebooks", "description": "Enables creation and execution of strata-level notebooks for BaselineKit report. The main purpose of this option is to enable parallelization of report generation process.", "default": true, "type": "boolean" }, "produce_and_execute_global_notebooks": { "title": "Produce And Execute Global Notebooks", "description": "Enables creation and execution of global-level notebooks for BaselineKit report. The main purpose of this option is to enable parallelization of report generation process.", "default": true, "type": "boolean" }, "generate_report": { "title": "Generate Report", "description": "Enables BaselineKit report generation: TOC is created, strata-level notebooks are moved to the target global folder, rendering with Jupyterbook. The main purpose of this option is to enable parallelization of report generation process, additionally it allows a user to manually re-run some notebooks and regenerate the report having \"produce_and_execute_*\" flags turned off.", "default": true, "type": "boolean" } } }
- Fields
- field produce_and_execute_strata_notebooks: bool = True
Enables creation and execution of strata-level notebooks for BaselineKit report. The main purpose of this option is to enable parallelization of report generation process.
- field produce_and_execute_global_notebooks: bool = True
Enables creation and execution of global-level notebooks for BaselineKit report. The main purpose of this option is to enable parallelization of report generation process.
- field generate_report: bool = True
Enables BaselineKit report generation: TOC is created, strata-level notebooks are moved to the target global folder, rendering with Jupyterbook. The main purpose of this option is to enable parallelization of report generation process, additionally it allows a user to manually re-run some notebooks and regenerate the report having “produce_and_execute_*” flags turned off.
- class logml.configuration.report.MasterSummarySection
Bases:
pydantic.main.BaseModel
Defines which sections to include to the Master Summary.
Show JSON schema
{ "title": "MasterSummarySection", "description": "Defines which sections to include to the Master Summary.", "type": "object", "properties": { "enable": { "title": "Enable", "description": "Enables Master Summary section that contains highlights from multiple sections.", "default": false, "type": "boolean" }, "eda": { "title": "Eda", "description": "Turns on EDA section highlights. This requires EDA artifacts to be generated (see `ed` section.)", "default": false, "type": "boolean" }, "feature_importance": { "title": "Feature Importance", "description": "Enables Feature Importance summary for a given modeling problem \"setup id\".", "default": "", "type": "string" }, "survival_feature_importance": { "title": "Survival Feature Importance", "description": "Enables Survival Feature Importance summary for a given modeling problem \"setup id\".", "default": "", "type": "string" }, "baseline_modeling": { "title": "Baseline Modeling", "description": "Enables Model Search/Selection summary for a given modeling problem \"setup id\".", "default": "", "type": "string" }, "survival_analysis": { "title": "Survival Analysis", "description": "Enables Survival Analysis summary for a given survival problem \"setup id\".", "default": "", "type": "string" } } }
- Fields
- field enable: bool = False
Enables Master Summary section that contains highlights from multiple sections.
- field eda: bool = False
Turns on EDA section highlights. This requires EDA artifacts to be generated (see ed section.)
- field feature_importance: str = ''
Enables Feature Importance summary for a given modeling problem “setup id”.
- field survival_feature_importance: str = ''
Enables Survival Feature Importance summary for a given modeling problem “setup id”.
- field baseline_modeling: str = ''
Enables Model Search/Selection summary for a given modeling problem “setup id”.
- field survival_analysis: str = ''
Enables Survival Analysis summary for a given survival problem “setup id”.
- list_modeling_problems() List[str]
Returns all mentioned modeling problems.
- list_survival_problems() List[str]
Returns all mentioned survival problems.
- class logml.configuration.report.ReportStructure
Bases:
pydantic.main.BaseModel
Defines BaselineKit report structure (topics) and expected params. Please refer to EligibleBaselineKitNotebooks for details on supported fields.
Show JSON schema
{ "title": "ReportStructure", "description": "Defines BaselineKit report structure (topics) and expected params.\nPlease refer to `EligibleBaselineKitNotebooks` for details on supported fields.", "type": "object", "properties": { "master_summary": { "title": "Master Summary", "default": { "enable": false, "eda": false, "feature_importance": "", "survival_feature_importance": "", "baseline_modeling": "", "survival_analysis": "" }, "allOf": [ { "$ref": "#/definitions/MasterSummarySection" } ] }, "eda": { "title": "Eda", "description": "Enables multiple EDA sections: dataset overview, numericals/categoricals analysis, dimensionality reduction, correlation analysis, etc.", "default": false, "type": "boolean" }, "modeling": { "title": "Modeling", "description": "Target modeling problems/setups for which Modeling Report sections should be produced.", "default": [], "type": "array", "items": { "type": "string" } }, "cross_strata_fi_summary": { "title": "Cross Strata Fi Summary", "description": "Target modeling problems/setups for which cross-strata analysis/comparison sections should be produced. NOTE: survival feature importance is supported as well.", "default": [], "type": "array", "items": { "type": "string" } }, "survival_analysis": { "title": "Survival Analysis", "description": "Target survival problems/setups for which survival analysis sections should be produced.", "default": [], "type": "array", "items": { "type": "string" } }, "greedy_split": { "title": "Greedy Split", "description": "Target analysis items (referenced by \"name\") of \"greedy_split\" type.", "default": [], "type": "array", "items": { "type": "string" } }, "rnaseq_differential_expression": { "title": "Rnaseq Differential Expression", "description": "Target analysis items (referenced by \"name\") of \"rnaseq_differential_expression\" type.", "default": [], "type": "array", "items": { "type": "string" } }, "rnaseq_enrichment_analysis": { "title": "Rnaseq Enrichment Analysis", "description": "Target analysis items (referenced by \"name\") of \"rnaseq_enrichment_analysis\" type.", "default": [], "type": "array", "items": { "type": "string" } }, "report_diagnostics": { "title": "Report Diagnostics", "description": "Enables report diagnostic view.", "default": true, "type": "boolean" }, "report_summary": { "title": "Report Summary", "description": "Enables report summary view.", "default": true, "type": "boolean" } }, "definitions": { "MasterSummarySection": { "title": "MasterSummarySection", "description": "Defines which sections to include to the Master Summary.", "type": "object", "properties": { "enable": { "title": "Enable", "description": "Enables Master Summary section that contains highlights from multiple sections.", "default": false, "type": "boolean" }, "eda": { "title": "Eda", "description": "Turns on EDA section highlights. This requires EDA artifacts to be generated (see `ed` section.)", "default": false, "type": "boolean" }, "feature_importance": { "title": "Feature Importance", "description": "Enables Feature Importance summary for a given modeling problem \"setup id\".", "default": "", "type": "string" }, "survival_feature_importance": { "title": "Survival Feature Importance", "description": "Enables Survival Feature Importance summary for a given modeling problem \"setup id\".", "default": "", "type": "string" }, "baseline_modeling": { "title": "Baseline Modeling", "description": "Enables Model Search/Selection summary for a given modeling problem \"setup id\".", "default": "", "type": "string" }, "survival_analysis": { "title": "Survival Analysis", "description": "Enables Survival Analysis summary for a given survival problem \"setup id\".", "default": "", "type": "string" } } } } }
- field master_summary: logml.configuration.report.MasterSummarySection = MasterSummarySection(enable=False, eda=False, feature_importance='', survival_feature_importance='', baseline_modeling='', survival_analysis='')
- field eda: bool = False
Enables multiple EDA sections: dataset overview, numericals/categoricals analysis, dimensionality reduction, correlation analysis, etc.
- field modeling: List[str] = []
Target modeling problems/setups for which Modeling Report sections should be produced.
- field cross_strata_fi_summary: List[str] = []
Target modeling problems/setups for which cross-strata analysis/comparison sections should be produced. NOTE: survival feature importance is supported as well.
- field survival_analysis: List[str] = []
Target survival problems/setups for which survival analysis sections should be produced.
- field greedy_split: List[str] = []
Target analysis items (referenced by “name”) of “greedy_split” type.
- field rnaseq_differential_expression: List[str] = []
Target analysis items (referenced by “name”) of “rnaseq_differential_expression” type.
- field rnaseq_enrichment_analysis: List[str] = []
Target analysis items (referenced by “name”) of “rnaseq_enrichment_analysis” type.
- field report_diagnostics: bool = True
Enables report diagnostic view.
- field report_summary: bool = True
Enables report summary view.
- is_notebook_enabled(notebook_alias: str) bool
Returns true when notebook enabled.
- list_modeling_problems() List[str]
Returns all mentioned modeling problems.
- list_survival_problems() List[str]
Returns all mentioned modeling problems.
- class logml.configuration.report.ReportSection
Bases:
pydantic.main.BaseModel
Defines BaselineKit section.
Show JSON schema
{ "title": "ReportSection", "description": "Defines BaselineKit section.", "type": "object", "properties": { "enable": { "title": "Enable", "description": "Enables BaselineKit report generation: underlying Jupyter notebooks generation (for target scopes) + report rendering via JupyterBook.", "default": true, "type": "boolean" }, "report_structure": { "title": "Report Structure", "default": { "master_summary": { "enable": false, "eda": false, "feature_importance": "", "survival_feature_importance": "", "baseline_modeling": "", "survival_analysis": "" }, "eda": false, "modeling": [], "cross_strata_fi_summary": [], "survival_analysis": [], "greedy_split": [], "rnaseq_differential_expression": [], "rnaseq_enrichment_analysis": [], "report_diagnostics": true, "report_summary": true }, "allOf": [ { "$ref": "#/definitions/ReportStructure" } ] }, "workflow": { "title": "Workflow", "default": { "produce_and_execute_strata_notebooks": true, "produce_and_execute_global_notebooks": true, "generate_report": true }, "allOf": [ { "$ref": "#/definitions/BaselineKitWorkflowSection" } ] } }, "definitions": { "MasterSummarySection": { "title": "MasterSummarySection", "description": "Defines which sections to include to the Master Summary.", "type": "object", "properties": { "enable": { "title": "Enable", "description": "Enables Master Summary section that contains highlights from multiple sections.", "default": false, "type": "boolean" }, "eda": { "title": "Eda", "description": "Turns on EDA section highlights. This requires EDA artifacts to be generated (see `ed` section.)", "default": false, "type": "boolean" }, "feature_importance": { "title": "Feature Importance", "description": "Enables Feature Importance summary for a given modeling problem \"setup id\".", "default": "", "type": "string" }, "survival_feature_importance": { "title": "Survival Feature Importance", "description": "Enables Survival Feature Importance summary for a given modeling problem \"setup id\".", "default": "", "type": "string" }, "baseline_modeling": { "title": "Baseline Modeling", "description": "Enables Model Search/Selection summary for a given modeling problem \"setup id\".", "default": "", "type": "string" }, "survival_analysis": { "title": "Survival Analysis", "description": "Enables Survival Analysis summary for a given survival problem \"setup id\".", "default": "", "type": "string" } } }, "ReportStructure": { "title": "ReportStructure", "description": "Defines BaselineKit report structure (topics) and expected params.\nPlease refer to `EligibleBaselineKitNotebooks` for details on supported fields.", "type": "object", "properties": { "master_summary": { "title": "Master Summary", "default": { "enable": false, "eda": false, "feature_importance": "", "survival_feature_importance": "", "baseline_modeling": "", "survival_analysis": "" }, "allOf": [ { "$ref": "#/definitions/MasterSummarySection" } ] }, "eda": { "title": "Eda", "description": "Enables multiple EDA sections: dataset overview, numericals/categoricals analysis, dimensionality reduction, correlation analysis, etc.", "default": false, "type": "boolean" }, "modeling": { "title": "Modeling", "description": "Target modeling problems/setups for which Modeling Report sections should be produced.", "default": [], "type": "array", "items": { "type": "string" } }, "cross_strata_fi_summary": { "title": "Cross Strata Fi Summary", "description": "Target modeling problems/setups for which cross-strata analysis/comparison sections should be produced. NOTE: survival feature importance is supported as well.", "default": [], "type": "array", "items": { "type": "string" } }, "survival_analysis": { "title": "Survival Analysis", "description": "Target survival problems/setups for which survival analysis sections should be produced.", "default": [], "type": "array", "items": { "type": "string" } }, "greedy_split": { "title": "Greedy Split", "description": "Target analysis items (referenced by \"name\") of \"greedy_split\" type.", "default": [], "type": "array", "items": { "type": "string" } }, "rnaseq_differential_expression": { "title": "Rnaseq Differential Expression", "description": "Target analysis items (referenced by \"name\") of \"rnaseq_differential_expression\" type.", "default": [], "type": "array", "items": { "type": "string" } }, "rnaseq_enrichment_analysis": { "title": "Rnaseq Enrichment Analysis", "description": "Target analysis items (referenced by \"name\") of \"rnaseq_enrichment_analysis\" type.", "default": [], "type": "array", "items": { "type": "string" } }, "report_diagnostics": { "title": "Report Diagnostics", "description": "Enables report diagnostic view.", "default": true, "type": "boolean" }, "report_summary": { "title": "Report Summary", "description": "Enables report summary view.", "default": true, "type": "boolean" } } }, "BaselineKitWorkflowSection": { "title": "BaselineKitWorkflowSection", "description": "Defines workflow for BaselineKit.", "type": "object", "properties": { "produce_and_execute_strata_notebooks": { "title": "Produce And Execute Strata Notebooks", "description": "Enables creation and execution of strata-level notebooks for BaselineKit report. The main purpose of this option is to enable parallelization of report generation process.", "default": true, "type": "boolean" }, "produce_and_execute_global_notebooks": { "title": "Produce And Execute Global Notebooks", "description": "Enables creation and execution of global-level notebooks for BaselineKit report. The main purpose of this option is to enable parallelization of report generation process.", "default": true, "type": "boolean" }, "generate_report": { "title": "Generate Report", "description": "Enables BaselineKit report generation: TOC is created, strata-level notebooks are moved to the target global folder, rendering with Jupyterbook. The main purpose of this option is to enable parallelization of report generation process, additionally it allows a user to manually re-run some notebooks and regenerate the report having \"produce_and_execute_*\" flags turned off.", "default": true, "type": "boolean" } } } } }
- Fields
- field enable: bool = True
Enables BaselineKit report generation: underlying Jupyter notebooks generation (for target scopes) + report rendering via JupyterBook.
- field report_structure: logml.configuration.report.ReportStructure = ReportStructure(master_summary=MasterSummarySection(enable=False, eda=False, feature_importance='', survival_feature_importance='', baseline_modeling='', survival_analysis=''), eda=False, modeling=[], cross_strata_fi_summary=[], survival_analysis=[], greedy_split=[], rnaseq_differential_expression=[], rnaseq_enrichment_analysis=[], report_diagnostics=True, report_summary=True)
- field workflow: logml.configuration.report.BaselineKitWorkflowSection = BaselineKitWorkflowSection(produce_and_execute_strata_notebooks=True, produce_and_execute_global_notebooks=True, generate_report=True)