zensols.deeplearn package¶
Subpackages¶
- zensols.deeplearn.batch package
- Submodules
- zensols.deeplearn.batch.domain module
Batch
Batch.STATES
Batch.__init__()
Batch.attributes
Batch.batch_stash
Batch.data_points
Batch.deallocate()
Batch.get_label_classes()
Batch.get_label_feature_vectorizer()
Batch.get_labels()
Batch.has_labels
Batch.id
Batch.keys()
Batch.size()
Batch.split_name
Batch.state_name
Batch.to()
Batch.torch_config
Batch.write()
DataPoint
DefaultBatch
- zensols.deeplearn.batch.interface module
- zensols.deeplearn.batch.mapping module
BatchFeatureMapping
BatchFeatureMapping.__init__()
BatchFeatureMapping.get_attributes()
BatchFeatureMapping.get_field_map_by_attribute()
BatchFeatureMapping.get_field_map_by_feature_id()
BatchFeatureMapping.label_attribute_name
BatchFeatureMapping.label_feature_id
BatchFeatureMapping.label_vectorizer_manager
BatchFeatureMapping.manager_mappings
BatchFeatureMapping.write()
ConfigBatchFeatureMapping
FieldFeatureMapping
ManagerFeatureMapping
- zensols.deeplearn.batch.meta module
- zensols.deeplearn.batch.multi module
- zensols.deeplearn.batch.stash module
BatchStash
BatchStash._process()
BatchStash.__init__()
BatchStash.batch_data_point_sets
BatchStash.batch_feature_mappings
BatchStash.batch_limit
BatchStash.batch_metadata
BatchStash.batch_size
BatchStash.batch_type
BatchStash.clear()
BatchStash.clear_all()
BatchStash.create_batch()
BatchStash.data_point_id_sets_path
BatchStash.data_point_type
BatchStash.deallocate()
BatchStash.decoded_attributes
BatchStash.load()
BatchStash.model_torch_config
BatchStash.populate_batch_feature_mapping()
BatchStash.prime()
BatchStash.split_stash_container
BatchStash.vectorizer_manager_set
- Module contents
- zensols.deeplearn.dataframe package
- Submodules
- zensols.deeplearn.dataframe.batch module
- zensols.deeplearn.dataframe.util module
- zensols.deeplearn.dataframe.vectorize module
DataframeFeatureVectorizerManager
DataframeFeatureVectorizerManager.__init__()
DataframeFeatureVectorizerManager.batch_feature_mapping
DataframeFeatureVectorizerManager.column_to_feature_id()
DataframeFeatureVectorizerManager.dataset_metadata
DataframeFeatureVectorizerManager.exclude_columns
DataframeFeatureVectorizerManager.get_flattened_features_shape()
DataframeFeatureVectorizerManager.include_columns
DataframeFeatureVectorizerManager.label_attribute_name
DataframeFeatureVectorizerManager.label_col
DataframeFeatureVectorizerManager.label_shape
DataframeFeatureVectorizerManager.prefix
DataframeFeatureVectorizerManager.stash
DataframeFeatureVectorizerManager.write()
DataframeMetadata
- Module contents
- zensols.deeplearn.layer package
- Submodules
- zensols.deeplearn.layer.conv module
Convolution1DLayerFactory
Convolution1DLayerFactory.C_in
Convolution1DLayerFactory.F
Convolution1DLayerFactory.L_in
Convolution1DLayerFactory.__init__()
Convolution1DLayerFactory.create_batch_norm_layer()
Convolution1DLayerFactory.create_conv_layer()
Convolution1DLayerFactory.create_pool_layer()
Convolution1DLayerFactory.in_channels
Convolution1DLayerFactory.kernel_filter
Convolution1DLayerFactory.out_channels
Convolution1DLayerFactory.pool_kernel_filter
Convolution2DLayerFactory
Convolution2DLayerFactory.D
Convolution2DLayerFactory.F
Convolution2DLayerFactory.H
Convolution2DLayerFactory.H_out
Convolution2DLayerFactory.K
Convolution2DLayerFactory.W
Convolution2DLayerFactory.W_out
Convolution2DLayerFactory.W_row
Convolution2DLayerFactory.X_col
Convolution2DLayerFactory.__init__()
Convolution2DLayerFactory.create_batch_norm_layer()
Convolution2DLayerFactory.create_conv_layer()
Convolution2DLayerFactory.create_pool_layer()
Convolution2DLayerFactory.depth
Convolution2DLayerFactory.height
Convolution2DLayerFactory.kernel_filter
Convolution2DLayerFactory.n_filters
Convolution2DLayerFactory.pool_kernel_filter
Convolution2DLayerFactory.width
ConvolutionLayerFactory
ConvolutionLayerFactory.P
ConvolutionLayerFactory.S
ConvolutionLayerFactory.__init__()
ConvolutionLayerFactory.clone()
ConvolutionLayerFactory.create_batch_norm_layer()
ConvolutionLayerFactory.create_conv_layer()
ConvolutionLayerFactory.create_pool_layer()
ConvolutionLayerFactory.dim
ConvolutionLayerFactory.iter_layers()
ConvolutionLayerFactory.next_layer()
ConvolutionLayerFactory.out_conv_shape
ConvolutionLayerFactory.out_pool_shape
ConvolutionLayerFactory.padding
ConvolutionLayerFactory.pool_padding
ConvolutionLayerFactory.pool_stride
ConvolutionLayerFactory.stride
ConvolutionLayerFactory.validate()
- zensols.deeplearn.layer.crf module
- zensols.deeplearn.layer.linear module
- zensols.deeplearn.layer.recur module
RecurrentAggregation
RecurrentAggregationNetworkSettings
RecurrentAggregationNetworkSettings.__init__()
RecurrentAggregationNetworkSettings.aggregation
RecurrentAggregationNetworkSettings.bidirectional
RecurrentAggregationNetworkSettings.get_module_class_name()
RecurrentAggregationNetworkSettings.hidden_size
RecurrentAggregationNetworkSettings.input_size
RecurrentAggregationNetworkSettings.network_type
RecurrentAggregationNetworkSettings.num_layers
- zensols.deeplearn.layer.recurcrf module
RecurrentCRF
RecurrentCRFNetworkSettings
RecurrentCRFNetworkSettings.__init__()
RecurrentCRFNetworkSettings.bidirectional
RecurrentCRFNetworkSettings.decoder_settings
RecurrentCRFNetworkSettings.get_module_class_name()
RecurrentCRFNetworkSettings.hidden_size
RecurrentCRFNetworkSettings.input_size
RecurrentCRFNetworkSettings.network_type
RecurrentCRFNetworkSettings.num_labels
RecurrentCRFNetworkSettings.num_layers
RecurrentCRFNetworkSettings.score_reduction
RecurrentCRFNetworkSettings.to_recurrent_aggregation()
- Module contents
- zensols.deeplearn.model package
- Submodules
- zensols.deeplearn.model.analyze module
- zensols.deeplearn.model.batchiter module
- zensols.deeplearn.model.executor module
ModelExecutor
ModelExecutor.ATTR_EXP_META
ModelExecutor.__init__()
ModelExecutor.batch_iterator
ModelExecutor.batch_stash
ModelExecutor.config
ModelExecutor.config_factory
ModelExecutor.criterion_optimizer_scheduler
ModelExecutor.cross_fold_batch_stash
ModelExecutor.cross_fold_dataset_stash
ModelExecutor.cross_fold_result_manager
ModelExecutor.cross_fold_result_path
ModelExecutor.cross_validate()
ModelExecutor.dataset_split_names
ModelExecutor.dataset_stash
ModelExecutor.deallocate()
ModelExecutor.deallocate_batches()
ModelExecutor.debug
ModelExecutor.feature_stash
ModelExecutor.get_model_parameter()
ModelExecutor.get_network_parameter()
ModelExecutor.intermediate_results_path
ModelExecutor.load()
ModelExecutor.model
ModelExecutor.model_exists
ModelExecutor.model_manager
ModelExecutor.model_result_report
ModelExecutor.model_settings
ModelExecutor.name
ModelExecutor.net_settings
ModelExecutor.predict()
ModelExecutor.progress_bar
ModelExecutor.progress_bar_cols
ModelExecutor.reset()
ModelExecutor.result_manager
ModelExecutor.result_path
ModelExecutor.set_model_parameter()
ModelExecutor.set_network_parameter()
ModelExecutor.test()
ModelExecutor.torch_config
ModelExecutor.train()
ModelExecutor.train_manager
ModelExecutor.train_production()
ModelExecutor.update_path
ModelExecutor.write()
ModelExecutor.write_model()
ModelExecutor.write_settings()
- zensols.deeplearn.model.facade module
ModelFacade
ModelFacade.__init__()
ModelFacade.batch_metadata
ModelFacade.batch_stash
ModelFacade.cache_batches
ModelFacade.class_explorer
ModelFacade.clear()
ModelFacade.config
ModelFacade.config_factory
ModelFacade.configure_cli_logging()
ModelFacade.configure_default_cli_logging()
ModelFacade.configure_jupyter()
ModelFacade.cross_fold_batch_stash
ModelFacade.cross_fold_result_manager
ModelFacade.cross_validate()
ModelFacade.dataset_stash
ModelFacade.deallocate()
ModelFacade.debug()
ModelFacade.dropout
ModelFacade.epochs
ModelFacade.executor
ModelFacade.executor_name
ModelFacade.feature_stash
ModelFacade.get_batch_metrics()
ModelFacade.get_cross_fold_batch_metrics()
ModelFacade.get_encode_sparse_matrices()
ModelFacade.get_predictions_factory()
ModelFacade.get_result_analyzer()
ModelFacade.get_result_reporter()
ModelFacade.get_singleton()
ModelFacade.label_attribute_name
ModelFacade.last_result
ModelFacade.learning_rate
ModelFacade.load_from_path()
ModelFacade.model_config
ModelFacade.model_result_reporter_class
ModelFacade.model_settings
ModelFacade.net_settings
ModelFacade.persist_result()
ModelFacade.plot_result()
ModelFacade.predict()
ModelFacade.predictions_dataframe_factory_class
ModelFacade.progress_bar
ModelFacade.progress_bar_cols
ModelFacade.reload()
ModelFacade.remove_metadata_mapping_field()
ModelFacade.result_manager
ModelFacade.result_name
ModelFacade.set_encode_sparse_matrices()
ModelFacade.stop_training()
ModelFacade.test()
ModelFacade.train()
ModelFacade.train_production()
ModelFacade.update_model_config_factory()
ModelFacade.vectorizer_manager_set
ModelFacade.write()
ModelFacade.write_result()
ModelFacade.writer
- zensols.deeplearn.model.format module
LatexPerformanceMetricsDumper
PerformanceMetricsDumper
PerformanceMetricsDumper.__init__()
PerformanceMetricsDumper.by_label_columns
PerformanceMetricsDumper.by_label_dataframe
PerformanceMetricsDumper.capitalize()
PerformanceMetricsDumper.facade
PerformanceMetricsDumper.format_thousand()
PerformanceMetricsDumper.majority_label_res_id
PerformanceMetricsDumper.name_replace
PerformanceMetricsDumper.precision
PerformanceMetricsDumper.sort_column
PerformanceMetricsDumper.summary_columns
PerformanceMetricsDumper.summary_dataframe
PerformanceMetricsDumper.write()
- zensols.deeplearn.model.manager module
- zensols.deeplearn.model.meta module
- zensols.deeplearn.model.module module
- zensols.deeplearn.model.multilabel module
- zensols.deeplearn.model.optimizer module
- zensols.deeplearn.model.pack module
ModelPacker
ModelUnpacker
ModelUnpacker.__init__()
ModelUnpacker.config_factory
ModelUnpacker.facade
ModelUnpacker.facade_source
ModelUnpacker.install_model()
ModelUnpacker.installed_model_path
ModelUnpacker.installed_version
ModelUnpacker.installer
ModelUnpacker.model_config_overwrites
ModelUnpacker.model_packer_name
ModelUnpacker.write()
SubsetConfig
- zensols.deeplearn.model.pred module
- zensols.deeplearn.model.sequence module
- zensols.deeplearn.model.trainmng module
- zensols.deeplearn.model.wgtexecutor module
WeightedModelExecutor
WeightedModelExecutor.__init__()
WeightedModelExecutor.clear()
WeightedModelExecutor.get_class_weights()
WeightedModelExecutor.get_label_counts()
WeightedModelExecutor.get_label_statistics()
WeightedModelExecutor.use_weighted_criterion
WeightedModelExecutor.weighted_split_name
WeightedModelExecutor.weighted_split_path
- Module contents
- zensols.deeplearn.result package
- Submodules
- zensols.deeplearn.result.compare module
- zensols.deeplearn.result.domain module
ClassificationMetrics
DatasetResult
EpochResult
EpochResult.__init__()
EpochResult.batch_ids
EpochResult.batch_labels
EpochResult.batch_losses
EpochResult.batch_outputs
EpochResult.batch_predictions
EpochResult.clone()
EpochResult.end()
EpochResult.index
EpochResult.losses
EpochResult.n_data_points
EpochResult.split_type
EpochResult.update()
EpochResult.write()
Metrics
ModelResult
ModelResult.RUNS
ModelResult.__init__()
ModelResult.clone()
ModelResult.config
ModelResult.contains_results
ModelResult.context
ModelResult.decoded_attributes
ModelResult.get_intermediate()
ModelResult.get_num_runs()
ModelResult.last_test
ModelResult.last_test_name
ModelResult.model_settings
ModelResult.name
ModelResult.net_settings
ModelResult.non_empty_dataset_result
ModelResult.reset()
ModelResult.reset_runs()
ModelResult.test
ModelResult.train
ModelResult.validation
ModelResult.write()
ModelResult.write_result_statistics()
ModelResultError
ModelType
MultiLabelClassificationMetrics
MultiLabelClassificationMetrics.__init__()
MultiLabelClassificationMetrics.accuracy
MultiLabelClassificationMetrics.context
MultiLabelClassificationMetrics.create_metrics()
MultiLabelClassificationMetrics.dataframes
MultiLabelClassificationMetrics.multi_labels
MultiLabelClassificationMetrics.n_correct
MultiLabelClassificationMetrics.write()
MultiLabelScoreMetrics
NoResultError
PredictionMetrics
ResultContext
ResultsContainer
ResultsContainer.FLOAT_TYPES
ResultsContainer.__init__()
ResultsContainer.ave_loss
ResultsContainer.classification_metrics
ResultsContainer.clone()
ResultsContainer.contains_results
ResultsContainer.context
ResultsContainer.end()
ResultsContainer.is_ended
ResultsContainer.is_started
ResultsContainer.labels
ResultsContainer.max_loss
ResultsContainer.metrics
ResultsContainer.min_loss
ResultsContainer.model_type
ResultsContainer.multi_label_classification_metrics
ResultsContainer.n_iterations
ResultsContainer.n_outcomes
ResultsContainer.prediction_metrics
ResultsContainer.predictions
ResultsContainer.start()
ScoreMetrics
- zensols.deeplearn.result.hypsig module
AnovaSignificanceTest
ChiSquareEvaluation
ChiSquareEvaluation.__init__()
ChiSquareEvaluation.adjusted_residuals
ChiSquareEvaluation.associated
ChiSquareEvaluation.contingency_table
ChiSquareEvaluation.contribs
ChiSquareEvaluation.dof
ChiSquareEvaluation.expected
ChiSquareEvaluation.pearson_residuals
ChiSquareEvaluation.raw_residuals
ChiSquareEvaluation.write()
ChiSquareEvaluation.write_associated()
ChiSquareSignificanceTest
Evaluation
McNemarSignificanceTest
SignificanceError
SignificanceTest
SignificanceTestData
SignificanceTestData.__init__()
SignificanceTestData.a
SignificanceTestData.alpha
SignificanceTestData.b
SignificanceTestData.contingency_table
SignificanceTestData.correct_table
SignificanceTestData.gold_col
SignificanceTestData.id_col
SignificanceTestData.null_hypothesis
SignificanceTestData.pred_col
SignificanceTestSuite
StudentTTestSignificanceTest
WilcoxSignificanceTest
- zensols.deeplearn.result.manager module
ArchivedResult
ModelResultManager
ModelResultManager.__init__()
ModelResultManager.create_results_stash()
ModelResultManager.dump()
ModelResultManager.file_pattern
ModelResultManager.file_regex
ModelResultManager.get_grapher()
ModelResultManager.get_last_id()
ModelResultManager.get_next_graph_path()
ModelResultManager.get_next_json_path()
ModelResultManager.get_next_model_path()
ModelResultManager.get_next_text_path()
ModelResultManager.model_path
ModelResultManager.name
ModelResultManager.parse_file_name()
ModelResultManager.results_stash
ModelResultManager.save_json
ModelResultManager.save_json_result()
ModelResultManager.save_plot
ModelResultManager.save_plot_result()
ModelResultManager.save_text
ModelResultManager.save_text_result()
ModelResultManager.to_file_name()
- zensols.deeplearn.result.plot module
- zensols.deeplearn.result.pred module
MultiLabelPredictionsDataFrameFactory
PredictionsDataFrameFactory
PredictionsDataFrameFactory.CORRECT_COL
PredictionsDataFrameFactory.ID_COL
PredictionsDataFrameFactory.LABEL_COL
PredictionsDataFrameFactory.METRICS_DF_COLUMNS
PredictionsDataFrameFactory.METRICS_DF_MACRO_COLUMNS
PredictionsDataFrameFactory.METRICS_DF_MICRO_COLUMNS
PredictionsDataFrameFactory.METRICS_DF_WEIGHTED_COLUMNS
PredictionsDataFrameFactory.METRIC_COLUMNS
PredictionsDataFrameFactory.METRIC_DESCRIPTIONS
PredictionsDataFrameFactory.PREDICTION_COL
PredictionsDataFrameFactory.TEST_METRIC_COLUMNS
PredictionsDataFrameFactory.VALIDATION_METRIC_COLUMNS
PredictionsDataFrameFactory.__init__()
PredictionsDataFrameFactory.batch_limit
PredictionsDataFrameFactory.column_names
PredictionsDataFrameFactory.data_point_transform
PredictionsDataFrameFactory.dataframe
PredictionsDataFrameFactory.dataframe_describer
PredictionsDataFrameFactory.epoch_result
PredictionsDataFrameFactory.label_vectorizer_name
PredictionsDataFrameFactory.majority_label_metrics_describer
PredictionsDataFrameFactory.metric_metadata
PredictionsDataFrameFactory.metrics_dataframe_describer
PredictionsDataFrameFactory.metrics_to_series()
PredictionsDataFrameFactory.name
PredictionsDataFrameFactory.result
PredictionsDataFrameFactory.source
PredictionsDataFrameFactory.stash
SequencePredictionsDataFrameFactory
- zensols.deeplearn.result.report module
- Module contents
- zensols.deeplearn.vectorize package
- Submodules
- zensols.deeplearn.vectorize.domain module
- zensols.deeplearn.vectorize.manager module
EncodableFeatureVectorizer
FeatureVectorizerManager
FeatureVectorizerManager.ATTR_EXP_META
FeatureVectorizerManager.MANAGER_SEP
FeatureVectorizerManager.__init__()
FeatureVectorizerManager.configured_vectorizers
FeatureVectorizerManager.deallocate()
FeatureVectorizerManager.feature_ids
FeatureVectorizerManager.get()
FeatureVectorizerManager.items()
FeatureVectorizerManager.keys()
FeatureVectorizerManager.torch_config
FeatureVectorizerManager.transform()
FeatureVectorizerManager.values()
FeatureVectorizerManager.write()
FeatureVectorizerManagerSet
FeatureVectorizerManagerSet.ATTR_EXP_META
FeatureVectorizerManagerSet.__init__()
FeatureVectorizerManagerSet.deallocate()
FeatureVectorizerManagerSet.feature_ids
FeatureVectorizerManagerSet.get()
FeatureVectorizerManagerSet.get_vectorizer()
FeatureVectorizerManagerSet.get_vectorizer_names()
FeatureVectorizerManagerSet.keys()
FeatureVectorizerManagerSet.names
FeatureVectorizerManagerSet.values()
FeatureVectorizerManagerSet.write()
TransformableFeatureVectorizer
- zensols.deeplearn.vectorize.util module
- zensols.deeplearn.vectorize.vectorizers module
AggregateEncodableFeatureVectorizer
AggregateEncodableFeatureVectorizer.DEFAULT_PAD_LABEL
AggregateEncodableFeatureVectorizer.DESCRIPTION
AggregateEncodableFeatureVectorizer.__init__()
AggregateEncodableFeatureVectorizer.create_padded_tensor()
AggregateEncodableFeatureVectorizer.delegate
AggregateEncodableFeatureVectorizer.delegate_feature_id
AggregateEncodableFeatureVectorizer.pad_label
AggregateEncodableFeatureVectorizer.size
AttributeEncodableFeatureVectorizer
CategoryEncodableFeatureVectorizer
IdentityEncodableFeatureVectorizer
MaskFeatureContext
MaskFeatureVectorizer
NominalEncodedEncodableFeatureVectorizer
NominalMultiLabelEncodedEncodableFeatureVectorizer
OneHotEncodedEncodableFeatureVectorizer
SeriesEncodableFeatureVectorizer
- Module contents
Submodules¶
zensols.deeplearn.cli module¶
Command line entry point to the application using the application CLI.
- class zensols.deeplearn.cli.BatchReport(value, names=None, *, module=None, qualname=None, type=None, start=1, boundary=None)[source]¶
Bases:
Enum
- labels = 3¶
- none = 1¶
- split = 2¶
- stats = 4¶
- class zensols.deeplearn.cli.ClearType(value, names=None, *, module=None, qualname=None, type=None, start=1, boundary=None)[source]¶
Bases:
Enum
Indicates what type of data to delete (clear).
- batch = 2¶
- none = 1¶
- source = 3¶
- class zensols.deeplearn.cli.FacadeApplication(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None)[source]¶
Bases:
Deallocatable
Base class for applications that use
ModelFacade
.- CLI_META = {'mnemonic_excludes': {'clear_cached_facade', 'create_facade', 'deallocate', 'get_cached_facade'}, 'option_overrides': {'model_path': {'long_name': 'model', 'short_name': None}, 'out_format': {'long_name': 'format', 'short_name': 'f'}}}¶
Tell the command line app API to igonore subclass and client specific use case methods.
- __init__(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None)¶
-
cache_global_facade:
bool
= True¶ Whether to globally cache the facade returned by
get_cached_facade()
.
-
config:
Configurable
¶ The config used to create facade instances.
-
config_factory_args:
Dict
[str
,Any
]¶ The arguments given to the
ImportConfigFactory
, which could be useful for reloading all classes while debugingg.
-
config_overwrites:
Configurable
= None¶ A configurable that clobbers any configuration in
config
for those sections/options set.
- get_cached_facade(**kwargs) ModelFacade ¶
Return a created facade that is cached in this application instance.
- Return type:
ModelFacade
-
model_config_overwrites:
Configurable
= None¶ Configuration that is injected into the model loaded by the
model.ModelManager
.
- class zensols.deeplearn.cli.FacadeApplicationFactory(package_resource, app_config_resource='resources/app.conf', children_configs=None, reload_factory=False, reload_pattern=None, error_handler=None)[source]¶
Bases:
ApplicationFactory
This is a utility class that creates instances of
FacadeApplication
. It’s only needed if you need to create a facade without wanting invoke the command line attached to the applications.It does this by only invoking the first pass applications so all the correct initialization happens before returning factory artifacts.
There mst be a
FacadeApplication.facade_name
entry in the configuration tied to an instance ofFacadeApplication
.- See:
- __init__(package_resource, app_config_resource='resources/app.conf', children_configs=None, reload_factory=False, reload_pattern=None, error_handler=None)¶
- class zensols.deeplearn.cli.FacadeApplicationManager(cli_harness, cli_args_fn=<function FacadeApplicationManager.<lambda>>, reset_torch=True, allocation_tracking=False, logger_name='notebook', default_logging_level='WARNING', progress_bar_cols=120, config_overwrites=<factory>)[source]¶
Bases:
Writable
A very high level client interface making it easy to configure and run models from an interactive environment such as a Python REPL or a Jupyter notebook (see
JupyterManager
)- __init__(cli_harness, cli_args_fn=<function FacadeApplicationManager.<lambda>>, reset_torch=True, allocation_tracking=False, logger_name='notebook', default_logging_level='WARNING', progress_bar_cols=120, config_overwrites=<factory>)¶
-
allocation_tracking:
Union
[bool
,str
] = False¶ Whether or not to track resource/memory leaks. If set to
stack
, the stack traces of the unallocated objects will be printed. If set tocounts
only the counts will be printed. If set toTrue
only the unallocated objects without the stack will be printed.
- cleanup(include_cuda=True, quiet=False)[source]¶
Report memory leaks, run the Python garbage collector and optionally empty the CUDA cache.
- Parameters:
include_cuda (
bool
) – ifTrue
clear the GPU cachequiet (
bool
) – do not report unallocated objects, regardless of the setting ofallocation_tracking
- cli_args_fn()¶
Creates the arguments used to create the facade from the application factory.
-
cli_harness:
CliHarness
¶ The CLI harness used to create the facade application.
- config(section, **kwargs)[source]¶
Add overwriting configuration used when creating the facade.
- Parameters:
section (
str
) – the section to be overwritten (or added)kwargs – the key/value pairs used as the section data to overwrite
- See:
-
config_overwrites:
Dict
[str
,Dict
[str
,str
]]¶ Clobbers any configuration set by
config()
for those sections/options set.
- create_facade(*args, **kwargs)[source]¶
Create and return a facade. This deallocates and cleans up state from any previous facade creation as a side effect.
- Parameters:
args – given to the
cli_args_fn
function to create arguments passed to the CLI- Return type:
-
default_logging_level:
str
= 'WARNING'¶ If set, then initialize the logging system using this as the default
logging level. This is the upper case logging name such as
WARNING
.
- property facade: ModelFacade¶
The current facade for this notebook instance.
- Returns:
the existing facade, or that created by
create_facade()
if it doesn’t already exist
- run(display_results=True)[source]¶
Train, test and optionally show results.
- Parameters:
display_results (
bool
) – ifTrue
, write and plot the results
- show_leaks(output='counts', fail=True)[source]¶
Show all resources/memory leaks in the current facade. First, this deallocates the facade, then prints any lingering objects using
Deallocatable
.Important:
allocation_tracking
must be set toTrue
for this to work.
- write(depth=0, writer=<_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>, include_model=False, include_metadata=False, include_settings=False, **kwargs)[source]¶
Write the contents of this instance to
writer
using indentiondepth
.- Parameters:
depth (
int
) – the starting indentation depthwriter (
TextIOBase
) – the writer to dump the content of this writable
- class zensols.deeplearn.cli.FacadeBatchApplication(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None)[source]¶
Bases:
FacadeApplication
Create and analyze batches.
- CLI_META = {'mnemonic_excludes': {'clear_cached_facade', 'create_facade', 'deallocate', 'get_cached_facade'}, 'mnemonic_overrides': {'batch': {'option_includes': {'clear_type', 'limit', 'report'}}}, 'option_overrides': {'clear': {'short_name': None}, 'clear_type': {'long_name': 'ctype', 'short_name': None}, 'limit': {'short_name': None}, 'model_path': {'long_name': 'model', 'short_name': None}, 'out_format': {'long_name': 'format', 'short_name': 'f'}, 'report': {'short_name': None}}}¶
Tell the command line app API to igonore subclass and client specific use case methods.
- __init__(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None)¶
- batch(limit=None, clear_type=ClearType.none, report=BatchReport.none)[source]¶
Create batches if not already, print statistics on the dataset.
- Parameters:
clear_type (
ClearType
) – what to delete to force recreatelimit (
int
) – the number of batches to createreport (
BatchReport
) – the type of report to generate
- class zensols.deeplearn.cli.FacadeInfoApplication(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None, feature_stash_name='feature_stash')[source]¶
Bases:
FacadeApplication
Contains methods that provide information about the model via the facade.
- CLI_META = {'mnemonic_excludes': {'clear_cached_facade', 'create_facade', 'deallocate', 'get_cached_facade'}, 'mnemonic_overrides': {'print_information': 'info'}, 'option_excludes': {'feature_stash_name'}, 'option_overrides': {'debug_value': {'long_name': 'execlevel', 'short_name': None}, 'info_item': {'long_name': 'item', 'short_name': 'i'}, 'model_path': {'long_name': 'model', 'short_name': None}, 'out_format': {'long_name': 'format', 'short_name': 'f'}}}¶
Tell the command line app API to igonore subclass and client specific use case methods.
- __init__(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None, feature_stash_name='feature_stash')¶
- debug(debug_value=None)[source]¶
Debug the model.
- Parameters:
debug_value (
int
) – the executor debugging level
-
feature_stash_name:
str
= 'feature_stash'¶ The section name of the stash to write for
InfoItem.feature
.
- class zensols.deeplearn.cli.FacadeModelApplication(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None, use_progress_bar=False)[source]¶
Bases:
FacadeApplication
Test, train and validate models.
- CLI_META = {'mnemonic_excludes': {'clear_cached_facade', 'create_facade', 'deallocate', 'get_cached_facade'}, 'mnemonic_overrides': {'early_stop': {'name': 'stop', 'option_includes': {}}, 'train_production': 'trainprod'}, 'option_overrides': {'model_path': {'long_name': 'model', 'short_name': None}, 'out_format': {'long_name': 'format', 'short_name': 'f'}, 'result_name': {'long_name': 'desc', 'short_name': None}, 'use_progress_bar': {'long_name': 'progress', 'short_name': 'p'}}}¶
Tell the command line app API to igonore subclass and client specific use case methods.
- __init__(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None, use_progress_bar=False)¶
- test(model_path=None, result_name=None)[source]¶
Test an existing model the model and dump the results of the test.
- train(result_name=None)[source]¶
Train the model and dump the results, including a graph of the train/validation loss.
- Parameters:
result_name (
str
) – a descriptor used in the results
- train_production()[source]¶
Train, test the model on train and test datasets, then dump the results with a graph.
- class zensols.deeplearn.cli.FacadePackageApplication(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None, packer=None)[source]¶
Bases:
FacadeApplication
Contains methods that package models.
- CLI_META = {'mnemonic_excludes': {'clear_cached_facade', 'create_facade', 'deallocate', 'get_cached_facade'}, 'option_excludes': {'packer'}, 'option_overrides': {'archive_model_dir': {'long_name': 'archdir', 'metavar': 'DIR'}, 'model_path': {'long_name': 'model', 'short_name': None}, 'out_format': {'long_name': 'format', 'short_name': 'f'}, 'train_model_dir': {'long_name': 'modeldir', 'metavar': 'DIR'}}}¶
Tell the command line app API to igonore subclass and client specific use case methods.
- __init__(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None, packer=None)¶
-
packer:
ModelPacker
= None¶ The model packer used to create the model distributions from this app.
- class zensols.deeplearn.cli.FacadePredictApplication(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None)[source]¶
Bases:
FacadeApplication
,_DataDescriberProcessor
An applicaiton that provides prediction funtionality.
- CLI_META = {'mnemonic_excludes': {'clear_cached_facade', 'create_facade', 'deallocate', 'get_cached_facade'}, 'mnemonic_overrides': {'predictions': 'respreds'}, 'option_overrides': {'model_path': {'long_name': 'model', 'short_name': None}, 'out_format': {'long_name': 'format', 'short_name': 'f'}}}¶
Tell the command line app API to igonore subclass and client specific use case methods.
- class zensols.deeplearn.cli.FacadeResultApplication(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None)[source]¶
Bases:
FacadeApplication
,_DataDescriberProcessor
Contains methods that dump previous results.
- CLI_META = {'mnemonic_excludes': {'clear_cached_facade', 'create_facade', 'deallocate', 'get_cached_facade'}, 'mnemonic_overrides': {'compare_results': 'rescmp', 'result_ids': 'resids', 'run': 'resrun', 'summary': 'ressum'}, 'option_overrides': {'describe': {'short_name': None}, 'include_validation': {'long_name': 'validation', 'short_name': None}, 'model_path': {'long_name': 'model', 'short_name': None}, 'out_file': {'long_name': 'outfile', 'short_name': 'o'}, 'out_format': {'long_name': 'format', 'short_name': 'f'}, 'report_type': {'long_name': 'report'}, 'sort': {'short_name': 's'}}}¶
Tell the command line app API to igonore subclass and client specific use case methods.
- __init__(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None)¶
- class zensols.deeplearn.cli.Format(value, names=None, *, module=None, qualname=None, type=None, start=1, boundary=None)[source]¶
Bases:
Enum
- csv = 4¶
- json = 2¶
- latex = 5¶
- render = 6¶
- txt = 1¶
- yaml = 3¶
- class zensols.deeplearn.cli.InfoItem(value, names=None, *, module=None, qualname=None, type=None, start=1, boundary=None)[source]¶
Bases:
Enum
Indicates what information to dump in
FacadeInfoApplication.print_information()
.- batch = 6¶
- config = 4¶
- feature = 5¶
- meta = 1¶
- model = 3¶
- param = 2¶
- class zensols.deeplearn.cli.JupyterManager(cli_harness, cli_args_fn=<function FacadeApplicationManager.<lambda>>, reset_torch=True, allocation_tracking=False, logger_name='notebook', default_logging_level='WARNING', progress_bar_cols=120, config_overwrites=<factory>, reduce_logging=False, browser_width=95)[source]¶
Bases:
FacadeApplicationManager
A facade application manager that provides additional convenience functionality.
- __init__(cli_harness, cli_args_fn=<function FacadeApplicationManager.<lambda>>, reset_torch=True, allocation_tracking=False, logger_name='notebook', default_logging_level='WARNING', progress_bar_cols=120, config_overwrites=<factory>, reduce_logging=False, browser_width=95)¶
- create_facade(*args, **kwargs)[source]¶
Create and return a facade. This deallocates and cleans up state from any previous facade creation as a side effect.
- Parameters:
args – given to the
cli_args_fn
function to create arguments passed to the CLI- Return type:
zensols.deeplearn.clicv module¶
Cross-fold validation application classes.
- class zensols.deeplearn.clicv.CrossValidationReport(value, names=None, *, module=None, qualname=None, type=None, start=1, boundary=None)[source]¶
Bases:
Enum
- results = 1¶
- stats = 2¶
- class zensols.deeplearn.clicv.FacadeCrossValidateBatchApplication(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None)[source]¶
Bases:
_FacadeCrossValidateApplication
,FacadeBatchApplication
Create and analyze batches.
- CLI_META = {'mnemonic_excludes': {'clear_cached_facade', 'create_facade', 'deallocate', 'get_cached_facade'}, 'mnemonic_overrides': {'batch': {'option_includes': {'clear_type', 'limit', 'report'}}, 'cross_validate_batch': {'name': 'cvalbatch'}}, 'option_overrides': {'clear': {'short_name': None}, 'clear_type': {'long_name': 'ctype', 'short_name': None}, 'limit': {'short_name': None}, 'model_path': {'long_name': 'model', 'short_name': None}, 'out_format': {'long_name': 'format', 'short_name': 'f'}, 'report': {'short_name': None}}}¶
Tell the command line app API to igonore subclass and client specific use case methods.
- __init__(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None)¶
- cross_validate_batch(limit=None, clear_type=ClearType.none, report=BatchReport.none)[source]¶
Create cross-validation batches if not already, print statistics on the dataset.
- Parameters:
clear_type (
ClearType
) – what to delete to force recreatelimit (
int
) – the number of batches to createreport (
BatchReport
) – the type of report to generate
- class zensols.deeplearn.clicv.FacadeCrossValidateModelApplication(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None, use_progress_bar=False)[source]¶
Bases:
FacadeModelApplication
Test, train and validate models.
- CLI_META = {'mnemonic_excludes': {'clear_cached_facade', 'create_facade', 'deallocate', 'get_cached_facade'}, 'mnemonic_overrides': {'cross_validate': 'cvalrun', 'early_stop': {'name': 'stop', 'option_includes': {}}, 'train_production': 'trainprod'}, 'option_excludes': {'CLASS_INSPECTOR'}, 'option_overrides': {'model_path': {'long_name': 'model', 'short_name': None}, 'n_iterations': {'long_name': 'iters', 'short_name': None}, 'out_format': {'long_name': 'format', 'short_name': 'f'}, 'result_name': {'long_name': 'desc', 'short_name': None}, 'use_progress_bar': {'long_name': 'progress', 'short_name': 'p'}}}¶
Tell the command line app API to igonore subclass and client specific use case methods.
- __init__(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None, use_progress_bar=False)¶
- cross_validate(n_iterations=1)[source]¶
Cross validate the model and dump the results.
- Parameters:
n_iterations (
int
) – the number of train/test iterations per fold
- use_progress_bar: bool = False¶
Display the progress bar.
- class zensols.deeplearn.clicv.FacadeCrossValidateResultApplication(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None)[source]¶
Bases:
_FacadeCrossValidateApplication
,FacadeResultApplication
Cross validation results.
- CLI_META = {'mnemonic_excludes': {'clear_cached_facade', 'create_facade', 'deallocate', 'get_cached_facade'}, 'mnemonic_overrides': {'compare_results': 'rescmp', 'cross_validate': 'cvalressum', 'result_ids': 'resids', 'run': 'resrun', 'summary': 'ressum'}, 'option_overrides': {'describe': {'short_name': None}, 'include_validation': {'long_name': 'validation', 'short_name': None}, 'model_path': {'long_name': 'model', 'short_name': None}, 'out_file': {'long_name': 'outfile', 'short_name': 'o'}, 'out_format': {'long_name': 'format', 'short_name': 'f'}, 'report_type': {'long_name': 'report'}, 'sort': {'short_name': 's'}}}¶
Tell the command line app API to igonore subclass and client specific use case methods.
- __init__(config, facade_name='facade', model_path=None, config_factory_args=<factory>, config_overwrites=None, cache_global_facade=True, model_config_overwrites=None)¶
- cross_validate(report, out_file=None, out_format=None)[source]¶
Create a summary of all archived results.
- Parameters:
report (
CrossValidationReport
) – the type of report to generateout_file (
Path
) – the output path or-
for standard outout_format (
Format
) – the output format
zensols.deeplearn.domain module¶
This file contains classes that configure the network and classifier runs.
- class zensols.deeplearn.domain.ActivationNetworkSettings(name, config_factory, torch_config, activation)[source]¶
Bases:
NetworkSettings
A network settings that contains a activation setting and creates a activation layer.
- __init__(name, config_factory, torch_config, activation)¶
- class zensols.deeplearn.domain.BatchNormNetworkSettings(name, config_factory, torch_config, batch_norm_d, batch_norm_features)[source]¶
Bases:
NetworkSettings
A network settings that contains a batchnorm setting and creates a batchnorm layer.
- __init__(name, config_factory, torch_config, batch_norm_d, batch_norm_features)¶
-
batch_norm_d:
int
¶ The dimension of the batch norm or
None
to disable. Based on this one of the following is used as a layer:torch.nn.BatchNorm1d
torch.nn.BatchNorm2d
torch.nn.BatchNorm3d
- property batch_norm_layer¶
- class zensols.deeplearn.domain.DatasetSplitType(value, names=None, *, module=None, qualname=None, type=None, start=1, boundary=None)[source]¶
Bases:
Enum
Indicates an action on the model, which is first trained, validated, then tested.
Implementation note: for now
test
is used for both testing the model and ad-hoc prediction- test = 3¶
- train = 1¶
- validation = 2¶
- exception zensols.deeplearn.domain.DeepLearnError[source]¶
Bases:
APIError
Raised for any frame originated error.
- __annotations__ = {}¶
- __module__ = 'zensols.deeplearn.domain'¶
- class zensols.deeplearn.domain.DropoutNetworkSettings(name, config_factory, torch_config, dropout)[source]¶
Bases:
NetworkSettings
A network settings that contains a dropout setting and creates a dropout layer.
- __init__(name, config_factory, torch_config, dropout)¶
- property dropout_layer¶
- exception zensols.deeplearn.domain.EarlyBailError[source]¶
Bases:
DeepLearnError
Convenience used for helping debug the network.
- __annotations__ = {}¶
- __module__ = 'zensols.deeplearn.domain'¶
- exception zensols.deeplearn.domain.ModelError[source]¶
Bases:
DeepLearnError
Raised for any model related error.
- __annotations__ = {}¶
- __module__ = 'zensols.deeplearn.domain'¶
- class zensols.deeplearn.domain.ModelSettings(name, config_factory, model_name, path, learning_rate, epochs, append_model_path=None, max_consecutive_increased_count=9223372036854775807, nominal_labels=True, labels=None, batch_iteration_class_name=None, criterion_class_name=None, optimizer_class_name=None, optimizer_params=None, clip_gradient_threshold=None, scale_gradient_params=None, scheduler_class_name=None, scheduler_params=None, reduce_outcomes='argmax', shuffle_training=False, batch_limit=9223372036854775807, batch_iteration='cpu', prediction_mapper_name=None, cache_batches=True, gc_level=0, observer_manager=<factory>, store_model_result='test', store_report=True)[source]¶
Bases:
Writeback
,PersistableContainer
This configures and instance of
ModelExecutor
. This differes fromNetworkSettings
in that it configures the model parameters, and not the neural network parameters.Another reason for these two separate classes is data in this class is not needed to rehydrate an instance of
torch.nn.Module
.The loss function strategy across parameters
nominal_labels
,criterion_class
andoptimizer_class
, must be consistent. The defaults uses nominal labels, which means a single integer, rather than one hot encoding, is used for the labels. Most loss function, including the defaulttorch.nn.CrossEntropyLoss`
uses nominal labels. The optimizer defaults totorch.optim.Adam
.However, if
nominal_labels
is set toFalse
, it is expected that the label output is aLong
one hot encoding of the class label that must be decoded withBatchIterator._decode_outcomes()
and uses a loss function such astorch.nn.BCEWithLogitsLoss
, which applies a softmax over the output to narow to a nominal.If the
criterion_class
is left as the default, the class the corresponding class across these two is selected based onnominal_labels
.Note: Instances of this class are pickled as parts of the results in
zensols.deeplearn.result.domain.ModelResult
, so they must be able to serialize. However, they are not used to restore the executor or model, which are instead, recreated from the configuration for each (re)load (see the package documentation for more information).- See:
- __init__(name, config_factory, model_name, path, learning_rate, epochs, append_model_path=None, max_consecutive_increased_count=9223372036854775807, nominal_labels=True, labels=None, batch_iteration_class_name=None, criterion_class_name=None, optimizer_class_name=None, optimizer_params=None, clip_gradient_threshold=None, scale_gradient_params=None, scheduler_class_name=None, scheduler_params=None, reduce_outcomes='argmax', shuffle_training=False, batch_limit=9223372036854775807, batch_iteration='cpu', prediction_mapper_name=None, cache_batches=True, gc_level=0, observer_manager=<factory>, store_model_result='test', store_report=True)¶
-
append_model_path:
str
= None¶ Whether and how to append the model’s name to the end of
path
. If this value isverbatim
, append the model name as provided withmodel_name
, ifnormalize
usenormalize_name()
to normalize it, and ifNone
do not append anything.
-
batch_iteration:
str
= 'cpu'¶ How the batches are buffered, which is one of:
gpu
, buffers all data in the GPUcpu
, which means keep all batches in CPU memory (the default)buffered
which means to buffer only one batch at a time (only for very large data).
-
batch_iteration_class_name:
InitVar
= None¶ A string fully qualified class name of type
BatchIterator
. This must be set to a class such asScoredBatchIterator
to handle descrete states in the output layer such as terminating CRF states. The default isBatchIterator
, which expects continuous output layers.
-
batch_limit:
Union
[int
,float
] = 9223372036854775807¶ The max number of batches to train, validate and test on, which is useful for limiting while debuging; defaults to sys.maxsize. If this value is a float, it is assumed to be a number between [0, 1] and the number of batches is multiplied by the value.
-
clip_gradient_threshold:
float
= None¶ Parameters passed to
torch.nn.utils.clip_grad_value_()
to clip gradients above this threshold.
-
criterion_class_name:
InitVar
= None¶ The loss function class name (see class doc).
-
gc_level:
int
= 0¶ The frequency by with the garbage collector is invoked. The higher the value, the more often it will be run during training, testing and validation.
-
labels:
Tuple
[str
,...
] = None¶ The list of labels for the model. This is currently only used for multi-label classification so the results and give more useful information.
-
max_consecutive_increased_count:
int
= 9223372036854775807¶ The maximum number of times the validation loss can increase per epoch before the executor “gives up” and early stops training.
-
nominal_labels:
bool
= True¶ True
if using numbers to identify the class as an enumeration rather than a one hot encoded array.
- property normal_model_name: str¶
Return the normalized
model_name
usingnormalize_name()
.
- static normalize_name(name)[source]¶
Normalize the name in to a string that is more file system friendly. This is used for the
model_name
by API components that write data to the file system about the model this class configures such asModelResultManager
.- Return type:
- Returns:
the normalized name
-
observer_manager:
ModelObserverManager
¶ The model observer used by the entire train, test, validation process.
-
optimizer_class_name:
InitVar
= None¶ The optimization algorithm class name (see class doc).
-
optimizer_params:
Dict
[str
,Any
] = None¶ The parameters given as
**kwargs
when creating the optimizer. Do not add the learning rate, instead seelearning_rate
.
-
prediction_mapper_name:
str
= None¶ Creates data points from a client for the purposes of prediction. This value is the string class name of an instance of
PredictionMapper
used to create predictions. While optional, if not set, ad-hoc predictions (i.e. from the command line) can not be created.Instances of
PredictionMapper
are created and managed in theModelFacade
.
-
reduce_outcomes:
str
= 'argmax'¶ The method by which the labels and output is reduced. The output is optionally reduced, which is one of the following:
argmax
: uses the index of the largest value, which is used for classification models and the defaultsoftmax
: just likeargmax
but applies a softmaxnone
: return the identity.otherwise, a class name whose instances takes the tensor batch as input and returns the the replacement batch tensor as the output
-
scale_gradient_params:
Dict
[str
,Union
[float
,bool
]] = None¶ Parameters passed to
torch.nn.utils.clip_grad_norm_()
to scale the gradient norm.
-
scheduler_class_name:
str
= None¶ The fully qualified class name of the learning rate scheduler used for the optimizer (if not
None
) such as:torch.optim.lr_scheduler.StepLR
or,torch.optim.lr_scheduler.ReduceLROnPlateau
.
- See:
-
scheduler_params:
Dict
[str
,Any
] = None¶ The parameters given as
**kwargs
when creating the scheduler (if any).- See:
-
shuffle_training:
bool
= False¶ If
True
shuffle the training data set split before the training process starts. The shuffling only happens once for all epocs.
-
store_model_result:
str
= 'test'¶ Whether to store the
ModelResult
instance in the state file, which is one of:test
: only tested models, as apposed to usingtrain_production()
always
: always save results, even in production modelsnever
: there will be no training or validation results in output
The results are also stored as
.dat
files in the results directory.- See:
- class zensols.deeplearn.domain.NetworkSettings(name, config_factory, torch_config)[source]¶
Bases:
Writeback
,PersistableContainer
A container settings class for network models. This abstract class must return the fully qualified (with module name) PyTorch model (`torch.nn.Module`) that goes along with these settings. An instance of this class is saved in the model file and given back to it when later restored.
Note: Instances of this class are pickled as parts of the results in
zensols.deeplearn.result.domain.ModelResult
, so they must be able to serialize. However, they are not used to restore the executor or model, which are instead, recreated from the configuration for each (re)load (see the package documentation for more information).- See:
- __init__(name, config_factory, torch_config)¶
-
config_factory:
ConfigFactory
¶ The configuration factory used to create the module.
- create_module(*args, **kwargs)[source]¶
Create a new instance of the network model.
- Return type:
Module
- abstract get_module_class_name()[source]¶
Returns the fully qualified class name of the module to create by
ModelManager
. This module takes as the first parameter an instance of this class.Important: This method is not used for nested modules. You must declare specific class names in the configuration for those nested class naems.
- Return type:
-
torch_config:
TorchConfig
¶ The GPU configuration for
torch.nn.Module
layers.
zensols.deeplearn.observer module¶
Contains a simple but effective observer pattern set of classes for training, testing and validating models.
- class zensols.deeplearn.observer.DumperObserver(events=<factory>, flatten=True, flatten_short_classes=True, output_file=PosixPath('dumper-observer.csv'), file_mode='append', trigger_events=<factory>, trigger_callers=None, mkdir=True, add_columns=None)[source]¶
Bases:
RecorderObserver
A class that dumps all data when certain events are received as a CSV to the file sytsem.
- __init__(events=<factory>, flatten=True, flatten_short_classes=True, output_file=PosixPath('dumper-observer.csv'), file_mode='append', trigger_events=<factory>, trigger_callers=None, mkdir=True, add_columns=None)¶
-
add_columns:
Dict
[str
,Any
] = None¶ Additional columns to add to the data frame across all rows if given.
-
file_mode:
str
= 'append'¶ If
append
, then append data to the output .CSV file. Otherwise, ifoverwrite
then overwrite the data.
-
output_file:
Path
= PosixPath('dumper-observer.csv')¶ The path to where the (flattened data) is written.
- class zensols.deeplearn.observer.FilterModelObserver(delegate, include_events=<factory>)[source]¶
Bases:
ModelObserver
Filters messages from the client to a delegate observer.
- __init__(delegate, include_events=<factory>)¶
-
delegate:
ModelObserver
¶ The delegate observer to notify on notifications from this observer.
- class zensols.deeplearn.observer.LogModelObserver(logger=<Logger zensols.deeplearn.observer.event (WARNING)>, level=20, add_context_format='{event}: {context}')[source]¶
Bases:
ModelObserver
Logs notifications to
logging
system.- __init__(logger=<Logger zensols.deeplearn.observer.event (WARNING)>, level=20, add_context_format='{event}: {context}')¶
-
add_context_format:
str
= '{event}: {context}'¶ If not
None
, use the string to format the log message.
-
logger:
Logger
= <Logger zensols.deeplearn.observer.event (WARNING)>¶ The logger that receives notifications.
- class zensols.deeplearn.observer.ModelObserver[source]¶
Bases:
ABC
Recipient of notifications by the model framework.
- class zensols.deeplearn.observer.ModelObserverManager(observers=<factory>)[source]¶
Bases:
object
- __init__(observers=<factory>)¶
-
observers:
List
[ModelObserver
]¶ A list of observers that get notified of all model lifecycle and process events.
- class zensols.deeplearn.observer.RecorderObserver(events=<factory>, flatten=True, flatten_short_classes=True)[source]¶
Bases:
ModelObserver
Records notifications and provides them as output.
- __init__(events=<factory>, flatten=True, flatten_short_classes=True)¶
-
flatten:
bool
= True¶ Whether or not make the caller and context in to a strings before storing them in
events
.
-
flatten_short_classes:
bool
= True¶ If
True
, then only use the class name sans module. Otherwise, use the fully qualified class name.
- zensols.deeplearn.observer.event_logger = <Logger zensols.deeplearn.observer.event (WARNING)>¶
`.LogModelObserver.
- Type:
Event logger for the
- Type:
class
- zensols.deeplearn.observer.mod_logger = <Logger zensols.deeplearn.observer.status (WARNING)>¶
Logger for this module.
zensols.deeplearn.torchconfig module¶
CUDA access and utility module.
- class zensols.deeplearn.torchconfig.CudaInfo[source]¶
Bases:
Writable
A utility class that provides information about the CUDA configuration for the current (hardware) environment.
- class zensols.deeplearn.torchconfig.TorchConfig(use_gpu=True, data_type=torch.float32, cuda_device_index=None, device_name=None)[source]¶
Bases:
PersistableContainer
,Writable
A utility class that provides access to CUDA APIs. It provides information on the current CUDA configuration and convenience methods to create, copy and modify tensors. These are handy for any given CUDA configuration and can back off to the CPU when CUDA isn’t available.
- __init__(use_gpu=True, data_type=torch.float32, cuda_device_index=None, device_name=None)[source]¶
Initialize this configuration.
- Parameters:
use_gpu (
bool
) – whether or not to use CUDA/GPUdata_type (
type
) – the default data type to use when creating new tensors in this configurationcuda_device_index (
int
) – the CUDA device to use, which defaults to 0 if CUDA ifuse_gpu
isTrue
device_name (
str
) – the string name of the device to use (i.e.cpu
ormps
); if provided, overridescuda_device_index
- cat(*args, **kwargs)[source]¶
Concatenate tensors in to one tensor using
torch.cat
.- Return type:
Tensor
- static close(a, b)[source]¶
Return whether or not two tensors are equal. This does an exact cell comparison.
- Return type:
- property cpu_device: torch.device¶
Return the CPU CUDA device, which is the device type configured to utilize the CPU (rather than the GPU).
- cross_entropy_pad(size)[source]¶
Create a padded tensor of size
size
using the repeated padignore_index
.- Return type:
Tensor
- property cuda_configs: Tuple[TorchConfig]¶
Return a new set of configurations, one for each CUDA device.
- property cuda_device_index: int | None¶
Return the CUDA device index if CUDA is being used for this configuration. Otherwise return
None
.
- property device: device¶
Return the torch device configured.
- static empty_cache()[source]¶
Empty the CUDA torch cache. This releases memory in the GPU and should not be necessary to call for normal use cases.
- static equal(a, b)[source]¶
Return whether or not two tensors are equal. This does an exact cell comparison.
- Return type:
- float(*args, **kwargs)[source]¶
Return a new tensor using
torch.tensor
as a float type.- Return type:
Tensor
- property float_type: Type¶
Return the float type that represents this configuration, converting to the corresponding precision from integer if necessary.
- Returns:
the float that represents this data, or
None
if neither float nor int
- from_iterable(array)[source]¶
Return a one dimenstional tensor created from
array
using the type and device in the current instance configuration.- Return type:
Tensor
- from_numpy(arr)[source]¶
Return a new tensor generated from a numpy aray using
torch.from_numpy
. The array type is converted if necessary.- Return type:
Tensor
- classmethod get_random_seed()[source]¶
Get the cross system random seed, meaning the seed applied to CUDA and the Python random library.
- Return type:
- classmethod get_random_seed_context()[source]¶
Return the random seed context given to
set_random_seed()
to restore across models for consistent results.
- static in_memory_tensors()[source]¶
Returns all in-memory tensors and parameters.
- See:
show_leaks()
- Return type:
List
[Tensor
]
- classmethod init(spawn_multiproc='spawn', seed_kwargs={})[source]¶
Initialize the PyTorch framework. This includes:
Configuration of PyTorch multiprocessing so subprocesses can access the GPU, and
Setting the random seed state.
The needs to be initialized at the very beginning of your program.
Example:
def main(): from zensols.deeplearn import TorchConfig TorchConfig.init()
Note: this method is separate from
set_random_seed()
because that method is called by the framework to reset the seed after a model is unpickled.- See:
torch.multiprocessing
- See:
- int(*args, **kwargs)[source]¶
Return a new tensor using
torch.tensor
as a int type.- Return type:
Tensor
- property int_type: Type¶
Return the int type that represents this configuration, converting to the corresponding precision from integer if necessary.
- Returns:
the int that represents this data, or
None
if neither int nor float
- property numpy_data_type: Type[dtype]¶
Return the numpy type that corresponds to this instance’s configured
data_type
.
- same_device(tensor_or_model)[source]¶
Return whether or not a tensor or model is in the same memory space as this configuration instance.
- Return type:
- classmethod set_random_seed(seed=0, disable_cudnn=True, rng_state=True)[source]¶
Set the random number generator for PyTorch.
- Parameters:
- See:
- See:
- property tensor_class: Type[dtype]¶
Return the class type based on the current configuration of this instance. For example, if using
torch.float32
on the GPU,torch.cuda.FloatTensor
is returned.
- to(tensor_or_model)[source]¶
Copy the tensor or model to the device this to that of this configuration.
- Return type:
Union
[Module
,Tensor
]
- classmethod to_cpu_deallocate(*arrs)[source]¶
Safely copy detached memory to the CPU and delete local instance (possibly GPU) memory to speed up resource deallocation. If the tensor is already on the CPU, it’s simply passed back. Otherwise the tensor is deleted.
This method is robust with
None
, which are skipped and substituted asNone
in the output.
- to_type(arr)[source]¶
Convert the type of the given array to the type of this instance.
- Return type:
Tensor
- write(depth=0, writer=<_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>)[source]¶
Write the contents of this instance to
writer
using indentiondepth
.- Parameters:
depth (
int
) – the starting indentation depthwriter (
TextIOBase
) – the writer to dump the content of this writable
- write_device_tensors(writer=<_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>)[source]¶
Like
write_in_memory_tensors()
, but filter on this instance’s device.- Parameters:
filter_device – if given, write only tensors matching this device
- See:
- class zensols.deeplearn.torchconfig.printopts(**kwargs)[source]¶
Bases:
object
Object used with a
with
scope that sets options, then sets them back.Example:
with printopts(profile='full', linewidth=120): print(tensor)
- DEFAULTS = {'edgeitems': 3, 'linewidth': 80, 'precision': 4, 'profile': 'default', 'sci_mode': None, 'threshold': 1000}¶
zensols.deeplearn.torchtype module¶
CUDA access and utility module.
- class zensols.deeplearn.torchtype.TorchTypes[source]¶
Bases:
object
A utility class to convert betwen numpy and torch classes. It also provides metadata for types that make other conversions, such as same precision cross types (i.e. int64 -> float64).
- FLOAT_TO_INT = {torch.float16: torch.int16, torch.float32: torch.int32, torch.float64: torch.int64}¶
- FLOAT_TYPES = frozenset({torch.float16, torch.float32, torch.float64})¶
- INT_TO_FLOAT = {torch.int16: torch.float16, torch.int32: torch.float32, torch.int64: torch.float64}¶
- INT_TYPES = frozenset({torch.int16, torch.int32, torch.int64})¶
- NAME_TO_TYPE = {'bool': {'cpu': <class 'torch.BoolTensor'>, 'desc': 'Boolean', 'gpu': <class 'torch.cuda.BoolTensor'>, 'name': 'bool', 'numpy': <class 'bool'>, 'types': {torch.bool}}, 'float16': {'cpu': <class 'torch.HalfTensor'>, 'desc': '16-bit floating point', 'gpu': <class 'torch.cuda.HalfTensor'>, 'name': 'float16', 'numpy': <class 'numpy.float16'>, 'sparse': <class 'torch.sparse.HalfTensor'>, 'types': {torch.float16}}, 'float32': {'cpu': <class 'torch.FloatTensor'>, 'desc': '32-bit floating point', 'gpu': <class 'torch.cuda.FloatTensor'>, 'name': 'float32', 'numpy': <class 'numpy.float32'>, 'sparse': <class 'torch.sparse.FloatTensor'>, 'types': {torch.float32}}, 'float64': {'cpu': <class 'torch.DoubleTensor'>, 'desc': '64-bit floating point', 'gpu': <class 'torch.cuda.DoubleTensor'>, 'name': 'float64', 'numpy': <class 'numpy.float64'>, 'sparse': <class 'torch.sparse.DoubleTensor'>, 'types': {torch.float64}}, 'int16': {'cpu': <class 'torch.ShortTensor'>, 'desc': '16-bit integer (signed)', 'gpu': <class 'torch.cuda.ShortTensor'>, 'name': 'int16', 'numpy': <class 'numpy.int16'>, 'sparse': <class 'torch.sparse.ShortTensor'>, 'types': {torch.int16}}, 'int32': {'cpu': <class 'torch.IntTensor'>, 'desc': '32-bit integer (signed)', 'gpu': <class 'torch.cuda.IntTensor'>, 'name': 'int32', 'numpy': <class 'numpy.int32'>, 'sparse': <class 'torch.sparse.IntTensor'>, 'types': {torch.int32}}, 'int64': {'cpu': <class 'torch.LongTensor'>, 'desc': '64-bit integer (signed)', 'gpu': <class 'torch.cuda.LongTensor'>, 'name': 'int64', 'numpy': <class 'numpy.int64'>, 'sparse': <class 'torch.sparse.LongTensor'>, 'types': {torch.int64}}, 'int8': {'cpu': <class 'torch.CharTensor'>, 'desc': '8-bit integer (signed)', 'gpu': <class 'torch.cuda.CharTensor'>, 'name': 'int8', 'numpy': <class 'numpy.int8'>, 'sparse': <class 'torch.sparse.CharTensor'>, 'types': {torch.int8}}, 'uint8': {'cpu': <class 'torch.ByteTensor'>, 'desc': '8-bit integer (unsigned)', 'gpu': <class 'torch.cuda.ByteTensor'>, 'name': 'uint8', 'numpy': <class 'numpy.uint8'>, 'sparse': <class 'torch.sparse.ByteTensor'>, 'types': {torch.uint8}}}¶
A map of type to metadata.
- TYPES = [{'cpu': <class 'torch.FloatTensor'>, 'desc': '32-bit floating point', 'gpu': <class 'torch.cuda.FloatTensor'>, 'name': 'float32', 'numpy': <class 'numpy.float32'>, 'sparse': <class 'torch.sparse.FloatTensor'>, 'types': {torch.float32}}, {'cpu': <class 'torch.DoubleTensor'>, 'desc': '64-bit floating point', 'gpu': <class 'torch.cuda.DoubleTensor'>, 'name': 'float64', 'numpy': <class 'numpy.float64'>, 'sparse': <class 'torch.sparse.DoubleTensor'>, 'types': {torch.float64}}, {'cpu': <class 'torch.HalfTensor'>, 'desc': '16-bit floating point', 'gpu': <class 'torch.cuda.HalfTensor'>, 'name': 'float16', 'numpy': <class 'numpy.float16'>, 'sparse': <class 'torch.sparse.HalfTensor'>, 'types': {torch.float16}}, {'cpu': <class 'torch.ByteTensor'>, 'desc': '8-bit integer (unsigned)', 'gpu': <class 'torch.cuda.ByteTensor'>, 'name': 'uint8', 'numpy': <class 'numpy.uint8'>, 'sparse': <class 'torch.sparse.ByteTensor'>, 'types': {torch.uint8}}, {'cpu': <class 'torch.CharTensor'>, 'desc': '8-bit integer (signed)', 'gpu': <class 'torch.cuda.CharTensor'>, 'name': 'int8', 'numpy': <class 'numpy.int8'>, 'sparse': <class 'torch.sparse.CharTensor'>, 'types': {torch.int8}}, {'cpu': <class 'torch.ShortTensor'>, 'desc': '16-bit integer (signed)', 'gpu': <class 'torch.cuda.ShortTensor'>, 'name': 'int16', 'numpy': <class 'numpy.int16'>, 'sparse': <class 'torch.sparse.ShortTensor'>, 'types': {torch.int16}}, {'cpu': <class 'torch.IntTensor'>, 'desc': '32-bit integer (signed)', 'gpu': <class 'torch.cuda.IntTensor'>, 'name': 'int32', 'numpy': <class 'numpy.int32'>, 'sparse': <class 'torch.sparse.IntTensor'>, 'types': {torch.int32}}, {'cpu': <class 'torch.LongTensor'>, 'desc': '64-bit integer (signed)', 'gpu': <class 'torch.cuda.LongTensor'>, 'name': 'int64', 'numpy': <class 'numpy.int64'>, 'sparse': <class 'torch.sparse.LongTensor'>, 'types': {torch.int64}}, {'cpu': <class 'torch.BoolTensor'>, 'desc': 'Boolean', 'gpu': <class 'torch.cuda.BoolTensor'>, 'name': 'bool', 'numpy': <class 'bool'>, 'types': {torch.bool}}]¶
A list of dicts containig conversions between types.
Module contents¶
A deep learning framework to make training, validating and testing models with PyTorch easier.