zensols.deeplearn.model package¶
Submodules¶
zensols.deeplearn.model.analyze module¶
Analysis tools for batch sets and result comparison.
- class zensols.deeplearn.model.analyze.BatchMetrics(stash)[source]¶
Bases:
object
Provides analysis and metrics on a set of batches.
- __init__(stash)¶
- get_label_dataframe()[source]¶
Return a dataframe of the labels of each batch’s data point set.
- Return type:
- Returns:
columns of the batch ID, data point ID, dataset split, and the label(s)
- get_label_variance()[source]¶
Return a dataframe of the label standard deviation across batches. This calculates the standard deviation of the portion of labels across the batches to give a measure of the extent to which labels are imbalanced.
- Return type:
-
stash:
Stash
¶ The stash containing the instances of
Batch
, usually aBatchStash
.
- class zensols.deeplearn.model.analyze.DataComparison(key, previous, current, compare_df)[source]¶
Bases:
Dictable
Contains the results from two runs used to compare. The data in this object is used to compare the validation loss from a previous run to a run that’s currently in progress. This is provided along with the performance metrics of the runs when written with
write()
.- __init__(key, previous, current, compare_df)¶
-
compare_df:
DataFrame
¶ A dataframe with the validation loss from the previous and current results and that difference.
-
current:
ModelResult
¶ The current results, which is probably a model currently running.
-
key:
str
¶ The results key used with a
ModelResultManager
.
-
previous:
ModelResult
¶ The previous resuls of the model from a previous run.
- write(depth=0, writer=<_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>)[source]¶
Write this instance as either a
Writable
or as aDictable
. If class attribute_DICTABLE_WRITABLE_DESCENDANTS
is set asTrue
, then use thewrite()
method on children instead of writing the generated dictionary. Otherwise, write this instance by first creating adict
recursively usingasdict()
, then formatting the output.If the attribute
_DICTABLE_WRITE_EXCLUDES
is set, those attributes are removed from what is written in thewrite()
method.Note that this attribute will need to be set in all descendants in the instance hierarchy since writing the object instance graph is done recursively.
- Parameters:
depth (
int
) – the starting indentation depthwriter (
TextIOBase
) – the writer to dump the content of this writable
- class zensols.deeplearn.model.analyze.ResultAnalyzer(executor, previous_results_key, cache_previous_results)[source]¶
Bases:
object
Load results from a previous run of the
ModelExecutor
and a more recent run. This run is usually a currently running model to compare the results during training. This might provide meaningful information such as whether to early stop training.- __init__(executor, previous_results_key, cache_previous_results)¶
-
cache_previous_results:
bool
¶ If
True
, globally cache the previous results to avoid having to reload each time.
- property comparison: DataComparison¶
Load the results data and create a comparison instance read to write or jsonify.
- property current_results: Tuple[ModelResult, ModelResult]¶
Return the current results (see class docs).
-
executor:
ModelExecutor
¶ The executor (not the running executor necessary) that will load the results if not already loadded.
- property previous_results: ModelResult¶
Return the previous results (see class docs).
zensols.deeplearn.model.batchiter module¶
Contains a class to assist in the batch loop during training, validation and testing.
- class zensols.deeplearn.model.batchiter.BatchIterator(executor, logger)[source]¶
Bases:
object
This class assists in the batch loop during training, validation and testing. Any special handling of a model related to its loss function can be overridden in this class.
- _decode_outcomes(outcomes)[source]¶
Transform the model output (and optionally the labels) that will be added to the
EpochResult
, which composes aModelResult
.This implementation returns :py:meth:~`Tensor.argmax`, which are the indexes of the max value across columns.
- Return type:
Tensor
- __init__(executor, logger)¶
-
executor:
InitVar
¶ The owning executor.
- iterate(model, optimizer, criterion, batch, epoch_result, split_type)[source]¶
Train, validate or test on a batch. This uses the back propogation algorithm on training and does a simple feed forward on validation and testing.
One call of this method represents a single batch iteration
- Parameters:
model (
BaseNetworkModule
) – the model to excerciseoptimizer (
Optimizer
) – the optimization algorithm (i.e. adam) to iteratecriterion – the loss function (i.e. cross entropy loss) used for the backward propogation step
batch (
Batch
) – contains the data to test, predict, and optionally the labels for training and validationepoch_result (
EpochResult
) – to be populated with the results of this epoch’s runsplit_type (
DatasetSplitType
) – indicates if we’re training, validating or testing
- Return type:
Tensor
- Returns:
the singleton tensor containing the loss
zensols.deeplearn.model.executor module¶
This file contains the network model and data that holds the results.
- class zensols.deeplearn.model.executor.ModelExecutor(config_factory, config, name, model_settings, net_settings, dataset_stash, dataset_split_names, cross_fold_dataset_stash=None, result_path=None, cross_fold_result_path=None, update_path=None, intermediate_results_path=None, progress_bar=False, progress_bar_cols=None)[source]¶
Bases:
PersistableContainer
,Deallocatable
,Writable
This class creates and uses a network to train, validate and test the model. This class is either configured using a
ConfigFactory
or is unpickled withModelManager
. If the later, it’s from a previously trained (and possibly tested) state.Typically, after creating a nascent instance,
train()
is called to train the model. This returns the results, but the results are also available via theResultManager
using themodel_manager
property. To load previous results, useexecutor.result_manager.load()
.During training, the training set is used to train the weights of the model provided by the executor in the
model_settings
, then validated using the validation set. When the validation loss is minimized, the following is saved to disk:Settings:
net_settings
,model_settings
,the model weights,
the results of the training and validation thus far,
the entire configuration (which is later used to restore the executor),
random seed information, which includes Python, Torch and GPU random state.
After the model is trained, you can immediately test the model with
test()
. To be more certain of being able to reproduce the same results, it is recommended to load the model withmodel_manager.load_executor()
, which loads the last instance of the model that produced a minimum validation loss.- See:
- See:
- See:
zensols.deeplearn.model.ModelSettings
- __init__(config_factory, config, name, model_settings, net_settings, dataset_stash, dataset_split_names, cross_fold_dataset_stash=None, result_path=None, cross_fold_result_path=None, update_path=None, intermediate_results_path=None, progress_bar=False, progress_bar_cols=None)¶
- property batch_iterator: BatchIterator¶
The train manager that assists with the training process.
- property batch_stash: DatasetSplitStash¶
The stash used to obtain the data for training and testing. This stash should have a training, validation and test splits. The names of these splits are given in the
dataset_split_names
.
-
config:
Configurable
¶ The configuration used in the configuration factory to create this instance.
-
config_factory:
ConfigFactory
¶ The configuration factory that created this instance.
- property criterion_optimizer_scheduler: Tuple[L1Loss, Optimizer, Any]¶
Return the loss function and descent optimizer.
- property cross_fold_batch_stash: DatasetSplitStash¶
The stash (like
batch_stash
) that contains cross-validation batches.- See:
-
cross_fold_dataset_stash:
DatasetSplitStash
= None¶ The stash that holds the cross validation folds, or
None
if no cross validation support is needed.
- property cross_fold_result_manager: ModelResultManager¶
Return the manager used for controlling the life cycle of the cross-fold validation results generated by this executor.
-
cross_fold_result_path:
Path
= None¶ If not
None
, a path to a directory where the results of the cross-fold validation.- See:
obj:result_path
- cross_validate(n_iterations)[source]¶
Cross validate the model storing the results in
cross_fold_result_path
. The folds are taken from the mini-batches taken fromcross_fold_dataset_stash
. The training batches in each iteration’s (n_iterations
) fold are shuffled.Just as with split dataset training, the training set is used for training and the validation set is used for validation with a batched set of data created by a class such as
zensols.dataset.split.StratifiedCrossFoldSplitKeyContainer
.- Parameters:
n_iterations (
int
) – the number of train/test iterations per fold- Return type:
-
dataset_split_names:
List
[str
]¶ train, validation, test (see
_get_dataset_splits()
)- Type:
The list of split names in the
dataset_stash
in the order
-
dataset_stash:
DatasetSplitStash
¶ The split data set stash that contains the
BatchStash
, which contains the batches on which to train and test.
- property feature_stash: Stash¶
The stash used to generate the feature, which is not to be confused with the batch source stash``batch_stash``.
-
intermediate_results_path:
Path
= None¶ If this is set, then save the model and results to this path after validation for each training epoch.
- load()[source]¶
Clear all results and trained state and reload the last trained model from the file system.
- Return type:
Module
- Returns:
the model that was loaded and registered in this instance of the executor
- property model: BaseNetworkModule¶
Get the PyTorch module that is used for training and test.
- property model_exists: bool¶
Return whether the executor has a model.
- Returns:
True
if the model has been trained or loaded
- property model_manager: ModelManager¶
Return the manager used for controlling the train of the model.
- property model_result_report: str | None¶
A human readable summary of the model’s performance.
- Returns:
the report if the model has been trained
-
model_settings:
ModelSettings
¶ The configuration of the model.
-
net_settings:
NetworkSettings
¶ The settings used to configure the network.
- predict(batches)[source]¶
Create predictions on ad-hoc data.
-
progress_bar_cols:
int
= None¶ The number of console columns to use for the text/ASCII based progress bar.
- property result_manager: ModelResultManager¶
Return the manager used for controlling the life cycle of the results generated by this executor.
-
result_path:
Path
= None¶ If not
None
, a path to a directory where the results are to be dumped; the directory will be created if it doesn’t exist when the results are generated.
- set_model_parameter(name, value)[source]¶
Safely set a parameter of the model, found in
model_settings
. This makes the corresponding update in the configuration, so that when it is restored (i.e for test) the parameters are consistent with the trained model. The value is converted to a string as the configuration representation stores all data values as strings.Important:
eval
syntaxes are not supported, and probably not the kind of values you want to set a parameters with this interface anyway.
- set_network_parameter(name, value)[source]¶
Safely set a parameter of the network, found in
network_settings
. This makes the corresponding update in the configuration, so that when it is restored (i.e for test) the parameters are consistent with the trained network. The value is converted to a string as the configuration representation stores all data values as strings.Important:
eval
syntaxes are not supported, and probably not the kind of values you want to set a parameters with this interface anyway.
- test(result_name=None)[source]¶
Test the model.
- Parameters:
result_name (
str
) – a descriptor used in the results, which is useful when making incremental hyperparameter changes to the model- Return type:
- property torch_config: TorchConfig¶
Return the PyTorch configuration used to convert models and data (usually GPU) during training and test.
- train(result_name=None)[source]¶
Train the model.
- Parameters:
result_name (
str
) – a descriptor used in the results, which is useful when making incremental hyperparameter changes to the model- Return type:
- property train_manager: TrainManager¶
Return the train manager that assists with the training process.
- train_production(result_name=None)[source]¶
Train and test the model on the training and test datasets. This is used for a “production” model that is used for some purpose other than evaluation.
- Parameters:
result_name (
str
) – a descriptor used in the results, which is useful when making incremental hyperparameter changes to the model- Return type:
-
update_path:
Path
= None¶ The path to check for commands/updates to make while training. If this is set, and the file exists, then it is parsed as a JSON file. If the file cannot be parsed, or 0 size etc., then the training is (early) stopped.
If the file can be parsed, and there is a single
epoch
dict entry, then the current epoch is set to that value.
- write(depth=0, writer=<_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>, include_settings=False, include_model=False)[source]¶
Write the contents of this instance to
writer
using indentiondepth
.- Parameters:
depth (
int
) – the starting indentation depthwriter (
TextIOBase
) – the writer to dump the content of this writable
zensols.deeplearn.model.facade module¶
Client entry point to the model.
- class zensols.deeplearn.model.facade.ModelFacade(config, config_factory=<property object>, progress_bar=True, progress_bar_cols='term', executor_name='executor', writer=<_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>, predictions_dataframe_factory_class=<class 'zensols.deeplearn.result.pred.PredictionsDataFrameFactory'>, model_result_reporter_class=<class 'zensols.deeplearn.result.report.ModelResultReporter'>, result_name=None)[source]¶
Bases:
PersistableContainer
,Writable
This class provides easy to use client entry points to the model executor, which trains, validates, tests, saves and loads the model.
More common attributes, such as the learning rate and number of epochs, are properties that dispatch to
executor
. For the others, go directly to the property.- __init__(config, config_factory=<property object>, progress_bar=True, progress_bar_cols='term', executor_name='executor', writer=<_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>, predictions_dataframe_factory_class=<class 'zensols.deeplearn.result.pred.PredictionsDataFrameFactory'>, model_result_reporter_class=<class 'zensols.deeplearn.result.report.ModelResultReporter'>, result_name=None)¶
- property batch_metadata: BatchMetadata¶
Return the batch metadata used on the executor.
- See:
zensols.deepnlp.model.module.EmbeddingNetworkSettings
- property batch_stash: BatchStash¶
The stash used to encode and decode batches by the executor.
- property class_explorer: FacadeClassExplorer¶
-
config:
Configurable
¶ The configuraiton used to create the facade, and used to create a new configuration factory to load models.
- property config_factory¶
The configuration factory used to create this facade, or
None
if no factory was used.
- configure_cli_logging(log_level=None)[source]¶
“Configure command line (or Python REPL) debugging. Each facade can turn on name spaces that make sense as useful information output for long running training/testing iterations.
This calls “meth:_configure_cli_logging to collect the names of loggers at various levels.
- static configure_default_cli_logging(log_level=30)[source]¶
Configure the logging system with the defaults.
- configure_jupyter(log_level=30, progress_bar_cols=120)[source]¶
Configures logging and other configuration related to a Jupyter notebook. This is just like
configure_cli_logging()
, but adjusts logging for what is conducive for reporting in Jupyter cells.;param log_level: the default logging level for the logging system
- Parameters:
progress_bar_cols (
int
) – the number of columns to use for the progress bar
- property cross_fold_batch_stash: BatchStash¶
The stash used to encode and decode batches by the executor.
- property cross_fold_result_manager: ModelResultManager¶
Return the executor’s cross-fold validation result manager.
- cross_validate(n_iterations=1)[source]¶
Perform a cross validation of using the additional resources configured in the
executor
.- Parameters:
n_iterations (
int
) – the number of train/test iterations per fold- Return type:
- property dataset_stash: DatasetSplitStash¶
The stash used to encode and decode batches split by dataset.
- debug(debug_value=True)[source]¶
Debug the model by setting the configuration to debug mode and invoking a single forward pass. Logging must be configured properly to get the output, which is typically just invoking
logging.basicConfig()
.
- property executor: ModelExecutor¶
A cached instance of the executor tied to the instance of this class.
-
executor_name:
str
= 'executor'¶ The configuration entry name for the executor, which defaults to
executor
.
- property feature_stash: Stash¶
The stash used to generate the feature, which is not to be confused with the batch source stash
batch_stash
.
- get_batch_metrics()[source]¶
Return a class that performs metrics on the
batch_stash
.- Return type:
- get_cross_fold_batch_metrics()[source]¶
Return a class that performs metrics on the
batch_stash
.- Return type:
- static get_encode_sparse_matrices()[source]¶
Return whether or not sparse matricies are encoded.
- See:
set_sparse()
- Return type:
- get_predictions_factory(column_names=None, transform=None, batch_limit=9223372036854775807, name=None)[source]¶
Generate a predictions factory from the test data set.
- Parameters:
column_names (
List
[str
]) – the list of string column names for each data item the list returned fromdata_point_transform
to be added to the results for each label/predictiontransform (
Callable
[[DataPoint
],tuple
]) – a function that returns a tuple, each with an element respective ofcolumn_names
to be added to the results for each label/prediction; ifNone
(the default),str
used (see the Iris Jupyter Notebook example)batch_limit (
int
) – the max number of batche of results to outputname (
str
) – the name/ID (name of the file sans extension in the results directory) of the previously archived saved results to fetch orNone
to get the last resultkwargs – additional keyword arguments used to create the factory
- Return type:
- get_result_analyzer(key=None, cache_previous_results=False)[source]¶
Return a results analyzer for comparing in flight training progress.
- Return type:
- get_result_reporter(cross_fold=False, include_validation=True)[source]¶
Return an instance of the summary report generator.
- Return type:
- classmethod get_singleton(*args, **kwargs)[source]¶
Return the singleton application using
args
andkwargs
as initializer arguments.- Return type:
- property label_attribute_name¶
Get the label attribute name.
- property last_result: ModelResult¶
The last recorded result during an
ModelExecutor.train()
orModelExecutor.test()
invocation is used.
- classmethod load_from_path(path, *args, **kwargs)[source]¶
Construct a new facade from the data saved in a persisted model file. This uses the
ModelManager.load_from_path()
to reconstruct the returned facade, which means some attributes are taken from default if not taken from*args
or**kwargs
.- Parameters:
path (
Path
) – the path of the model file on the file systemmodel_config_overwrites – a
Configurable
used to overrwrite configuration in the model package configargs – passed through to the initializer of invoking class
cls
kwargs – passed through to the initializer of invoking class
cls
- Return type:
- Returns:
a new instance of a
ModelFacade
- See:
- property model_config: Configurable¶
The configurable packaged with the model if this facade was created with :mth:`load_from_path`.
- model_result_reporter_class¶
The class used to report result summaries over all the runs.
alias of
ModelResultReporter
- property model_settings: ModelSettings¶
Return the executor’s model settings.
- property net_settings: NetworkSettings¶
Return the executor’s network settings.
- persist_result()[source]¶
Save the last recorded result during an
Executor.train()
orExecutor.test()
invocation to disk. Optionally also save a plotted graphics file to disk as well whenpersist_plot_result
is set toTrue
.Note that in Jupyter notebooks, this method has the side effect of plotting the results in the cell when
persist_plot_result
isTrue
.- Parameters:
persist_plot_result – if
True
, plot and save the graph as a PNG file to the results directory
- plot_result(result=None, save=False, show=False)[source]¶
Plot results and optionally save and show them. If this is called in a Jupyter notebook, the plot will be rendered in a cell.
- Parameters:
result (
ModelResult
) – the result to plot, or ifNone
, uselast_result()
save (
bool
) – ifTrue
, save the plot to the results directory with the same naming as the last data resultsshow (
bool
) – ifTrue
, invokematplotlib
’sshow
function to visualize in a non-Jupyter environment
- Return type:
- Returns:
the result used to graph, which comes from the executor when none is given to the invocation
- predictions_dataframe_factory_class¶
The factory class used to create predictions, or the string of the configuration section. If the latter, the factory is created using
config_factory
.alias of
PredictionsDataFrameFactory
-
progress_bar_cols:
Union
[str
,int
] = 'term'¶ The number of console columns to use for the text/ASCII based progress bar. If the value is
term
, then use the terminal width.
- remove_metadata_mapping_field(attr)[source]¶
Remove a field by attribute if it exists across all metadata mappings.
This is useful when a very expensive vectorizer slows down tasks, such as prediction, on a single run of a program. For this use case, override
predict()
to call this method before calling the superpredict
method.
- property result_manager: ModelResultManager¶
Return the executor’s result manager.
-
result_name:
str
= None¶ A descriptor used in the results, which is useful when making incremental hyperparameter changes to the model.
- static set_encode_sparse_matrices(use_sparse=False)[source]¶
If called before batches are created, encode all tensors the would be encoded as dense rather than sparse when
use_sparse
isFalse
. Oherwise, tensors will be encoded as sparse where it makes sense on a per vectorizer basis.
- stop_training()[source]¶
Early stop training if the model is currently training. This invokes the
TrainManager.stop()
, communicates to the training process to stop on the next check.- Returns:
True
if the application is configured to early stop and the signal has not already been given
- train()[source]¶
Train and test or just debug the model depending on the configuration.
- Return type:
- train_production()[source]¶
Like
train()
but uses the test dataset in addition to the training dataset to train the model.- Return type:
- update_model_config_factory(path=None, config_factory=None)[source]¶
Update the model’s configuration factory and configuration that was previously used to train the model. The modification is made in the model’s state file.
Important: the caller is responsible for creating the
executor
model (i.e. either directly withModelExecutor._get_or_create_model()
or indirectly withModelExector.write_model()
).- Parameters:
path (
Path
) – the path of the model directory, which defaults to the model data directory (rather than the model results directory)config_factory (
ConfigFactory
) – the configuration factory that will be replaced, which defaults toconfig_factory
- property vectorizer_manager_set: FeatureVectorizerManagerSet¶
Return the vectorizer manager set used for the facade. This is taken from the executor’s batch stash.
- write(depth=0, writer=None, include_executor=True, include_metadata=True, include_settings=True, include_model=True, include_config=False, include_object_graph=False)[source]¶
Write the contents of this instance to
writer
using indentiondepth
.- Parameters:
depth (
int
) – the starting indentation depthwriter (
TextIOBase
) – the writer to dump the content of this writable
- write_result(depth=0, writer=<_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>, include_settings=False, include_converged=False, include_config=False)[source]¶
Load the last set of results from the file system and print them out. The result to print is taken from
last_result
- Parameters:
depth (
int
) – the number of indentation levelswriter (
TextIOBase
) – the data sinkinclude_settings (
bool
) – whether or not to include model and network settings in the outputinclude_config (
bool
) – whether or not to include the configuration in the output
-
writer:
TextIOBase
= <_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>¶ The writer to this in methods like
train()
, andtest()
for writing performance metrics results and predictions orNone
to not output them.
zensols.deeplearn.model.format module¶
Contains a class to write performance metrics.
- class zensols.deeplearn.model.format.LatexPerformanceMetricsDumper(facade, summary_columns=('mF1t', 'mPt', 'mRt', 'MF1t', 'MPt', 'MRt'), by_label_columns=('mF1', 'mP', 'mR', 'MF1', 'MP', 'MR', 'acc', 'count'), name_replace=None, sort_column='mF1', majority_label_res_id=True, precision=3, results_dir=PosixPath('results/perf'), config_dir=PosixPath('../config'))[source]¶
Bases:
PerformanceMetricsDumper
Writes model performance metrics in data formats then used to import to the LaTeX typesetting system used by the Zensols build framework. The class writes a YAML configuration used by mklatextbl.py script in the Zensols Build repo, which generates a LaTeX table. The output is a
.sty` style file with the table, which is included with ``usepackage
and then added with a command.- See:
- See:
- __init__(facade, summary_columns=('mF1t', 'mPt', 'mRt', 'MF1t', 'MPt', 'MRt'), by_label_columns=('mF1', 'mP', 'mR', 'MF1', 'MP', 'MR', 'acc', 'count'), name_replace=None, sort_column='mF1', majority_label_res_id=True, precision=3, results_dir=PosixPath('results/perf'), config_dir=PosixPath('../config'))¶
-
config_dir:
Path
= PosixPath('../config')¶ The path to the YAML configuration files used by the
mklatextbl.py
Zensols LaTeX table generator.
- dump_by_label()[source]¶
Dump per label of metrics of the highest performing model to a LaTeX mktable YAML and CSV files.
- class zensols.deeplearn.model.format.PerformanceMetricsDumper(facade, summary_columns=('mF1t', 'mPt', 'mRt', 'MF1t', 'MPt', 'MRt'), by_label_columns=('mF1', 'mP', 'mR', 'MF1', 'MP', 'MR', 'acc', 'count'), name_replace=None, sort_column='mF1', majority_label_res_id=True, precision=3)[source]¶
Bases:
Writable
Formats performance metrics, which can be used in papers.
- __init__(facade, summary_columns=('mF1t', 'mPt', 'mRt', 'MF1t', 'MPt', 'MRt'), by_label_columns=('mF1', 'mP', 'mR', 'MF1', 'MP', 'MR', 'acc', 'count'), name_replace=None, sort_column='mF1', majority_label_res_id=True, precision=3)¶
-
by_label_columns:
Tuple
[str
] = ('mF1', 'mP', 'mR', 'MF1', 'MP', 'MR', 'acc', 'count')¶ The columns used in the by-label report.
-
facade:
ModelFacade
¶ The facade used to fetch previously written results.
-
majority_label_res_id:
Union
[str
,bool
] = True¶ Indicates how to create (if any) the majority label performance metrics. If a string, use as the result id (
res_id
) of previous result set used to compute the majority label statitics to include in the summary. IfTrue
use the results from the last tested model. IfNone
the majority label is not added.
-
name_replace:
Tuple
[str
,str
] = None¶ If provided, a tuple of
(regular expression, replacement)
string given tore.sub()
in the name column of generated tables.
-
sort_column:
str
= 'mF1'¶ The column to sort, with the exception of the majority label, which is always first.
-
summary_columns:
Tuple
[str
] = ('mF1t', 'mPt', 'mRt', 'MF1t', 'MPt', 'MRt')¶ The columns used in the summary report.
- write(depth=0, writer=<_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>, indent=0)[source]¶
Write the contents of this instance to
writer
using indentiondepth
.- Parameters:
depth (
int
) – the starting indentation depthwriter (
TextIOBase
) – the writer to dump the content of this writable
zensols.deeplearn.model.manager module¶
A utility class to help with a ModelExecutor
life cycle.
- class zensols.deeplearn.model.manager.ModelManager(path, config_factory, model_executor_name=None, persist_random_seed_context=True, keep_last_state_dict=False)[source]¶
Bases:
object
This class manages the lifecycle of an instance of a
ModelExecutor
. This class is mostly used by the executor to control it’s lifecycle. However, a client can also use an instance of this to revive a model that’s been saved to disk with theModelResultManager
- See ModelExecutor:
- __init__(path, config_factory, model_executor_name=None, persist_random_seed_context=True, keep_last_state_dict=False)¶
-
config_factory:
ConfigFactory
¶ The configuration factory to be used to create the
ModelExecutor
.
-
keep_last_state_dict:
bool
= False¶ Whether to store the PyTorch module state in attribute
last_saved_state_dict
.
- load_executor(config_overwrites=None)[source]¶
Load the model the last saved model from the disk. This is used load an instance of a
ModelExecutor
with all previous state completely in tact. It does this by using an instance ofzensols.config.factory.Configurable
and azensols.config.factory.ImportConfigFactory
to reconstruct the executor and it’s state by recreating all instances.After the executor has been recreated with the factory, the previous model results and model weights are restored.
- Parameters:
load_factory – whether to load the configuration factory from the check point; which you probably don’t want when loading from
load_from_path()
- Return type:
ModelExecutor
- Returns:
an instance of
ModelExecutor
- See:
zensols.deeplearn.model.ModelExecutor
- classmethod load_from_path(path)[source]¶
Load and return an instance of this class from a previously saved model. This method exists to recreate a
ModelManager
from a saved file from scratch. The returned model manager can be used to create the executor orModelFacade
using :obj:config_factory
.- Parameters:
path (
Path
) – points to the model file persisted with_save_executor()
- Return type:
- Returns:
an instance of
ModelManager
that was used to save the executor pointed bypath
zensols.deeplearn.model.meta module¶
Explore the facade in memory object graph
- class zensols.deeplearn.model.meta.FacadeClassExplorer(*args, **kwargs)[source]¶
Bases:
ClassExplorer
A class explorer that includes interesting and noteable framework classes to print.
zensols.deeplearn.model.module module¶
Base class PyTorch module and utilities.
- class zensols.deeplearn.model.module.BaseNetworkModule(net_settings, sub_logger=None)[source]¶
Bases:
DebugModule
,PersistableContainer
A utility base network module that contains ubiquitous, but optional layers, such as dropout and batch layeres, activation, etc.
- abstract _forward(x, *args, **kwargs)[source]¶
The model’s forward implementation. Normal backward semantics are no different.
- Parameters:
- Return type:
Tensor
- __init__(net_settings, sub_logger=None)[source]¶
Initialize.
- Parameters:
net_settings (
NetworkSettings
) – contains common layers such as droput and batch normalizationsub_logger (
Logger
) – used to log activity in this module so they logged module comes from some parent model
- property device: device¶
Return the device on which the model is configured.
zensols.deeplearn.model.multilabel module¶
A class to reduce multilabel output.
- class zensols.deeplearn.model.multilabel.MultiLabelOutcomeReducer(model_settings, apply_softmax=False)[source]¶
Bases:
object
Reduces mult-label classification predictions by clamping to
[0, 1]
and converting the tensor to longs.- __init__(model_settings, apply_softmax=False)¶
-
apply_softmax:
bool
= False¶ Whether the application of the softmax is applied to the predictions before boxed between [0, 1] and rounded.
-
model_settings:
ModelSettings
¶ Configures the model.
zensols.deeplearn.model.optimizer module¶
Base classes for overriding optimizers.
- class zensols.deeplearn.model.optimizer.ModelResourceFactory[source]¶
Bases:
ABC
An abstract factory that creates either an optimizer or criteria by the
ModelExecutor
. This is more of a marker class for other sub classes to be configured and created by anImportConfigFactory
.
zensols.deeplearn.model.pack module¶
Model packaging and distribution.
- class zensols.deeplearn.model.pack.ModelPacker(version, executor, installer=None)[source]¶
Bases:
_PackerBase
Creates distribution model packages by creating a zip file of everything needed to by a client to use the model.
- __init__(version, executor, installer=None)¶
-
executor:
ModelExecutor
¶ The result manager used to obtain the results and model to package.
- class zensols.deeplearn.model.pack.ModelUnpacker(version, config_factory, installer, model_packer_name='deeplearn_model_packer', facade_source='facade', model_config_overwrites=None)[source]¶
Bases:
_PackerBase
,PersistableContainer
,Writable
Unpacks a model created by
ModelPacker
. Intances of this class should be deallocated withDeallocatable()
after using it, which in turn deallocates thefacade
.- __init__(version, config_factory, installer, model_packer_name='deeplearn_model_packer', facade_source='facade', model_config_overwrites=None)¶
-
config_factory:
ConfigFactory
¶ Used to get the model facade class when
facade_source
is a string.
- property facade: ModelFacade¶
The cached facade from installed model. This installs the model if isn’t already.
- Returns:
a model facade that allows the caller to deallocate
-
facade_source:
Union
[str
,Type
[ModelFacade
]] = 'facade'¶ `.ModelFacade.
- Type:
The client facade section name or the
- Type:
class
- install_model(**kwargs) Path ¶
Install the model if it isn’t already and return a path to it.
- Return type:
- property installed_version: str¶
The version of the model installed on the file system per the
model.version
file.
-
installer:
Installer
¶ Used to create a path to the model file.
-
model_config_overwrites:
Configurable
= None¶ Configuration that overwrites the packaged model configuration.
-
model_packer_name:
str
= 'deeplearn_model_packer'¶ The section of the
ModelPacker
used to create the model. This defaults to the section in resource libraryresources/cli-pack.conf
.
- write(depth=0, writer=<_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>)[source]¶
Write the contents of this instance to
writer
using indentiondepth
.- Parameters:
depth (
int
) – the starting indentation depthwriter (
TextIOBase
) – the writer to dump the content of this writable
- class zensols.deeplearn.model.pack.SubsetConfig(config_factory, sections, options, option_delim=':')[source]¶
Bases:
DictionaryConfig
A
Configurable
that takes a subset of the application configuration. This is useful to pass toModelFacade.load_from_path()
to merge application into the packed model’s configuration.- __init__(config_factory, sections, options, option_delim=':')[source]¶
Initialize the instance.
- Parameters:
config_factory (
ConfigFactory
) – the application config and factoryoptions (
Tuple
[str
,...
]) – a list of<section>:<option>
, each of which is added to the subsetoption_delim (
str
) – the string used to delimit sections and options inoptions
- default_section¶
- serializer¶
zensols.deeplearn.model.pred module¶
Contains classs for creating predictions from a trained model.
- class zensols.deeplearn.model.pred.PredictionMapper(datas, batch_stash)[source]¶
Bases:
PersistableContainer
Used by a top level client to create features used to create instances of
DataPoint
and map label classes from nominal IDs to their string representations.- _create_data_point(cls, feature)[source]¶
Create a data point. This base implementation creates it with the passed parameters.
- Parameters:
cls (
Type
[DataPoint
]) – the data point class of which to make an instancestash – to be set as the batch stash on the data point and the caller
feature (
Any
) – to be set as the third argument and generate from_create_features()
- Return type:
- __init__(datas, batch_stash)¶
- batch_stash: BatchStash¶
“The batch stash used to own batches and data points created by this instance.
- property batches: List[Batch]¶
Create a prediction batch that is detached from any stash resources, except this instance that created it. This creates a tuple of features, each of which is used to create a
DataPoint
.
- datas: Tuple[Any, ...]¶
The input data to create ad-hoc predictions.
- abstract map_results(result)[source]¶
Map ad-hoc prediction results from the
ModelExecutor
to an instance that makes sense to the client.- Parameters:
result (
ResultsContainer
) – contains the predictions produced by the model aspredictions_raw
- Return type:
- Returns:
a first class instance suitable for easy client consumption
zensols.deeplearn.model.sequence module¶
Sequence modules for sequence models.
- class zensols.deeplearn.model.sequence.SequenceBatchIterator(executor, logger)[source]¶
Bases:
BatchIterator
Expects outputs as a list of lists of labels of indexes. Examples of use cases include CRFs (e.g. BiLSTM/CRFs).
- __init__(executor, logger)¶
- class zensols.deeplearn.model.sequence.SequenceNetworkContext(split_type, criterion)[source]¶
Bases:
object
The forward context for the
SequenceNetworkModule
. This is used inSequenceNetworkModule._forward()
to provide the module additional information needed to score the model and produce the loss.- __init__(split_type, criterion)¶
-
criterion:
Module
¶ The criterion used to create the loss. This is provided for modules that produce the loss in the forward phase with the :meth:`torch.nn.module.forward` method.
-
split_type:
DatasetSplitType
¶ The split type, which informs the module when decoding to produce outputs or using the forward pass to prod.
- class zensols.deeplearn.model.sequence.SequenceNetworkModule(net_settings, sub_logger=None)[source]¶
Bases:
BaseNetworkModule
A module that has a forward training pass and a separate scoring phase. Examples include layers with an ending linear CRF layer, such as a BiLSTM CRF. This module has a
decode
method that returns a 2D list of integer label indexes of a nominal class.The context provides additional information needed to train, test and use the module.
- See:
zensols.deeplearn.layer.RecurrentCRFNetwork
- abstract _forward(batch, context)[source]¶
The forward pass, which either trains the model and creates the loss and/or decodes the output for testing and evaluation.
- Parameters:
batch (
Batch
) – the batch to train, validate or test oncontext (
SequenceNetworkContext
) – contains the additional information needed for scoring and decoding the sequence
- Return type:
- class zensols.deeplearn.model.sequence.SequenceNetworkOutput(predictions, loss=None, score=None, labels=None, outputs=None)[source]¶
Bases:
Deallocatable
The output from :clas:`.SequenceNetworkModule` modules.
- __init__(predictions, loss=None, score=None, labels=None, outputs=None)[source]¶
Initialize the output of a sequence NN.
- Parameters:
predictions (
Union
[List
[List
[int
]],Tensor
]) – list of list predictions to convert in to a 1-D tensor if given and not already a tensor; if a tensor, the shape must also be 1-Dloss (
Tensor
) – the loss tensorscore (
Tensor
) – the score given by the CRF’s Verterbi algorithmlabels (
Union
[List
[List
[int
]],Tensor
]) – list of list gold labels to convert in to a 1-D tensor if given and not already a tensoroutputs (
Tensor
) – the logits from the model
zensols.deeplearn.model.trainmng module¶
Contains a class to assist in the training of the ModelExecutor
.
- class zensols.deeplearn.model.trainmng.TrainManager(status_logger, progress_logger, update_path, max_consecutive_increased_count, progress_bar_number_width=6)[source]¶
Bases:
object
The class is used to assist in the training of the
ModelExecutor
. It updates validation loss and helps with early stopping decisions. It also watches for a file on the file system to provide instructions on what to do in the next epoch.- __init__(status_logger, progress_logger, update_path, max_consecutive_increased_count, progress_bar_number_width=6)¶
- get_status()[source]¶
Return the epoch to set in the training loop of the
ModelExecutor
.- Return type:
-
progress_bar_number_width:
int
= 6¶ The string width of the train/validation loss metrics in the progress bar, which needs to be greater than 4.
-
progress_logger:
Logger
¶ The logger to record progress updates during training. This is used only when the progress bar is turned off (see
ModelFacade._configure_cli_logging()
).
- stop()[source]¶
Stops the execution of training the model. Currently this is done by creating a file the executor monitors.
- Return type:
- Returns:
True
if the application is configured to early stop and the signal has not already been given
- update_loss(valid_epoch_result, train_epoch_result, ave_valid_loss)[source]¶
Update the training and validation loss.
zensols.deeplearn.model.wgtexecutor module¶
A class that weighs labels non-uniformly.
- class zensols.deeplearn.model.wgtexecutor.WeightedModelExecutor(config_factory, config, name, model_settings, net_settings, dataset_stash, dataset_split_names, cross_fold_dataset_stash=None, result_path=None, cross_fold_result_path=None, update_path=None, intermediate_results_path=None, progress_bar=False, progress_bar_cols=None, weighted_split_name='train', weighted_split_path=None, use_weighted_criterion=True)[source]¶
Bases:
ModelExecutor
A class that weighs labels non-uniformly. This class uses invert class sampling counts to help the minority label.
- __init__(config_factory, config, name, model_settings, net_settings, dataset_stash, dataset_split_names, cross_fold_dataset_stash=None, result_path=None, cross_fold_result_path=None, update_path=None, intermediate_results_path=None, progress_bar=False, progress_bar_cols=None, weighted_split_name='train', weighted_split_path=None, use_weighted_criterion=True)¶
- get_class_weights(**kwargs) Tensor ¶
Compute invert class sampling counts to return the weighted class.
- Return type:
Tensor
- get_label_statistics()[source]¶
Return a dictionary whose keys are the labels and values are dictionaries containing statistics on that label.
-
use_weighted_criterion:
bool
= True¶ If
True
, use the class weights in the initializer of the criterion. Setting this toFalse
effectively disables this class.
-
weighted_split_path:
InitVar
= None¶ The path to the cached weithed labels.
Module contents¶
Classes that train, validate and test a model. The values of the
predictions are made available using
zensols.deeplearn.model.Executor.get_predictions
.