Generate, split into folds or train/test and cache a dataset 0.0.12

Released under the MIT

Generate, split into folds or train/test and cache a dataset.

Installation

To install, add the following dependency to your project or build file:

[com.zensols.ml/dataset "0.0.12"]

Namespaces

zensols.dataset.db

Preemptively compute a dataset (i.e. features from natural language utterances) and store them in Elasticsearch. This is useful for use with training, testing, validating and development machine learning models.

zensols.dataset.elsearch

A client simple wrapper for an Elasticsearch wrapper. You probably want use the more client friendly zensols.dataset.db.

zensols.dataset.thaw

Exactly like zensols.dataset.db but use the file system.

zensols.dataset.version

Public variables and functions: