Generate, split into folds or train/test and cache a dataset 0.0.12

Released under the MIT

Generate, split into folds or train/test and cache a dataset.

Installation

To install, add the following dependency to your project or build file:

[com.zensols.ml/dataset "0.0.12"]

Namespaces

zensols.dataset.db

Preemptively compute a dataset (i.e. features from natural language utterances) and store them in Elasticsearch. This is useful for use with training, testing, validating and development machine learning models.

zensols.dataset.elsearch

A client simple wrapper for an Elasticsearch wrapper. You probably want use the more client friendly zensols.dataset.db.

Public variables and functions:

aggregation
buckets
create-context
create-index
delete-document
delete-index
delete-mapping
describe
document-by-id
document-count
document-ids
documents
exists?
put-document
recreate-index
search
search-literal
with-context

zensols.dataset.thaw

Exactly like zensols.dataset.db but use the file system.

Public variables and functions:

default-connection-inst
ids
instance-by-id
instances
instances-count
set-default-connection
thaw-connection
with-connection

zensols.dataset.version

Public variables and functions:

gitmsg
gitref
timestamp
version

Generated by Codox

Generate, split into folds or train/test and cache a dataset 0.0.12

Project

Namespaces

Generate, split into folds or train/test and cache a dataset 0.0.12

Released under the MIT

Installation

Namespaces

zensols.dataset.db

zensols.dataset.elsearch

zensols.dataset.thaw

zensols.dataset.version