Skip to content

Datasets

OutlierDetectionData provides access to collections of outlier detection datasets. The following collections are currently supported:

  • ODDS, Outlier Detection DataSets, Shebuti Rayana, 2016
  • ELKI, On the Evaluation of Unsupervised Outlier Detection, Campos et al., 2016
  • TSAD, The UCR Time Series Archive, Dau et al., 2018

For the TSAD collection, the class with the least members is chosen as the anomaly class and all other classes are defined as normal. If there are multiple classes, the lexically first class is chosen.

The following methods are defined for all collections.

list

# OutlierDetectionData.listFunction.

list([prefix])

List the available datasets in a dataset collection optionally given a prefix.

Parameters

prefix::Union{Regex, AbstractString}

Regex or string used to filter the datasets.

load

# OutlierDetectionData.loadFunction.

load(dataset)

Load a given dataset from a dataset collection.

Parameters

name::AbstractString

Name of the dataset to load.