Configure STL Dataset (STL)
The single-task learning dataset is a single dataset which has training, validation and test data.
STL dataset is a sub-config under the index config of:
To configure a custom STL dataset, you need to create a YAML file in stl_dataset/
folder. Below shows examples of the STL dataset config.
Example
configs
βββ __init__.py
βββ entrance.yaml
βββ index
β βββ example_stl_expr.yaml
β βββ ...
βββ stl_dataset
β βββ stl_mnist.yaml
...
configs/index/example_stl_expr.yaml
defaults:
...
- /stl_dataset: stl_mnist.yaml
...
configs/stl_dataset/stl_mnist.yaml
_target_: clarena.stl_datasets.STLMNIST
root: data/MNIST
validation_percentage: 0.1
batch_size: 64
Supported STL Datasets & Required Config Fields
In CLArena, we have implemented many STL datasets as Python classes in clarena.stl_datasets
module that you can use for your experiments and evaluations.
To choose a STL dataset, assign the _target_
field to the class name of the STL dataset. For example, to use the MNIST dataset, set _target_
field to clarena.stl_datasets.MNIST
. Each STL dataset has its own hyperparameters and configurations, which means it has its own required fields. The required fields are the same as the arguments of the class specified by _target_
. The arguments of each STL dataset class can be found in API documentation.
<i class=βbi bi-bookβ></i> API Reference (STL Datasets) <i class=βfa-brands fa-githubβ></i> Source Code (STL Datasets)
Below is the full list of supported STL datasets. We only support image classification datasets. Note that the βSTL Datasetβ is exactly the class name that the _target_
field is assigned.
STL Dataset | Description | Required Config Fields |
---|---|---|
STLMNIST | MNIST dataset. The MNIST dataset is a collection of handwritten digits. It consists of 60,000 training and 10,000 test images of handwritten digit images (10 classes), each 28x28 grayscale image. | Same as MNIST class arguments |