Configure MTL Dataset (MTL)

Modified

August 16, 2025

The multi-task learning dataset consists of multiple datasets corresponding to multiple learning tasks, each of which has their own training, validation and test data.

MTL dataset is a sub-config under the experiment index config (MTL). To configure a custom MTL dataset, you need to create a YAML file in mtl_dataset/ folder.

MTL dataset can also be converted from continual learning dataset. To do this, specify /cl_dataset field rather than /mtl_dataset field in the experiment index config (MTL), and create a YAML file in cl_dataset/ folder. Please refer to Configure CL Dataset (CL Main) for detailed instructions.

Below shows examples of the MTL dataset config.

Example

configs
├── __init__.py
├── entrance.yaml
├── experiment
│   ├── example_stl_train.yaml
│   └── ...
├── mtl_dataset
│   └── multi_domain_sentiment.yaml
...

configs/experiment/example_mtl_train.yaml

defaults:
  ...
    - /mtl_dataset: multi_domain_sentiment.yaml
  ...

configs/mtl_dataset/multi_domain_sentiment.yaml

_target_: clarena.mtl_datasets.MultiDomainSentiment
(TBC)

Supported MTL Datasets & Required Config Fields

All CL datasets in CLArena can be converted into MTL datasets. Please refer to Supported CL Datasets & Required Config Fields.

In CLArena, we also implemented many MTL datasets as Python classes in clarena.mtl_datasets module that you can use for your experiment.

To choose a MTL dataset, assign the _target_ field to the class name of the MTL dataset. For example, to use the Multi-Domain Sentiment dataset, set _target_ field to clarena.mtl_datasets.MultiDomainSentiment. Each MTL dataset has its own hyperparameters and configurations, which means it has its own required fields. The required fields are the same as the arguments of the class specified by _target_. The arguments of each MTL dataset class can be found in API documentation.

API Reference (MTL Datasets) Source Code (MTL Datasets)

Below is the full list of supported MTL datasets. We only support image classification datasets. Note that the “MTL Dataset” is exactly the class name that the _target_ field is assigned.

MTL Dataset	Description	Required Config Fields
MultiDomainSentiment (TBC)	TBC.	Same as PermutedArabicHandwrittenDigits class arguments

MultiDomainSentiment

(TBC)

TBC.

Same as PermutedArabicHandwrittenDigits class arguments