Multi-task synthetic dataset

Name: Multi-task synthetic dataset
Creator: Seyedsaman Emami
Published: 2025-12-10T17:08:49.619Z
Keywords: Multi-Objective Parameter Optimization, Linear Regression, Binary Classification

Emami, Seyedsaman; Martínez Muñoz, Gonzalo; Hernández Lobato, Daniel

doi:10.17632/r2mnkjfmh3.2

Multi-task synthetic dataset

Published: 10 December 2025| Version 2 | DOI: 10.17632/r2mnkjfmh3.2

Contributors:

,

Description

A synthetic dataset collection designed for evaluating multi-task learning and transfer learning algorithms under both regression and binary classification settings. It consists of 100 independently generated batches, each initialized with distinct random seeds to promote diversity across tasks. Every batch contains 10 tasks (including two designated outliers), with 300 training and 1,000 test instances per task distributed across five input features. The dataset ensures balanced class representation and controlled task variation through a weighting parameter of w = 0.9.

Files

Steps to reproduce

To reproduce the dataset, please refer to the corresponding GitHub repository available at https://github.com/GAA-UAM/R-MTGB

Institutions

Universidad Autonoma de Madrid

Multi-task synthetic dataset

Description

Files

Steps to reproduce

Institutions

Categories

Related Links

Licence