Skip to content

semipy.datasets.get_stl10

    semipy.datasets.cifar.get_stl10(num_labelled: int = 5000,
                                    num_unlabelled: int = 100000,
                                    valid_proportion: float = 0.0,
                                    path: str = './data',
                                    augmentation: bool = False,
                                    include_labelled: bool = False) -> dict

Downloads STL10 dataset from torchvision and transforms it into a Semi-Supervised dataset (with labelled and unlabelled samples).

Parameters

  • num_labelled (int) - Number of labelled items to include in the dataset. Default: 5 000
  • num_unlabelled (int) - Number of unlabelled items to include in the dataset. Default: 100 000
  • valid_proportion (float) - Size of the validation set defined by a proportion with respect to the number of labelled items. Default: 0.0
  • path (str) - Path to download and store data samples. Default: './data'
  • augmentation (bool) - To use augmentation or not when sampling in the dataset. Default: False
  • include_labelled (bool) - To include labelled items into unlabelled set as additional data. Default: False