On a jupyter notebook with Tensorflow-2.0.0, a train-validation-test split of 80-10-10 was performed in this way:
import tensorflow_datasets as tfds
from os import getcwd
splits = tfds.Split.ALL.subsplit(weighted=(80, 10, 10))
filePath = f"{getcwd()}/../tmp2/"
splits, info = tfds.load('fashion_mnist', with_info=True, as_supervised=True, split=splits, data_dir=filePath)
However, when trying to run the same code locally I get the error
AttributeError: type object 'Split' has no attribute 'ALL'
I have seen I can create two sets in this way:
splits, info = tfds.load('fashion_mnist', with_info=True, as_supervised=True, split=['train[:80]','test[80:90]'], data_dir=filePath)
but I do not know how I can add a third set.
tfds.Split.ALL.subsplit
ortfds.Split.TRAIN.subsplit
apparently are deprecated and no longer supported.Some of the datasets are already split between train and test. In this case I found the following solution (using for example the fashion MNIST dataset):
EDIT AFTER COMMENTS
The previous code had some errors. First of all, this official link says:
but when I tried it did not work.
all
would be helpful but there is an alternative. The error in the previous code is that the%
must be used and that it must be specified for each set. I modified the code in this way:which prints: