I have a .tfrecords
file and I want to extract, see the images in the file and augment them.
I am using https://colab.research.google.com
TensorFlow version: 2.3.0
And for the following code
raw_dataset = tf.data.TFRecordDataset("*path.tfrecords")
for raw_record in raw_dataset.take(1):
example = tf.train.Example()
example.ParseFromString(raw_record.numpy())
print(example)
I am facing the following output:
features {
feature {
key: "depth"
value {
int64_list {
value: 3
}
}
}
feature {
key: "height"
value {
int64_list {
value: 333
}
}
}
feature {
key: "image_raw"
value {
bytes_list {
value:
}
}
}
feature {
key: "label"
value {
int64_list {
value: 16
}
}
}
feature {
key: "width"
value {
int64_list {
value: 500
}
}
}
}
Here is a simple code that can extract your .tfrecord images as .png format.
To run next codes you need to install one time pip modules through
pip install tensorflow tensorflow_addons pillow numpy matplotlib
.First image from dataset:
Below is code for drawing number of images per each label. Labels inside
max_32_set.tfrecords
file are represented as integers (not string names), probably names of labels are located in separate small file with meta information about dataset.Plot for
max_32_set.tfrecords
:Next code does augmentation using gaussian noise and gaussian blur, augmented tfrecord dataset is saved to
max_32_set.augmented.tfrecords
file:Example augmented images:
Original:
Noised:
Blurred:
Noised-blurred:
Noised-blurred-mirrored:
Number of images per label after augmentation (exactly balanced 30 images per label):
Same augmentation as above but for the case of input and output folders with labeled images, instead of TFRecordDataset, change
c_inp_dir
andc_out_dir
to your folders paths: