I want to train a Mask R-CNN model in Google Colab using transfer learning. For that, I'm utilizing the coco.h5
dataset. I installed Mask R-CNN with !pip install mrcnn-colab
. I noticed that the following code does not load the weights:
model.load_weights(COCO_MODEL_PATH, by_name=True)
. The names are right and by_name=False
results in the same problem. I can confirm this by checking with the following lines:
from mrcnn import visualize
visualize.display_weight_stats(model)
This displays the same values both before and after loading (I just show the first 10 layers):
I believe I've found the solution to this problem. It involves the following lines of code:
import tensorflow.compat.v1 as tf
tf.disable_v2_behavior()
tf.compat.v1.get_default_graph()
This solution is often recommended because Mask R-CNN actually requires TensorFlow 1.X, whereas the latest TensorFlow version is 2.X, and Colab doesn't support TensorFlow 1.X. Therefore, I used this solution, which unfortunately results in the load_weights
function not working.
I managed to adjust my code so that import tensorflow.compat.v1
is not necessary and used the modified model.py
and utils.py
code from https://github.com/ahmedfgad/Mask-RCNN-TF2/tree/master, which requires a Python version lower than 3.10 (the standard in Colab).
For the Python downgrade, I used the following commands:
!apt-get update -y
!update-alternatives --install /usr/bin/python3 python3 /usr/bin/python3.7 1
!update-alternatives --config python3
!apt install python3-pip
!apt install python3.7-distutils
This resulted in the installation of another Python version, but I am unable to use it in Colab. Colab always defaults to using Python 3.10. This can be confirmed by running the following code:
import sys
print("User Current Version:-", sys.version)
which results in the following output:
User Current Version:- 3.10.12 (main, Jun 11 2023, 05:26:28) [GCC 11.4.0]
Therefore, I created a new runtime in Colab with Python 3.7.6 as follows:
!wget -O mini.sh https://repo.anaconda.com/miniconda/Miniconda3-py37_4.8.2-Linux-x86_64.sh
!chmod +x mini.sh
!bash ./mini.sh -b -f -p /usr/local
!conda install -q -y jupyter
!conda install -q -y google-colab -c conda-forge
!python -m ipykernel install --name "py37" --user
After switching to this runtime, I upgraded the Python version to 3.7.11, which I actually needed:
!conda install python=3.7.11 -y
With these adjustments, I can load the weights; however, I am limited to using the CPU. The reason for this limitation is that the CUDA version of Colab is not compatible with this Python version, and I was unable to achieve a downgrade. Additionally, the new runtime solution often necessitates frequent restart runtime actions, as it tends to freeze when I click the run button. So, regarding this problem, I have the following questions:
- How can I downgrade the CUDA version to 10.1? I've already tried various approaches, but I always come to the conclusion that it's not possible in Colab.
- Is it possible to force Colab to use a previously installed Python version?
- Is there an alternative to the
import tensorflow.compat.v1 as tf
code that allows loading the weights?
You can use this implementation which is built on top of the original Mask R-CNN repo to support TF2. This repository allows to train and test the Mask R-CNN model with TensorFlow 2.14.0, and Python 3.10.12.
You can also use it on Google Colab (current colab environment also uses Python 3.10.12 and TF 2.14.0) and it's working without any issues on GPU. Please make sure your runtime is using the GPU:
and then follow these exact steps:
And then use this snippet to load weights into the Mask R-CNN model:
If you could properly follow all those steps, you should be able to load the pre-trained weights without any issue and verify the change in weights with:
which prints out:
Here's a snippet to visualize the predictions from the pre-trained Mask R-CNN:
which yields: