I had around 360 images splitted %25 as validation data. I could train Deeplabv3 with those images without any issue. Later on I have added around 40 images with labeled images. But the model now started to give validation loss always nan. Sometimes it gives at very first epoch some validation loss value but starting by second epoch the validation loss is always nan. The strange thing is I can still train Unet or any other model with the same data, without having no problem. And Later I discarded those 40 images and trained Deeplabv3 and worked again without any issue. I have checked labels and everything from those images and looks like there is no problem with new images. Any idea about what could cause this issue ?
Deeplabv3 validation loss is nan
580 Views Asked by Dan Py At
1
There are 1 best solutions below
Related Questions in PYTHON
- new thread blocks main thread
- Extracting viewCount & SubscriberCount from YouTube API V3 for a given channel, where channelID does not equal userID
- Display images on Django Template Site
- Difference between list() and dict() with generators
- How can I serialize a numpy array while preserving matrix dimensions?
- Protractor did not run properly when using browser.wait, msg: "Wait timed out after XXXms"
- Why is my program adding int as string (4+7 = 47)?
- store numpy array in mysql
- how to omit the less frequent words from a dictionary in python?
- Update a text file with ( new words+ \n ) after the words is appended into a list
- python how to write list of lists to file
- Removing URL features from tokens in NLTK
- Optimizing for Social Leaderboards
- Python : Get size of string in bytes
- What is the code of the sorted function?
Related Questions in DEEP-LEARNING
- [Caffe]: Check failed: ShapeEquals(proto) shape mismatch (reshape not set)
- Caffe net.predict() outputs random results (GoogleNet)
- Implementation of convolutional sparse coding in deep networks frameworks
- Matlab example code for deep belief network for classification
- Two errors while running Caffe
- How to speed up caffe classifer in python
- Caffe Framework Runtest Core dumped error
- Scan function from Theano replicates non_sequences shared variables
- Why bad accuracy with neural network?
- Word2Vec Sentiment Classification with R and H2O
- What is gradInput and gradOutput in Torch7's 'nn' package?
- Error while drawing net in Caffe
- How does Caffe determine the number of neurons in each layer?
- Conclusion from PCA of dataset
- Google Deep Dream art: how to pick a layer in a neural network and enhance it
Related Questions in COMPUTER-VISION
- OpenCV algorithm of contours searching and creation of bounding rectagle
- How to make sense (handle) when computes logarithm of zero in prior information
- Matlab code crashes and gives error: Dimension of matrices being concatenated are not consistent
- Haar Cascade classifier does not detect faces in simple frontal pictures
- Face cropping using facial landmarks
- qtimer and opencv running slow
- Simple RGB to Gray program crashes
- Estimating pose of one camera given another with known baseline
- dealing with dimensions in scikit-learn tree.decisiontreeclassifier
- converting matlab code to c code readiness error
- MATLAB ConnectedComponentLabeler does not work in for loop
- Finding camera position without calibration
- StereoSGBM cannot handle negative minDisparity
- How to speed up caffe classifer in python
- HOG Feature extraction
Related Questions in CONV-NEURAL-NETWORK
- Using Convolution Neural Net with Lasagne in Python error
- How to prepare data for torch7 deep learning convolutional neural network example?
- additive Gaussian noise in Tensorflow
- Same output in neural network for each input after training
- ConvNet : Validation Loss not strongly decreasing but accuracy is improving
- Tensor flow affecting multiprocessing/threading
- Inceptionv3 Transfer Learning on Torch
- Transfer weights from caffe to tensorflew
- Lasagne NN strange behavior with accuracy and weight convergence
- Multiple outputs in Keras gives value error
- How to use feature maps of CNN to localize obect on the image?
- Why Validation Error Rate remain same value?
- How to create LMDB files for semantic segmentation?
- Training model to recognize one specific object (or scene)
- Restoring saved TensorFlow model to evaluate on test set
Related Questions in DEEPLAB
- How to train a DeepLab model with a dataset that contains different size of images without unifying their sizes?
- Keras: Value error while running deepLabV3+ with ResNet50 backbone
- ResNet50 with multiple channels for DeepLabV3+ model has layer incompatibility issues
- Keras model.fit() returns large array of int with node sparse_categorical_crossentropy error
- Refine segmentation mask based on contours of image
- Deeplabv3 validation loss is nan
- Unable to load tflite deeplab segmentation model in android application. Error: ByteBuffer is not a valid flatbuffer model
- Unable to test and deploy a deeplabv3-mobilenetv2 tensorflow-lite segmentation model for inference
- running deeplab v3+ with tensorRT
- Deeplab v3+ Shape mismatch in tuple component
- Why do all images resulting by DeepLabV3+ become black only?
- DeepLab v3 - how to process images with any size ratio. (h * w)
- deeplabv3+ mean subtraction implementated?
- How to train deeplabv3 on custom dataset on pytorch?
- Deeplabv3 re-train result is skewed for non-square images
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Assuming you haven't solved this or moved on, and if you're using a tf.keras implementation of Deeplabv3, check your DilatedSpatialPyramidPooling layer and in the convolution block of that layer, either comment out BatchNormalization, or surround it with Flatten and Reshape like this
Seems like there might be some weird behaviour with batch norm and spatial dimensions such as (1,1,num_channels) but I'm not entirely sure why. It solved the issue for me, however.