I had around 360 images splitted %25 as validation data. I could train Deeplabv3 with those images without any issue. Later on I have added around 40 images with labeled images. But the model now started to give validation loss always nan. Sometimes it gives at very first epoch some validation loss value but starting by second epoch the validation loss is always nan. The strange thing is I can still train Unet or any other model with the same data, without having no problem. And Later I discarded those 40 images and trained Deeplabv3 and worked again without any issue. I have checked labels and everything from those images and looks like there is no problem with new images. Any idea about what could cause this issue ?
Deeplabv3 validation loss is nan
573 Views Asked by Dan Py At
1
There are 1 best solutions below
Related Questions in PYTHON
- Dropzone.js inside magnific popup not working
- Magnific Popup position bug
- Flexslider with a popup Gallery
- magnific popup custom on click
- leave screen popup with magnific popup pass data-effect
- Magnific Popup: error trying to invoke a YouTube iframe popup from an inline popup
- magnific pop-up - zoom gallery: inline element max-height
- OwlCarousel with modal window
- Magnific popup galleries - isolate from each other
- Magnific popup youtube video doesn't load
Related Questions in DEEP-LEARNING
- Dropzone.js inside magnific popup not working
- Magnific Popup position bug
- Flexslider with a popup Gallery
- magnific popup custom on click
- leave screen popup with magnific popup pass data-effect
- Magnific Popup: error trying to invoke a YouTube iframe popup from an inline popup
- magnific pop-up - zoom gallery: inline element max-height
- OwlCarousel with modal window
- Magnific popup galleries - isolate from each other
- Magnific popup youtube video doesn't load
Related Questions in COMPUTER-VISION
- Dropzone.js inside magnific popup not working
- Magnific Popup position bug
- Flexslider with a popup Gallery
- magnific popup custom on click
- leave screen popup with magnific popup pass data-effect
- Magnific Popup: error trying to invoke a YouTube iframe popup from an inline popup
- magnific pop-up - zoom gallery: inline element max-height
- OwlCarousel with modal window
- Magnific popup galleries - isolate from each other
- Magnific popup youtube video doesn't load
Related Questions in CONV-NEURAL-NETWORK
- Dropzone.js inside magnific popup not working
- Magnific Popup position bug
- Flexslider with a popup Gallery
- magnific popup custom on click
- leave screen popup with magnific popup pass data-effect
- Magnific Popup: error trying to invoke a YouTube iframe popup from an inline popup
- magnific pop-up - zoom gallery: inline element max-height
- OwlCarousel with modal window
- Magnific popup galleries - isolate from each other
- Magnific popup youtube video doesn't load
Related Questions in DEEPLAB
- Dropzone.js inside magnific popup not working
- Magnific Popup position bug
- Flexslider with a popup Gallery
- magnific popup custom on click
- leave screen popup with magnific popup pass data-effect
- Magnific Popup: error trying to invoke a YouTube iframe popup from an inline popup
- magnific pop-up - zoom gallery: inline element max-height
- OwlCarousel with modal window
- Magnific popup galleries - isolate from each other
- Magnific popup youtube video doesn't load
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Assuming you haven't solved this or moved on, and if you're using a tf.keras implementation of Deeplabv3, check your DilatedSpatialPyramidPooling layer and in the convolution block of that layer, either comment out BatchNormalization, or surround it with Flatten and Reshape like this
Seems like there might be some weird behaviour with batch norm and spatial dimensions such as (1,1,num_channels) but I'm not entirely sure why. It solved the issue for me, however.