I am working on a deep learning model for "Age Invariant Face detection". I started with preprocessing the data which included face cropping via MTCNN, image contrast correction and sharpening. I then moved to build the model architecture, I designed a custom sequential CNN model. The training and validation losses were close to 1.9 but the model was taking approx 8 hrs for one epoch. I needed fast results so I moved to try on transfer learning. I read a paper which said Resnet-18 performed well on the task so I went with Resnet-18 first. Tried on tuning it by varying learning rates, but for no good. Accuracy wasn't increasing beyond 40%. I then went with Resnet-50, currently, I am working on it. I tried fine-tuning by changing the learning rate manually, tried with different optimizers Adam, SGD. Used learning rate decay. My model started to overfit so then I added dropout, image augmentation to training data and again added weight_decay as it was still overfitting. Right now there is no overfit but the losses aren't decreasing. I am trying on doing my experiments over it, reading many articles per day but feels like I'm stuck.
Fine tuning a model in transfer learning
607 Views Asked by Ankita Prasad At
1
There are 1 best solutions below
Related Questions in DEEP-LEARNING
- [Caffe]: Check failed: ShapeEquals(proto) shape mismatch (reshape not set)
- Caffe net.predict() outputs random results (GoogleNet)
- Implementation of convolutional sparse coding in deep networks frameworks
- Matlab example code for deep belief network for classification
- Two errors while running Caffe
- How to speed up caffe classifer in python
- Caffe Framework Runtest Core dumped error
- Scan function from Theano replicates non_sequences shared variables
- Why bad accuracy with neural network?
- Word2Vec Sentiment Classification with R and H2O
- What is gradInput and gradOutput in Torch7's 'nn' package?
- Error while drawing net in Caffe
- How does Caffe determine the number of neurons in each layer?
- Conclusion from PCA of dataset
- Google Deep Dream art: how to pick a layer in a neural network and enhance it
Related Questions in DATA-SCIENCE
- How access a downloaded library that is not showing up?
- Convert groupby.DataFrameGroupBy object to a dictionary in Python
- How can I detect keypresses using accelerometer/gyroscope data?
- Multiple Linear Regression handle NA
- Input/output error while copying from hadoop file system to local
- Removing duplicated values with missing values in a dataframe
- R editing dataframe based on column value
- PredictionIO Universal Recommender
- Pandas : TypeError: float() argument must be a string or a number
- Text classification algorithms which are not Naive?
- adding row generated inside a loop to a new data frame
- How to read multiple line elements in Spark , where each record of log is starting with yyyy-MM-dd format and each record of log is multi-line?
- Pandas merge duplicate DataFrame columns preserving column names
- How to plot multiple graphs in one chart using pygal?
- Removing non-English words from text using Python
Related Questions in CONV-NEURAL-NETWORK
- Using Convolution Neural Net with Lasagne in Python error
- How to prepare data for torch7 deep learning convolutional neural network example?
- additive Gaussian noise in Tensorflow
- Same output in neural network for each input after training
- ConvNet : Validation Loss not strongly decreasing but accuracy is improving
- Tensor flow affecting multiprocessing/threading
- Inceptionv3 Transfer Learning on Torch
- Transfer weights from caffe to tensorflew
- Lasagne NN strange behavior with accuracy and weight convergence
- Multiple outputs in Keras gives value error
- How to use feature maps of CNN to localize obect on the image?
- Why Validation Error Rate remain same value?
- How to create LMDB files for semantic segmentation?
- Training model to recognize one specific object (or scene)
- Restoring saved TensorFlow model to evaluate on test set
Related Questions in RESNET
- how to pretrain my image using resnet50 in mask-rcnn
- Keras model.predict always 0
- Defining model in Keras
- Error with to_proto(caffe-master/python/caffe/net_spec.py)-- 'LayerParameter' object has no attribute 'num_output'
- Tensorflow 2 Object Detection with OpenCV C++
- Fine tuning a model in transfer learning
- Detecting the location (latitude and longitude) of utility poles in an area using Resnet
- Accuracy and Recall is the same for resnet implementation pytorch
- What should I do when I'm getting an mAP of 0.000 using keras-retinanet / resnet50?
- Layer.get_weights() returns wrong output with shape(1, 1, 1, 2080, 1536)
- Adding Dropout layer after every activation layer in a pre-trained resnet model in tensorflow-2
- TypeError: ('Keyword argument not understood:', 'input') of ResNet
- Transformer Image captioning model produces just padding rather than a caption
- Incompatible shapes while using triplet loss and pre-trained resnet
- SimCLR/ResNet18 : last fractional batch mecanism not functional ? (tensor shapes incompatible)
Related Questions in TRANSFER-LEARNING
- Implementing Transfer Learning using Pegasus for Text Summarization generating junk characters
- How to add new classes to tensorflow.js model in transfer learning?
- Prediction for single image file using tensor flow transfer learning model
- Can this be considered overfitting?
- why am I getting error in transfer learning?
- Fine tuning a model in transfer learning
- How should I improve my accuracy in Transfer learning?
- Do you have any ideas to proceed custom size image using MobileNet?
- WARNING:tensorflow:Can save best model only with val_accuracy available, skipping
- ValueError: The input must have 3 channels; got `input_shape=(200, 200, 1)`
- Minimize the small loss to zero loss in transfer learning
- In Transfer Learning ValueError: Failed to convert a NumPy array to a Tensor
- How to do fine tuning on TFlite model
- Confusion Matrix in transfer learning with keras
- Keras AttributeError: 'Functional' object has no attribute 'shape'
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Apparently, this is one of the very good cases where feature engineering comes to the light. Try as much as you can to make your images use less computational power by apply some classic feature engineering methods. Using MTCNN is a really good approach and I would deffo look into Feature Engineering, also here is a link that could provide you some AHAA moments performace wise. All the best