Paddy Doctor: Paddy Disease Classification
Introduction:
This time, I don't want to write the hurdles in fastbook and how to overcome them because once you move to notebook 6 or 7 it shows that you have the strong mindset to overcome by yourself and finish the book. Here after we have to practise what we learn by participating in Kaggle Competitions.
Problem statement:
Identify the type of disease present in paddy leaf images.
Understanding the dataset
Labels are given in a separate csv file, train data set contains 10 folders each containing an image of paddy disease with respect to folder name, test data contain the images we have to predict.
Model Implementation
Once we Explore the dataset, the next step is to build a data block for the problem. We can create using two ways they are:
We may map the label of the disease name to the image by csv file.
Or we may label the data according to the folder name.
What I chose is mapping the label of the images with respect to folder names. Once the data loaders are ready we can check the train and validation set. If it shows no error, our data block is built well. For this problem I tried 3 different models, they are Efficient B0, Resnet34, Resnet18. I tried everything that I learned till now with these models, like discriminative learning rate rate, unfreezing and so on.
Once the model is working, predict the label with test data. And prepare the data for the submission file and submit it.
Problem faced and how I overcame
Building a model with resnet architecture comes handy for efficient net not being able to use. I searched through the fastbook and the notebook of great people of fastai. I got to know about the timm and the issue was solved.
Why I am saying this, because once you start working on a problem you may face some issues which you may never come across. This type of issue helps to build your knowledge and problem solving abilities.
Conclusion
The idea of the notebook is not to build a great model, it is to build an end to end model for the kaggle and implement what we learn till now. This type of solving problems in the Kaggle give us a mental boost to complete the book.
Note:
You can check out my notebook of this competition in Kaggle. And also my Github for more datascience related projects and works.
Comments
Post a Comment