Mammalian Species Detection Using a Cascade of Unet and SqueezeNet

Show simple item record

dc.contributor.author Njeru, Michael
dc.contributor.author Ciira, Wa Maina
dc.contributor.author Kibet, Langat
dc.date.accessioned 2021-11-02T13:57:20Z
dc.date.available 2021-11-02T13:57:20Z
dc.date.issued 2021-10-25
dc.identifier.citation M. Njeru, C. Maina and K. Langat, "Mammalian Species Detection Using a Cascade of Unet and SqueezeNet," 2021 IEEE AFRICON, 2021, pp. 1-5, doi: 10.1109/AFRICON51333.2021.9570950. en_US
dc.identifier.uri 10.1109/AFRICON51333.2021.9570950
dc.identifier.uri http://repository.dkut.ac.ke:8080/xmlui/handle/123456789/4903
dc.description.abstract Monitoring of wild animals has taken different approaches with an aim to provide vital information used in animal protection in their natural habitats. To recognize animal species without human trackers requires machine learning models that extract specie's features from an image. This project proposes a method of counting animals in an image and specifying the species of each animal using Unet and a variant of the SqueezeNet model. To train the Unet model, images and corresponding masks are used as the training data. Different optimizers are applied to each model. During inference, Unet outputs a binary mask with ones where an animal is detected and zeros elsewhere. SqueezeNet model is trained with images corresponding to six classes: bushbuck, impala, llama, warthog, waterbuck, and zebra. Three variants of the SqueezeNet model have been trained. The first contains the original backbone while the other two have the original backbone with an additional fire module. In one model the Fire module is similar to the Fire modules of the original backbone while in the other model, the extra fire module contains batch normalization layers. The trained models show that Unet trained with Nadam optimizer achieves the highest dice coefficient while the SqueezeNet with an extra Fire module containing batch norm layers and RMSprop optimizer achieves the highest training accuracy. The combined system containing the two models takes an image and outputs the image with bounding boxes around each animal and the corresponding animal species. The system achieves both counting and recognition of the species for each image placed at the input. en_US
dc.language.iso en en_US
dc.publisher IEEE en_US
dc.title Mammalian Species Detection Using a Cascade of Unet and SqueezeNet en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account