A Comparative Analysis for 2D Object Recognition: A Case Study with Tactode Puzzle-Like Tiles

dc.contributor.author Daniel Queirós Silva en
dc.contributor.author Armando Sousa en
dc.contributor.author Costa,V en
dc.contributor.other 5152 en
dc.contributor.other 8276 en
dc.date.accessioned 2023-05-04T09:38:48Z
dc.date.available 2023-05-04T09:38:48Z
dc.date.issued 2021 en
dc.description.abstract <jats:p>Object recognition represents the ability of a system to identify objects, humans or animals in images. Within this domain, this work presents a comparative analysis among different classification methods aiming at Tactode tile recognition. The covered methods include: (i) machine learning with HOG and SVM; (ii) deep learning with CNNs such as VGG16, VGG19, ResNet152, MobileNetV2, SSD and YOLOv4; (iii) matching of handcrafted features with SIFT, SURF, BRISK and ORB; and (iv) template matching. A dataset was created to train learning-based methods (i and ii), and with respect to the other methods (iii and iv), a template dataset was used. To evaluate the performance of the recognition methods, two test datasets were built: tactode_small and tactode_big, which consisted of 288 and 12,000 images, holding 2784 and 96,000 regions of interest for classification, respectively. SSD and YOLOv4 were the worst methods for their domain, whereas ResNet152 and MobileNetV2 showed that they were strong recognition methods. SURF, ORB and BRISK demonstrated great recognition performance, while SIFT was the worst of this type of method. The methods based on template matching attained reasonable recognition results, falling behind most other methods. The top three methods of this study were: VGG16 with an accuracy of 99.96% and 99.95% for tactode_small and tactode_big, respectively; VGG19 with an accuracy of 99.96% and 99.68% for the same datasets; and HOG and SVM, which reached an accuracy of 99.93% for tactode_small and 99.86% for tactode_big, while at the same time presenting average execution times of 0.323 s and 0.232 s on the respective datasets, being the fastest method overall. This work demonstrated that VGG16 was the best choice for this case study, since it minimised the misclassifications for both test datasets.</jats:p> en
dc.identifier P-00T-PT1 en
dc.identifier.uri http://dx.doi.org/10.3390/jimaging7040065 en
dc.identifier.uri https://repositorio.inesctec.pt/handle/123456789/13711
dc.language eng en
dc.rights info:eu-repo/semantics/openAccess en
dc.title A Comparative Analysis for 2D Object Recognition: A Case Study with Tactode Puzzle-Like Tiles en
dc.type en
dc.type Publication en
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
P-00T-PT1.pdf
Size:
15.23 MB
Format:
Adobe Portable Document Format
Description: