Image Augmentation for Object Image Classification Based On Combination of Pre-Trained CNN and SVM

Yoshihiro Shima

doi:10.1088/1742-6596/1004/1/012001

Journal of Physics: Conference Series

Paper • The following article is Open access

Image Augmentation for Object Image Classification Based On Combination of Pre-Trained CNN and SVM

Yoshihiro Shima¹

Published under licence by IOP Publishing Ltd
Journal of Physics: Conference Series, Volume 1004, 2nd International Conference on Machine Vision and Information Technology (CMVIT 2018) 23–25 February 2018, Hong Kong Citation Yoshihiro Shima 2018 J. Phys.: Conf. Ser. 1004 012001 DOI 10.1088/1742-6596/1004/1/012001

Download Article PDF

Article metrics

1655 Total downloads

Author e-mails

shima@ee.meisei-u.ac.jp

Author affiliations

¹ School of Science and Engineering, Meisei University, Hino, Tokyo, 191-8506, Japan

Buy this article in print

Journal RSS

Sign up for new issue notifications

Abstract

Neural networks are a powerful means of classifying object images. The proposed image category classification method for object images combines convolutional neural networks (CNNs) and support vector machines (SVMs). A pre-trained CNN, called Alex-Net, is used as a pattern-feature extractor. Alex-Net is pre-trained for the large-scale object-image dataset ImageNet. Instead of training, Alex-Net, pre-trained for ImageNet is used. An SVM is used as trainable classifier. The feature vectors are passed to the SVM from Alex-Net. The STL-10 dataset are used as object images. The number of classes is ten. Training and test samples are clearly split. STL-10 object images are trained by the SVM with data augmentation. We use the pattern transformation method with the cosine function. We also apply some augmentation method such as rotation, skewing and elastic distortion. By using the cosine function, the original patterns were left-justified, right-justified, top-justified, or bottom-justified. Patterns were also center-justified and enlarged. Test error rate is decreased by 0.435 percentage points from 16.055% by augmentation with cosine transformation. Error rates are increased by other augmentation method such as rotation, skewing and elastic distortion, compared without augmentation. Number of augmented data is 30 times that of the original STL-10 5K training samples. Experimental test error rate for the test 8k STL-10 object images was 15.620%, which shows that image augmentation is effective for image category classification.

Export citation and abstract BibTeX RIS

Previous article in issue

Next article in issue

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.

Image Augmentation for Object Image Classification Based On Combination of Pre-Trained CNN and SVM

Article metrics

Share this article

Author e-mails

Author affiliations

Abstract