Data classification using sparse coding based active learning

24th Signal Processing and Communication Application Conference (SIU), Zonguldak, Türkiye, 16 - 19 Mayıs 2016, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Doi Numarası: 10.1109/siu.2016.7495836
Basıldığı Şehir: Zonguldak
Basıldığı Ülke: Türkiye
Dokuz Eylül Üniversitesi Adresli: Evet

Özet

Dictionary learning is used in signal, image, audio and video processing applications to represent signals by a sparse set of atoms where sparse representations are managed for the problems of compression, denoising, feature extraction and data classification. In many machine learning applications, classifier ensembles are shown to be superior than their single classifier counterparts. On the other hand, possibility of access to huge amount of unlabeled data has been increased along with getting easy access to data. Active learning, which is proposed for this type of problems, is a learning method in which the most informative instances from the unlabeled data are chosen, then labeled by an oracle and after then added to the training set. In this study, proposed ensemble based active learning algorithm (Active_RDL) learns a dictionary for each class by use of random feature subspaces and in each iteration after labeling the most informative instances they are added to the training set. In the test phase, a class label is assigned by considering the majority of the outputs from ensemble classifiers. The final decision of the ensembles is given by the majority of the dictionary outputs. Active_RDL method is compared with ensemble based support vector machine (Active_RSVM) on ten different benchmarks from UCI repository. In the test results, proposed Active_RDL method outperforms Active_RSVM.