Classifying text streams by keywords using classifier ensemble

Yang, Baoguo, Zhang, Yang and Li, Xue (2011) Classifying text streams by keywords using classifier ensemble. Data & Knowledge Engineering, 70 9: 775-793. doi:10.1016/j.datak.2011.05.002

Author Yang, Baoguo
Zhang, Yang
Li, Xue
Title Classifying text streams by keywords using classifier ensemble
Journal name Data & Knowledge Engineering   Check publisher's open access policy
ISSN 0169-023X
Publication date 2011-09
Sub-type Article (original research)
DOI 10.1016/j.datak.2011.05.002
Volume 70
Issue 9
Start page 775
End page 793
Total pages 19
Place of publication Amsterdam, The Netherlands
Publisher Elsevier
Collection year 2012
Language eng
Abstract Traditional approaches for text data stream classification usually require the manual labeling of a number of documents, which is an expensive and time consuming process. In this paper, to overcome this limitation, we propose to classify text streams by keywords without labeled documents so as to reduce the burden of labeling manually. We build our base text classifiers with the help of keywords and unlabeled documents to classify text streams, and utilize classifier ensemble algorithms to cope with concept drifting in text data streams. Experimental results demonstrate that the proposed method can build good classifiers by keywords without manual labeling, and when the ensemble based algorithm is used, the concept drift in the streams can be well detected and adapted, which performs better than the single window algorithm.
Keyword Text stream classification
Concept drift
Classifier ensemble
Knowledge acquisition
Q-Index Code C1
Q-Index Status Confirmed Code
Institutional Status UQ
Additional Notes Available online 1 June 2011. Published with tag of Editorial.

Document type: Journal Article
Sub-type: Article (original research)
Collections: Official 2012 Collection
School of Information Technology and Electrical Engineering Publications
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 5 times in Thomson Reuters Web of Science Article | Citations
Scopus Citation Count Cited 7 times in Scopus Article | Citations
Google Scholar Search Google Scholar
Created: Sun, 28 Aug 2011, 01:22:03 EST by System User on behalf of School of Information Technol and Elec Engineering