Semi-supervised co-training and active learning based approach for multi-view intrusion detection

Please use this identifier to cite or link to this item: https://doi.org/10.1145/1529282.1529735

DC Field	Value
dc.title	Semi-supervised co-training and active learning based approach for multi-view intrusion detection
dc.contributor.author	Mao C.-H.
dc.contributor.author	Lee H.-M.
dc.contributor.author	Parikh D.
dc.contributor.author	Chen T.
dc.contributor.author	Huang S.-Y.
dc.date.accessioned	2018-08-21T05:02:03Z
dc.date.available	2018-08-21T05:02:03Z
dc.date.issued	2009
dc.identifier.citation	Mao C.-H., Lee H.-M., Parikh D., Chen T., Huang S.-Y. (2009). Semi-supervised co-training and active learning based approach for multi-view intrusion detection. Proceedings of the ACM Symposium on Applied Computing : 2042-2048. ScholarBank@NUS Repository. https://doi.org/10.1145/1529282.1529735
dc.identifier.isbn	9781605581668
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/146192
dc.description.abstract	Although there is immense data available from networks and hosts, a very small proportion of this data is labeled due to the cost of obtaining expert labels. This proves to be a significant bottle-neck for developing supervised intrusion detection systems that rely solely on labeled data. In spite of the data being collected from real network environments and hence potentially holding valuable information for intrusion detection, such systems can not exploit the remaining unlabeled data. In this work, we intelligently leverage both labeled and unlabeled data. Also, intrusion detection tasks naturally lend themselves into a multi-view scenario, and can benefit significantly if these multiple views are combined meaningfully. In this paper, we propose a co-training method framework for intrusion detection, which is a semi-supervised learning method and can not only utilize unlabeled data, but can also combine multi-view data. We also employ an active learning framework where statistically ambiguous parts of the unlabeled data are identified, which can then be labeled by an expert. This allows for minimal expert labeling while ensuring that the labels obtained from the expert are most informative. In our experiments, we demonstrate that leveraging the unlabeled data using our proposed method significantly reduces the error rate as compared to using the labeled data alone. In addition, our proposed multi-view method has a lower error rate than using a single view. Copyright 2009 ACM.
dc.source	Scopus
dc.subject	Active learning
dc.subject	Co-training
dc.subject	Intrusion detection
dc.subject	Multi-view
dc.subject	Semi-supervised learning
dc.type	Conference Paper
dc.contributor.department	OFFICE OF THE PROVOST
dc.contributor.department	DEPARTMENT OF COMPUTER SCIENCE
dc.description.doi	10.1145/1529282.1529735
dc.description.sourcetitle	Proceedings of the ACM Symposium on Applied Computing
dc.description.page	2042-2048
dc.published.state	published
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM