Please use this identifier to cite or link to this item:
https://doi.org/10.1145/1529282.1529735
DC Field | Value | |
---|---|---|
dc.title | Semi-supervised co-training and active learning based approach for multi-view intrusion detection | |
dc.contributor.author | Mao C.-H. | |
dc.contributor.author | Lee H.-M. | |
dc.contributor.author | Parikh D. | |
dc.contributor.author | Chen T. | |
dc.contributor.author | Huang S.-Y. | |
dc.date.accessioned | 2018-08-21T05:02:03Z | |
dc.date.available | 2018-08-21T05:02:03Z | |
dc.date.issued | 2009 | |
dc.identifier.citation | Mao C.-H., Lee H.-M., Parikh D., Chen T., Huang S.-Y. (2009). Semi-supervised co-training and active learning based approach for multi-view intrusion detection. Proceedings of the ACM Symposium on Applied Computing : 2042-2048. ScholarBank@NUS Repository. https://doi.org/10.1145/1529282.1529735 | |
dc.identifier.isbn | 9781605581668 | |
dc.identifier.uri | http://scholarbank.nus.edu.sg/handle/10635/146192 | |
dc.description.abstract | Although there is immense data available from networks and hosts, a very small proportion of this data is labeled due to the cost of obtaining expert labels. This proves to be a significant bottle-neck for developing supervised intrusion detection systems that rely solely on labeled data. In spite of the data being collected from real network environments and hence potentially holding valuable information for intrusion detection, such systems can not exploit the remaining unlabeled data. In this work, we intelligently leverage both labeled and unlabeled data. Also, intrusion detection tasks naturally lend themselves into a multi-view scenario, and can benefit significantly if these multiple views are combined meaningfully. In this paper, we propose a co-training method framework for intrusion detection, which is a semi-supervised learning method and can not only utilize unlabeled data, but can also combine multi-view data. We also employ an active learning framework where statistically ambiguous parts of the unlabeled data are identified, which can then be labeled by an expert. This allows for minimal expert labeling while ensuring that the labels obtained from the expert are most informative. In our experiments, we demonstrate that leveraging the unlabeled data using our proposed method significantly reduces the error rate as compared to using the labeled data alone. In addition, our proposed multi-view method has a lower error rate than using a single view. Copyright 2009 ACM. | |
dc.source | Scopus | |
dc.subject | Active learning | |
dc.subject | Co-training | |
dc.subject | Intrusion detection | |
dc.subject | Multi-view | |
dc.subject | Semi-supervised learning | |
dc.type | Conference Paper | |
dc.contributor.department | OFFICE OF THE PROVOST | |
dc.contributor.department | DEPARTMENT OF COMPUTER SCIENCE | |
dc.description.doi | 10.1145/1529282.1529735 | |
dc.description.sourcetitle | Proceedings of the ACM Symposium on Applied Computing | |
dc.description.page | 2042-2048 | |
dc.published.state | published | |
Appears in Collections: | Staff Publications |
Show simple item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.