Rule extraction from support vector machines: An overview of issues and application in credit scoring | ScholarBank@NUS

Please use this identifier to cite or link to this item: https://doi.org/10.1007/978-3-540-75390-2_2

Title:	Rule extraction from support vector machines: An overview of issues and application in credit scoring
Authors:	Martens, D. Huysmans, J. Setiono, R. Vanthienen, J. Baesens, B.
Issue Date:	2008
Citation:	Martens, D.,Huysmans, J.,Setiono, R.,Vanthienen, J.,Baesens, B. (2008). Rule extraction from support vector machines: An overview of issues and application in credit scoring. Studies in Computational Intelligence 80 : 33-63. ScholarBank@NUS Repository. https://doi.org/10.1007/978-3-540-75390-2_2
Abstract:	Innovative storage technology and the rising popularity of the Internet have generated an ever-growing amount of data. In this vast amount of data much valuable knowledge is available, yet it is hidden. The Support Vector Machine (SVM) is a state-of-the-art classification technique that generally provides accurate models, as it is able to capture non-linearities in the data. However, this strength is also its main weakness, as the generated non-linear models are typically regarded as incomprehensible black-box models. By extracting rules that mimic the black box as closely as possible, we can provide some insight into the logics of the SVM model. This explanation capability is of crucial importance in any domain where the model needs to be validated before being implemented, such as in credit scoring (loan default prediction) and medical diagnosis. If the SVM is regarded as the current state-of-the-art, SVM rule extraction can be the state-of-the-art of the (near) future. This chapter provides an overview of recently proposed SVM rule extraction techniques, complemented with the pedagogical Artificial Neural Network (ANN) rule extraction techniques which are also suitable for SVMs. Issues related to this topic are the different rule outputs and corresponding rule expressiveness; the focus on high dimensional data as SVM models typically perform well on such data; and the requirement that the extracted rules are in line with existing domain knowledge. These issues are explained and further illustrated with a credit scoring case, where we extract a Trepan tree and a RIPPER rule set from the generated SVM model. The benefit of decision tables in a rule extraction context is also demonstrated. Finally, some interesting alternatives for SVM rule extraction are listed. © 2008 Springer-Verlag Berlin Heidelberg.
Source Title:	Studies in Computational Intelligence
URI:	http://scholarbank.nus.edu.sg/handle/10635/42574
ISBN:	9783540753896
ISSN:	1860949X
DOI:	10.1007/978-3-540-75390-2_2
Appears in Collections:	Staff Publications

Show full item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Altmetric

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.