Please use this identifier to cite or link to this item: https://doi.org/10.1021/ci049869h
DC FieldValue
dc.titleEffect of molecular descriptor feature selection in support vector machine classification of pharmacokinetic and toxicological properties of chemical agents
dc.contributor.authorXue, Y.
dc.contributor.authorLi, Z.R.
dc.contributor.authorYap, C.W.
dc.contributor.authorSun, L.Z.
dc.contributor.authorChen, X.
dc.contributor.authorChen, Y.Z.
dc.date.accessioned2014-12-02T06:52:40Z
dc.date.available2014-12-02T06:52:40Z
dc.date.issued2004-09
dc.identifier.citationXue, Y., Li, Z.R., Yap, C.W., Sun, L.Z., Chen, X., Chen, Y.Z. (2004-09). Effect of molecular descriptor feature selection in support vector machine classification of pharmacokinetic and toxicological properties of chemical agents. Journal of Chemical Information and Computer Sciences 44 (5) : 1630-1638. ScholarBank@NUS Repository. https://doi.org/10.1021/ci049869h
dc.identifier.issn00952338
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/114320
dc.description.abstractStatistical-learning methods have been developed for facilitating the prediction of pharmacokinetic and toxicological properties of chemical agents. These methods employ a variety of molecular descriptors to characterize structural and physicochemical properties of molecules. Some of these descriptors are specifically designed for the study of a particular type of properties or agents, and their use for other properties or agents might generate noise and affect the prediction accuracy of a statistical learning system. This work examines to what extent the reduction of this noise can improve the prediction accuracy of a statistical learning system. A feature selection method, recursive feature elimination (RFE), is used to automatically select molecular descriptors for support vector machines (SVM) prediction of P-glycoprotein substrates (P-gp), human intestinal absorption of molecules (HIA), and agents that cause torsades de pointes (TdP), a rare but serious side effect. RFE significantly reduces the number of descriptors for each of these properties thereby increasing the computational speed for their classification. The SVM prediction accuracies of P-gp and HIA are substantially increased and that of TdP remains unchanged by RFE. These prediction accuracies are comparable to those of earlier studies derived from a selective set of descriptors. Our study suggests that molecular feature selection is useful for improving the speed and, in some cases, the accuracy of statistical learning methods for the prediction of pharmacokinetic and toxicological properties of chemical agents.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1021/ci049869h
dc.sourceScopus
dc.typeArticle
dc.contributor.departmentCOMPUTATIONAL SCIENCE
dc.contributor.departmentBIOLOGICAL SCIENCES
dc.contributor.departmentSINGAPORE-MIT ALLIANCE
dc.description.doi10.1021/ci049869h
dc.description.sourcetitleJournal of Chemical Information and Computer Sciences
dc.description.volume44
dc.description.issue5
dc.description.page1630-1638
dc.description.codenJCISD
dc.identifier.isiut000224185900013
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.