Bao, Y.Datta, A.INFORMATION SYSTEMS2014-07-042014-07-042012Bao, Y.,Datta, A. (2012). Summarization of corporate risk factor disclosure through topic modeling. International Conference on Information Systems, ICIS 2012 1 : 701-719. ScholarBank@NUS Repository.9781627486040https://scholarbank.nus.edu.sg/handle/10635/78363In this paper, we propose a novel problem of summarizing textual corporate risk factor disclosure, which aims to simultaneously infer the risk types across corpus and assign each risk factor to its most probable risk type. To solve the problem, we develop a variation of LDA topic model called Sent-LDA. The variational EM learning algorithm, which guarantees fast convergence, is derived and implemented for our model. Experiments show that our model is much more efficient and effective than LDA for solving our proposed problem. Specifically, our model is 50 times faster than LDA in the same conditions, and generates better topics for summarization than LDA. Our model is visualized in a publicly available system.Risk factor disclosureSummarizationTopic modelingVariational EMSummarization of corporate risk factor disclosure through topic modelingConference PaperNOT_IN_WOS