Please use this identifier to cite or link to this item: https://doi.org/10.1007/978-3-642-13654-2_28
DC FieldValue
dc.titleKairos: Proactive harvesting of research paper metadata from scientific conference web sites
dc.contributor.authorHänse, M.
dc.contributor.authorKan, M.-Y.
dc.contributor.authorKarduck, A.P.
dc.date.accessioned2013-07-04T08:24:48Z
dc.date.available2013-07-04T08:24:48Z
dc.date.issued2010
dc.identifier.citationHänse, M.,Kan, M.-Y.,Karduck, A.P. (2010). Kairos: Proactive harvesting of research paper metadata from scientific conference web sites. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 6102 LNCS : 226-235. ScholarBank@NUS Repository. <a href="https://doi.org/10.1007/978-3-642-13654-2_28" target="_blank">https://doi.org/10.1007/978-3-642-13654-2_28</a>
dc.identifier.isbn3642136532
dc.identifier.issn03029743
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/41322
dc.description.abstractWe investigate the automatic harvesting of research paper metadata from recent scholarly events. Our system, Kairos, combines a focused crawler and an information extraction engine, to convert a list of conference websites into a index filled with fields of metadata that correspond to individual papers. Using event date metadata extracted from the conference website, Kairos proactively harvests metadata about the individual papers soon after they are made public. We use a Maximum Entropy classifier to classify uniform resource locators (URLs) as scientific conference websites and use Conditional Random Fields (CRF) to extract individual paper metadata from such websites. Experiments show an acceptable measure of classification accuracy of over 95% for each of the two components. © 2010 Springer-Verlag.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1007/978-3-642-13654-2_28
dc.sourceScopus
dc.typeConference Paper
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.doi10.1007/978-3-642-13654-2_28
dc.description.sourcetitleLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.description.volume6102 LNCS
dc.description.page226-235
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.