The study of science at the individual scholar level requires the disambiguation of author names. The creation of author's publication oeuvres involves matching the list of unique author names to... Show moreThe study of science at the individual scholar level requires the disambiguation of author names. The creation of author's publication oeuvres involves matching the list of unique author names to names used in publication databases. Despite recent progress in the development of unique author identifiers, e. g., ORCID, VIVO, or DAI, author disambiguation remains a key problem when it comes to large-scale bibliometric analysis using data from multiple databases. This study introduces and tests a new methodology called seed ? expand for semi-automatic bibliographic data collection for a given set of individual authors. Specifically, we identify the oeuvre of a set of Dutch full professors during the period 1980-2011. In particular, we combine author records from a Dutch National Research Information System (NARCIS) with publication records from the Web of Science. Starting with an initial list of 8,378 names, we identify 'seed publications' for each author using five different approaches. Subsequently, we 'expand' the set of publications in three different approaches. The different approaches are compared and resulting oeuvres are evaluated on precision and recall using a 'gold standard' dataset of authors for which verified publications in the period 2001-2010 are available. Show less
Jankowski, N.W.; Scharnhorst, A.; Tatum, C.C.; Tatum, Z. 2012
Enhancing publications has a long history but is gaining acceleration as authors and publishers explore electronic tablets as devices for dissemination and presentation. Enhancement of scholarly... Show moreEnhancing publications has a long history but is gaining acceleration as authors and publishers explore electronic tablets as devices for dissemination and presentation. Enhancement of scholarly publications, in contrast, more often takes place in a Web environment and is coupled with presentation of supplementary materials related to research. The approach to enhancing scholarly publications presented in this article goes a step further and involves the interlinking of the “objects” of a document: datasets, supplementary materials, secondary analyses, and post-publication interventions. This approach connects the user-centricity of Web 2.0 with the Semantic Web. It aims at facilitating long-term content structure through standardized formats intended to improve interoperability between concepts and terms within and across knowledge domains. We explored this conception of enhancement on a small set of books prepared for traditional academic publishers. While the project was primarily an exercise in development, the conclusion section of the article reflects on areas where conceptual and empirical studies could be initiated to complement this new direction in scholarly publishing. Show less