Persistent URL of this record https://hdl.handle.net/1887/3192382
Documents
-
- Download
- Title pages_Abstract_Contents
- open access
-
- Download
- Chapter 3
- open access
- Full text at publishers site
-
- Download
- Chapter 4
- open access
- Full text at publishers site
-
- Download
- Chapter 5
- open access
- Full text at publishers site
-
- Download
- Chapter 6
- open access
- Full text at publishers site
-
- Download
- Bibliography_Acronyms
- open access
-
- Download
- Summary in English
- open access
-
- Download
- Summary in Dutch
- open access
-
- Download
- Propositions
- open access
In Collections
This item can be found in the following collections:
Knowledge extraction from archives of natural history collections
This thesis describes computational methods for knowledge extraction from archives of natural history collections---here referring to handwritten manuscripts and hand-drawn illustrations. As we are dealing with heterogeneous real-world data, the task becomes exceptionally challenging. Small samples and a long-tailed distribution, sometimes with very fine-grained distinctions between classes, hamper model learning. Prior knowledge is therefore needed to bootstrap the learning process. Moreover, archival content can be difficult to interpret and integrate, and should therefore be formally described for data...Show moreNatural history collections provide invaluable sources for researchers with different disciplinary backgrounds, aspiring to study the geographical distribution of flora and fauna across the globe as well as other evolutionary processes. They are of paramount importance for mapping out long-term changes: from culture, to ecology, to how natural history is practiced.
This thesis describes computational methods for knowledge extraction from archives of natural history collections---here referring to handwritten manuscripts and hand-drawn illustrations. As we are dealing with heterogeneous real-world data, the task becomes exceptionally challenging. Small samples and a long-tailed distribution, sometimes with very fine-grained distinctions between classes, hamper model learning. Prior knowledge is therefore needed to bootstrap the learning process. Moreover, archival content can be difficult to interpret and integrate, and should therefore be formally described for data integration within and across collections. By serving extracted knowledge to the Semantic Web, collections are made amenable for research and integration with other biodiversity resources on the Web.
Show less
- All authors
- Stork, L.
- Supervisor
- Plaat, A.; Verbeek, F.J.
- Co-supervisor
- Wolstencroft, K.J.
- Committee
- Terras, M.M.; Andel, T.R. van; Kleijn, H.C.M.; Lew, M.S.K.; Boer, V. de; Weber, A.
- Qualification
- Doctor (dr.)
- Awarding Institution
- Leiden Institute of Advanced Computer Science (LIACS), Faculty of Science, Leiden University
- Date
- 2021-07-01
Funding
- Sponsorship
- This work is supported by the Netherlands Organisation for Scientific Research (NWO) and Brill publishers, grant number 652.001.001 (the Making Sense of Illustrated Handwritten Archives project).