Persistent URL of this record https://hdl.handle.net/1887/3220882
Documents
-
- Download
- Title pages_Contents-List of Symbols
- open access
-
- Download
- Appendices_Bibliography
- open access
-
- Download
- Summary in Dutch
- open access
-
- Download
- Summary in English
- open access
-
- Download
- Summary in Portuguese
- open access
-
- Download
- Propositions
- open access
In Collections
This item can be found in the following collections:
Robust rules for prediction and description
Rules provide a simple form of storing and sharing information about the world. As humans, we use rules every day, such as the physician that diagnoses someone with flu, represented by "if a person has either a fever or sore throat (among others), then she has the flu.". Even though an individual rule can only describe simple events, several aggregated rules can represent more complex scenarios, such as the complete set of diagnostic rules employed by a physician.
The use of rules spans many fields in computer science, and in this dissertation, we focus on rule-based models for machine learning and data mining. Machine learning focuses on learning the model that best predicts future (previously unseen) events from historical data. Data mining aims to find interesting patterns in the available...Show moreIn this work, we attempt to answer the question: "How to learn robust and interpretable rule-based models from data for machine learning and data mining, and define their optimality?".
Rules provide a simple form of storing and sharing information about the world. As humans, we use rules every day, such as the physician that diagnoses someone with flu, represented by "if a person has either a fever or sore throat (among others), then she has the flu.". Even though an individual rule can only describe simple events, several aggregated rules can represent more complex scenarios, such as the complete set of diagnostic rules employed by a physician.
The use of rules spans many fields in computer science, and in this dissertation, we focus on rule-based models for machine learning and data mining. Machine learning focuses on learning the model that best predicts future (previously unseen) events from historical data. Data mining aims to find interesting patterns in the available data.
To answer our question, we use the Minimum Description Length (MDL) principle, which allows us to define the statistical optimality of rule-based models. Furthermore, we empirically show that this formulation is highly competitive for real-world problems.
Show less
- All authors
- Manuel Proenca, H.
- Supervisor
- Bäck, T.W.H.
- Co-supervisor
- Leeuwen, M. van
- Committee
- Plaat, A.; Mentens, N.; Grünwald, P.D.; Siebes, A.P.J.M.; Vreeken, J.
- Qualification
- Doctor (dr.)
- Awarding Institution
- Leiden Institute of Advanced Computer Science (LIACS), Faculty of Science, Leiden University
- Date
- 2021-10-26
- Title of host publication
- SIKS Dissertation Series
- ISBN (print)
- 9789463327923
Publication Series
- Name
- 2021-23
Funding
- Sponsorship
- NWO; GE Research Bengaluru