This study presents an agent-based simulation model exploring the patterns of presence and absence of Late Pleistocene Neanderthals in western Europe. HomininSpace implements a parameterized... Show moreThis study presents an agent-based simulation model exploring the patterns of presence and absence of Late Pleistocene Neanderthals in western Europe. HomininSpace implements a parameterized generic demographic and social model of hominin dispersal while avoiding parameter value biases and explicitly modelled handicaps. Models are simulated through time within a high-resolution environment where reconstructed temperatures and precipitation levels influence the carrying capacity of the landscape. Model parameter values are assigned and varied automatically while optimizing the match with Neanderthal archaeology using a Genetic Algorithm (GA) inspired by the processes of natural selection. The system is able to traverse the huge parameter space that is created by the complete set of all possible parameter value combinations to find those values that will result in a simulation that matches well with archaeological data in the form of radiometrically obtained presence data. Show less
Part of a series of digital guest lectures from Leiden University scholars for use in secondary school education. For more information, see:https://www.universiteitleiden.nl/gastlessen/cursussen... Show morePart of a series of digital guest lectures from Leiden University scholars for use in secondary school education. For more information, see:https://www.universiteitleiden.nl/gastlessen/cursussen/digitale-gastlessen/artificial-intelligence Show less
Mining time series is a machine learning subfield that focuses on a particular data structure, where variables are measured over (short or long) periods of time. In this thesis we focus on... Show moreMining time series is a machine learning subfield that focuses on a particular data structure, where variables are measured over (short or long) periods of time. In this thesis we focus on multivariate time series, with multiple vari- ables measured over the same period of time. In most cases, such variables are collected at different sampling rates. When combined, these variables can be explored with machine learning methods for multiple purposes.Firstly, we consider the possibility of unsupervised learning. In this case, we propose a pattern recognition method that discovers subsets of variables that show consistent behavior in a number of shared time segments. Fur- thermore, when in a supervised setting, given a dependent variable (target),we propose a method that aggregates independent variables into meaningful features.Additionally to the methods above, we provide two tools in the form of Software as a Service, where users without programming background can intuitively follow the learning and testing methodologies for both methods.Finally, we present an applied study of machine learning to improve speed skating athletes performance. Here, we make a deep analysis of historical data, in order to help optimize performance results. Show less
Many databases do not consist of a single table of fixed dimensions, but of objects that are related to each other: the databases are relational, or structured. We study the discovery of patterns... Show moreMany databases do not consist of a single table of fixed dimensions, but of objects that are related to each other: the databases are relational, or structured. We study the discovery of patterns in such data. In our approach, a data analyst specifies constraints on patterns that she believes to be of interest, and the computer searches for patterns that satisfy these constraints. An important constraint on which we focus, is the constraint that a pattern should have a significant number of occurrences in the data. Constraints like this allow the search to be performed reasonably efficiently. We develop algorithms for searching ppatterns taht are represented in formal first order logic, tree data structures and graph data structures. We perform experiments in which these algorithms, and algorithms proposed by other researchers, are compared with each other, and study which properties determine the efficiency of the algorithms. As a result, we are able to develop more efficient algorithms. As application we study the discovery of fragments in molecular datasets. The aim is to discover fragments that relate the structure of molecules to their activity. Show less