Cell migration research has become a high-content field. However, the quantitative information encapsulated in these complex and high-dimensional datasets is not fully exploited due to the... Show moreCell migration research has become a high-content field. However, the quantitative information encapsulated in these complex and high-dimensional datasets is not fully exploited due to the diversity of experimental protocols and non-standardised output formats. In addition, typically the datasets are not open for reuse. Making the data open and Findable, Accessible, Interoperable, and Reusable (FAIR) will enable meta-analysis, data integration, and data mining. Standardised data formats and controlled vocabularies are essential for building a suitable infrastructure for that purpose but are not available in the cell migration domain. We here present standardisation efforts by the Cell Migration Standardisation Organization, CMSO, an open community-driven organisation to facilitate the development of standards for cell migration data. This work will foster the development of improved algorithms and tools, and enable secondary analysis of public datasets, ultimately unlocking new knowledge of the complex biological process of cell migration. Show less
Compositions of tree-walking tree transducers form a hierarchy with respect to the number of transducers in the composition. As main technical result it is proved that any such composition can be... Show moreCompositions of tree-walking tree transducers form a hierarchy with respect to the number of transducers in the composition. As main technical result it is proved that any such composition can be realized as a linear bounded composition, which means that the sizes of the intermediate results can be chosen to be at most linear in the size of the output tree. This has consequences for the expressiveness and complexity of the translations in the hierarchy. First, if the computed translation is a function of linear size increase, i.e., the size of the output tree is at most linear in the size of the input tree, then it can be realized by just one, deterministic, tree-walking tree transducer. For compositions of deterministic transducers it is decidable whether or not the translation is of linear size increase. Second, every composition of deterministic transducers can be computed in deterministic linear time on a RAM and in deterministic linear space on a Turing machine, measured in the sum of the sizes of the input and output tree. Similarly, every composition of nondeterministic transducers can be computed in simultaneous polynomial time and linear space on a nondeterministic Turing machine. Their output tree languages are deterministic context-sensitive, i.e., can be recognized in deterministic linear space on a Turing machine. The membership problem for compositions of nondeterministic translations is nondeterministic polynomial time and deterministic linear space. The membership problem for the composition of a nondeterministic and a deterministic tree-walking tree translation (for a nondeterministic IO macro tree translation) is log-space reducible to a context-free language, whereas the membership problem for the composition of a deterministic and a nondeterministic tree-walking tree translation (for a nondeterministic OI macro tree translation) is possibly NP-complete. Show less
Baltissen, G.; Kaag, M.M.A.; Lodder, A.; Steel, G. 2019
We evaluated the effectiveness of using language models, that were pre-trained in one domain, as the basis for a classification model in another domain: Dutch book reviews. Pre-trained language... Show moreWe evaluated the effectiveness of using language models, that were pre-trained in one domain, as the basis for a classification model in another domain: Dutch book reviews. Pre-trained language models have opened up new possibilities for classification tasks with limited labelled data, because representation can be learned in an unsupervised fashion. In our experiments we have studied the effects of training set size (100-1600 items) on the prediction accuracy of a ULMFiT classifier, based on a language models that we pre-trained on the Dutch Wikipedia. We also compared ULMFiT to Support Vector Machines, which is traditionally considered suitable for small collections. We found that ULMFiT outperforms SVM for all training set sizes and that satisfactory results (~90%) can be achieved using training sets that can be manually annotated within a few hours. We deliver both our new benchmark collection of Dutch book reviews for sentiment classification as well as the pre-trained Dutch language model to the community. Show less
Snellen, I.A.G.; Albrecht, S.; Anglada-Escude, G.; Baraffe, I.; Baudoz, P.; Benz, W.; ... ; Visser, P. de 2019
In this white paper, we recommend the European Space Agency plays a proactive role in developing a global collaborative effort to construct a large high-contrast imaging space telescope, e.g. as... Show moreIn this white paper, we recommend the European Space Agency plays a proactive role in developing a global collaborative effort to construct a large high-contrast imaging space telescope, e.g. as currently under study by NASA. Such a mission will be needed to characterize a sizable sample of temperate Earth-like planets in the habitable zones of nearby Sun-like stars and to search for extraterrestrial biological activity. We provide an overview of relevant European expertise, and advocate ESA to start a technology development program towards detecting life outside the Solar system. Show less