Dario Malchiodi — Università degli Studi di Milano

A learning algorithm for fuzzy sets processing data labeled with their membership degrees has been proposed in [Malchiodi and Pedrycz, 2013; Malchiodi, 2019a] . Such algorithm has been applied to axiom mining within semantic Web [Malchiodi and Tettamanzi, 2018] and to negative examples selection in bioinformatics [Frasca and Malchiodi, 2017; Frasca and Malchiodi, 2016] . This approach has been extended in [Cermenati et al., 2020] to the simultaneous induction of several fuzzy sets, and in [Malchiodi and Zanaboni, 2019] to shadowed sets.

in collaboration with Prof. Zanaboni (Università degli Studi di Milano), Prof. Pedrycz (University of Alberta)

Knowledge induced via machine learning techniques is often encoded and stored in a distributed fashion withen models learnt from data. Thus it might be difficult to give a qualitative interpretation of the obtained results. Moreover, this typically turns out in bandwidth and storage capacity issues when resources are limited. A possible solution to these problems consists in reducing the amount of space necessary in order to store the above mentioned models after they have been trained. Some compression techniques for neural networks obtained via deep learning is currently under investigation within the research project Multicriteria Data Structures and Algorithms: from compressed to learned indexes, and beyond, funded by the Italian Ministry of Education and Research under the PRIN initiative [Marinò et al., 2021] . Their implementation is described in [Marinò et al., 2021] .

in collaboration with Prof. Frasca (Università degli Studi di Milano)

Searching potential axioms within a set of formulas is a particularly demanding problem from a computational viewpoint. The solution of inducing such axioms starting from formulas labeled via a precomputed fitness measure, obtained through processing of a knowledge base from the semabtic Web field, has been studied using learning algorithms for fuzzy sets [Malchiodi and Tettamanzi, 2018] and kernel-based regression techniques [Malchiodi et al., 2018] . The dependency of the problem on the used learning algorithm and on the dimensionality reduction technique employed in order to encode axioms as numerical vectors has been investigated in [Malchiodi et al., 2020] .

in collaboration with Prof. Da Costa Pereira, Prof. Tettamanzi (Université de la Côte d'Azur)

The application of supervised machine learning methods in bioinformatics requires the selection among non-positively labeled data of those representing reliable negative examples, that is excluding entities on which no experiments have been conducted. In [Frasca and Malchiodi, 2017; Frasca and Malchiodi, 2016] such negative selection problem has been tackled using a ranking based on membership functions to fuzzy sets, while [Frasca et al., 2017; Boldi et al., 2018] propose an encoding for the available data promoting the negative selection process in the problem of protein functions prediction. Finally, a similar procedure has been proposed in [Frasca et al., 2019] for the problem of gene prioritization.

in collaboration with Prof. Frasca (Università degli Studi di Milano)

[Casiraghi et al., 2020] and [Esposito et al., 2021] describe the application of machine learning techniques to the problem of predicting the severity of COVID-19 in patients entering EDs.

in collaboration with Prof. Valentini (Università degli Studi di Milano) Prof. Casiraghi (Università degli Studi di Milano) Prof. Frasca (Università degli Studi di Milano)

Some machine learning and statistical data analysis techniques have been adapted in order to deal with problems in the veterinary and forensic fields. In particular, [Galizzi et al., 2021] and [Bagardi et al., 2021] describe the application of statistical methods in order to classify the incidence of cardiovascular factors in the death of dogs undergoing specific therapy, while [Casali et al., 2021] discusses a pilot study on the application of classification algorithms to predict the type of vehicle involved in a pedestrian hit.

in collaboration with Prof. Zanaboni (Università degli Studi di Milano)

Machine learning models have as starting point a labeled sample whose elements are processed homogeneously (that is, each element has the same importance). In [Malchiodi, 2008] the general model of data quality-based learning was proposed. In this model it is possible to associate each of the available data items a numerical quantification of its importance with reference to the remaining data. This model was applied to the problem of classification through Support Vector Machines, both in its linear [Apolloni and Malchiodi, 2006] and kernel-based version [Apolloni et al., 2007] . A first analysis of the performance for these applications has been undertaken both theoretically [Apolloni et al., 2007] and experimentally [Malchiodi, 2009] . Some preliminary applications in the bioinformatics field is described in [Malchiodi et al., 2010] . A similar approach has also been applied to the regression problem in [Apolloni et al., 2010; Malchiodi et al., 2009; Apolloni et al., 2005] and to unbalanced learning in [Malchiodi, 2013b] .

Several types of learning algorithms have been designed, implemented and analyzed. In particular, [Malchiodi and Legnani, 2014] proposes an improvement of the support vector-based classification algorithms dealing both with partially labeled data and with uncertain labels, while [Malchiodi and Pedrycz, 2013] introduces a learning algorithm for membership functions of fuzzy sets. The latter approach has been extended in [Malchiodi and Zanaboni, 2019] to shadowed sets.

Concerning tertiary-level teaching, two publications have been produced: a manual for a software for automatic computations and a exercise textbook on operating systems [Malchiodi, 2007; Malchiodi, 2015] . Within a wider audience, [Monga et al., 2017] is centered around Alan Turing, and [Malchiodi, 2019a] describes possible future evolutions of fuzzy-based technologies.