C2. High dimensional, Educational and Functional data

17:20 - 18:30, Aula 10


Chair: Giovanni Boscaino


Analysis of Brain-Body Physiological Rhythm Using Functional Graphical Models


Rita Fici, Luigi Augugliaro and Ernst C. Wit


Abstract: This paper presents an analysis of physiological data derived from a recent investigation on network physiology, adopting the conceptual framework that views the human organism as a complex network of interacting organs. The study explores coordinated interactions among organs using functional conditional Gaussian Graphical Models (fcGGM). Organ functions are modelled as networks with individual regulatory mechanisms, forming a broader system through continuous interactions. The focus of the analysis is on the interactions within and between two subnetworks: brain activity and a composite network comprising the RR interval of the electrocardiographic waveform, respiration amplitude and blood volume pulse.

Click here to view the abstract.

A comparison of scalable estimation methods for large-scale logistic regression models with crossed random effects


Ruggero Bellio and Cristiano Varin


Abstract: Parameter estimation of generalized linear models with crossed random effects for large-scale settings is hampered by challenging numerical hindrances. This contribution focuses on logistic regression with crossed-random intercepts and it investigates the properties of two estimation methods for which a scalable software implementation exists, namely the all-row-column and penalized quasi- likelihood methods. The results of a simulation study for sparse settings inspired by e-commerce data, with sample sizes up to \(10^6\) suggest that the all-row-column method is preferable over penalized quasi-likelihood.

Click here to view the abstract.

Single-cell Sequencing Data: Critical Analysis and Definition of Statistical Models


Antonino Gagliano, Gianluca Sottile, Nicolina Sciaraffa, Claudia Coronnello and Luigi Augugliaro


Abstract: In the past decade, advances in single-cell RNA sequencing technologies have radically improved the comprehension of cell biology. Pseudo-time computation has revealed greater heterogeneity in cell differentiation, which is important for the study of various diseases. In this paper we evaluate pseudo-time calculation methodologies and propose statistical models for analyzing the relationship between pseudo-time and gene expression.

Click here to view the abstract.

Investigating the association between high school outcomes and university enrolment choice: a machine learning approach


Andrea Priulla, Alessandro Albano, Nicoletta D’Angelo, Massimo Attanasio


Abstract: This paper investigates how proficiency in mathematics and Italian tests in high school affect university enrolment choices in Italy. We distinguish between students from scientific and humanistic backgrounds, providing valuable insights into their enrolment choices. We employ gradient boosting methodology, adjusting for students’ sociodemographic characteristics and previous educational attainment. Results shed light on the interplay between student performance, sex, and the type of high school attended in shaping enrolment choices.

Click here to view the abstract.

 

A work by Gianluca Sottile

(on behalf of the local organizing committee)