Thesis - Image and Sound Processing Lab

Proposals

Audio

Field	Contact	Title	Thesis type	Description
Acoustics	Parrinelli (Speri S.p.A.) / Antonacci	Experimental Comparison of Measurement Methods and Sound Sources for Measuring Reverberation Time and Impact Sound Insulation	Full / short thesis	The thesis conducts a comparative analysis of the various measurement techniques used to determine two parameters in the field of building acoustics: reverberation time and impact sound insulation. The study focuses on the use and evaluation of the different sound sources specified in the related UNI EN ISO standards. By conducting measurement campaigns in typical environments, organizing the results into a database, and analyzing the data, the project aims to identify the advantages and limitations of each method. The objective is to evaluate how different sound sources affect the reliability and precision of the results, identifying the pros, cons, and limitations of each method to define an optimal approach for acoustic assessments.
Beamforming	Antonacci / Cohen	Robust covariance estimation for intelligibility-oriented MVDR beamforming	Full / short thesis	This thesis investigates robust covariance-matrix estimation for MVDR beamforming in noisy and reverberant environments. MVDR beamforming depends critically on accurate estimates of speech and noise spatial covariance matrices. Classical approaches often rely on VAD-based noise updates, but this can fail during speech presence and in nonstationary noise. The project will start from classical MVDR, delay-and-sum, and VAD-based covariance estimation, then develop a soft-VAD / coherence-aware covariance update. Spatial coherence can be used as a reliability cue, inspired by dual-microphone coherence-based enhancement, while VAD-free covariance estimation provides the main technical baseline. The proposed method will be evaluated against classical and neural-mask-inspired MVDR baselines. Recent neural beamforming work shows that the quality of speech/noise spatial covariance matrices remains a central issue even in modern systems. The expected contribution is an interpretable, lightweight beamforming strategy that improves robustness without requiring large-scale end-to-end training.
Beamforming	Antonacci / Cohen	Intelligibility-aware speech enhancement via improved SNR/noise estimation	Full / short thesis	This thesis focuses on intelligibility-aware speech enhancement through improved a priori SNR and noise PSD estimation. Classical enhancement methods such as Wiener filtering and spectral subtraction can reduce noise but often fail to improve intelligibility because they introduce speech distortions that are not captured by MSE- or SNR-based objectives. Previous works explicitly identify this gap and analyze distortion types that affect intelligibility. The project will implement classical decision-directed SNR estimation, then improve it with adaptive smoothing, onset-aware correction, or a lightweight data-driven estimator. The starting point is the work by Nicolson and Paliwal referenced below, which estimates the SNR using a data-driven approach through learned nonlinear mappings. The theoretical analysis of decision-directed SNR estimation will guide the design, especially in low-SNR and transient conditions. The expected contribution is an interpretable enhancement method that preserves speech onsets and avoids excessive attenuation, evaluated with both quality and intelligibility metrics.
Beamforming	Antonacci / Cohen	DRR-guided dereverberation and reverberation-aware enhancement	Full / short thesis	Abstract: This thesis studies reverberation-aware speech enhancement using the direct-to-reverberant ratio (DRR) as an acoustic reliability cue. In reverberant rooms, aggressive denoising or beamforming may distort speech when the direct-path component is weak. The project will estimate DRR from single- or multichannel signals and use it to adapt dereverberation or post-filtering strength. Previous works estimate DRR using a spatial correlation model that decomposes the microphone-array input into direct and reverberant components. Other works relate perceived reverberation to T60, room spectral variance, and DRR. WPE will be used as the main dereverberation baseline because it is a standard blind dereverberation method based on long-term linear prediction. The proposed contribution is a DRR-guided enhancement pipeline that adapts processing strength according to reverberation severity, rather than applying a fixed dereverberation or post-filtering strategy.
Beamforming	Antonacci / Cohen	Objective intelligibility and quality evaluation for beamforming/enhancement	Full / short thesis	This thesis develops an evaluation and tuning protocol for speech enhancement and beamforming systems using objective intelligibility and quality metrics. The motivation is that noise reduction, SNR improvement, and perceptual intelligibility are not equivalent. STOI predicts intelligibility of noisy and time-frequency weighted speech using short-time temporal-envelope correlations. The project will compare intrusive metrics such as STOI/ESTOI with non-intrusive neural metrics such as DNSMOS and TorchAudio-Squim. Time-varying quality will also be considered, since post-filtering and beamforming artifacts may occur locally rather than uniformly across an utterance. The expected contribution is a practical evaluation framework and a metric-aware tuning strategy for enhancement systems, showing when classical metrics agree or disagree and how this affects algorithm selection.
Musical Acoustics	Pezzoli, Antonacci	Modeling and analysis of the soundboard in keyboard instruments.	Full thesis, Short thesis	Keyboard instruments (e.g., pianos and harpsichord) are characterized by complex sound radiation caused by the soundboard and the enclosure. Replicating such behaviour is fundamental especially in the case of ancient instruments that cannot be played anymore. Through simulation of several soundboard designs these thesis aim at: analysing the relationship between the soundboard designa and vibroacoustic response thereof; generate extensive simulation datasets of soundboards,; develop surrogate digital models of the instruments using machine learning. [Required knowledge: vibroacoustics, COMSOL Multiphysics, machine learning]
Musical Acoustics	Antonacci / Cillo (University of Stuttgart)	Numerical Prediction of Modal Radiation Efficiency in Classical Guitars	Full thesis	[Thesis abroad at the Institute of Engineering and Computational Mechanics (ITM), University of Stuttgart, Germany.] The acoustical character of a classical guitar arises from the complex coupling between the structural vibrations of its wooden plates and the surrounding air. However, the mechanical properties of tonewoods show significant variability, even within the same species, leading to differences in modal frequencies, shapes, and radiation characteristics between instruments of identical geometry. To achieve consistent acoustic performance, such material variability can be compensated through geometry adjustments guided by virtual prototyping. The goal of this thesis is to develop a Boundary Element (BE) model capable of predicting the radiation efficiency of the structural vibration modes of a classical guitar. Using mode shapes obtained from finite element analysis as input, the BE model will evaluate the acoustic radiation into the surrounding air, accounting for both the vibrating body and the air flow through the soundhole. The study will quantify how each mode contributes to the total radiated sound power, investigate the interaction between structural and air-borne radiation, and identify modes most relevant for perceptual sound consistency. The results will provide a foundation for integrating radiation-based weighting into geometry optimization of guitars. Required knowledge: vibration analysis, acoustics, familiarity with the finite element method.
Audio signal processing	Pezzoli	Modeling and characterization of sound source directivities	Full thesis, Short thesis	The directivity is an inherent property of every sound source (e.g., a musical instrument). The goal of this thesis is to define suitable models for the directivity of sound sources which can be used when simulating directivities. Long or short thesis depends on the depth and novelty of the analysis. [Required knowledge: machine learning, basic knowledge of statistical signal processing, spherical harmonics decomposition of sound field].
Audio Signal Processing	Bernardini, Pezzoli	Spatial audio in networked music performance applications	Full thesis	In this thesis, novel approaches to the processing of multichannel audio contents will be investigated. In particular, the main goal is to enhance the immersivity in the context of networked music performance. [Required knowledge: audio signal processing, ambisonics, binaural rendering] [Additional skills: VR]
3D audio	Pezzoli, Greco	Novel approach to parametric soundfield reconstruction	Full thesis	Investigate a method for reconstructing sound fields at arbitrary locations using data from spatially distributed microphone arrays. Optimized for reverberant environments, this approach models the acoustic scene through parameters that define direct and diffuse sound components, capturing source location and directivity. This enables precise reconstruction, with parameters estimated in a relative coordinate system to support scalable and distributed processing.
3D audio	Pezzoli	Physics-informed deep prior for sound field reconstruction	Full thesis	The estimation of sound field provided by neural networks such has deep prior, can potentially diverge from the underlying physics. This thesis aims at defining novel paradigm for sound field reconstruciton that leverage on the generational power of neural networks and prior knowledge of physics. [Required knowledge: deep learning, acoustics] [Thesis in collaboration with Prof. S. Koyama of the National Institute of Informatics - Tokyo.]
3D audio	Pezzoli	Physics-informed deep kernel interpolation for sound field reconstruction	Full thesis	Sound field reconstruction is at the base of several spatial audio applications. Deep kernel learning has potential application for the reconstruction of the sound field thanks to the possibility of adopting physics-informed neural network in order to impose prior knowledge of the acoustics. The thesis aims to develop novel deep kernel models for the reconstruction of acoustic fields. [Required knowledge: deep learning, acoustics] [Thesis in collaboration with Prof. S. Koyama of the National Institute of Informatics - Tokyo.]
Audio Signal Processing	Mezza, Bernardini	Deep Packet Loss Concealment for Speech and Music	Full thesis	Audio communications over the Internet have become an integral part of everyday life. However, speed is often prioritized over reliability in order to respect strict real-time constraints. Consequently, short audio segments (packets) risk being severely delayed or lost. We recently developed deep Packet Loss Concealment (PLC) methods, as well as hybrid PLC algorithms combining signal-processing and deep-learning techniques in a synergistic way. In this thesis, we will explore its performance on speech and/or music signals. The thesis won't deal with network-related and other IP-related aspects. [Required knowledge: signal processing theory; practical experience with deep learning.]
Audio signal processing	Pezzoli	Modeling and characterization of HRTF	Full thesis, Short thesis	The HRTFs are individualized acoustic characteristics of human ears. The goal of this thesis is to define suitable models for the HRTFs which can be used when simulating sound fields. Long or short thesis depends on the depth and novelty of the analysis. [Required knowledge: machine learning, basic knowledge of statistical signal processing, spherical harmonics decomposition of sound field].
3D audio	Pezzoli, Antonacci	Nearfield filter for spherical microphone array recordings	Full thesis	Spherical Microphone Arrays (SMA) are very suitable for binaural rendering and in general for spatial audio applications. In this thesis we are interested in developing new methods to filter the undesired signals in a nearfield region of the SMA.
Audio forensics	Bestagini, Antonacci	Detection of text-to-speech algorithms	Full thesis	Nowadays, text-to-speech and voice conversion algorithms are able to produce very realistic speech signals, which can easily trick human ear. Moreover, this technology is in rapid evolution and it is not possible to take in account all new synthesis methods. It is necessary to develop effective synthetic speech detection systems able to work in open-set scenarios.
Audio signal processing	Pezzoli, Antonacci	Deep learning solution for localization of acoustic sources in the spherical harmonics domain	Full thesis	The spherical harmonics representation of the sound field is a widely adopted description of spatial sound. The goal of this thesis is to devise deep learning solutions that exploit the spherical harmonics representation for the analysis of the acoustic field e.g., localization of acoustic sources. [Required knowledge: spherical harmonics decomposition of sound field, theoretical knowledge and pratical experience with deep learning]
Audio signal processing	Giampiccolo, Bernardini	Investigating the Correlation between Respiratory volumes and Speaking Voices	Full thesis	The projects is in collaboration with Bioengineering researchers and medical doctors from Ospedale Sacco. The aim is to study how volumes acquired through 3D cams are correlated to speaking voices for patients affected by different pathologies (SLA, Parkinson's disease, etc.)
Audio signal processing	Giampiccolo, Bernardini	Gradient Descent Methods for the Emulation of Nonlinear Audio Circuits	Full thesis	The thesis concerns the study and implementation of gradient methods for the emulation of audio circuits in the Wave Digital domain. In fact, in presence of multiple nonlinear elements, iterative methods are needed to find the solution of nonlinear circuits. Over the past few years, several iterative techniques have been considered, namely fixed-point, Netwon-Raphson methods, etc. We propose to change paradigm and explore new ways for solving circuits in order to find the cheapest one as this is desired for the real-time emulation of audio circuits in the context of Virtual Analog applications.
Audio signal processing	Giampiccolo, Bernardini	Virtual Analog Modeling, Audio Circuit Emulation, Physical Modeling Sound Synthesis through Wave Digital Filters	Full thesis
Musical Acoustics	Pezzoli, Antonacci	Machine learning techniques for Nearfield Acoustic Holography	Full/short thesis	Recent data-driven based NAH methods can predict the vibrational behavior on sources from the acquisition of the radiated sound field. Nevertheless, several limitations are still reducing the perfomance of data-driven NAH in real scenarios. This thesis aims at extending the recent NAH solutions with various strategies (transfer learning, low-rank adaptation etc) in order to tune the networks with different data and improving the model with specific physical priors to reconstruct the vibrational content with an unsupervised approach (long thesis). [knowledge of Deep Learning required]
Audio Signal Processing	Miotello, Pezzoli	AI and Data-driven Techniques for Spatial Audio Applications	Full thesis	Spatial audio integrates principles from acoustics and signal processing to accurately capture, model, and reproduce sound in complex 3D environments. As immersive technologies such as virtual and augmented reality, telecommunication, and acoustic simulation continue to evolve, the demand for robust, efficient and accurate methods to analyze and process complex acoustic phenomena is rapidly growing. We offer thesis opportunities in spatial audio that focus on building innovative frameworks based on modern data-driven and deep learning techniques. Potential research areas include room impulse response estimation, sound field reconstruction, sound source directivity processing, multichannel sound source separation, sound field separation, head-related transfer functions personalization and upsampling. These projects will explore cutting-edge techniques such as generative diffusion models, physics-informed neural networks, equivariant architectures, neural fields, and other advanced representations. If you are interested in these topics or wish to discuss specific research directions, please contact us for more details: federico.miotello@polimi.it, mirco.pezzoli@polimi.it. [Required knowledge: audio signal processing, deep learning (eventually)]
Audio Signal Processing	Miotello, Bernardini	Advances in Differential Beamforming and Signal Enhancement	Full thesis	Differential arrays exploit closely spaced transducers to approximate acoustic pressure differentials, achieving nearly frequency-invariant beam patterns, particularly advantageous in MEMS-based applications. However, conventional differential beamforming approaches often face limitations due to their inherent linear nature and rigid array geometry constraints. We propose thesis opportunities that investigate advanced strategies to overcome these challenges, exploring both filter design and signal enhancement techniques. These projects will explore the potential of data-driven techniques, such as neural networks, alongside traditional model-based approaches to improve the performance and flexibility of differential arrays in acoustic applications. Topics may include neural network-based beamforming filter design and optimization, adaptive signal enhancement pipelines, and hybrid frameworks that integrate physical modeling with machine learning. If you are interested in these topics or wish to discuss specific research directions, please contact us for more details: federico.miotello@polimi.it, alberto.bernardini@polimi.it. [Required knowledge: audio signal processing, deep learning (eventually)]
Audio Signal Processing	Miotello, Marazzi, Pezzoli	Audio-Based Signal Processing for Health Monitoring and Diagnosis	Full/short thesis	This thesis explores the application of audio signal processing techniques for healthcare monitoring and diagnostic support by analyzing physiological sounds such as speech, breathing, coughing, and heartbeats. The study aims to develop methods for preprocessing and extracting meaningful features from audio signals and to apply machine learning or deep learning models for classification and anomaly detection. By leveraging widely available recording devices, the research seeks to enable non-invasive, cost-effective, and remote health monitoring solutions. The work will evaluate the effectiveness and robustness of different signal processing and modeling approaches, ultimately contributing to the development of scalable audio-based healthcare systems and highlighting their potential and limitations in real-world applications. If you are interested in these topics or wish to discuss specific research directions, please contact us for more details: federico.miotello@polimi.it, alice.marazzi@polimi.it, mirco.pezzoli@polimi.it. [Required knowledge: audio signal processing, deep learning (eventually)]

Image and Video

Field	Contact	Title	Thesis type	Description
Image/video forensics	Bestagini, Mandelli, Cannas	Detect and localize image and video manipulations	Full	Images and videos can be manipulated in many different ways (e.g., object insertion and removal, local retouching, laundering operations, etc.). We are interested in developing methods to detect and localize possible editing operations on images and videos.
Image/video forensics	Bestagini, Mandelli, Cannas	Distinguish original videos from DeepFakes	Full	DeepFake videos can be maliciously spread online. We are interested in developing techniques to detect whether a video is a DeepFake or not, why a detector says a video is fake, and understand which DeepFake generation software has been used to create a video.
Image/video forensics	Bestagini, Mandelli, Cannas	Assess the authenticity of satellite images	Full	Satellites can acquire visual data with different sensors. We are interested in developing techniques that verify whether an overhead image has been edited or not.
Image/video forensics	Bestagini, Mandelli, Cannas	Forensic analysis of scientific images	Full	Scientific publications in the life science area typically contain charcateristic kinds of images to showcase the achieved results (e.g., western blots, microscopy acquisitions, etc.). As these images differ from natural photographs, we are interested in developing novel techniques to detect possible scientific image forgery operations.
Image processing	Bestagini, Mandelli, Giganti	Enhancement of emission maps	Full	Accurate BVOC emission maps are crucial for understanding their effects on air quality and climate, yet existing maps often lack the spatial resolution needed for detailed analysis. This thesis proposes using Super-Resolution Neural Networks (SRNNs) to enhance these maps by generating high-resolution data from low-resolution inputs. SRNNs can capture finer spatial details and improve the accuracy of emission maps, bridging gaps in sparse data to support high-precision environmental modeling.
Spatiotemporal processing	Bestagini, Mandelli, Giganti	Spatiotemporal analysis of climate data	Full	Climate data analysis is hindered by complex patterns and frequent data gaps. This thesis proposes using Spatiotemporal Graph Neural Networks (STGNNs) to improve climate forecasting and data imputation by capturing spatial and temporal relationships. By testing STGNNs for predicting future climate variables and filling missing data, this research aims to enhance data accuracy and reliability in climate modeling.

Geophysics

Contact	Title	Thesis type	Description
Tubaro, Bestagini	Improving Full Waveform Inversion with CNNs	Full/short	Full Waveform Inversion reconstructs the subsurface velocities from a set of measurements. It is very expensive, time-consuming and prone to a number of tips and tricks for avoiding local minima, numerical instability and optimization errors.
Tubaro, Bestagini	Denoising and Interpolation of seismic data through CNNs	Full/short	The amount of data is constantly increasing and the areas of interest are more and more complex to analyze. Moreover, they require a subsurface mapping at increasingly higher resolution and higher fidelity. Can CNNs help this process?
Tubaro, Bestagini	Machine Learning guided Seismic Interpretation	Full/short	Human experts visually inspects seismic images looking for subsurface features. On the other hand, Machine Learning techniques have proven to be effective in image segmentation (i.e., recognizing objects and targets from a set of pixels). Can we merge these two worlds?

Currently on-going

Expand list

Field	Supervisor	Topic	Student(s)
Room Acoustics	Giampiccolo, Bernardini, Weinzierl	Difference Limen of a Room-Acoustical Predictor of Timbre	Eleonora Serra
Audio/Bio signal processing	Giampiccolo, Bernardini, Lo Mauro	Analysis of Opera Singer's Respiratory Signals vs. Audio recordings	Mateo Vitalone
Audio signal processing	Giampiccolo, Bernardini	Modeling Guitar Cabinet Impulse Responses via DFDN	Francesco Panettieri
Audio signal processing	Giampiccolo, Bernardini	Modeling Circuits with a High Number of Nonlinearities in the WD Domain	Mattia Dalla Costa
Audio signal processing	Giampiccolo, Bernardini	Biparametric Wave Digital Filters for Time-Varying Circuit Modeling	Alessandro Lillo
Audio signal processing	Giampiccolo, Bernardini	Phase interpolation for loudspeaker simulation	Carmen Frieda Franci
Audio signal processing	Giampiccolo, Bernardini	Learning Reverberation from Audio files with Differentiable Feedback Delay Networks	Giuliano Di Lorenzo
Audio signal processing	Giampiccolo, Bernardini	Optimizing WDFs for FPGA	Youssef Abouelazm
Audio signal processing	Giampiccolo, Bernardini	Modeling Digital ICs in Wave Digital Filters	Stefano Polimeno
Audio Signal Processing	Giampiccolo, Bernardini	Quantum Neural Networks for Virtual Analog Modeling	Carlo Macrì
Audio signal processing	Pezzoli, Miotello	Development of acoustic simulation framework for GPU	Yuhang Chen
Musical Acoustics	Pezzoli, Antonacci	Physics-informed Neural networks for estimating material parameters in wood plates	Francesca Benesso
Music informatics	Sarti, Mezza, Bernardini	Unsupervised selection of harmonic complexity metrics	Giorgio De Luca
Musical Acoustics	Gonzalez, Antonacci	Random variation of guitar bracings	Mattia Vanessa
Musical acoustics	Gonzalez, Antonacci	Metamaterials for guitarmaking	Gabriele Marelli, Mattia Lercari
Musical acoustics / AI	Gonzalez, Antonacci	AI-powered pick up: making guitars sound great again	Emanuele Voltini
Music Informatics	Sarti, Comanducci	HandMonizer, personalized digital musical instrument design	Antonios Pappas
Deep Learning for audio	Ronchini, Comanducci	Balance between performance end carbon footspring of state-of-the-art deep learning systems for audio domain applications	Riccardo Passoni
DCASE	Comanducci	Bioacoustic detection	Nicolò Pisanu
Spatial audio	Miotello, Pezzoli	Latent space sound field reconstruction	Riccardo Tocci
Audio processing for biomedical applications	Miotello, Malvermi, Pezzoli	Pediatric heart murmur detection	Alberto Bollino
Spatial audio	Miotello, Pezzoli	Directivity reconstruction using neural processes	Elena Molinari
Spatial audio	Miotello, Pezzoli	Differentiable solver for sound field reconstruction	Alinda Gercek
Spatial audio	Miotello. Pezzoli, Bernardini	Development of an Immersive Network Music Performances framework	Giuseppe Longo
Spatial audio	Miotello, Malvermi, Pezzoli	Acoustic measurements in the new spatial audio room of Politecnico di Milano campus in Cremona	Madhav Gopi
Spatial audio	Greco, Miotello, Pezzoli	HRTF modeling using vMF distribution	Maksim Stephanov
Audio processing for biomedical applications	Marazzi, Miotello, Pezzoli	Heart-lung sound separation using deep learning models	Pietro Callandrone
Audio processing for biomedical applications	Marazzi, Miotello, Pezzoli	Respiratory sound classification using pre-trained models	Lorenzo Bianco

Past (from 2017)

Expand list

Field	Supervisor	Title	Student(s)	Link
Biomedical Signal Processing	Lo Mauro, Giampiccolo, Bernardini	Integrated analysis of respiratory-phonatory functions: normative patterns across sex and age	Bianca Zocco	https://www.politesi.polimi.it/handle/10589/251617
Audio signal processing	Giampiccolo, Bernardini	Kolmogorov-Arnold Networks for Virtual Analog Modeling	Enrico Torres	https://www.politesi.polimi.it/handle/10589/253655
Music informatics	Giampiccolo, Bernardini	Pattern matching in polyphonic music: the Chamfer distance	Antonio Rizzitiello	https://www.politesi.polimi.it/handle/10589/247040
Audio signal processing	Pezzoli	Parametric virtual microphone synthesis with spatial coherence constraints for sound field reconstruction	Silvio Attolini
Audio signal processing	Pezzoli	VR-PTOLEMAIC: a framework for the subjective assessment of spatial audio algorithms in virtual environments	Francesca del Gaudio
Musical acoustics	Pezzoli, Malvermi	A PINN-based approach for displacement reconstruction in thin orthotropic plates	Riccardo Sebastiani Croce
Audio signal processing	Pezzoli, Ostan, Miotello	Spatial audio in networked music performance applications	Guido Elli
Generative AI	Giampiccolo, Antonacci	MIDI-Mistral: Controllable Tranformer-based MIDI Generation for Bar and Track Infilling	Davide Rizzotti	https://www.politesi.polimi.it/handle/10589/235511
Generative AI for audio	Comanducci, Ronchini	Adding temporal information and event order modeling to generative models for audio/music	Marco Furio Colombo
Generative AI for audio/music	Ronchini, Comanducci	Generative Controllable Neural Audio Synthesis	Simone Marcucci
DCASE	Ronchini, Comanducci, Cobos	Sound Event Detection and Localization using Mel-FSGCC	Federico Angelo Luigi Ferreri
Generative AI for audio/music	Ronchini, Comanducci	Timbre Transfer	Guglielmo Fraticcioli
Audio forensics	Negroni, Salvi, Bestagini	Anomaly Detection and Localization for Speech Deepfakes via Feature Pyramid Matching	Emma Coletta
Audio forensics	Salvi, Bestagini	Enhanced text-to-speech synthesis for adversarial attacks	Jiayan Cui	https://www.politesi.polimi.it/handle/10589/223309
Audio forensics	Salvi, Leonzio, Bestagini	DeepMetric: enhancing synthetic speech detection through support tracks generation	Alessandro Orsatti	https://www.politesi.polimi.it/handle/10589/218339
Audio forensics	Salvi, Bestagini	Voice-spoofing detection via low-level acoustic features and anti-fraud ML methods	Stefano Antonio Amico	https://www.politesi.polimi.it/handle/10589/230281
Audio forensics	Salvi, Leonzio, Bestagini	Recording device model identification: an experimental analysis of forensic and anti-forensic techniques	Claudio Eutizi	https://www.politesi.polimi.it/handle/10589/219728
Multimedia forensics	Salvi, Mandelli	Exploiting visual and audio features for multimodal deepfake detection	Alessandra Moro	https://www.politesi.polimi.it/handle/10589/223375
Audio forensics	Leonzio, Negroni, Salvi, Bestagini	Exploring signal purification against adversarial attacks for speech deepfake detection	Alfredo Brusca	https://www.politesi.polimi.it/handle/10589/230974
Multimedia forensics	Salvi, Bestagini	Video deepfake detection through head pose estimation	Federica Zezza	https://www.politesi.polimi.it/handle/10589/226744
Audio forensics	Leonzio, Negroni, Salvi, Bestagini	Adversarial attacks against speech deepfake detectors	Wendy Edda Wang	https://www.politesi.polimi.it/handle/10589/230960
Audio forensics	Leonzio, Salvi, Bestagini	Audio Deepfake Splicing Detection and Localization based on Forensic Similarity	Viola Negroni
Audio signal processing	Massi, Giampiccolo, Bernardini	Explicit Vector Wave Digital Filter Modeling of Time-Varying Circuits with a Single Bipolar Junction Transistor	Shijie Yang	https://www.politesi.polimi.it/handle/10589/231500
Audio signal processing	Giampiccolo, Bernardini	An Automatic Audio VST Generator Based on Wave Digital Filters	Stefano Ravasi	https://www.politesi.polimi.it/handle/10589/230385
Audio signal processing	Giampiccolo, Bernardini	Modeling circuits with multiple N-port nonlinearities in the wave digital domain	Sebastian Gafencu	https://www.politesi.polimi.it/handle/10589/227734
Audio signal processing	Giampiccolo, Bernardini	Wave digital neural network-based models of MOSFETs for virtual analog applications	Marco Ferrè	https://www.politesi.polimi.it/handle/10589/223020
Audio signal processing	Massi, Giampiccolo, Bernardini	Data-Driven Parameter Estimation of a Piezoelectric MEMS Loudspeaker using Lumped-Element Models	Lelio Casale	https://www.politesi.polimi.it/handle/10589/231523
Audio signal processing	Giampiccolo, Massi, Bernardini	Vacuum Tubes Modeling by means of Neural Networks in the Wave Digital Domain	Genis Casanova
Musical acoustics	Pezzoli, Malvermi	PINN based vibroacoustic analysis	Federico Zese
3D audio	Pezzoli, Greco	Localization of sound sources using spherical harmonics	Silvia Messena
Musical acoustics	Pezzoli, Malvermi	Statistical charcterization of directivity	Gian Marco Ricci
Space-time audio	Pezzoli, Comanducci	Generative Models for HRTF prediction	Juan Camilo Albarracín Sánchez
Space-time audio	Pezzoli, Miotello	Spherical microphone array upsampling	Ferdinando Terminiello
3D audio	Pezzoli, Malvermi	Neural Network-based representation of sound source directivity	Edoardo Morena
Musical acoustics	Pezzoli	Nearfield Acoustic Holography solver based on Physics-Informed Neural Network	Xinmeng Luan
Space-time audio	Pezzoli, Miotello	Real-time microphone array rendering framework for binaural reproduction	Paolo Ostan
Music Informatics	Comanducci, Mezza	Impact of velocity on drum patterns perceived complexity	Gabriele Maucione
Audio signal processing	Giampiccolo, Bernardini	Wave Digital Models of Nonlinear Piezoelectric Loudspeakers	Armando Boemio	https://www.politesi.polimi.it/handle/10589/218000
Music Informatics	Comanducci, Ronchini, Zanoni	Personalized Music Generation using text-to-music models	Gabriele Perego
Space-time audio	Pezzoli	Analysis of the directivity of sound sources	Hou Hin Au-Yeung
Audio signal processing	Bernardini, Giampiccolo, Albertini	Application of antiderivative antialiasing to MOSFET elements in wave digital filters	Christian Parra	https://www.politesi.polimi.it/handle/10589/214898
Music Informatics	Zanoni, Comanducci	Procedural Music Generation For Video games	Francesco Zumerle	https://www.politesi.polimi.it/handle/10589/210809
Audio signal processing	Bernardini, Giampiccolo, Mezza	On the Use of Fundamental Frequency Estimation for Virtual Bass Enhancement	Fabio Spreafico	https://www.politesi.polimi.it/handle/10589/210018
Image forensics	Bestagini, Mandelli	Manipulation detection for scientific images	Giovanni Zanocco
Video forensics	Bestagini, Cannas	Deepfake video detection through multi-look analysis	Adriano Bonfantini
Video processing	Bestagini, Redondi	Automatic video analysis of badminton matches	Ivan Motasov
Space-time audio	Bernardini, Giampiccolo, Mezza	Designing of Scattering Delay Networks Via Automatic Differentiation	Francesco Boarino	https://www.politesi.polimi.it/handle/10589/211644
Audio signal processing	Bernardini, Giampiccolo	A Wave Digital Extended Fixed-Point Method for Virtual Analog Applications	Davide Marin Pasin	https://www.politesi.polimi.it/handle/10589/212614
Space-time audio	Antonacci, Pezzoli	DIRECTION OF ARRIVAL ESTIMATION USING CONVOLUTIONAL RECURRENT NEURAL NETWORK WITH RELATIVE HARMONIC COEFFICIENTS AND TRIPLET LOSS IN NOISY AND REVERBERATING ENVIRONMENTS	Luca Cattaneo	https://www.politesi.polimi.it/handle/10589/208311
Musical Acoustics	Ripamonti, Malvermi, Gonzalez	Experimental Validation for data-driven Near-field Acoustic Holography	Alessio Lampis
Musical Acoustics	Antonacci, Malvermi	Improved sensors for low-cost Vibrometric Kit	Fabio Guarnieri
Audio signal processing	Bernardini, Giampiccolo	A Wave Digital Hierarchical Quasi-Newton Method for Virtual Analog Modeling	Luca Gobbato	https://www.politesi.polimi.it/handle/10589/198537
Musical Acoustics	Sarti, Paoletti, Adali, Malvermi	Acoustic Characterization of materials	Marco Donzelli
Music Informatics	Zanoni, Comanducci	Deep Learning-based Timbre Transfer	Silvio Pol	https://www.politesi.polimi.it/handle/10589/189682
Audio signal processing	Antonacci, Pezzoli, Borra	A perceptual evaluation of sound field reconstruction algorithms	Miriam Papagno	https://www.politesi.polimi.it/handle/10589/186341
Audio signal processing	Bernardini, Giampiccolo	Characterization of Small-Size Loudspeakers for Mobile Applications	Samuele Buonassisi	https://www.politesi.polimi.it/handle/10589/189746
Image forensics	Bestagini, Cannas	Enhanced Amplitude SAR Imagery Splicing Localization through Land Cover Mapping Techniques	Emanuele Intagliata
Geophysics	Bestagini, Lipari	Salt Segmentation of Geophysical Images through Explainable CNNs	Francesco Maffezzoli
Music informatics	Sarti, Borrelli	Connecting NN to bio-metric signals	Joep Rene Wulms
Audio forensics	Bestagini, Salvi, Borrelli	A metric learning approach for splicing localization based on synthetic speech detection	Francesco Castelli	https://www.politesi.polimi.it/handle/10589/184332
Audio forensics	Bestagini, Salvi, Borrelli	Combining automatic speaker verification and prosody analysis for synthetic speech detection	Luigi Attorresi	https://www.politesi.polimi.it/handle/10589/187094
Music informatics	Zanoni, Borrelli	Social interaction based music recommendation system	Carlo Pulvirenti
Music informatics	Bestagini, Cuccovillo	Speech fingerprinting and matching for content retrieval	Laura Colzani	https://www.politesi.polimi.it/handle/10589/187212
Musical Acoustics	Antonacci, Olivieri	Towards white-box data-driven methods for Near-field Acoustic Holography	Hagar Kafri
Video forensics	Bestagini	A CNN-based detector for video frame-rate interpolation	Simone Mariani	https://www.politesi.polimi.it/handle/10589/186433
Image/video processing	Bestagini	Audio-video techniques for the analysis of players behaviour in Badminton matches	Samuele Bosi	https://www.politesi.polimi.it/handle/10589/186571
Video forensics	Bestagini, Mandelli	Forensic detection of deepfakes generated through video-to-video translation	Carmelo Fascella	https://www.politesi.polimi.it/handle/10589/182988
Audio signal processing	Bernardini, Mezza, Giampiccolo	Wave Digital Filter Modeling of Audio Circuits with Hysteresis Nonlinearities using Neural Networks	Oliviero Massi	https://www.politesi.polimi.it/handle/10589/186739
Music informatics	Antonacci, Pezzoli, Comanducci	Deep Prior Audio Inpainting	Federico Miotello
Audio signal processing	Bestagini, Buccoli	Low-latency speaker recognition	Francesco Salani
Video forensics	Bestagini, Bonettini	A Data Driven Approach to Deepfake Detection via Feature Analysis Based on Limited Data	Bingyang Hu
Space-time audio	Antonacci, Borrelli, Borra	Beamforming and Speaker Identification through Deep Neural Networks	Matteo Scerbo	https://www.politesi.polimi.it/handle/10589/176160
Music informatics	Sarti, Borrelli	Harmonic complexity estimation of jazz music	Giovanni Agosti
Audio forensics	Antonacci, Borrelli	A model selection method for room shape classification based on mono speech signals	Gabriele Antonacci	https://www.politesi.polimi.it/handle/10589/179887
Audio forensics	Bestagini	Audio splicing detection and localization based on recording device cues	Daniele Ugo Leonzio	https://www.politesi.polimi.it/handle/10589/179424
Audio forensics	Bestagini	Speaker-Independent Microphone Identification via Blind Channel Estimation in Noisy Condition	Antonio Giganti	https://www.politesi.polimi.it/handle/10589/179420
Audio forensics	Bestagini, Borrelli	Synthetic Speech Detection through Convolutional Neural Networks in Noisy Environments	Eleonora Landini	https://www.politesi.polimi.it/handle/10589/179458
Audio forensics	Bestagini, Borrelli, Salvi	Synthetic speech detection based on sentiment analysis	Emanuele Conti	https://www.politesi.polimi.it/handle/10589/177968
Multimedia forensics	Bestagini, Salvi, Borrelli	Audio-video deepfake detection through emotion recognition	Jacopo Gino	https://www.politesi.polimi.it/handle/10589/179037
Audio signal processing	Sarti, Giampiccolo, Bernardini	Parallel Wave Digital Implementations of Nonlinear Audio Circuits	Natoli Antonino	https://www.politesi.polimi.it/handle/10589/178037
Musical Acoustics	Antonacci, Malvermi	Data driven methods for frequency response functions interpolation	Matteo Acerbi	https://www.politesi.polimi.it/handle/10589/170179
Audio forensics	Bestagini, Mandelli	Time-Scaling Detection in Audio Recordings	Michele Pilia	https://www.politesi.polimi.it/handle/10589/173711
Audio forensics	Bestagini, Borrelli	Speech Intelligibility Parameters Estimation Through Convolutional Neural Networks	Mattia Papa	https://www.politesi.polimi.it/handle/10589/173756
Audio forensics	Antonacci	Closed and open set classification of real and AI synthesised speech	Michelangelo Medori	https://www.politesi.polimi.it/handle/10589/170094
Audio forensics	Antonacci	An approach to room volume estimation from single-channel speech signals based on neural networks	Castelnuovo Carlo	https://www.politesi.polimi.it/handle/10589/164749
Audio forensics	Bestagini	Audio Splicing Detection and Localization Based on Acoustic Cues	Capoferri Davide	https://www.politesi.polimi.it/handle/10589/164950
Audio processing	Sarti, Comanducci	Audio frame reconstruction from incomplete observations using Deep Learning techniques	Schils Minh Cédric	https://matheo.uliege.be/handle/2268.2/10138
Audio processing	Sarti, Bernardini	Wave Digital Modeling and Simulation of Nonlinear Electromagnetic Circuits	Giampiccolo Riccardo	https://www.politesi.polimi.it/handle/10589/153994
Audio processing	Sarti, Bernardini	Antiderivative Antialiasing in Nonlinear Wave Digital Filters	Albertini Davide	https://www.politesi.polimi.it/handle/10589/152934
Audio processing	Sarti, Bernardini	Wave Digital Implementation of Nonlinear Audio Circuits based on the Scattering Iterative Method	Proverbio Alessandro	https://www.politesi.polimi.it/handle/10589/152323
Audio processing	Antonacci	A system for super resolution vibrometric analysis through convolutional neural networks	Campagnoli Chiara	https://www.politesi.polimi.it/handle/10589/152613
Audio processing	Antonacci	Development of a low-cost platform for acoustic and vibrometric analysis on lutherie products, with a special focus on the estimation of the elastic parameters of the tonewood	Villa Luca	https://www.politesi.polimi.it/handle/10589/150531
Audio processing	Bestagini	DNN based post-filtering for quality improvement of AMR-WB decoded speech	Gupta Kishan	https://www.politesi.polimi.it/handle/10589/151000
Audio processing	Sarti	Studio sull'implementazione degli algoritmi per il musical instruments ed il sound reinforcement basato su un processore multicore	Aretino Michele	https://www.politesi.polimi.it/handle/10589/139079
Audio processing	Sarti, Bernardini	Modeling nonlinear 3-terminal devices in the wave digital domain	Vergani Alessio Emanuele	https://www.politesi.polimi.it/handle/10589/133184
Forensics	Bestagini	Convolutional and recurrent neural networks for video tampering detection and localization	Cannas Edoardo Daniele	https://www.politesi.polimi.it/handle/10589/149900
Forensics	Bestagini	A study on Bagging-Voronoi algorithm for tampering localization	Cereghetti Corinne Elena	https://www.politesi.polimi.it/handle/10589/141725
Forensics	Bestagini	JPEG-based forensics through convolutional neural networks	Bonettini Nicolò	https://www.politesi.polimi.it/handle/10589/133727
Forensics	Bestagini	Analysis of different footprints for JPEG compression detection	Chen Ke	https://www.politesi.polimi.it/handle/10589/132721
Geophysics	Bestagini	Landmine detection on GPR data employing convolutional autoencoder	Testa Giuseppe	https://www.politesi.polimi.it/handle/10589/142106
Image and video	Marcon, Paracchini	A novel tomographic approach for an early detection of multiple myeloma progression	Andrea Leggio
Image and video	Marcon, Paracchini	Limited angle computed tomography reconstruction with deep learning enhancement	Erbol Kasenov, Girolamo Gerace
Image and video	Marcon	Upper body postural assessment for common dentistry visual aids	Trotta Emilio	https://www.politesi.polimi.it/handle/10589/145563
Image and video	Tubaro	Real-time tracking of electrode during deep-brain surgery	Dilauro Valerio	https://www.politesi.polimi.it/handle/10589/144685
Image and video	Marcon	Analytical estimation of the error on the radius of industrial pipes	Lazzarin Sara	https://www.politesi.polimi.it/handle/10589/144394
Image and video	Marcon	3D reconstruction from stereo video acquired from odontoiatric microscope	Spatafora Leonardo	https://www.politesi.polimi.it/handle/10589/143780
Image and video	Marcon	Denoising and classification of hyperspectral X-ray images for food quality assessment	Re Marco	https://www.politesi.polimi.it/handle/10589/142922
Image and video	Marcon	A computer vision approach for assessment of dental bracket removal	Behnami Arezoo	https://www.politesi.polimi.it/handle/10589/142362
Image and video	Marcon	Sistema per il rilevamento automatico di contaminanti alimentari basato su immagini iperspettrali	Ramoni Francesco	https://www.politesi.polimi.it/handle/10589/135891
Image and video	Marcon	Postural assessment in dentistry by computer vision	Pignatelli Nicola	https://www.politesi.polimi.it/handle/10589/135030
Multimedia forensics	Bestagini, Mandelli	A Multi-Modal Approach to Forensic Audio-Visual Device Identification	Davide Dal Cortivo	https://www.politesi.polimi.it/handle/10589/175593
Music informatics	Sarti, Bernardini, Borrelli, Mezza	Estimating Harmonic Complexity of Chord Sequences using Transformer Networks	Cecilia Morato
Music informatics	Zanoni, Comanducci	Modeling Harmonic Complexity in Automatic Music Generation using Conditional Variational Autoencoders	Davide Gioiosa
Music informatics	Sarti, Borrelli, Comanducci	Cellular music : a novel music-generation platform based on an evolutionary paradigm	Matteo Manzolini	https://www.politesi.polimi.it/handle/10589/167291
Music informatics	Sarti, Borrelli	Music emotion detection. A framework based on electrodermal activities.	Gioele Pozzi	https://www.politesi.polimi.it/handle/10589/152931
Music informatics	Sarti, Comanducci	Techniques for mitigating the impact of latency in Networked Music Performance (NMP) through adaptive metronomes	Battello Riccardo	https://www.politesi.polimi.it/handle/10589/152923
Music information retrieval	Sarti	Musical instrument recognition: a transfer learning approach	Molgora Andrea	https://www.politesi.polimi.it/handle/10589/147383
Music information retrieval	Sarti	Unsupervised domain adaptation for deep learning based acoustic scene classification	Mezza Alessandro Ilic	https://www.politesi.polimi.it/handle/10589/145573
Music information retrieval	Antonacci	An investigation of piano transcription algorithm for jazz music	Marzorati Giorgio	https://www.politesi.polimi.it/handle/10589/144745
Music information retrieval	Sarti	Automatic playlist generation using recurrent neural network	Irene Rosilde Tatiana	https://www.politesi.polimi.it/handle/10589/142101
Music information retrieval	Sarti	A personalized metric for music similarity using Siamese deep neural networks	Sala Federico	https://www.politesi.polimi.it/handle/10589/139078
Music information retrieval	Sarti	Learning a personalized similarity metric for musical content	Carloni Luca	https://www.politesi.polimi.it/handle/10589/139076
Music information retrieval	Sarti	Beat tracking using recurrent neural network : a transfer learning approach	Fiocchi Davide	https://www.politesi.polimi.it/handle/10589/139073
Music information retrieval	Sarti	Python-based framework for managing a base of complex data for music information retrieval	Avocone Giuseppe	https://www.politesi.polimi.it/handle/10589/138449
Music information retrieval	Sarti	Individual semantic modeling for music information retrieval	Ansidei Pietro	https://www.politesi.polimi.it/handle/10589/137160
Music information retrieval	Sarti	Chord sequences : evaluating the effect of complexity on preference	Foscarin Francesco	https://www.politesi.polimi.it/handle/10589/136448
Music information retrieval	Sarti	Audio features compensation based on coding bitrate	Tavella Maria Stella	https://www.politesi.polimi.it/handle/10589/134607
Musical Acoustics	Antonacci	Modal analysis and optimization of the top plate of string instruments through a parametric control of their shape	Salvi Davide	https://www.politesi.polimi.it/handle/10589/166557
Musical Acoustics	Antonacci, Pezzoli, Malvermi	An approach for Near-field Acoustic Holography based on Convolutional Autoencoders	Olivieri Marco	https://www.politesi.polimi.it/handle/10589/167039
Space-time audio	Antonacci, Borra	A parametric approach to virtual miking with distributed microphone arrays	Marco Langè
Space-time audio	Antonacci, Pezzoli, Borra, Bernardini	A Deep Prior Approach to Room Impulse Response Interpolation	Davide Perini	https://www.politesi.polimi.it/handle/10589/175583
Space-time audio	Antonacci, Comanducci	Interpreting Deep Neural Networks Models for Acoustic Source Localization using Layer-wise Relevance Propagation	Alessandro Montali	https://www.politesi.polimi.it/handle/10589/169239
Space-time audio	Antonacci, Borra, Bernardini	Analysis of Uniform Linear Arrays of Differential Microphones	Bertuletti Ivan	https://www.politesi.polimi.it/handle/10589/154604
Space-time audio	Sarti	A geometrical method of 3D sound spatialization for virtual reality applications	Iamele Jacopo	https://www.politesi.polimi.it/handle/10589/143770
Space-time audio	Antonacci	Convolutional neural networks applied to space-time audio processing applications	Comanducci Luca	https://www.politesi.polimi.it/handle/10589/139077
Space-time audio	Canclini	Denoising in the spherical harmonic domain of sound scenes acquired by compact arrays	Borrelli Clara	https://www.politesi.polimi.it/handle/10589/139075
Space-time audio	Antonacci	Simulazione di sistemi complessi. Case study : l'altoparlante a tromba	Moscara Francesco	https://www.politesi.polimi.it/handle/10589/139074
Space-time audio	Sarti, Bernardini	Steerable differential microphone arrays	Lovatello Jacopo	https://www.politesi.polimi.it/handle/10589/139072
Space-time audio	Antonacci	A plenacoustic approach to sound scene manipulation	Picetti Francesco	https://www.politesi.polimi.it/handle/10589/138430
Space-time audio	Antonacci	Reconstruction of the soundfield in arbitrary locations using the distributed ray space transform	Pezzoli Mirco	https://www.politesi.polimi.it/handle/10589/136447
Space-time audio	Sarti	A method for HRTF personalization : weighted sparse representation synthesis of HRTFs	Zhu Mo	https://www.politesi.polimi.it/handle/10589/135952
Space-time audio	Antonacci	Robust parametric spatial audio processing using beamforming techniques	Milano Guendalina	https://www.politesi.polimi.it/handle/10589/134609
Space-time audio	Antonacci	Estimation of singing voice quality through microphone in air and contact microphone	Landini Roberta	https://www.politesi.polimi.it/handle/10589/134604
Musical Acoustics	Antonacci, Malvermi	Mechanical parameter estimation for vibrometric analysis and development of a low-cost platform for violin making	Federico Simeon	https://www.politesi.polimi.it/handle/10589/170995
Space-time audio	Antonacci, Comanducci	3D audio with irregular microphone setups using deep learning	Davide Mori	https://www.politesi.polimi.it/handle/10589/175608
Space-time audio	Antonacci, Comanducci	Personalized Sound Zone Generation using Deep Learning	Roberto Alessandri	https://www.politesi.polimi.it/handle/10589/203852