A paper accepted to IEEE International Conference on Acoustics, Speech, and Signal Processing  (ICASSP), 2024