Journal of Information Technology & Software Engineering

Journal of Information Technology & Software Engineering
Open Access

ISSN: 2165- 7866

+44 1300 500008

Shanta Maharjan

Department of Electronics and Computer Engineering, IOE Thapathali Campus, Kathmandu, Nepal

Publications
  • Review Article   
    Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model
    Author(s): Sankalpa Rijal*, Rajan Neupane, Saroj Prasad Mainali, Shishir Kumar Regmi and Shanta Maharjan

    Cocktail party problem is the scenario where it is difficult to separate or distinguish individual speaker form a mixed speech from several speakers. There have been several researches going on in this field but the size and complexity of model is being traded off with the accuracy and robustness of speech separation. “Monaural multi-speaker speech separation” presents a speech-separation model based on the transformer architecture and its efficient forms. The model has been trained with the LibriMix dataset containing diverse speakers’ utterances. The model separates 2 distinct speaker sources from a mixed audio input. The developed model approaches the reduction in computational complexity of the speech separation model, with minimum tradeoff with the performance of prevalent speech separation model and it has shown significant movement towards that goal. This proj.. View More»

    Abstract PDF

Top