Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
__init__.py		__init__.py
config.yaml		config.yaml
test.py		test.py
train.py		train.py

README.md

E009 - Implementation of the Sheffield entry for the first Clarity enhancement challenge (CEC1)

This repository contains the PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing", the Sheffield entry E009 for the first Clarity enhancement challenge (CEC1). The system consists of a Conv-TasNet based denoising module, and a finite-inpulse-response (FIR) filter based amplification module. A differentiable approximation to the Cambridge MSBG model released in the CEC1 is used in the loss function.

Train

To build the overall system, the multi-channel Conv-TasNet based denoising module is trained in the first stage, and the FIR based amplification module is trained in the second stage. The FIR amplification module is dependent on the listener ID. To run the script, specify path.cec1_root as the CEC1 data path, and path.exp_folder as the experiment directory.

References

[1] Luo Y, Mesgarani N. Conv-tasnet: Surpassing ideal time–frequency magnitude masking for speech separation[J]. IEEE/ACM transactions on audio, speech, and language processing, 2019, 27(8): 1256-1266.
[2] Andersen A H, de Haan J M, Tan Z H, et al. Refinement and validation of the binaural short time objective intelligibility measure for spatially diverse conditions[J]. Speech Communication, 2018, 102: 1-13.
[3] Taal, C. H., Hendriks, R. C., Heusdens, R., & Jensen, J. An algorithm for intelligibility prediction of time–frequency weighted noisy speech. IEEE Transactions on Audio, Speech, and Language Processing, 19(7), 2125-2136.
[4] Zhang, J., Zorilă, C., Doddipatla, R., & Barker, J. On end-to-end multi-channel time domain speech separation in reverberant environments. In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6389-6393). IEEE.

Citation

If you use this work, please cite:

@article{tutwo,
  title={A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing},
  author={Tu, Zehai and Zhang, Jisi and Ma, Ning and Barker, Jon},
  year={2021},
  booktitle={The Clarity Workshop on Machine Learning Challenges for Hearing Aids (Clarity-2021)},
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

e009_sheffield

e009_sheffield

README.md

README.md

init.py

init.py

config.yaml

config.yaml

test.py

test.py

train.py

train.py

README.md

E009 - Implementation of the Sheffield entry for the first Clarity enhancement challenge (CEC1)

Train

References

Citation

Files

e009_sheffield

Directory actions

More options

Directory actions

More options

Latest commit

History

e009_sheffield

Folders and files

parent directory

E009 - Implementation of the Sheffield entry for the first Clarity enhancement challenge (CEC1)

Train

References

Citation