Shortcuts

asteroid.data.whamr_dataset module

class asteroid.data.whamr_dataset.WhamRDataset(json_dir, task, sample_rate=8000, segment=4.0, nondefault_nsrc=None)[source]

Bases: sphinx.ext.autodoc.importer._MockObject

Dataset class for WHAMR source separation and speech enhancement tasks.

Parameters:
  • json_dir (str) – The path to the directory containing the json files.
  • task (str) –

    One of 'sep_clean', 'sep_noisy', 'sep_reverb' or 'sep_reverb_noisy'.

    • 'sep_clean' for two-speaker clean (anechoic) source separation.
    • 'sep_noisy' for two-speaker noisy (anechoic) source separation.
    • 'sep_reverb' for two-speaker clean reverberant source separation.
    • 'sep_reverb_noisy' for two-speaker noisy reverberant source separation.
  • sample_rate (int, optional) – The sampling rate of the wav files.
  • segment (float, optional) – Length of the segments used for training, in seconds. If None, use full utterances (e.g. for test).
  • nondefault_nsrc (int, optional) – Number of sources in the training targets. If None, defaults to one for enhancement tasks and two for separation tasks.
References
“WHAMR!: Noisy and Reverberant Single-Channel Speech Separation”, Maciejewski et al. 2020
dataset_name = 'WHAMR'[source]
__getitem__(idx)[source]

Gets a mixture/sources pair. :returns: mixture, vstack([source_arrays])

get_infos()[source]

Get dataset infos (for publishing models).

Returns:dict, dataset infos with keys dataset, task and licences.
Read the Docs v: v0.4.4
Versions
latest
stable
v0.4.4
v0.4.3
v0.4.2
v0.4.1
v0.4.0
v0.3.5_b
v0.3.4
v0.3.3
v0.3.2
v0.3.1
Downloads
On Read the Docs
Project Home
Builds

Free document hosting provided by Read the Docs.