Shortcuts

asteroid.data.kinect_wsj module

asteroid.data.kinect_wsj.make_dataloaders(train_dir, valid_dir, n_src=2, sample_rate=16000, segment=4.0, batch_size=4, num_workers=None, **kwargs)[source]
class asteroid.data.kinect_wsj.KinectWsjMixDataset(json_dir, n_src=2, sample_rate=16000, segment=4.0)[source]

Bases: asteroid.data.wsj0_mix.Wsj0mixDataset

Dataset class for the KinectWSJ-mix source separation dataset.

Parameters:
  • json_dir (str) – The path to the directory containing the json files.
  • sample_rate (int, optional) – The sampling rate of the wav files.
  • segment (float, optional) – Length of the segments used for training, in seconds. If None, use full utterances (e.g. for test).
  • n_src (int, optional) – Number of sources in the training targets.
References
“Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition”, Sunit Sivasankaran et al. 2020.
dataset_name = 'Kinect-WSJ'[source]
__getitem__(idx)[source]

Gets a mixture/sources pair. :returns: mixture, stack([source_arrays]), noise

mixture is of dimension [samples, channels] sources are of dimension [n_src, samples, channels]
get_infos()[source]

Get dataset infos (for publishing models).

Returns:dict, dataset infos with keys dataset, task and licences.
Read the Docs v: v0.4.4
Versions
latest
stable
v0.4.4
v0.4.3
v0.4.2
v0.4.1
v0.4.0
v0.3.5_b
v0.3.4
v0.3.3
v0.3.2
v0.3.1
Downloads
On Read the Docs
Project Home
Builds

Free document hosting provided by Read the Docs.