Chime4 #423

JorisCos · 2021-02-02T15:50:44Z

About this PR

This PR introduces a recipe for evaluation purposes only, on real noisy data.
It uses the data from the CHiME 4 challenge (an adaptation from the CHiME 3 challenge).

Design

I am unsure about the design of the dataset and the recipe :

The dataset mimics the behaviour of other datasets by returning mixture and sources(here fake sources that are actually a copy of the mixture)
The recipe only contains scripts for the evaluation it might be worth to make it more general by avoiding to limit its application to only one model

change model

remove clipping before ASR

mpariente · 2021-02-02T17:43:23Z

I'm undecided about this.
On the one hand, having data prep and eval script for Chime4 is cool, and it's an example for other models but we could probably make more modular than this.

But to make it more modular, we should attach the data prep to the dataloader (needs refactoring of the data part of Asteroid, which we want to do), and we should enable evaluating any model on any dataset (also needs refactoring).

I'd vote in favour of merging it, because it diversifies the examples that we have (both for the users and things in mind when we refactor).

@jonashaag @popcornell

jonashaag · 2021-02-03T11:56:26Z

Yes, I'd say merge it, as it's not introducing any new way of doing things, so it doesn't add much work to our refactoring plate.

Did you forget to rename the "chime3" variables and CLI flag or is that on purpose?

JorisCos · 2021-02-04T09:01:20Z

Yes, I'd say merge it, as it's not introducing any new way of doing things, so it doesn't add much work to our refactoring plate.

Did you forget to rename the "chime3" variables and CLI flag or is that on purpose?

That's on purpose because CHiME 4 uses data from CHiME 3 when you download the data you actually get a directory named CHiME3. But maybe it's confusing.

mpariente · 2021-02-04T09:37:14Z

I guess it will be clear once people have the data from CHiME3 downloaded.

mpariente

Mainly good to do but needs a last polishing.

I feel the create_metadata.py file is not so readable. Mainly because of the function flow. I'd have to think about what I'd reproach to the design exactly.

egs/chime4/README.md

mpariente · 2021-02-04T09:37:45Z

asteroid/data/chime4_dataset.py

+import os
+
+
+class CHiME4(Dataset):


Let's call this CHiME4Dataset.

mpariente · 2021-02-04T09:39:49Z

asteroid/data/chime4_dataset.py

+        mixture = torch.from_numpy(mixture)
+        mock_source = torch.vstack([mixture])
+        if self.return_id:
+            id1 = row.ID
+            return mixture, mock_source, [id1]
+        return mixture, mock_source


Let's not return the mock_source.

mpariente · 2021-02-04T09:40:29Z

egs/chime4/ConvTasNet/eval.py

+        mix, sources, ids = test_set[idx]
+        mix, sources = tensors_to_device([mix, sources], device=model_device)
+        est_sources = model(mix.unsqueeze(0))
+        mix_np = mix.cpu().data.numpy()
+        sources_np = sources.cpu().data.numpy()
+        est_sources_np = est_sources.squeeze(0).cpu().data.numpy()
+        est_sources_np *= np.max(np.abs(mix_np)) / np.max(np.abs(est_sources_np))


Remove the code related to sources.

mpariente · 2021-02-04T09:40:58Z

egs/chime4/ConvTasNet/eval.py

+        utt_metrics.update(
+            **wer_tracker(
+                mix=mix_np,
+                clean=sources_np,


Extend WerTracker to work without clean reference. And remove clean.

mpariente · 2021-02-04T09:48:44Z

egs/chime4/ConvTasNet/run.sh

+# Choice for the ASR model whether trained on clean or noisy data. One of clean or noisy
+asr_type=noisy
+
+test_dir=data/test


Does this need to be overwritten?
If it doesn't, then add it after utils/parse_options.sh

mpariente · 2021-02-04T09:49:10Z

egs/chime4/ConvTasNet/run.sh

+if [[ $stage -le  0 ]]; then
+	echo "Stage 0: Generating CHiME-4 dataset"
+  $python_path local/create_metadata.py --chime3_dir $storage_dir/CHiME3/
+fi
+
+if [[ $stage -le 1 ]]; then
+	echo "Stage 2 : Evaluation"
+  echo "Results from the following experiment will be stored in $exp_dir/chime4/$asr_type"
+
+	if [[ $compute_wer -eq 1 ]]; then
+
+    # Install espnet if not instaled
+    if ! python -c "import espnet" &> /dev/null; then
+        echo 'This recipe requires espnet. Installing requirements.'
+        $python_path -m pip install espnet_model_zoo
+        $python_path -m pip install jiwer
+        $python_path -m pip install tabulate
+    fi
+  fi
+
+  $python_path eval.py \
+    --exp_dir $exp_dir \
+    --test_dir $test_dir \
+  	--use_gpu $eval_use_gpu \
+  	--compute_wer $compute_wer \
+  	--asr_type $asr_type
+fi


Fix indents, but otherwise OK

mpariente · 2021-02-04T09:50:10Z

egs/chime4/README.md

+As the channel to use for the training set wasn't defined by 
+the challenge's rules, we will set it randomly.
+
+NOTE : 


Suggested change

NOTE :

**Note :**

mpariente · 2021-02-04T09:50:38Z

egs/chime4/README.md

+
+NOTE : 
+This dataset uses real noisy data. This means the clean speech from the noisy
+utterances is not available. This makes it not suitable for the usual training 


Suggested change

utterances is not available. This makes it not suitable for the usual training

utterances is not available. This makes it unsuitable for the usual training

egs/chime4/README.md

jonashaag

Manuel asked for some review of create_metadata.py so here we go :)

jonashaag · 2021-02-04T10:36:00Z