Skip to content

SXKA/conv-tasnet-visualization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

conv-tasnet-visualization

A implementation of Conv-TasNet visualization.

conv-tasnet-visualization use Asteroid model.

The images of visualization are produced with JorisCos/ConvTasNet_Libri2Mix_sepclean_8k.

Usage

python main.py \
  --model_path path/to/model \
  --mixture_path path/to/mixture_audio \
  --spk1_path path/to/spk1_audio \
  --spk2_path path/to/spk2_audio
  • --model_path: Conv-TasNet model path.
  • --mixture_path: mixture audio file path.
  • --spk1_path: speaker 1 audio file path.
  • --spk2_path: speaker 2 audio file path.

Visualization

The sample audio isn't provided because it is generated from WSJ0.

Waveform

mixture_waveform spk1_waveform spk2_waveform

Encoder and decoder basis functions

encoder_weights  encoder_weights_fft decoder_weights  decoder_weights_fft

Encoder representation

encoder_output

Source masks

spk1_mask spk2_mask

About

A implementation of Conv-TasNet visualization.

Topics

Resources

License

Stars

Watchers

Forks

Languages