march 2012
The authors train a 4-layer MLP. The last layer outputs the phonemes (list of phonemes from IPA) probability. The first 3 layer are shared and the last layer is language specific.
Dataset: 3 languages are used English, German and, Spanish from the Callhome corpora.
Their method gives better features instead of training the acoustic model.