In your readme I see that you are using the following classes: > one, two, three, four, five, front, back, left, right, stop, none But on [the dataset page](https://ai.googleblog.com/2017/08/launching-speech-commands-dataset.html) they are using these classes: > yes, no, up, down, left, right, on, off, stop, go Why this difference?