what’s up with tensorflow.js MNIST example nextbatch implementation?

Question

While taking inspiration from the tensorflow.js Handwritten digit recognition with CNNs tutorial, I stumbled upon the following implementation of the nextBatch function in mnist_data.js: I understood the point of this function was selecting the images and the corresponding label. The problem with the provided…

Accepted Answer

The issue is related to the shape of the label.const labels = tf.tensor2d(batchLabelsArray, [batchSize, 1]);The labels are created with the most right axis having the shape 1. It should rather be equal to the number of classes there are (ie: 0, 1 &#8230;, 9) which should therefore be 10.The error is straightforward indicating that the shape should be [, 10].create tensor with the shape [batchSize, 10]Obviously if the tensor is created with the shape [batchSize, 10] whereas batchLabelsArray has the length batchSize, it will throw a shape error. It should rather have the length batchSize * NUMBER_OF_CLASSES.The codelab usesconst batchLabelsArray = new Uint8Array(batchSize * NUM_CLASSES);An then to set the class of a certain batchSize it uses the following:for (let i = 0; i < batchSize; i++) {      const idx = index();      const image =          data[0].slice(idx * IMAGE_SIZE, idx * IMAGE_SIZE + IMAGE_SIZE);      batchImagesArray.set(image, i * IMAGE_SIZE);      const label =          data[1].slice(idx * NUM_CLASSES, idx * NUM_CLASSES + NUM_CLASSES);      batchLabelsArray.set(label, i * NUM_CLASSES);    }The other option is to use tf.oneHot:const labels = tf.oneHot(batchLabelsArray, 10) // batchLabelsArray is an array of batchSize length

Advertisement

Answer