[an error occurred while processing this directive]

<title>Coins classifier Neural Network: Coin Value Classifier.</title>
	<meta name="description" content="Coins classifier Neural Network: Value of a Coin?">
	<meta name="keywords" content="deep learning coins, deep learning coin value, deep learning coin nominal, neural network coins classifier, image recognigion, image classifier coins">
		
<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/prism/1.5.0/themes/prism.min.css"/>

[an error occurred while processing this directive]

<script src="https://cdnjs.cloudflare.com/ajax/libs/prism/1.17.1/prism.min.js"></script>
<script src="https://cdnjs.cloudflare.com/ajax/libs/prism/1.17.1/components/prism-python.min.js"></script>

<p><h1 class="header">Classifying Coins by Value</h1>

	<p>In our <a href="01_crop_and_group.htm">first tutorial</a> we used the image segmentation tool (YOLO) to find the coin (or coins) in the image and crop them. 
		In our <a href="02_head_or_tail.htm">second tutorial</a>, we have created a classifier to figure out if this particular cropped image 
		contains "head" or "tail". Now it is time to learn to classify images by their value. As I am using Russian coins, it is going to 
		be 1, 2, 5 and 10 roubles.
	<p>A reminder: any coins can be classified. I chose Russian coins as Russia had a monetary reform in 1997, so they have relatively small
		number of coin types, and it is easier for me to create a training dataset.

	<p>Just to clarify: as we already have coin pairs and we know where the head or tail is, we don't, technically, need to further divide our dataset by value, 
		year and so on. Why don't we write a final classifier to figure out what exact mint type the coin is? 
	<br>The reason is: we do not have a training set yet. We got 20-40 thousand images of coins, and we are going to rescan the net at least once a year to 
		get new coins. So we need to automate the job for a human that marks images for the final classifier. We are not (yet) writing a "production coins classifier", instead, we are creating tools to help us with the data mining!	

	<p>Same way as before, we are going to use Google Colab environment, taking the advantage of a free video card they grant us an access to.
		We will store data on a Google Drive, so first thing we need is to allow Colab to access the Drive:
		
<pre><code class="language-python">
from google.colab import drive
drive.mount("/content/drive/", force_remount=True)
</code></pre>

	<p>Same way as we did in the second tutorial, we are going to use the transfer learning approach: we will load a pretrained EfficientNet and modify it to 
		work on our task.

<pre><code class="language-python">
!pip install -q efficientnet 
import efficientnet.tfkeras as efn
</code></pre>

	<p>Next, i usually have a large "include" section, please note that some files may be included that are not really used: feel free to 
		delete them:

<pre><code class="language-python">		
import numpy as np
from sklearn.utils import shuffle
import pandas as pd
import tensorflow as tf
from tensorflow import keras
from tensorflow.keras import layers
import sys
import random
import matplotlib.pyplot as plt
import os
from os import listdir
from os.path import isfile, join
import json
from tensorflow.keras import regularizers
from tensorflow.keras.optimizers import Adamax
from tensorflow.keras.preprocessing.image import ImageDataGenerator
from tensorflow.keras.preprocessing.image import array_to_img, img_to_array
from tensorflow.keras import backend as K
from tensorflow.keras.applications.vgg16 import VGG16,preprocess_input
from tensorflow.keras.applications import InceptionResNetV2
from tensorflow.keras.applications import Xception, NASNetLarge
from mpl_toolkits.mplot3d import Axes3D
from sklearn.manifold import TSNE
from tensorflow.keras.layers import Input, Conv2D, MaxPooling2D, Dense
from tensorflow.keras.layers import  Activation, Dropout, Flatten
from tensorflow.keras.layers import Lambda, concatenate, BatchNormalization
from tensorflow.keras.layers import GlobalAveragePooling2D
from tensorflow.keras.callbacks import LambdaCallback
from tensorflow.keras.callbacks import ModelCheckpoint
from tensorflow.keras.models import Sequential
from sklearn.neighbors import NearestNeighbors
import seaborn as sns
import cv2
from tensorflow.python.keras.utils.data_utils import Sequence

import re	
</code></pre>

	<p>Let's see which version of Tensorflow is used. This step is important, as Google is known for suddenly changing (increasing)
		versions:

<pre><code class="language-python">	
import tensorflow as tf
print(tf.__version__)
tf.test.gpu_device_name()			
</code></pre>

<p>The output in my case was:
<br>2.4.0
<br>'/device:GPU:0'

	<p>Folders we are going to use in our project
<pre><code class="language-python">			
working_path = "/content/drive/My Drive/03_coin_value/"

best_weights_filepath = working_path + "models/03_coin_value_best.h5"
last_weights_filepath = working_path + "models/03_coin_value_last.h5"
</code></pre>

	<p>As we save our last training configuration, we don't have to redo training every time we start the notebook: we can simply reload from disk:

<pre><code class="language-python">			
bDoTraining = True
</code></pre>

	<p>We are going to scale down images to 256x256, use batch size 8 during training, and so on: here are constants we will need.
		Names are self-explainatory. We are also going to break our data to training images (used to tune network's weights), validation images 
		used to calculate performance on data the net never saw) and the rest (testing data, used to test the result).

<pre><code class="language-python">			
# Constants for training and image preprocessing
IMAGE_SIZE = 256
input_shape=(IMAGE_SIZE, IMAGE_SIZE, 3)

BATCH_SIZE = 8

alpha = 0.4

TRAINING_IMAGES_PERCENT = 0.6
VALIDATION_IMAGES_PERCENT = 0.2

IMAGE_ROTATION_ANGLE = 180
</code></pre>

	<p>Classes our classifier should distinguish. Note that, unlike in step 2, it is not a binary classifier, so we 
		use the one-hot encoding, provided by Pandas library:

<pre><code class="language-python">	
arrClasses = ["1R", "2R", "5R", "10R"]
pdClassLabels = pd.get_dummies(arrClasses)
</code></pre>

	<p>In the first two tutorials we had no choice but to physically move images around. We have downloaded images from the net, and we had a lot of junk there. We did
		cropping which could easily make ten images out of one, if that one contained 10 coins. We groupped images to pairs (head/tail) and removed images that had no pair. However for our current task, we already have images we need, and all we have to do is placing labels on them, so to speak. As the result, instead of copying images from "source" location to a "target" one (for example, to "1Rub/2Rub etc folders), we will create a file containing labels, describing existing images. This approach saves a lot of disk space.

<pre><code class="language-python">		
# We are going to store labeled data for training in the following format:
# arrLabeledData = [ { 'id':'169860023_00_00_head.png', 'class':'1R' }, ... ]

def scanCoinsDir(strDirName):
  path = working_path + "images/" + strDirName
  arrImageNames = [f for f in os.listdir(path)] # if f.endswith('.txt')]
  for file_name in arrImageNames:
    arrLabeledData.append({ 'id': file_name, 'class': strDirName})

# ---

np.random.seed(7)

if(bDoTraining):
  arrLabeledData = []

  strDirName = "1R"
  scanCoinsDir(strDirName)

  strDirName = "2R"
  scanCoinsDir(strDirName)

  strDirName = "5R"
  scanCoinsDir(strDirName)

  strDirName = "10R"
  scanCoinsDir(strDirName)

  #arrLabeledData print(shuffle(arrLabeledData, random_state=0))
  np.random.shuffle(arrLabeledData)
</code></pre>

	<p>Function to load images:

<pre><code class="language-python">	
def loadImage(path):
    img=cv2.imread(str(path))
    img = cv2.resize(img, (IMAGE_SIZE, IMAGE_SIZE))
    img = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)
    img = img.astype(np.float32)/255.
    img = img.reshape(input_shape)
    return img
</code></pre>

	<p>It is always a good idea to make sure everything works as intended, so let's test image loading:

<pre><code class="language-python">	
if(bDoTraining):
  nImageIdx = random.randint(0, len(arrLabeledData))
  print(arrLabeledData[nImageIdx])
  img = loadImage(join(working_path, "images/", 
	arrLabeledData[nImageIdx]['class'], arrLabeledData[nImageIdx]['id']))
  plt.imshow(img)
  plt.show()
</code></pre>

	<p><img src="images/02_01.png">

	<p>When doing classification on images, we have a choice. Our coins are randomly rotated. On one side it is good, we can even rotate them 
		additionally, to make it harder for a network to overtrain. Or we can align images, arguing that it is easier to learn classification 
		task on images with the same orientation (size, color palette etc). We are going to choose the first approach: we will do so called 
		image augmentation: size, rotation and so on can be randomly altered, so that the network learns to generalize, rather than just 
		memorizing images of the training set (and therefore performing poorly on images it never saw). Particularly, we are going to try 
		adding random noise to images:

<pre><code class="language-python">	
def add_noise(img):
    VARIABILITY = 60
    deviation = VARIABILITY*random.random() / 255.
    noise = np.random.normal(0, deviation, img.shape)
    img += noise
    np.clip(img, 0., 1.)
    return img
</code></pre>

	<p>Additionally, as we NEED diverse images, we are going to add some more functionality to make images different:

<pre><code class="language-python">			
def shiftColors(img):
    hsv = cv2.cvtColor(img, cv2.COLOR_BGR2HSV)
    h, s, v = cv2.split(hsv)

    value = (0.5 - np.random.randint(255) / 255.)

    lim = 1. - value
    v[v > lim] = 1.
    v[v <= lim] += value

    final_hsv = cv2.merge((h, s, v))
    img = cv2.cvtColor(final_hsv, cv2.COLOR_HSV2BGR)

    return img
	
def shiftChannelColors(img):
    value_r = (0.5 - np.random.randint(255) / 255.)
    value_g = (0.5 - np.random.randint(255) / 255.)
    value_b = (0.5 - np.random.randint(255) / 255.)

    lim_r = 1. - value_r
    lim_g = 1. - value_g
    lim_b = 1. - value_b
    
    img[..., 0][value_r > lim_r] = 1.
    img[..., 1][value_g > lim_g] = 1.
    img[..., 2][value_b > lim_b] = 1.
    
    img[..., 0][value_r <= lim_r] += value_r
    img[..., 1][value_g <= lim_g] += value_g
    img[..., 2][value_b <= lim_b] += value_b

    return img	
</code></pre>

	<p>To perform the above mentioned image augmentation, let's use the image data generator provided with TF:

<pre><code class="language-python">		
if(bDoTraining):
  datagen = ImageDataGenerator(
    samplewise_center=True,
    rotation_range=IMAGE_ROTATION_ANGLE,
    width_shift_range=0.1,
    height_shift_range=0.1,
    zoom_range=[0.5, 2]
  )
</code></pre>  
  
	<p>The following function is used to get an image by index from data we loaded earlier, using image data generator we just created:

<pre><code class="language-python">			
def getImage(nImageIdx, datagen):
  img = loadImage(join(working_path, "images/", 
	arrLabeledData[nImageIdx]['class'], arrLabeledData[nImageIdx]['id']))
  
  arrImg = img_to_array(img)
  arrImg = datagen.random_transform(arrImg) # augmentation
  arrImg = add_noise(arrImg)
  arrImg = shiftChannelColors(arrImg)
  
  return np.array(arrImg, dtype="float32")
</code></pre>    

	<p>Again, we need to make sure everything works, so let's see what this function returns:

<pre><code class="language-python">		
if(bDoTraining):
  nImageIdx = 0#np.random.randint(len(arrLabeledData))
  
  img = getImage(nImageIdx, datagen)

  plt.imshow(img) #, cmap='gray')
  plt.show()
</code></pre>    

	<p><img src="images/03_02.png">

	<p>Most likely, we will not be able to make our code perfect first time we run it. So we need to be able to delete networks we have 
		saved on previous attempts, ones with non-optimal parameters and so on.

<pre><code class="language-python">	
def deleteSavedNet(weights_filepath):
    if(os.path.isfile(weights_filepath)):
        os.remove(weights_filepath)
        print("deleteSavedNet():File removed")
    else:
        print("deleteSavedNet():No file to remove") 
</code></pre>

	<p>Generally, we either need to use a Keras-provided callback to plot charts as we train the network, or (better) use TensorBoard. 
		However, the current task is rather simple, so we'll simply wait for the training to finish, and only then wisplay charts:

<pre><code class="language-python">			
def plotHistory(history, strParam1, strParam2):
	plt.plot(history.history[strParam1], label=strParam1)
	plt.plot(history.history[strParam2], label=strParam2)
	plt.legend(loc="best")
	plt.show()
    
def plotFullHistory(history):
    arrHistory = []
    for i,his in enumerate(history.history):
        arrHistory.append(his)
    plotHistory(history, arrHistory[0], arrHistory[2])    
    plotHistory(history, arrHistory[1], arrHistory[3]) 
</code></pre>

	<p>Now a function that creates a model. It loads the EfficientNet, removes its last layers (the classifier) and attaches our own classifier, one
		we are going to train:

<pre><code class="language-python">			
def createModel(nL2, dDrop, optimizer):
  inputs = keras.Input(shape=(IMAGE_SIZE, IMAGE_SIZE, 3))
 
  model_b0 = efn.EfficientNetB0(weights='imagenet', 
	include_top=False)(inputs)
  model_b0.trainable = False
  
  model_concat = model_b0 #layers.concatenate([model_b0, 
	model_vgg16]) #, model_x]) #model_b0
  model_classifier = layers.Flatten(name="Flatten")(model_concat)
  
  model_classifier = layers.Dense(32, 
	kernel_regularizer=regularizers.l2(nL2), activation='relu', 
	name="Dense128")(model_classifier)
  model_classifier = layers.LeakyReLU(alpha=0.1, 
	name="LeakyReLU")(model_classifier)
  model_classifier = layers.Dropout(dDrop, name="Dropout")(model_classifier)

  base_model = layers.Dense(len(arrClasses), activation="softmax", 
	kernel_regularizer=regularizers.l2(nL2), name="DenseEmbedding")(model_classifier)
  model = keras.Model(inputs=inputs, outputs=base_model, name="embedding_model")
  model.compile(loss=keras.losses.CategoricalCrossentropy(), 
	optimizer=optimizer, metrics=["accuracy"])
 
  return model	
</code></pre>

	<p>TF ver 2 have some problems with old style image data generators. Instead, it offers us the new approach that uses Sequence-derived classes. 
		On one side, this approach takes more efforts then just providing generators to a training function. On the other - it handles parallel 
		tasks (or so they tell us) and is, therefore, faster. Here is the Sequence-derived generator we are going to use in our code:

<pre><code class="language-python">				
def getStepSizes():
    nNumOfSamples = len(arrLabeledData)
    nNumOfTrainSamples = nNumOfSamples * TRAINING_IMAGES_PERCENT
    nNumOfValidSamples = nNumOfSamples * VALIDATION_IMAGES_PERCENT

    step_train = nNumOfTrainSamples // BATCH_SIZE

    nNumOfValidSamples = 
		int(nNumOfTrainSamples * VALIDATION_IMAGES_PERCENT / TRAINING_IMAGES_PERCENT)
    step_valid = nNumOfValidSamples // BATCH_SIZE
    
    if(step_train < 100):
      step_train = 100

    if(step_valid < 100):
      step_valid = 100      

    return (step_train, step_valid)
</code></pre>		

	<p>The following class is used to produce batches of images (and labels) that are used during training. Sequence class
		that is used as a parent is a new standard of Keras (if you don't want to use tfdata), it is highly paralelizeable 
		and convenient:

<pre><code class="language-python">		
class MyImageDataGenerator(Sequence):    
  def __init__(self, bIsTrain):
    self.batch_size = BATCH_SIZE
    self.bIsTrain = bIsTrain

    nNumOfSamples = len(arrLabeledData)

    step_train, step_valid = getStepSizes()
    if(bIsTrain):
      self.nStartIdx = 0
      self.nEndIdx = nNumOfSamples * TRAINING_IMAGES_PERCENT - 1
      self.STEP_SIZE = step_train
    else:
      self.nStartIdx = nNumOfSamples * TRAINING_IMAGES_PERCENT
      self.nEndIdx = 
		nNumOfSamples * (TRAINING_IMAGES_PERCENT + VALIDATION_IMAGES_PERCENT) - 1
      self.STEP_SIZE = step_valid

    print("STEP_SIZE: ", self.STEP_SIZE, " (bIsTrain: ", bIsTrain, ")")

  def __len__(self):
    return self.STEP_SIZE

  def __getitem__(self, idx):
    arrBatchImages = []
    arrBatchLabels = []

    for i in range(self.batch_size):
      nImageIdx = np.random.randint(self.nStartIdx, self.nEndIdx)
      sample = arrLabeledData[nImageIdx]

      img = getImage(nImageIdx, datagen)
      strLabel = sample['class']

      arrBatchImages.append(img)
      arrBatchLabels.append(pdClassLabels[strLabel].to_list())
  
    return np.array(arrBatchImages), np.array(arrBatchLabels)	
</code></pre>

	<p>Now we create generators for training and validation:
	
<pre><code class="language-python">	
if(bDoTraining):
  gen_train = MyImageDataGenerator(True)
  gen_valid = MyImageDataGenerator(False)
</code></pre>

	<p>Same way as for all image processing routines, let's make sure everything works:

<pre><code class="language-python">	
def ShowImg(img, label):
  
  print(label)
  
  fig = plt.figure()
  fig.add_subplot(1, 1, 1)
  plt.imshow(img) #, cmap='gray')
  plt.show()
  plt.close()
</code></pre>

<pre><code class="language-python">	
if(bDoTraining):
  (images, labels) = gen_valid.__getitem__(0) #next(gen_train)

  for i, img in enumerate(images):
    ShowImg(img, labels[i])
    break
</code></pre>

	<p><img src="images/03_03.png">
	
	<p>As we do training, we need to perform certain actions at certain moments. For example, we want to save the current (last) and the best (one with minimum error) 	
		networks at the end of each epoch.

<pre><code class="language-python">		
def getCallbacks(monitor, mode, model):
	checkpoint = ModelCheckpoint(best_weights_filepath, 
		monitor=monitor, save_best_only=True, save_weights_only=True, 
		mode=mode, verbose=1)

	save_model_at_epoch_end_callback = LambdaCallback(
		on_epoch_end=lambda epoch, 
		logs: model.save_weights(last_weights_filepath))  

	callbacks_list = [checkpoint, save_model_at_epoch_end_callback]  # , early]

	return callbacks_list
</code></pre>

	<p>The following function will load the previously saved model, if it was saved, of course. Depending on the bBest flag that we pass to it, 
		it loads either best or last model. Best one is used after the training is complete, to actually USE the resulting network. The last network is used in case our training was interrupted, so we can resume it from a checkpoint, rather than starting from the very beginning.:

<pre><code class="language-python">		
def loadModel(model, bBest):
  if(bBest):
    path = best_weights_filepath
    strMessage = "load best model"
  else:
    path = last_weights_filepath
    strMessage = "load last model"

  if(os.path.isfile(path)):
    model.load_weights(path)
    print(strMessage, ": File loaded")
  else:
    print(strMessage, ": No file to load")

  return model
</code></pre>

	<p>The following function is used to do training. Depending on parameters we pass, it deletes (or not) the previously saved network, creates a model, loads the
		previously 	saved results (if we haven't deleted them earlier). After the training is complete, it plots the charts of training and testing errors.

<pre><code class="language-python">	
def trainNetwork(EPOCHS, nL2, nDrop, optimizer, 
	history, bCumulativeLearning = False):

  if(bCumulativeLearning == False):
    deleteSavedNet(best_weights_filepath)

  random.seed(7)
  
  model = createModel(nL2, nDrop, optimizer)
  print("Model created")
  
  callbacks_list = getCallbacks("val_accuracy", 'max', model)  
      
  if(bCumulativeLearning == True):
    loadModel(model, False)

  STEP_SIZE_TRAIN, STEP_SIZE_VALID = getStepSizes()

  print(STEP_SIZE_TRAIN, STEP_SIZE_VALID)
  print("Available metrics: ", model.metrics_names)

  history = model.fit(gen_train, 
    validation_data=gen_valid, verbose=1,
    epochs=EPOCHS, steps_per_epoch=STEP_SIZE_TRAIN, 
    validation_steps=STEP_SIZE_VALID, callbacks=callbacks_list)
    #workers=4, 
    #use_multiprocessing=True)

  print(nL2)
  plotFullHistory(history)
  
  return model, history
  </code></pre>

	<p>As you can see, it does some initializations, and then calls Keras's "fit" function.
	
	<p>Accuracy of our classification will be calculated at a simple formula: we compare the ground truth with our prediction. We are also going to display images that 
		produced wrong predictions, it might help us to figure out what the problem was:
	
<pre><code class="language-python">
def getAccuracy(test_preds, test_file_names, test_class_names):

  nTotalSuccess = 0

  for i, arrPredictedProbabilities in enumerate(test_preds):
    nPredictedClassIdx = arrPredictedProbabilities.argmax()
    gt_class = test_class_names[i]
    predicted_class = arrClasses[nPredictedClassIdx]
    if(predicted_class == gt_class):
      nTotalSuccess += 1
    else:
      print("GT: ", gt_class, "; Pred: ", predicted_class, 
		"; Probabilitires: ", arrPredictedProbabilities[0], 
		", ", arrPredictedProbabilities[1])
      img = loadImage(join(working_path, "images/", 
	  test_file_names[i], ".png"))
      plt.imshow(img)
      plt.show()

  nSuccess = nTotalSuccess / (i+1)

  return nSuccess
</code></pre>

	<p>This function performs the actual training and calculates the accuracy of a resulting net:
	
<pre><code class="language-python">	
def train_and_test(EPOCHS, nL2, nDrop, optimizer, 
	learning_rate, bCumulativeLearning):
  model, history = trainNetwork(EPOCHS, nL2, nDrop, 
	optimizer, bCumulativeLearning)
  print("loading best model")
  model = loadModel(model, True)

  return model
</code></pre>	

	<p>As the function train_and_test is just the function, it would not call itself, so we have to do it:

<pre><code class="language-python">		
opt = tf.keras.optimizers.Adam(0.0002) ##Adamax(lr=0.0001, 
	beta_1=0.9, beta_2=0.999, epsilon=None, decay=0.0)
nL2 = 0.4
nDrop = 0.4

if(bDoTraining):
  EPOCHS = 50
  learning_rate=0.001

  np.random.seed(7)
  model = train_and_test(EPOCHS, nL2, nDrop, opt, 
	learning_rate, bCumulativeLearning=False)

  #model = createModel(nL2, optimizer)
  model = loadModel(model, True)
  model.save(best_weights_filepath)    # A full model is saved
</code></pre>

	<p>A helper function to figure our where training ant test sets begin and end:

<pre><code class="language-python">		
def getClassMinMax(bIsTrain):
  nLen = len(arrLabeledData)
  if(bIsTrain):
    nMinIdx = 0
    nMaxIdx = nLen * TRAINING_IMAGES_PERCENT - 1
  else:
    nMinIdx = nLen * TRAINING_IMAGES_PERCENT
    nMaxIdx = nLen * (TRAINING_IMAGES_PERCENT + VALIDATION_IMAGES_PERCENT) - 1 
  
  return int(nMinIdx), int(nMaxIdx)	
</code></pre>

	<p>An additional test, which can be very useful when something is not right and we want to "see" the problem. Here we display test images 
		along with predictions for them IF Prediction and Ground Truth are different:

<pre><code class="language-python">	
if(bDoTraining):
  nMinIdx, nMaxIdx = getClassMinMax(False)

  for i, nImageIdx in enumerate(range(nMinIdx, nMaxIdx)):
    print(i+1, "of", nMaxIdx - nMinIdx)
    img = loadImage(join(working_path, "images/", 
		arrLabeledData[nImageIdx]['class'], arrLabeledData[nImageIdx]['id']))

    arrImg = img_to_array(img)
    arrImg = np.array(arrImg, dtype="float32")  

    # ---

    test_preds = model.predict(arrImg.reshape(1, IMAGE_SIZE, IMAGE_SIZE, 3))

    nIdx = test_preds.argmax()
    if(arrLabeledData[nImageIdx]['class'] != arrClasses[nIdx]):
      print("GT: ", arrLabeledData[nImageIdx]['class'], "; Pred: ", arrClasses[nIdx])
      plt.imshow(img)
      plt.show()
</code></pre>

	<p>Finally, our network is production-ready. Now we can process our images, ALL of them.
		Same as above in "test" section, but this time we process images from output folder of step 2.
	<br>Note that we don't have that much space on Colab, and our dataset is 26,000+ images. So instead of copying images of stage 1, 
		we are going to index them.
	
<pre><code class="language-python">		
# Same as above in "test" section, but this time we 
# process images from output folder of step 2.

# Note that we don't have that much space on Colab, 
# and our dataset is 26,000+ images. 
# So instead of copying images of stage 1, 
# we are going to index them.

images_source_path = working_path + 
	"../02_avers_or_revers/images_processed/"
result_path = working_path + "output.txt"

# Temporary comment
#arrSourceImageNames = [f for f in listdir(images_source_path)]
arrSourceImageNames = []
for i, f in enumerate(listdir(images_source_path)):
  if(i >= 100):
    break
  arrSourceImageNames.append(f)

model = createModel(nL2, nDrop, opt)
model = loadModel(model, True)

arrOutput = []

nTotal = len(arrSourceImageNames)
#i = 41
#file_name = arrSourceImageNames[i]
print(">>> Processing heads <<<")
for i, file_name in enumerate(arrSourceImageNames):

  if(i > 100):
    break;
  if(i%100 == 0):
    print(i, " of ", nTotal)    

  # 169860023_00_00_tail(.png)
  # Here 169860023 is number of image set 
  # (groups of photos of a set of coins from meshok.ru)
  # First 00 is number of image (from meshok.ru, so they have 
  # 169860023_00, 169860023_01...)
  # Second 00 is coin number within coin set (image with 3 coins: 
  # 169860023_00_00, 169860023_00_01, 169860023_00_02, 
  # 169860023_01_00, 169860023_01_01, 169860023_01_02) 
  word_list = file_name.split(".")  # ['169860023_00_00_tail', 'png']
  image_name = word_list[0]
  image_ext = word_list[1]

  # Russian coins have value on "head" side
  if("tail" in image_name):
    continue

  # ["169860023", "00", "00", "tail"]
  image_name_parts = image_name.split("_")

  image_path = join(images_source_path, file_name)
  img = loadImage(image_path)
#  print("GT: ", arrLabeledData[nImageIdx]['class'], "; Pred: ", arrClasses[nIdx])
#  plt.imshow(img)
#  plt.show()  

  arrImg = img_to_array(img)
  arrImg = np.array(arrImg, dtype="float32")  

  # ---

  test_preds = model.predict(arrImg.reshape(1, IMAGE_SIZE, IMAGE_SIZE, 3))

  nIdx = test_preds.argmax()
  strClass = arrClasses[nIdx]
  
  # Find out if info for this coin (but for other image of it) is in arrOutput

  # 'coin_id': coin set / image number within set
  # 'id': num. of image version (like we have 3 photos where the 
  # same coin is coin number 7: xxxx_00_07, xxxx_01_07, xxxx_02_07). 
  # Note: this is part of a sub-array
  # 'side': head/tail

  coin_id = image_name_parts[0] + "_" + image_name_parts[2]
  coin_info = [info for info in arrOutput if info['coin_id'] == coin_id] 
  if coin_info: 
    coin_info = coin_info[0]
    coin_info['data'].append({'id': image_name_parts[1], 
		'side':image_name_parts[3], 'class':strClass})

    # If classes predicted for different photos of the 
	# same coin are different, show that coin
    if(strClass != coin_info['data'][0]['class']):
      print("================\n", coin_info['data'][0]['class'], 
		" vs ", strClass)
      
      plt.imshow(img)
      plt.show()    

  else:
    coin_info = { 'coin_id':coin_id, 
		'data': [{'id': image_name_parts[1], 
			'side':image_name_parts[3], 'class':strClass}], 
			'ext': image_ext }
    arrOutput.append(coin_info)
</code></pre>

	<p>Let's take a closer look at our code. First of all, we have a BIG problem organizing our files. 
		See, we may have more than one photo of hear and / or tail of a coin. And we might want to use those 
		images together to increase the accuracy of whatever we'll do with those images in future. 
	<p>It means that, instead of having separate records for each photo (of the same coin), we better 
		group them together. The way we stored data until now didn't support this approach, so we have 
		rewriten it: we indexed our data by "coin_id" - a unique combination of image name and number of the
		particular coin inside this image.
	<br>To explain the phrase above: note that originally image could have more than one coin. Then (during step 1) we 
		extracted coins from that image, but kept the original name (169860023) and added coin number (169860023_YY_XX_tail).
		Here XX is a coin number, and YY is the number of the photo (we had 3 photos of 5 coins set, we look at photo number one and our coin is
		number two, so we have 169860023_01_02_tail).
	<p>Now, we do not add number of photo to the coin index, as it can be different for images containing the same coin. 
	<p>Sounds complex, but as a Data Scientist you will do this sort of things a lot, and it will, at some point, become
		automatic.
		
	<p>
	<p>Note, that during that pass we ignored "tail" images. This is because Russian coins have the value printed on "head" side. 
		But now we need to add the "tail" info to the same list.  

<pre><code class="language-python">	
# Add tail images
#i = 41
#file_name = arrSourceImageNames[i]
print(">>> Processing tails <<<")
for i, file_name in enumerate(arrSourceImageNames):

  if(i > 100):
    break;
  if(i%100 == 0):
    print(i, " of ", nTotal)    

  # 169860023_00_00_tail(.png)
  # Here 169860023 is number of image set 
  # (groups of photos of a set of coins from meshok.ru)
  # First 00 is number of image (from meshok.ru, so they 
  # have 169860023_00, 169860023_01...)
  # Second 00 is coin number within coin set 
  # (image with 3 coins: 169860023_00_00, 169860023_00_01, 
  # 169860023_00_02, 169860023_01_00, 169860023_01_01, 169860023_01_02) 
  word_list = file_name.split(".")  # ['169860023_00_00_tail', 'png']
  image_name = word_list[0]
  image_ext = word_list[1]

  # Russian coins have value on "head" side
  if("head" in image_name):
    continue

  # ["169860023", "00", "00", "tail"]
  image_name_parts = image_name.split("_")

  # ---
  
  # Find out if info for this coin (but for other image of it) 
  # is in arrOutput. As we might have more than one "head", 
  # we'll take 0th head class for this tail image

  coin_id = image_name_parts[0] + "_" + image_name_parts[2]
  coin_info = [info for info in arrOutput if info['coin_id'] == coin_id] 
  if coin_info: 
    coin_info = coin_info[0]
    coin_info['data'].append({'id': image_name_parts[1], 
	'side':image_name_parts[3], 'class':coin_info['data'][0]['class']})
    
# Save arrOutput to a JSON file
with open(result_path, 'w') as outfile:
    json.dump(arrOutput, outfile)
</code></pre>

	<p><hr>
	<p>At this point we have a JSON file with info about our coins, including value (1, 2, 5, 10 roubles) and side (head/tail).
	<p>As part of fine-tuning, we may want to look through the list, finding coins that - for same coin id - have different predicted classes,
		and create a new training set, emphasizing those coins. 
	
	
	
  
[an error occurred while processing this directive]