evaranks.com👋

A web application where a CNN-based AI classifies your fashion style

Demo

Eva, fashion-classifier AI, is able to analyze your picture to figure out what your fashion is and classify it into 7~8 classes with a fine-tuend VGG-16 model. You can also share the result with others on leaderboard. The service process:

AI Stack

Eva was created based on a nice paper regarding vision AI : M. Takagi, E. Simo-Serra, S. Iizuka, and H. Ishikawa. WhatMakes a Style: Experimental Analysis of Fashion Prediction.InProceedings of the International Conference on ComputerVision Workshops (ICCVW), 2017.

Among several models, we chose to use VGG-16 model after examining the validation result, which was a little bit higher than other candidates such as ResNet or VGG-50.

Feature Extraction Map

One of the noticeable outcome is we extracted the weight shape of last convolutional layer and visualized them on the original picture we used to predict its class. we got a result like the below.

Here's the sample code we wrote :

from matplotlib.pyplot import imshow
test_image = Image.open('test_img.jpg', 'r')
test_image = test_image.resize((256, 256))
test_image = np.asarray(test_image2)

# load model
test_image = np.array(test_image.reshape(-1, 256, 256, 3))
last_conv_output, pred = new_model.predict(test_image)

last_conv_output = np.squeeze(last_conv_output)
feature_activation_map = scipy.ndimage.zoom(last_conv_output, (32, 32, 0.5), order=1)


pred_class = np.argmax(pred)
predicted_class_weights= last_weight[:, pred_class]
print(predicted_class_weights.shape)

# visualize output
final_output = np.dot(feature_activation_map.reshape((256*256, 256)), predicted_class_weights).reshape((256, 256))
fig, ax = plt.subplots(nrows=1, ncols=2)
fig.set_size_inches(16, 20)

Front-end Stack

Vanila JS & CSS3(HTML5)

We tried to use as many as up-to-date ES6 grammers and implement visual effects using them.

Interactive interface

How to design the whole layout for better UI/UX was a core issue in this project. Through several arguments and research, we were agreed to make interactive interface using typing effect and speech bubbles.

Here's the sample code :

let typing =  setInterval(function(){
  content_element.innerHTML += content[text_idx]
  text_idx += 1
  if(text_idx===content_length&&content_idx<num_contents-1)
  {
    content_element.classList.remove('typing-on')
    content_idx += 1
    text_idx = 0
    content = texts_arrs[content_idx]
    content_length = content.length
    content_element = elements[content_idx]
    content_element.classList.add('typing-on')

  }
  else if(text_idx===content_length&&content_idx===num_contents-1){
    content_element.classList.remove('typing-on')
    clearInterval(typing)
  }
},100)
}

Back-end Stack

Flask

Flask is python based micro-framework. It gives a lot of flexibility to developers compared to other framework like django or spring. Since we needed to deploy AI model in our service, we needed flexibity in our backend server, and therefore, we chose to use flask

Some Issues

Importing Tensorflow

Importing tensorflow takes time when server is initialized. While developing, when we change and test the code, the server restarts and imports the library again. This takes quite a lot of time and lowers the productivity. To solve this problem, we set a flag variable named USE_TF in the config file and didn't import tensorlflow while development.

#config.py
class DevelopmentConfig(BaseConfig):
    DEBUG = True
    USE_TF = False

#ranks.py
if USE_TF:
    from tensorflow.keras import applications, optimizers
    from tensorflow.keras.utils import to_categorical
    from tensorflow.keras.models import Sequential, Model, load_model
    from tensorflow.keras.layers import Dropout, Flatten, Dense
    from tensorflow.keras.preprocessing.image import ImageDataGenerator
    from tensorflow.keras.callbacks import ModelCheckpoint

def predict():
	  if USE_TF:
        print("#")    
	  else:
        result = np.random.randn(1,len(style_name))
        pred_class=np.argmax(result)
        ai_style =style_name[pred_class] #Street
        img_str = request.json['image']
    return pil_random_img

Big size Image

We encoded the image and encoded into base64 . When big size image was uploaded, it is sent to the server after it is chunked into small size data. However flask didn't support handling chunked datas, therefore we had to resize the image beforehand.

//rank..js

function imageToDataUri() {
  // create an off-screen canvas
  var canvas = document.createElement('canvas'),
      ctx = canvas.getContext('2d');

  width = 256
  height = 256

  // set its dimension to target size
  canvas.width = width;
  canvas.height = height;
  // draw source image into the off-screen canvas:
  ctx.drawImage(this, 0, 0, width, height);

  // encode image to data-uri with base64 version of compressed image
  compressed_base64 = canvas.toDataURL();

  return compressed_base64
}

PostgreSQL

Storing analyzed fashion style and features like following or scraping required many joining operations. Therefore we used RDBMS instead of NoSQL services

Deploy

We used AWS Elastic Beanstalk and RDS to deploy our application. There are pros and cons you should check out before you enter into the AWS World!

AWS

Pros:

Easy to deploy through CLI.
Easy to roll back a deployment with checking S3 logs and app version.
Able to do both direct control and auto-deployment
Able to automate load-balancing and capacity-provision

Cons:

Temporary server error occurs in updating app version.
A bit complicate for beginners to figure out settings and configuration like ebextensions.

Tips

You can execute the command shell script when you deplay as below. You don't need to connect to the ec2 directly which makes very convenient. Since tensorflow takes up quite a lot of memory while server is runing, I would recommend useing container commands which is executed before server runs.

container_commands:
  python_create:
    command: source /opt/python/run/venv/bin/activate && python db_create.py

RDS

Pros

Handy to use since it isn't necessary to do OS and DB setting directly.
Easy scaling and auto-scaling for storage.

Cons Nothing!! at least while we have used it.

UI/UX

Facebook Login 2.Sharing Results in KakKoTalk

wkdalsgh192/Evaranks