Nissan Project

Overview

Backgroud

Currently Nissan uses a survey to measure opinion/brand preference, brand awareness, and attribute association for automotive brands and models.

Attributes currently being tracked in survey

Functional Attributes: Dependable, Lasts long, Value for money, Quality fit and finish, Attractive styling, Safe, Retains resale value, Driver comfort, Fun to drive, Advanced features, Responsive handling, Prestigious, Dealerships, Fuel efficient, Quick acceleration, Environmentally friendly, Affordable
Personal Attributes: Trusted, Leader, Responsible, Confident, Innovative, Exciting, Practical, Adventurous, Passionate, Distinctive, Youthful, Aggressive

We use Transformer Model combing actual customer and expert commentary online to determine natural conversations about how automotive brands and models are being talked about instead of using the traditional attributes survey takers are pushed through to.

Problem

Understand organically the conversations around the Nissan Rogue and competitors as a first use case. What attributes are naturally associated with which models, both outside of and including the existing attributes we track? And what is preference/opinion of each model in comparison to one another.

Solution

To gain the data, we scraped from the Web Scrape car reviews online and merge them, including Edmunds.com, KBB.com, cars.com, Youtube Review under Nissan Rogue, Chevy Equinox, Ford Escape, Ford Bronco Sport, Honda CR-V, Hyundai Tucson, Kia Sportage, Mazda CX-5, Subaru Forester, Toyota RAV4.

Then, we directly apply zero-shot classification on all attributes as our baseline model.

Since there are some potential issues, we design a two-stage modeling:

Judge if this attribute is mentioned in each car review by applying transformers question-answering to it.
If Yes (this attribute is included in this car review), proceed to apply zero-shot classification for each attribute (eg. dependable, not dependable); If No, ignore.

Result

Improved results:

Correlation of each attributes:

Comparing each model:

GPT-3

Generative Pre-trained Transformer - Generation 3

Result table

Compare Values:

Use OpenAi (GPT-3), a large, more accurate, charged, model
Use Hugging Face, a open-source, relatively low accurate, free model

Solution:

Domain Adaptation to Hugging Face model to reduce the model size and make more accurate prediction by open source model: Mask Fill: https://huggingface.co/bert-base-uncased

Critical Analysis

The Rogue's overall rating is slightly below average, especially when it comes to car acceleration and fun to drive. We checked the car configs and Rogue has 180 hp and the others are around 200 hp. We think this is the main reason.
We think Rogue can bring more at this price compared with other models, which means better value for money. We think the potential customers of these models are price-sensitive, and Rogue should improve on the premise of maintaining value for money.
Some attributes are overlapped or unclearly defined, we may optimize them later, such as Value for money and Retains resale value are similar, and it's hard to interpret what is last long.
Since zero-shot model has limitations, we will choose another model, such as GPT-3, to try fine-tuning. This is also our future work.

Huggingface Space

Huggingface space is here.

Huggingface Model Card

Huggingface model card is here.

Resource Links

Huggingface tutorial

facebook/bart-large-mnli

deepset/roberta-base-squad2

Code Demo

Code is inside this repo

Video Recording

Coming Soon

vanderbilt-data-science/nissan