Adding support for MobileViTV2 model

Question

Adding support for MobileViTV2 model

Closed this issue a month ago · 3 comments

laszlokiss-szelena commented a month ago

Model description

Hi,

I would love to use MobileViTV2 in my application. I am definitely not an expert, but it seems that its architecture is pretty similar to MobileViT, so adding it seems fairly straightforward to me.

Laszlo

Prerequisites

The model is supported in Transformers (i.e., listed here)
The model can be exported to ONNX with Optimum (i.e., listed here)

Additional information

No response

Your contribution

I experimented with this model on my fork here: KLaci@e1e02b1

I can submit a PR too if needed.

Answer 1 · 2024-04-22T14:57:53.000Z

Hi there 👋 Looks like the ONNX export isn't as simple as I originally thought (see here). Is this something you'd be able to look into? :)

Answer 2 · 2024-04-22T15:56:09.000Z

Okay I might have got it working.

Answer 3 · 2024-04-22T17:00:31.000Z

Example code (requires #721):

import { pipeline } from '@xenova/transformers';

const url = 'https://huggingface.co/datasets/Xenova/transformers.js-docs/resolve/main/tiger.jpg';
const classifier = await pipeline('image-classification', 'Xenova/mobilevitv2-1.0-imagenet1k-256', {
    quantized: false,
});
const output = await classifier(url);
// [{ label: 'tiger, Panthera tigris', score: 0.6491137742996216 }]