[Question] Are numerical and categorical features used for fine-tuning the BERT(or LLMs)?

Question

[Question] Are numerical and categorical features used for fine-tuning the BERT(or LLMs)?

Closed this issue 4 months ago · 3 comments

anirbandey303 commented 4 months ago

Hi Developers,

I have a naive question, could you please help me understand:

During the model training, are the weights and biases of the BERT model(or any other supported model) changing? (IF yes,) is it taking the numerical and categorical features into account or is it being used as a feature (along with the vector-embeddings from the LLM) by the MLP that does the classification?

Answer 1 · 2024-05-02T18:22:02.000Z

Hi @anirbandey303,

Yes, the pre-trained language model (such as BERT) weights do change.

The numerical/categorical features are not used by the language (BERT) model. They are sent as features to the MLP alongside the output of the language model. The architecture diagram here might help!

Answer 2 · 2024-05-02T18:32:49.000Z

Perfect, that clears out my confusion. Thanks a lot for the prompt response. 👍

Answer 3 · 2024-05-02T18:33:47.000Z

Happy to help :)