/Chinese-Text-Classification

medical text classification

Primary LanguagePythonOtherNOASSERTION

1.Topic:Medical Text Classification

contest and data link:https://tianchi.aliyun.com/competition/entrance/532153?spm=a2c22.12281957.0.0.4c885d9bzbNNxD

2.Paper Title:《A Model Ensemble Approach With LLM for Chinese Text Classification》

3.Architecture

image

4.Model Selection

Qwen-7b-Chat,ChatGLM2-6b,Macbert

5.Train methods

Qlora ,lora,FGM adversarial train

6.data format

{ "id": "id_0", "conversations": [ { "from": "user", "value": "糖尿病人可不可以服用伟哥? 请对上述的句子划分类别,从下面提供的类别中选出一个正确的选项:A.诊断 B.治疗 C.医疗常识 D.健康生活方式 E.流行病学 F.其他" }, { "from": "assistant", "value": "B.治疗" } ] }

7.label transfer

image

8.Run

1.run data_process.py

2.run sh lora.sh or train.py

3.run batch.py

4.run data_postprocess.py

9.rank(top2)

image

10.Citation

{email:18819893186wcy@gmail.com}