nlpjoe/daguan-classify-2018

key_words_train_features.df

dabaitudiu opened this issue · 2 comments

博主你好,请问在train_predict.py这个文件中

def static_data_prepare():
    train_y = pd.read_csv(config.TRAIN_X, usecols=['label_c_numeric']).values
    kw_train_df = pd.read_csv('../data/feature/key_words_train_feature.df')
    kw_test_df = pd.read_csv('../data/feature/key_words_test_feature.df')

里面的key_words_train_features.df是怎么得来的?是过滤掉低频词之后直接save的df吗?

okok 谢谢博主