FinProLex: A Professional Lexicon for Finance
Introduction
FinProLex provides 5,162 tokens in professional analysts' reports and the financial social media platform posts with expert-like scores. The expert-like scores are calculated based on the pointwise mutual information (PMI).
Format
The FinProLex is consisted of "token" and "expertise_score" in json format.
Example
{
'token': '空下去'
'expertise_score': -1.7505470585119092
},
{
'token': '考量'
'expertise_score': 2.049518947959047
}
Download
Download FinProLex.zip
for the data.
How to Cite the Corpus
Please cite the following paper when referring to the FinProLex in academic publications and papers.
Chung-Chi Chen, Hen-Hsen Huang, and Hsin-Hsi Chen. 2021. Evaluating the Rationales of Retail Investors. In Proceedings of The Web Conference 2021 (WWW 2021).
License
FinProLex is licensed under the Creative Commons Attribution-Non-Commercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) license.