will4906/PatentCrawler

abstract解析出来有html代码

fallleave001 opened this issue · 1 comments

将patent.py第145行,改为如下即可:
sipocrawler['abstract'] = BeautifulSoup(detail.get('abstractInfoDTO').get('abIndexList')[0].get('value')).text.replace('\n','')

好的,我修改一下