qghckkjkkkk's Stars
Werneror/Poetry
非常全的古诗词数据,收录了从先秦到现代的共计85万余首古诗词。
WuLC/ThesaurusSpider
下载搜狗、百度、QQ输入法的词库文件的 python 爬虫,可用于构建不同行业的词汇库
17621192638/JiebaLexicon
构建**百科词库,作为jieba分词的自定义词库。爬取百度拼音输入法词库,将.bdict文件解析为txt文件.python3.
StuPeter/Sougou_dict_spider
搜狗词库爬虫,全类目下载,自动分类,scel转txt
iMyon/charts4bmoe
charts for Bilibili Moe http://bmoe.uuzsama.me/
yxcs/poems-db
比较全的中华古诗古词古文库,包括21万首古诗词,以及注释、赏析等信息,包含10000多名诗人以及诗人的介绍、生平等,同时包含,1600多个词牌介绍,**70多个朝代解析,和古诗文的近200个分类标签
AI-YULU/baike_triples
爬取百度百科词条,抽取三元组,构建知识图谱
Warkeeper/BaiduBaikeCrawler
This project is a crawler which trying to get all lemmas of Baidu Baike.The author has downloaded about 100,000 lemmas in one and a half hour.This project uses https://github.com/qq1367212627/XDX03065 for reference and improves its performance.
BIT-ENGD/baidu_baike
Pelhans/Z_knowledge_graph
Bulding kg from 0
thunlp/THUOCL
THUOCL(THU Open Chinese Lexicon)中文词库
wainshine/Chinese-Names-Corpus
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
chinese-poetry/chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
ryanhoo/StylishMusicPlayer
A stylish music player for android device 16+
elpwc/anime_crawler
从萌娘百科和Bangumi上爬取动画信息的爬虫
HCLonely/pcr-dict
公主连结Re:Dive 中文输入法词库。
liuhuanyong/DomainWordsDict
DomainWordsDict, Chinese words dict that contains more than 68 domains, which can be used as text classification、knowledge enhance task。涵盖68个领域、共计916万词的专业词典知识库,可用于文本分类、知识增强、领域词汇库扩充等自然语言处理应用。
outloudvi/mw2fcitx
Fcitx 5 pinyin dictionary generator for MediaWiki instances. (Releases for dict of zh.moegirl.org.cn / Check release list for latest releases)
felixonmars/fcitx5-pinyin-zhwiki
Fcitx 5 Pinyin Dictionary from zh.wikipedia.org
DiexMi/MoegirlMenuDictionary-For-Gboard-MSPinyinIME
为Gboard与微软拼音而作的,取自于萌娘百科目录页的二次元词库
yuhui-zh15/SogouWord
jilelab/acg-chinese-words
二次元中文特征词库。
Naigang/py-sogouciku
python搜狗词库下载并转换为txt文档
danimahardhika/candybar-library
Android icon pack material dashboard
Monster2848/sougou_dic_spider
pyunits/pyunit-sogou
搜狗词库下载模块
justjavac/weibo-trending-hot-search
微博热搜榜,记录从 2020-11-24 日开始的微博热门搜索。每小时抓取一次数据,按天归档。
request/request
🏊🏾 Simplified HTTP request client.
Tang-Li-Jen/GooglePlay_Crawler
Python crawler for Apps informations on Google Play
shuangma/GooglePlayCrawler
Google Play Crawler is able to automatically and efficiently crawl the detail information of Android apps on Google Play.