HuaBao-OK's Stars
facebookresearch/fastText
Library for fast text representation and classification.
facebook/zstd
Zstandard - Fast real-time compression algorithm
google/brotli
Brotli compression format
NLPchina/ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
tesseract-ocr/tessdata
Trained models with fast variant of the "best" LSTM models + legacy models
rmtheis/tess-two
Fork of Tesseract Tools for Android
huaban/jieba-analysis
结巴分词(java版)
Micro-sheep/efinance
efinance 是一个可以快速获取基金、股票、债券、期货数据的 Python 库,回测以及量化交易的好帮手!🚀🚀🚀
mpquant/Ashare
股票行情实时数据接口-A股,完全免费的沪深证券股票数据-**股市,python最简封装的API接口,包含日线,历史K线,分时线,分钟线,全部实时采集,系统包括新浪腾讯双数据核心采集获取,自动故障切换,STOCK数据格式成DataFrame格式,可用来查询研究量化分析,股票程序自动化交易系统.为量化研究者在数据获取方面极大地减轻工作量,更加专注于策略和模型的研究与实现。
naxiaoduo/1000UserGuide
1000UserGuide:对独立开发者和创业者来说,找到前1000个早期用户太关键了。这里精心整理了300多个国内外渠道,适合独立开发者和创业者推广产品的渠道。
apache/opennlp
Apache OpenNLP
florianv/swap
:currency_exchange: Currency exchange rates library
uname-yang/pysnowball
雪球股票数据接口 python edition
maanavshah/stock-market-india
API for Indian Stock Market's NSE and BSE.
luben/zstd-jni
JNI binding for Zstd
robbiehanson/KissXML
A replacement for Cocoa's NSXML cluster of classes. Based on libxml. Works on iOS.
adaptech-cz/Tesseract4Android
Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
zhangxiangliang/stock-api
股票接口 | 韭菜小猪 | A股 | 美股 | 港股 | 股票 | 基金 | JavaScript
fawazahmed0/exchange-api
Free Currency Exchange Rates API with 200+ Currencies & No Rate Limits
mayabot/mynlp
一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)
joestelmach/natty
Java natural language date parser
florianv/exchanger
:office: Currency exchange rates framework for PHP
mayabot/fastText4j
Implementing Facebook's FastText with java
EugenePig/ik-analyzer-solr5
IKAnalyzer for Solr5
Waikato/weka-trunk
No longer updated mirror of the Weka trunk branch.
ellisonchan/SplashScreen
Splash screen demo that used with Jetpack ‘SplashScreen‘ library and Android 12's Splash Screen API.
alicelmx/SVM-Chinese-Classification
利用支持向量机实现中文文本分类
rahulnadella/FuzzySearch
Utility class to find an approximate match for specific String
geek-jiaxu/CnnTextClassifyByDl4j
采用卷积神经网络(cnn)进行文本分类,依赖dl4j
chenergy1991/Chinese-Text-Classification-Based-on-Naive-Bayes
The development of computer and communications technology has resulted in huge amount of data. The automatic text classification technique has become very significant. Naive Bayes algorithm is based on probabilistic model. It is an effective way to deal with automatic text classification. The main task of this paper is to discuss the theoretical basis of Naive Bayes text classifier and describe the process of using Java language to accomplish the classifier. We can divide the classifier into two parts: the feature extraction and the calculation according to the feature. In the feature extraction part, I use the Chinese word segmentation method and the stop words filtering. In the classification part, I calculate the prior probability, the likelihood function value and the maximum a posterior estimation. During the simple test, the author uses the Sogou laboratory’s text classification corpus as the training set and the test set. During the test, the accuracy is between 39% to 56 %. The results show that there is still room for improvement. The paper also includes the discussion of its improvement methods and wider application.