Pinned Repositories
bigdata-sql-parser
基于antlr4 解析器,支持spark sql, tidb sql, flink sql, Spark/flink jar 运行命令解析器
datawork
开源技术打造数据开发平台
DF
招商银行信用卡中心金融数据大赛
DRDCDeviceMonitor
DRDCDeviceMonitor Flink+Kafka+redis+rabbitmq实现的实时智能运维系统。
ETL-1
数据基本清洗包括日期、时间、数值、字符串、字符、金钱、数据库(mysql、postgresql、mongodb、hbase、hdfsmemcached)、加解密(md5、sha、base64、aes、rsa)、文件、http服务、正则表达式等,后期会不断更新。
flink-learning
flink learning demo
GBTLRTuniu
基于Spark streaming+Kafka+Redis/HBase的GBDT+LR推荐排序模型
lighthouse
离线调度, hive, 任务依赖, 任务调度, 大数据开发平台
s_PublicSecurityBigDataPlatform
公安大数据平台
Semantic-search
基于知识图谱的语义搜索模块,底层为neo4j图数据库
0xqq's Repositories
0xqq/albert_zh
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
0xqq/alchemy
给flink开发的web系统。支持页面上定义udf,进行sql和jar任务的提交;支持source、sink、job的管理;可以管理openshift上的flink集群
0xqq/AlinkLearning
📖 Alink Learning with scala
0xqq/allennlp
An open-source NLP research library, built on PyTorch.
0xqq/angel-graph
The graph computing package for Angel.
0xqq/Coronavirus-Epidemic-2019-nCov
👩🏻⚕️2019-nCoV estimation and forecast using statistical model; 新型冠状病毒武汉肺炎统计模型预测
0xqq/deeplearning4j
Eclipse Deeplearning4j, ND4J, DataVec and more - deep learning & linear algebra for Java/Scala with GPUs + Spark
0xqq/Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
0xqq/tidb-in-action
TiDB In Action: based on 4.0
0xqq/zeppelin
Mirror of Apache Zeppelin
0xqq/12306
12306智能刷票,订票
0xqq/Alink
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
0xqq/Chinese-medical-dialogue-data
Chinese medical dialogue data 中文医疗对话数据集
0xqq/Chinese-PreTrained-XLNet
Pre-Trained Chinese XLNet(中文XLNet预训练模型)
0xqq/clause
:horse_racing: Chatopera语义理解系统
0xqq/CLUE
中文任务基准测评 datasets, baselines, pre-trained models, corpus and leaderboard
0xqq/Event-Extraction
基于法律裁判文书的事件抽取及其应用,包括数据的分词、词性标注、命名实体识别、事件要素抽取和判决结果预测等内容
0xqq/GPT2-chitchat
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI**)
0xqq/incubator-doris
Apache Doris (Incubating)
0xqq/K-BERT
Source code of K-BERT (AAAI2020)
0xqq/lineflow
:zap:A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python
0xqq/ludwig
Ludwig is a toolbox built on top of TensorFlow that allows to train and test deep learning models without the need to write code.
0xqq/nussknacker
Process authoring tool for Apache Flink
0xqq/power
电力相关代码及文档数据
0xqq/pytext
A natural language modeling framework based on PyTorch
0xqq/spark-binlog
A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).
0xqq/study-hard
图计算平台,子图匹配算法
0xqq/tf-encrypted
A Framework for Machine Learning on Encrypted Data
0xqq/tokenizers
💥Fast State-of-the-Art Tokenizers optimized for Research and Production
0xqq/xf_tag
大数据应用分类标注挑战赛(NLP),亚军🥈