sophychen79's Stars
daattali/addinslist
📜 Discover and install useful RStudio addins
MIT-LCP/mimic-code
MIMIC Code Repository: Code shared by the research community for the MIMIC family of databases
piskvorky/gensim-data
Data repository for pretrained NLP models and NLP corpora.
dunovank/jupyter-themes
Custom Jupyter Notebook Themes
riejohnson/ConText
ConText v4: Neural networks for text categorization
Wangpeiyi9979/IE-Bert-CNN
一个关于百度2019语言与智能技术竞赛信息抽取 (http://lic2019.ccf.org.cn/kg) 模型, 模型采用BERT+CNN。DEMO地址 https://github.com/Wangpeiyi9979/InformationExtractionDemo
snorkel-team/snorkel
A system for quickly generating training data with weak supervision
codelucas/newspaper
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
Shuilongyin/semi_ML
tmadl/semisup-learn
Semi-supervised learning frameworks for python, which allow fitting scikit-learn classifiers to partially labeled data
stevengj/nlopt
library for nonlinear optimization, wrapping many algorithms for global and local, constrained or unconstrained, optimization
ageron/handson-ml
⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.
ThilinaRajapakse/simpletransformers
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
dvanoni/notero
A Zotero plugin for syncing items and notes into Notion
yuzhimanhua/MotifClass
MotifClass: Weakly Supervised Text Classification with Higher-order Metadata Information (WSDM'22)
scikit-learn/scikit-learn
scikit-learn: machine learning in Python
apachecn/sklearn-doc-zh
:book: [译] scikit-learn(sklearn) 中文文档
autonlab/weasel
Weakly Supervised End-to-End Learning (NeurIPS 2021)
HazyResearch/flyingsquid
More interactive weak supervision with FlyingSquid
SALT-NLP/MixText
MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification
PouringRain/blog_code
存放知乎,博客发表文章中的代码
fengdu78/machine_learning_beginner
机器学习初学者公众号作品
shenweichen/GraphEmbedding
Implementation and experiments of graph embedding algorithms.
eshwarkoka/Medical-document-classification
Text classification on the medical abstracts in OHSUMED dataset
dmlc/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
adableau/Reuters_datasets
Reuters _datasets
horcham/TSVM
ICDI0906/MachineLearning
该仓库包含了机器学习,数据挖掘中的理论知识和相关实践代码
barebell/DA
Unsupervised Domain Adaptation Papers and Code
zhaoxin94/awesome-domain-adaptation
A collection of AWESOME things about domian adaptation