heeyngpark's Stars
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
codelucas/newspaper
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
sloria/TextBlob
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
victoresque/pytorch-template
PyTorch deep learning projects made easy.
facebookresearch/LASER
Language-Agnostic SEntence Representations
kakaobrain/pororo
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
MilaNLProc/contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).
lucidrains/performer-pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
mimno/Mallet
MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
songys/AwesomeKorean_Data
한국어 데이터 세트 링크
datanada/Awesome-Korean-NLP
A curated list of resources for NLP (Natural Language Processing) for Korean
orcasgit/python-fitbit
Fitbit API Python Client Implementation
vi3k6i5/GuidedLDA
semi supervised guided topic model with custom guidedLDA
hyunwoongko/kss
KSS: Korean String processing Suite
xiaohuiyan/BTM
Code for Biterm Topic Model (published in WWW 2013)
FinanceData/OpenDartReader
Open DART Reader
Beomi/KcELECTRA
🤗 Korean Comments ELECTRA: 한국어 댓글로 학습한 ELECTRA 모델
changdoc/naver-vaccine-macro
네이버 우리동네 백신 예약 자동 시도 매크로
bab2min/corpus
개인적으로 수집한 한국어 NLP용 말뭉치 모음
fingeredman/teanaps
자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
mariru/exponential_family_embeddings
Bernoulli Embeddings for Text
entelecheia/eKoNLPy
Korean NLP Python Library for Economic Analysis
mariru/structured_embeddings
nyu-mll/pretraining-learning-curves
The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"
johnliu55tw/Pomodoro-Ionic
The Pomodoro clock on Fitbit Ionic.
edinbb/fitbit-air-quality-app
See air quality reports from 10k monitoring stations around the world.
uclanlp/clusters
codes for EMNLP2020 LOGAN paper
KaitakuShiba/fitbit