Pinned Repositories
CLIP-self-attention-visualization
Plotting heatmaps with the self-attention of the [CLS] tokens in the last layer.
korean-spacing-model
한국어 문장 띄어쓰기(삭제/추가) 모델입니다. 데이터 준비 후 직접 학습이 가능하도록 작성하였습니다.
korean-wikipedia-corpus
문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.
KR-BERT-SimCSE
Implementing SimCSE using KR-BERT
namuwiki-corpus
문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.
nori-clone
Standalone Nori (Korean Morphological Analyzer)
python-mecab
A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)
pytorch-bert
An implementation of BERT using PyTorch's TransformerEncoder
smaller-labse
Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE
tfds-korean
A collection of Korean Text Datasets ready to use using Tensorflow-Datasets.
jeongukjae's Repositories
jeongukjae/nori-clone
Standalone Nori (Korean Morphological Analyzer)
jeongukjae/korean-wikipedia-corpus
문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.
jeongukjae/jeongukjae.github.io
My engineering blog
jeongukjae/tensorflow-serving-apis-proto
Protobuf files for TensorFlow Serving apis
jeongukjae/vscode-protobuf
protobuf extension for VSCode
jeongukjae/about
about page for myself
jeongukjae/image-cropping-using-attention
jeongukjae/tensorflow-io-issue-1828
jeongukjae/zero-to-production-in-rust
Playground repository while reading "Zero To Production In Rust"
jeongukjae/aws-sdk-cpp-1.11-bazel-test
jeongukjae/bazel
a fast, scalable, multi-language and extensible build system
jeongukjae/beam
Apache Beam is a unified programming model for Batch and Streaming data processing.
jeongukjae/code-search
Demo to search codes in org with natural languages
jeongukjae/community-plugins
Community plugins for Backstage
jeongukjae/daac
jeongukjae/darts-ac
Extending darts-clone for Aho-Corasick
jeongukjae/data-validation
Library for exploring and validating machine learning data
jeongukjae/dockerfiles
Personal docker images
jeongukjae/flb-plugin-sample
jeongukjae/hera
Hera is an Argo Python SDK. Hera aims to make construction and submission of various Argo Project resources easy and accessible to everyone! Hera abstracts away low-level setup details while still maintaining a consistent vocabulary with Argo. ⭐️ Remember to star!
jeongukjae/hera-5.16-name-conflict-error
jeongukjae/io
Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO
jeongukjae/metacontroller
Writing kubernetes controllers can be simple
jeongukjae/model-analysis
Model analysis tools for TensorFlow
jeongukjae/model-card-toolkit
A toolkit that streamlines and automates the generation of model cards
jeongukjae/redpanda-data-connect
Fancy stream processing made operationally mundane
jeongukjae/tensorflow-serving
A flexible, high-performance serving system for machine learning models
jeongukjae/tfx
TFX is an end-to-end platform for deploying production ML pipelines
jeongukjae/tfx-addons
Developers helping developers. TFX-Addons is a collection of community projects to build new components, examples, libraries, and tools for TFX. The projects are organized under the auspices of the special interest group, SIG TFX-Addons. Join the group at http://goo.gle/tfx-addons-group
jeongukjae/twitter-the-algorithm
Source code for Twitter's Recommendation Algorithm