judy-pp's Stars
tunib-ai/large-scale-lm-tutorials
Large-scale language modeling tutorials with PyTorch
tunib-ai/tunib-electra
Korean-English Bilingual Electra Models
tunib-ai/artwork_captions
Machine Generated Captions for Best Artworks
tunib-ai/DKTC
Dataset of Korean Threatening Conversations
tunib-ai/KMWP
Korean Math Word Problems
tunib-ai/oslo
OSLO: Open Source framework for Large-scale model Optimization
sooftware/k-startups
List of tech startups in South Korea. (Republic of Korea)
tunib-ai/transformers
🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed
tunib-ai/parallelformers
Parallelformers: An Efficient Model Parallelization Toolkit for Deployment
kocohub/korean-hate-speech
Korean HateSpeech Dataset
alexgreene/WikiQuiz
Generates a quiz for a Wikipedia page using parts of speech and text chunking.
LexPredict/lexpredict-lexnlp
LexNLP by LexPredict
captainnemo9292/hate-speech-language-modeling
Recurrent Neural Network based Hate Speech Language Model for Korean Hate Speech Detection
deepset-ai/FARM
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
makcedward/nlpaug
Data augmentation for NLP
MrBananaHuman/KoGPT2ForParaphrasing
TEMP
tesseract-ocr/langdata
Source training data for Tesseract for lots of languages
ICLRandD/Blackstone
:black_circle: A spaCy pipeline and model for NLP on unstructured legal text.
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
clovaai/ClovaCall
ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
jungyeul/korean-parallel-corpora
Korean Parallel Corpus
facebookresearch/TaBERT
This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic parsing. TaBERT is pre-trained on a massive corpus of 26M Web tables and their associated natural language context, and could be used as a drop-in replacement of a semantic parsers original encoder to compute representations for utterances and table schemas (columns).
sooftware/kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
krasserm/fairseq-image-captioning
Transformer-based image captioning extension for pytorch/fairseq
Tencent/TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
yunjey/pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers
rakshithShetty/py-faster-rcnn-featextract
This is a clone of py faster rcnn -- With additional scripts to extract features for image/video captioning
peteanderson80/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
orionw/RedditHumorDetection
Code and datasets for the paper "Humor Detection: A Transformer Gets the Last Laugh"