judy-pp

judy-pp's Stars

tunib-ai/large-scale-lm-tutorials
Large-scale language modeling tutorials with PyTorch
Language:Jupyter Notebook28754
tunib-ai/tunib-electra
Korean-English Bilingual Electra Models
1098
tunib-ai/artwork_captions
Machine Generated Captions for Best Artworks
22
tunib-ai/DKTC
Dataset of Korean Threatening Conversations
703
tunib-ai/KMWP
Korean Math Word Problems
571
tunib-ai/oslo
OSLO: Open Source framework for Large-scale model Optimization
Language:Python30630
sooftware/k-startups
List of tech startups in South Korea. (Republic of Korea)
2169
tunib-ai/transformers
🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed
Language:Python312
tunib-ai/parallelformers
Parallelformers: An Efficient Model Parallelization Toolkit for Deployment
Language:Python77961
kocohub/korean-hate-speech
Korean HateSpeech Dataset
37538
alexgreene/WikiQuiz
Generates a quiz for a Wikipedia page using parts of speech and text chunking.
Language:JavaScript80358
LexPredict/lexpredict-lexnlp
LexNLP by LexPredict
Language:Jupyter Notebook703179
captainnemo9292/hate-speech-language-modeling
Recurrent Neural Network based Hate Speech Language Model for Korean Hate Speech Detection
Language:Jupyter Notebook245
deepset-ai/FARM
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Language:Python1.7k247
makcedward/nlpaug
Data augmentation for NLP
Language:Jupyter Notebook4.5k463
MrBananaHuman/KoGPT2ForParaphrasing
TEMP
Language:Python355
tesseract-ocr/langdata
Source training data for Tesseract for lots of languages
837888
ICLRandD/Blackstone
:black_circle: A spaCy pipeline and model for NLP on unstructured legal text.
Language:Python637101
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Language:Python24.6k3.2k
clovaai/ClovaCall
ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
Language:Python21856
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Language:Jupyter Notebook3.8k1.1k
jungyeul/korean-parallel-corpora
Korean Parallel Corpus
14132
facebookresearch/TaBERT
This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic parsing. TaBERT is pre-trained on a massive corpus of 26M Web tables and their associated natural language context, and could be used as a drop-in replacement of a semantic parsers original encoder to compute representations for utterances and table schemas (columns).
Language:Python58666
sooftware/kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Language:Python604191
krasserm/fairseq-image-captioning
Transformer-based image captioning extension for pytorch/fairseq
Language:Python31456
Tencent/TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
Language:C++1.5k198
yunjey/pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers
Language:Python30.3k8.1k
rakshithShetty/py-faster-rcnn-featextract
This is a clone of py faster rcnn -- With additional scripts to extract features for image/video captioning
Language:Python81
peteanderson80/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Language:Jupyter Notebook1.4k378
orionw/RedditHumorDetection
Code and datasets for the paper "Humor Detection: A Transformer Gets the Last Laugh"
Language:Python7615