Pinned Repositories
deep-knn
Code for the 2018 EMNLP Interpretability Workshop Paper "Interpreting Neural Networks with Nearest Neighbors"
es-korean
exobrain-api
Exobrain API
Fast_Sentence_Embeddings
Compute Sentence Embeddings Fast!
grokking-pytorch
The Hitchiker's Guide to PyTorch
verb-attributes
Contains code for the EMNLP paper `Learning Linguistic Attributes for Zero-Shot Verb Classification'
Wikidata_relation_extractor
Code to extract KB triples from given contexts using Wikidata API
kyoungrok0517's Repositories
kyoungrok0517/Wikidata_relation_extractor
Code to extract KB triples from given contexts using Wikidata API
kyoungrok0517/ARENA_3.0
kyoungrok0517/art
Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"
kyoungrok0517/bashidioms-examples
Example code from O'Reilly's bash Idioms
kyoungrok0517/BERT-QPP
BERT-QPP: Contextualized Pre-trained transformers for Query Performance Prediction
kyoungrok0517/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
kyoungrok0517/big-ann-benchmarks
Framework for evaluating ANNS algorithms on billion scale datasets.
kyoungrok0517/charset_normalizer
Truly universal encoding detector in pure Python
kyoungrok0517/CircuitsVis
Mechanistic Interpretability Visualizations using React
kyoungrok0517/dictionary_learning
kyoungrok0517/dropout
Code release for "Dropout Reduces Underfitting"
kyoungrok0517/einops
Deep learning operations reinvented (for pytorch, tensorflow, jax and others)
kyoungrok0517/entropix
Entropy Based Sampling and Parallel CoT Decoding
kyoungrok0517/gpt3-blog-title-optimizer
Python code for building a GPT-3 based technical blog post optimizer.
kyoungrok0517/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
kyoungrok0517/ingredients
kyoungrok0517/jailbreak_llms
[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).
kyoungrok0517/lightning-transformers
Flexible interface for high-performance research using SOTA Transformers leveraging Pytorch Lightning, Transformers, and Hydra.
kyoungrok0517/nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
kyoungrok0517/natural-questions
Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answering systems.
kyoungrok0517/NC-BERT
BERT capable of numerical extrapolation over text (solving arithmetic problems over textual reasoning dataset)
kyoungrok0517/pmi-masking
This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper
kyoungrok0517/SAELens
Training Sparse Autoencoders on Language Models
kyoungrok0517/semantic-python-overview
(subjective) overview of projects which are related both to python and semantic technologies (RDF, OWL, Reasoning, ...)
kyoungrok0517/splade
SPLADE: sparse neural search (SIGIR21, SIGIR22)
kyoungrok0517/stat_rethinking_2024
kyoungrok0517/Streamlit-Authenticator
A secure authentication module to validate user credentials in a Streamlit application.
kyoungrok0517/Style-Transformer-for-MSD
Expert-laymen style transfer model based on the Style Transformer (Dai. et al., 2019). This code is based on the MSD dataset (Cao et al., 2020), which is a task for style transfer between an expert-level language to a laymen language for the ease of communication.
kyoungrok0517/subspace-clustering
Toolbox for large scale subspace clustering
kyoungrok0517/the-art-of-command-line
Master the command line, in one page