Pinned Repositories
acl2020-openqa-tutorial
ACL2020 Tutorial: Open-Domain Question Answering
AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
ahocorapy
Pure python Aho-Corasick library.
ai-hackathon
네이버 AI Hackathon_AI Vision!
alexa-with-dstc9-track1-dataset
DSTC9 Track 1 - Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access
allennlp-reading-comprehension
Awesome-Embodied-Agent-with-LLMs
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates!
awesome-nlg
A curated list of resources dedicated to Natural Language Generation (NLG)
awesome-phd-advice
Collection of advice for prospective and current PhD students
baby-jarvis
jongwon-jay-lee's Repositories
jongwon-jay-lee/AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
jongwon-jay-lee/Awesome-Embodied-Agent-with-LLMs
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates!
jongwon-jay-lee/bert
TensorFlow code and pre-trained models for BERT
jongwon-jay-lee/bigcode-dataset
jongwon-jay-lee/bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
jongwon-jay-lee/c4-dataset-script
Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese data processing and cleaning methods in MassiveText.
jongwon-jay-lee/data-engineering
jongwon-jay-lee/deduplicate-text-datasets
jongwon-jay-lee/DeepLearningExamples
Deep Learning Examples
jongwon-jay-lee/DPR
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
jongwon-jay-lee/dps
Data processing system for polyglot
jongwon-jay-lee/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
jongwon-jay-lee/finetune-gpt2xl
Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed
jongwon-jay-lee/google-research
Google Research
jongwon-jay-lee/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
jongwon-jay-lee/kenlm
KenLM: Faster and Smaller Language Model Queries
jongwon-jay-lee/KoAlpaca
KoAlpaca: Korean Alpaca Model based on Stanford Alpaca (feat. LLAMA and Polyglot-ko)
jongwon-jay-lee/KoBERTScore
BERTScore for Korean
jongwon-jay-lee/llama
Inference code for LLaMA models
jongwon-jay-lee/LLMs-Planning
An extensible benchmark for evaluating large language models on planning
jongwon-jay-lee/llmss
LLM simple serving (tensor model parallel, pubsub, grpc)
jongwon-jay-lee/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
jongwon-jay-lee/Megatron-LM
Ongoing research training transformer models at scale
jongwon-jay-lee/openai-quickstart-python
Python example app from the OpenAI API quickstart tutorial
jongwon-jay-lee/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
jongwon-jay-lee/text-dedup
All-in-one text de-duplication
jongwon-jay-lee/text-generation-inference
Large Language Model Text Generation Inference
jongwon-jay-lee/tppys
Text processing by pyspark (just sample project)
jongwon-jay-lee/transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
jongwon-jay-lee/triton-server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.