shatu

Researcher @allenai

Allen Institute for AI (Ai2)Seattle, Washington

Pinned Repositories

cogcomp-nlp
CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.
Language:Java475 62 385142
PyMarlin
Lightweight Deep Learning Model Training library based on PyTorch
Language:Python32 8 216
adapter-transformers
Huggingface Transformers + Adapters = ❤️
Language:Python0 0 00
AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
Language:Python0 0 00
alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook0 0 00
Annotated-WikiExtractor
Simple Wikipedia plain text extractor with article link annotations and Hadoop support.
Language:Python0 2 00
appworld
🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL2024
Language:Python0 0 00
DialoGPT
Large-scale pretraining for dialogue
Language:Python0 1 00
Docker-Containers
Language:Dockerfile0 1 00
Joint-NER-RelEx-Coref
Code for Joint Modeling of NER, Relation Extraction and Coreference Resolution using Constrained Conditional Models
Language:Java4 3 00

shatu's Repositories

shatu/adapter-transformers
Huggingface Transformers + Adapters = ❤️
Language:Python0 0 00
shatu/AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
Language:Python0 0 00
shatu/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook0 0 00
shatu/appworld
🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL2024
Language:Python0 0 00
shatu/DialoGPT
Large-scale pretraining for dialogue
Language:Python0 1 00
shatu/Docker-Containers
Language:Dockerfile0 1 00
shatu/awesome-system-design-resources
This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems
0 0
shatu/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Language:Python1 0
shatu/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.
Language:Python0 0
shatu/Generating_Text_Summary_With_GPT2
A simple approach to use GPT2-medium (345M) for generating high quality text summaries with minimal training.
Language:Jupyter Notebook1 0
shatu/gorilla
Gorilla: An API store for LLMs
Language:Python0 0
shatu/langchain
⚡ Building applications with LLMs through composability ⚡
Language:Python0 0
shatu/NeuralDialog-CVAE
Tensorflow Implementation of Knowledge-Guided CVAE for dialog generation ACL 2017. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU
Language:OpenEdge ABL1 0
shatu/OpenHands
🙌 OpenHands: Code Less, Make More
Language:Python0 0
shatu/ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
Language:Python1 0
shatu/prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
Language:Python0 0
shatu/PyMarlin
Lightweight Deep Learning Model Training library based on PyTorch
Language:Python1 0
shatu/pytorch-pretrained-BERT
📖The Big-&-Extending-Repository-of-Transformers: Pretrained PyTorch models for Google's BERT, OpenAI GPT & GPT-2, Google/CMU Transformer-XL.
Language:Python1 0
shatu/reasoning-on-cots
Language:Python0 0
shatu/SelfEval-Guided-Decoding
Language:Python0 0
shatu/shatu.github.io
Code for the personal website
Language:HTML1 0
shatu/SimCSE
EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Language:Python0 0
shatu/SpaceFusion
An implementation for the SpaceFusion model, https://arxiv.org/abs/1902.11205
Language:Python1 0
shatu/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python0 0
shatu/t5x
Language:Python0 0
shatu/TheoremQA
The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset
Language:Python0 0
shatu/ThoughtSource
A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/
Language:Python0 0
shatu/tree-of-thought-llm
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
0 0
shatu/unify-parameter-efficient-tuning
Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)
Language:Python0 0
shatu/xLAM
Language:Python0 0