Pinned Repositories
AnaloBench
This project focus on curating a robust analogical reasoning dataset for research and development.
CLSP-grid-onboarding
Exercise to teach a newcomer to the CLSP grid to set up their environment and run jobs
Confidence-Estimation-TrustNLP2023
Repo for the "Strength in Numbers: Estimating Confidence of Large Language Models by Prompt Agreement" paper in TrustNLP 2023 by Wightman et al.
Cost-Effective-Experiment
Scripts and docs that help us run cost effective experiment with OpenAI APIs
CS-601-471-671-Sp24-Public
Repo of the programming homework for the course "CS 601.471/671 NLP: Self-supervised Models" - Spring 2024
NeoCoder
Official implementation of our paper "Benchmarking Language Model Creativity: A Case Study on Code Generation"
RATIONALYST
Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044
rockfish-tutorial
Select-Topics-In-Multilingual-NLP
This is a wiki for keeping track of papers being presented during the Reading Group Select Topics in Multilingual Natural Language Processing
turking-bench
Web-grounded natural language instructions
JHU Center for Language and Speech Processing's Repositories
JHU-CLSP/RATIONALYST
Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044
JHU-CLSP/turking-bench
Web-grounded natural language instructions
JHU-CLSP/CS-601-471-671-Sp24-Public
Repo of the programming homework for the course "CS 601.471/671 NLP: Self-supervised Models" - Spring 2024
JHU-CLSP/NeoCoder
Official implementation of our paper "Benchmarking Language Model Creativity: A Case Study on Code Generation"
JHU-CLSP/rockfish-tutorial
JHU-CLSP/Select-Topics-In-Multilingual-NLP
This is a wiki for keeping track of papers being presented during the Reading Group Select Topics in Multilingual Natural Language Processing
JHU-CLSP/Confidence-Estimation-TrustNLP2023
Repo for the "Strength in Numbers: Estimating Confidence of Large Language Models by Prompt Agreement" paper in TrustNLP 2023 by Wightman et al.
JHU-CLSP/Cost-Effective-Experiment
Scripts and docs that help us run cost effective experiment with OpenAI APIs
JHU-CLSP/AnaloBench
This project focus on curating a robust analogical reasoning dataset for research and development.
JHU-CLSP/civil-unrest-case-study
Code and data from "Study of Manifestation of Civil Unrest on Twitter" W-NUT @ EMNLP 2021
JHU-CLSP/CLSP-grid-onboarding
Exercise to teach a newcomer to the CLSP grid to set up their environment and run jobs
JHU-CLSP/gpt2-narrative-decoding
Code for "Decoding Methods for Neural Narrative Generation"
JHU-CLSP/JHU-CUT
Code, data, and models from "Civil Unrest on Twitter (CUT): A Dataset of Tweets to Support Research on Civil Unrest" EMNLP 2020 W-NUT
JHU-CLSP/Kreyol-MT
JHU-CLSP/according-to
JHU-CLSP/Bernice-Twitter-encoder
JHU-CLSP/carmen-wnut22-submission
JHU-CLSP/clsp-pubs
The code used for crawling CLSP faculty publication from Semantic Scholar
JHU-CLSP/Geo-Seq2seq-Twitter
Code for training and evaluating Geo-Seq2seq, a seq2seq approach to Twitter user geolocation. Published at ACL 2023.
JHU-CLSP/csci-601-771-self-supervised-models
JHU-CLSP/docker-http-api-example
Example code for creating an HTTP API for an ML model (or similar) in Docker
JHU-CLSP/slack_lm
Connect our internal LLM to Slack
JHU-CLSP/wikicite
JHU-CLSP/NELLIE
Repository for NELLIE: A Neuro-Symbolic Inference Engine for Grounded, Compositional, and Explainable Reasoning
JHU-CLSP/prompt-format-grounding