Pinned Repositories
babylm.github.io
deep_exploration_with_E_network
for eecs 598 class final project
DORA
Supplementary code for "DORA The Explorer: Directed Outreaching Reinforcement Action-Selection"
EoE
Automatic Metric Validation for Grammatical Error Correction
GEC_UD_divergences
ordert
Code for The Grammar-Learning Trajectories of Neural Language Models
tutEval
Tutorial on LLM Evaluation in LREC
USim
monolingual sentence similarity measure
comsum
model-recycling
Ranking of fine-tuned HF models as base models.
borgr's Repositories
borgr/GEC_UD_divergences
borgr/ordert
Code for The Grammar-Learning Trajectories of Neural Language Models
borgr/tutEval
Tutorial on LLM Evaluation in LREC
borgr/deep_exploration_with_E_network
for eecs 598 class final project
borgr/EoE
Automatic Metric Validation for Grammatical Error Correction
borgr/paper_updated
How to get updated with all the new papers? List of ways
borgr/auto_challenge_sets
Automatically Extracting Challenge Sets for Non-local Phenomena in Neural Machine Translation git repo
borgr/nematus
Open-Source Neural Machine Translation in Tensorflow
borgr/autograd-hacks
borgr/awesome-early-exiting
A curated list of Early Exiting papers, benchmarks, and misc.
borgr/blimp_ngram
borgr/ColPret
Efficient Scaling laws and collaborative pretraining.
borgr/datablations
Scaling Data-Constrained Language Models
borgr/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
borgr/eval-arena
borgr/ewok-paper
Elements of World Knowledge! This repository houses data and code needed to replicate our first paper and the EWoK-core-1.0 dataset
borgr/GEC_BOTHER
borgr/just-the-docs
A modern, high customizable, responsive Jekyll theme for documention with built-in search.
borgr/low-resource-text-classification-framework
Research framework for low resource text classification that allows the user to experiment with classification models and active learning strategies on a large number of sentence classification datasets, and to simulate real-world scenarios. The framework is easily expandable to new classification models, active learning strategies and datasets.
borgr/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
borgr/post
Postdoc tips a post doctoral guide to how to look for a good post
borgr/pythiarch
The hub for EleutherAI's work on interpretability and learning dynamics
borgr/q-squared
borgr/silo-lm
SILO Language Models code repository
borgr/sleuth
An open source no-code system for text annotation and building text classifiers
borgr/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
borgr/training_trajectory_analysis
ACL 2023: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf
borgr/tweeteval
Repository for TweetEval
borgr/wanli
code associated with WANLI dataset in Liu et al., 2022
borgr/zipnn
A lossless and near-lossless compression method optimized for numbers/tensors in the Foundation Models environment