imperialite
Working on NLP/NLG for Education. Doctoral Researcher at UKRI CDT in Accountable, Responsible, and Transparent AI at the University of Bath, UK.
University of BathUnited Kingdom
Pinned Repositories
ara-close-lang
Code for Automatic Readability Assessment for Closely Related Philippine Languages (ACL2023)
BasahaCorpus-HierarchicalCrosslingualARA
This repository contains the code and data for BasahaCorpus paper accepted for EMNLP 2023 (Main).
BERT-Embeddings-For-ARA
cebuano-readability
cmcl2022-unified-eye-tracking-ipa
filipino-linguistic-extractors
This repository contains Python scripts for extracting linguistic features from Filipino texts.
FilWordNetExtractor
This project contains a Python notebook for extracting sense from the FilWordNet by Borra et. al.
nlp-research-primer-ph
This repository contains the main primer file for kickstarting NLP research intended for a Filipino student's use. The primer contains short discussions on basic NLP processes, example published NLP papers by Filipino students and researchers, open-source codes and repositories, and links to online tools.
Philippine-Languages-Online-Corpora
This repository contains the Philippine Languages Online Corpora (PLOC)
uniform-complexity-textgen
imperialite's Repositories
imperialite/Philippine-Languages-Online-Corpora
This repository contains the Philippine Languages Online Corpora (PLOC)
imperialite/uniform-complexity-textgen
imperialite/BasahaCorpus-HierarchicalCrosslingualARA
This repository contains the code and data for BasahaCorpus paper accepted for EMNLP 2023 (Main).
imperialite/cebuano-readability
imperialite/cmcl2022-unified-eye-tracking-ipa
imperialite/filipino-tiktok-hatespeech
A dataset containing hate speech in text form transcribed from Filipino Tiktok videos related to politics.
imperialite/getting-started-with-the-twitter-api-v2-for-academic-research
A course on getting started with the Twitter API v2 for academic research
imperialite/readability-standard-alignment
Code and data repository for Readability Standard Alignment paper by Joseph Imperial and Harish Tayyar Madabushi at GEM 2023.
imperialite/ara-close-lang
Code for Automatic Readability Assessment for Closely Related Philippine Languages (ACL2023)
imperialite/ACL2023-Retrieval-LM.github.io
https://acl2023-retrieval-lm.github.io/
imperialite/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
imperialite/CEFR-SP
Repository for CEFR-SP corpus and sentence level assessment
imperialite/definition-complexity
imperialite/drawio-diagrams
imperialite/egyptians-in-ai
A website dedicated to showcasing the profiles of prominent Egyptian researchers in the field of AI.
imperialite/evaluation
Code and Data for Evaluation WG
imperialite/gpt-2-simple
Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts
imperialite/imperialite
My personal repository
imperialite/llama
Inference code for LLaMA models
imperialite/mteb
MTEB: Massive Text Embedding Benchmark
imperialite/nerfies.github.io
imperialite/reinforcement-learning
Minimal and Clean Reinforcement Learning Examples
imperialite/Scweet
A simple and unlimited twitter scraper : scape tweets, likes, retweets, following, followers, user info, images...
imperialite/seacrowd-datahub
A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
imperialite/sgnlp
Machine learning models from Singapore's NLP research community
imperialite/specialex
imperialite/standardize
This repository contains the code, data, and website assets for the Standardize paper.
imperialite/standardize-ctg
Code for Standardize: Aligning Language Models with Expert-Defined Standards for Content Generation (EMNLP 2024)
imperialite/StoryPlot-RewardShaping
Code from the IJCAI 2019 paper "Controllable Neural Story Plot Generation via Reward Shaping"
imperialite/TSAR-2022-Shared-Task
TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts