Pinned Repositories
COCA-WordFrequency
COCA, Top 5000 Word Frequency List
h-test
[ACL 2024] Language Models Don't Learn the Physical Manifestation of Language
lftk
[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.
lingfeat
[EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment
nutcracker
Large Model Evaluation Experiments
nutcracker-db
prompt-learning-readability
[EACL 2023] use text-to-text models (BART, T5) for readability assessment
pushingonreadability_traditional_ML
wiki-text-summarizer-keyword-extractor
Uses Beautiful Soup to read Wiki pages, Gensim to summarize, NLTK to process, and extracts keywords based on entropy: everything in one beautiful code. A simple but effective solution to extractive text summarization.
activation-steering
General-purpose activation steering library
brucewlee's Repositories
brucewlee/lingfeat
[EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment
brucewlee/lftk
[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.
brucewlee/nutcracker
Large Model Evaluation Experiments
brucewlee/nutcracker-db
brucewlee/prompt-learning-readability
[EACL 2023] use text-to-text models (BART, T5) for readability assessment
brucewlee/nlpPandas
Basic preprocessing for NLP datasets in Pandas dataframe.
brucewlee/h-test
[ACL 2024] Language Models Don't Learn the Physical Manifestation of Language
brucewlee/textreader
Readability Formulas and Reading Time Statistics
brucewlee/conference_cheatsheet
publication-related stuff, largely for myself
brucewlee/moral-value-bias
brucewlee/accuracy
There are more than one way to skin a cat
brucewlee/activation-steering-
General-purpose activation steering library
brucewlee/ARENA_3.0
brucewlee/brucewlee
brucewlee/brucewlee.github.io
brucewlee/conditional-activation-steering
brucewlee/DiscSense
Automated Semantic Analysis of Discourse Markers
brucewlee/easy-lens
brucewlee/HaluScan
brucewlee/mathematics-of-ml
brucewlee/nltk
NLTK Source
brucewlee/pdtb3
Preprocessing code and BERT/XLNet baselines for PDTB 2.0 and 3.0
brucewlee/research-exercises
brucewlee/sampl
brucewlee/sample
brucewlee/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
brucewlee/streamlit-example
Example Streamlit app that you can fork to test out share.streamlit.io
brucewlee/transformers-tutorials
Github repo with tutorials to fine tune transformers for diff NLP tasks
brucewlee/tutorial-readthedocs
brucewlee/values