Pinned Repositories
COCA-WordFrequency
COCA, Top 5000 Word Frequency List
h-test
LAMA-Music-Genre-Dataset
.wav files, training dataset (MFCC), and graph plots (FFTs, MFCCs, Waveforms) from Latin America, Asia, MiddleEast, and Africa
lftk
[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.
lingfeat
[EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment
prompt-learning-readability
[EACL 2023] use text-to-text models (BART, T5) for readability assessment
pushingonreadability_traditional_ML
wiki-text-summarizer-keyword-extractor
Uses Beautiful Soup to read Wiki pages, Gensim to summarize, NLTK to process, and extracts keywords based on entropy: everything in one beautiful code. A simple but effective solution to extractive text summarization.
nutcracker
Large Model Evaluation Experiments
nutcracker-db
brucewlee's Repositories
brucewlee/lingfeat
[EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment
brucewlee/lftk
[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.
brucewlee/pushingonreadability_traditional_ML
brucewlee/prompt-learning-readability
[EACL 2023] use text-to-text models (BART, T5) for readability assessment
brucewlee/textreader
Readability Formulas and Reading Time Statistics
brucewlee/h-test
brucewlee/nlpPandas
Basic preprocessing for NLP datasets in Pandas dataframe.
brucewlee/conference_cheatsheet
publication-related stuff, largely for myself
brucewlee/accuracy
There are more than one way to skin a cat
brucewlee/brucewlee
brucewlee/brucewlee.github.io
brucewlee/DiscSense
Automated Semantic Analysis of Discourse Markers
brucewlee/GeoDataViz-Toolkit
The GeoDataViz Toolkit is a set of resources that will help you communicate your data effectively through the design of compelling visuals. In this repository we are sharing resources, assets and other useful links.
brucewlee/HaluScan
brucewlee/moral-value-bias
brucewlee/neuralcoref
✨Fast Coreference Resolution in spaCy with Neural Networks
brucewlee/nltk
NLTK Source
brucewlee/oldweb
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
brucewlee/pdtb3
Preprocessing code and BERT/XLNet baselines for PDTB 2.0 and 3.0
brucewlee/readability-transformers
brucewlee/research-exercises
brucewlee/sampl
brucewlee/sample
brucewlee/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
brucewlee/streamlit-example
Example Streamlit app that you can fork to test out share.streamlit.io
brucewlee/transformers-tutorials
Github repo with tutorials to fine tune transformers for diff NLP tasks
brucewlee/TRUNAJOD2.0
Este repositorio es para mantener el código de TRUNAJOD2.0
brucewlee/tutorial-readthedocs
brucewlee/values
brucewlee/website-academic
Source code for my personal website https://mutschler.eu