KennethEnevoldsen
Researcher, scholar, teacher, and developer. Python/R
Center for Humanities Computing AarhusAarhus
Pinned Repositories
DaCy
DaCy: The State of the Art Danish NLP pipeline using SpaCy
mteb
MTEB: Massive Text Embedding Benchmark
TextDescriptives
A Python library for calculating a large variety of metrics from text
asent
Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.
augmenty
Augmenty is an augmentation library based on spaCy for augmenting texts.
Exp-Meth-III-Tutorials
An series of tutorials for the course Experimental Methods III
pimp-my-github
A checklist for creating pleasing GitHub repos
scandinavian-embedding-benchmark
A Scandinavian Benchmark for sentence embeddings
spacy-wrap
spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to include existing fine-tuned models within your SpaCy workflow.
tomsup
tomsup 👍 Theory of Mind Simulation using Python. A package that allows for easy agent-based modelling of recursive Theory of Mind
KennethEnevoldsen's Repositories
KennethEnevoldsen/augmenty
Augmenty is an augmentation library based on spaCy for augmenting texts.
KennethEnevoldsen/asent
Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.
KennethEnevoldsen/spacy-wrap
spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to include existing fine-tuned models within your SpaCy workflow.
KennethEnevoldsen/scandinavian-embedding-benchmark
A Scandinavian Benchmark for sentence embeddings
KennethEnevoldsen/swift-python-cookiecutter
A python package template intended for low maintenance and quick package development.
KennethEnevoldsen/danske-sprogteknognologi-termer
Denne GitHub indeholder anbefalede dansk termer for sprogteknologi.
KennethEnevoldsen/dna2vec
KennethEnevoldsen/OpenJournal
A discussion forum for discussing alternatives way for to scientific publishing.
KennethEnevoldsen/snp-transformer
KennethEnevoldsen/confection
:candy: Confection: the sweetest config system for Python
KennethEnevoldsen/curated-transformers
🤖 A PyTorch library of curated Transformer models and their composable components
KennethEnevoldsen/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
KennethEnevoldsen/dolma
Data and tools for generating and inspecting OLMo pre-training data.
KennethEnevoldsen/dummy_repo
KennethEnevoldsen/genda-lens
GenDa Lens: Python package for quatifying gender bias in Danish language models.
KennethEnevoldsen/genq_dansk
Repo til auto generering af et retrieval (spørgsmål-tekst) benchmark datasæt
KennethEnevoldsen/jury
Comprehensive NLP Evaluation System
KennethEnevoldsen/KennethEnevoldsen
Personal repository
KennethEnevoldsen/KennethLM
A test repository for testing out variations on generative language models
KennethEnevoldsen/LDAK
KennethEnevoldsen/MissingDataChallenge
Scripts for the missing data challenge 2023
KennethEnevoldsen/NLP-AU-23
Primary repository for the NLP course as part of the CogSci masters program at Aarhus University.
KennethEnevoldsen/NorQuAD
Norwegian question answering dataset
KennethEnevoldsen/ollama-r
R library to run Ollama language models
KennethEnevoldsen/permetrics
Artificial intelligence (AI, ML, DL) performance metrics implemented in Python
KennethEnevoldsen/scandeval.github.io
KennethEnevoldsen/scikit-llm
Seamlessly integrate LLMs into scikit-learn.
KennethEnevoldsen/snip
A utility package handling Single Nucleotide polymorphism data in Python
KennethEnevoldsen/spacy-curated-transformers
spaCy entry points for Curated Transformers
KennethEnevoldsen/spacy-lookups-data
📂 Additional lookup tables and data resources for spaCy