npedrazzini
Turing Research Fellow @ The Alan Turing Institute
The Alan Turing Institute @alan-turing-institute University of Oxford
Pinned Repositories
intro-text-mining
Material for practical lecture on Intro to Text Mining for the Humanities (with Python)
dhoxss-text2tech
Materials for the Text to Tech workshop at the Digital Humanities Oxford Summer School
DiachronicEmb-BigHistData
Tools to train and explore diachronic word embeddings from Big Historical Data
ancientgreek-syntactic-embeddings
Ancient Greek Syntactic Word Embeddings
averageReducedFrequency
R script to calculate the Average Reduced Frequency (ARF) of all words in a corpus
OldSlavNet
Bi-LSTM Parser for Early Slavic
oxford-text-mining
Materials for Introduction to Text Mining (MSc Digital Scholarship, University of Oxford)
parallelbibles
Word-alignment models for Bible translations in 100+ historical and contemporary languages
PreModernSlavic-NLP
Mixed drafts, scripts or data useful for NLP tasks on Pre-Modern Slavic
subsample_news
npedrazzini's Repositories
npedrazzini/OldSlavNet
Bi-LSTM Parser for Early Slavic
npedrazzini/ancientgreek-syntactic-embeddings
Ancient Greek Syntactic Word Embeddings
npedrazzini/oxford-text-mining
Materials for Introduction to Text Mining (MSc Digital Scholarship, University of Oxford)
npedrazzini/parallelbibles
Word-alignment models for Bible translations in 100+ historical and contemporary languages
npedrazzini/PreModernSlavic-NLP
Mixed drafts, scripts or data useful for NLP tasks on Pre-Modern Slavic
npedrazzini/subsample_news
npedrazzini/word2vec-tutorial
npedrazzini/academic
Jekyll theme with a focus on simplicity, typography and flexibility
npedrazzini/ADA-DHOxSS
Teaching materials for the Applied Data Analysis course at DHOxSS. Data science methods to analyse humanities data.
npedrazzini/best-practices-for-coding-in-dh
Turing RSE-DH Summer School practical
npedrazzini/DataPapersAnalysis
Scripts to scrape JOHD's and RDJHSS websites for metrics on data papers and corresponding datasets, and to carry out analyses on them.
npedrazzini/DeezyMatch
A Flexible Deep Learning Approach to Fuzzy String Matching
npedrazzini/DH-RSE-Summer-School
npedrazzini/gramtypix
npedrazzini/histLM
Neural Language Models for Historical Research
npedrazzini/KERMIT
🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings
npedrazzini/lancaster-newspaper-workshop
npedrazzini/lstmtagger
npedrazzini/MapReader
A computer vision pipeline for exploring and analyzing images at scale
npedrazzini/my-first-binder
npedrazzini/nilo-cultural-web
Blog for my students at 7AAVDM14 The Cultural Web: Building a Humanities Website (King's College London 2021-2022)
npedrazzini/node2vec
npedrazzini/npedrazzini.github.io
npedrazzini/OCSharmonizeOES
Python script to harmonize Church Slavonic and Old East Slavic (Old Russian) orthographic variants
npedrazzini/spacy-lookups-data
📂 Additional lookup tables and data resources for spaCy
npedrazzini/spec
Test spec
npedrazzini/subsamplr
A tool for representative subsampling
npedrazzini/test_to_delete
npedrazzini/text
Data loaders and abstractions for text and NLP
npedrazzini/zooniverse-analysis-workshop