JamesMTucker
A polymath with interests in AI, ML, LLM, NLP, Data Science, Linguistics, Neuroscience, Mathematics, Statistics. Hobbies:🎾📚🥃🎮 Favorite Season: 🍂
William and MaryWilliamsburg, VA
Pinned Repositories
AskAttia
This repo contains machine learning code that analyzes Peter Attia's public facing scientific education materials
AskHuberman
This repo contains machine learning code that analyzes Andrew Huberman's public facing scientific education materials
awesome-hebrew-nlp
:book: A curated list of resources for NLP (Natural Language Processing) for Hebrew
course-nlp
A Code-First Introduction to NLP course
DATA_340_NLP
Natural Langauge Processing Course Syllabus for DATA 340
dotfiles
Default dotfiles and configuration options to setup a new machine
jamesmtucker.github.io
PAM
JamesMTucker's Repositories
JamesMTucker/DATA_340_NLP
Natural Langauge Processing Course Syllabus for DATA 340
JamesMTucker/PAM
JamesMTucker/jamesmtucker.github.io
JamesMTucker/AskAttia
This repo contains machine learning code that analyzes Peter Attia's public facing scientific education materials
JamesMTucker/AskHuberman
This repo contains machine learning code that analyzes Andrew Huberman's public facing scientific education materials
JamesMTucker/awesome-hebrew-nlp
:book: A curated list of resources for NLP (Natural Language Processing) for Hebrew
JamesMTucker/course-nlp
A Code-First Introduction to NLP course
JamesMTucker/dotfiles
Default dotfiles and configuration options to setup a new machine
JamesMTucker/english_words
:memo: A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion
JamesMTucker/huggingface_notebooks
Notebooks using the Hugging Face libraries 🤗
JamesMTucker/math
🧮 Path to a free self-taught education in Mathematics!
JamesMTucker/geoBoundaries
geoBoundaries : A Political Administrative Boundaries Dataset (www.geoboundaries.org)
JamesMTucker/NetCare
JamesMTucker/NSF_Grants
Data and code to analyze National Science Foundation Grants
JamesMTucker/ORCID
Parsers for the ORCID data
JamesMTucker/PDF_to_Text
An API to covert pdf to txt source files
JamesMTucker/plagiarism_publisher
A suite of AI tools to detect plagiarism and direct researchers to proper sources for attribution
JamesMTucker/Preprints
Scrape the preprints archive and preprocess articles
JamesMTucker/pubmed-rct-expanded
PubMed 200k RCT dataset: a large dataset for sequential sentence classification.
JamesMTucker/Samba
Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"
JamesMTucker/SemanticClustering
JamesMTucker/tabula
Tabula is a tool for liberating data tables trapped inside PDF files
JamesMTucker/TopicExplorer
NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)
JamesMTucker/user-dotbrev
Setup script for each Brev user. Used to add custom user settings in each new project (ex. terminal profiles, vscode settings).
JamesMTucker/Word2Vec
Notes on the Word2Vec CBOW and SG algos