kspicer80
College Professor in the Humanities; caught the Digital Humanities bug a while ago and haven't looked back since ...
kspicer80's Stars
zylon-ai/private-gpt
Interact with your documents using the power of GPT, 100% privately, no data leaks
jgm/pandoc
Universal markup converter
p0deje/Maccy
Lightweight clipboard manager for macOS
Avaiga/taipy
Turns Data and AI algorithms into production-ready web applications in no time.
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
google/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
gruns/icecream
🍦 Never use print() to debug again.
drivendataorg/cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
NicolasHug/Surprise
A Python scikit for building and analyzing recommender systems
opentoonz/opentoonz
OpenToonz - An open-source full-featured 2D animation creation software
camelot-dev/camelot
A Python library to extract tabular data from PDFs
microsoft/live-share
Real-time collaborative development from the comfort of your favorite tools
approximatelabs/sketch
AI code-writing assistant that understands data content
Paxa/postbird
Open source PostgreSQL GUI client for macOS, Linux and Windows
HendrikStrobelt/detecting-fake-text
Giant Language Model Test Room
mokeyish/obsidian-enhancing-export
This is an enhancing export plugin base on Pandoc for Obsidian (https://obsidian.md/ ). It's allow you to export to formats like Markdown、Markdown (Hugo https://gohugo.io/ )、Html、docx、Latex etc.
ozntel/obsidian-link-converter
Obsidian Plugin to scan all your links in your vault and convert them to your desired format.
jgm/citeproc
CSL citation processing library in Haskell
jez/pandoc-sidenote
Convert Pandoc Markdown-style footnotes into sidenotes
data-is-plural/newsletter-archive
Markdown archive & RSS/Atom feeds for Data Is Plural
Priya22/project-dialogism-novel-corpus
The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.
IBM/RADAR
Code for our NeurIPS2023 accepted paper: RADAR: Robust AI-Text Detection via Adversarial Learning. We tested RADAR on 8 LLMs including Vicuna and LLaMA. The results show that RADAR can attain good detection performance on LLM-generated AI-text while being robust against paraphrasing.
Codecademy-Curriculum/Learn-Tableau-for-Data-Viz
Repository for relevant datasets.
lucharo/raceplotly
High level package to make a chart bar plot using plotly.
pandoc-ext/info
General info on pandoc extensions
DHRI-Curriculum/databases
@DHRI-Curriculum Session on databases, including concepts such as structured data, SQL, and exploring data.
Codecademy-Curriculum/Data-Engineering-Career-Path-Portfolio-Projects
Repository containing example solutions for the Data Engineering Career Path Portfolio Projects
rfeers/Medium
Repository to post all codes attached to my stories in Medium. Feel free to use whatever you need or contact me for further questions!! :D
Codecademy-Curriculum/Data-Science-Project-Solutions
relikd/md2tufte
A custom markdown to html + pdf compiler; based on Tufte CSS.