jsoma
baby data journo wrangler @ledeprogram + @littlecolumns, cat wrangler @cat-republic
Brooklyn BraineryBrooklyn, NY
Pinned Repositories
autoscraper-changes
Tiny template that uses GitHub Actions and Python to automatically scrape the BBC's homepage every four hours.
flatware
S3 caching of Google Spreadsheets for Tabletop.js
mapsheet
Easily create interactive maps from data in Google Spreadsheets. Supports Google Maps, Leaflet, Mapbox, MapQuest, etc.
NICAR20-geocoding-apis
Geocoding APIs repo for NICAR20 session
NICAR20-intro-d3
Base for an introductory walkthrough of D3's features for NICAR 2020
nicar23-building-ai-editors
Conceptual framework for building AI editors to help with journalism workflows
quarto-tutorial
Quarto framework tutorial for C+J/European Data Journalism Conf 2023
selenium-github-actions
spotify-langchain-gpt
Building Spotify playlists based on vibes using LangChain and GPT
tabletop
Tabletop.js gives spreadsheets legs
jsoma's Repositories
jsoma/nicar25-ai-images-video
jsoma/py-reform
A Python library for dewarping/straightening/reformatting document images and PDFs
jsoma/nicar25-ai-newsroom
jsoma/nicar25-passive-scraping
Passive scraping for social media (HAR and WACZ to the rescue)
jsoma/data-studio-projects-2024
jsoma/lede-2023-auto-scraper
jsoma/natural-pdf
A friendly library for working with PDFs
jsoma/scraping-ai-flavor
jsoma/2024-ds-dojo
jsoma/flask-energy
jsoma/flask-tutorial
jsoma/japanese-dialect-quiz
Japanese dialect quiz built using Claude artifacts
jsoma/nbcompare
Tool for comparison Jupyter notebooks. Great for simple plagiarism detection/code search in submitted work.
jsoma/pandas-question-factory
a question factory. for pandas.
jsoma/2024-birn
jsoma/2025-economist
jsoma/ai-tricks
jsoma/data-studio-projects-2025
jsoma/dataharvest25-ai-images-video
jsoma/dataharvest25-playwright-scraping
jsoma/el-pais-autoscraper
jsoma/el-pais-scraper
jsoma/flask-demo
jsoma/german-dialect-quiz
jsoma/har2data
HAR data extractor, pull out arrays from JSON API requests into a nice convenient CSV file
jsoma/important-classwork
jsoma/lede-2024-presentations
jsoma/scraper-sample
A sample automatic scraper that pulls in El Pais English headlines and updates a CSV
jsoma/state-warehouse-deals
jsoma/warc2data
WARC data extractor, pull out arrays from JSON API requests into a nice convenient CSV file