Pinned Repositories
azure-table-extractor
DocumentCloud Add-On that uses Azure Document Intelligence to extract tables from documents
bulk-add-to-project-add-on
DocumentCloud Add-On that adds a set of queried documents to a project.
bulk-delete-annotations
Runs through a query or selection of documents and deletes all annotations for the documents.
bulk-delete-documents
DocumentCloud Add-On to mass delete comments from a query
bulk-delete-tags
DocumentCloud Add-On to bulk remove specific tag or key/value pairs from documents.
bulk-reprocress-addon
DocumentCloud Add-On that allows you to bulk re-process documents
Bulk-Tag-AddOn
A DocumentCloud Add-On that allows you to add tags and/or key value pairs to more than 25 documents at a time.
change-note-visibility
DocumentCloud Add-On that changes the access level (public, private, organization) of all notes on documents in a selection or query.
clouddl
Python library to download Google Drive & Dropbox content.
Klaxon
This repository contains a DocumentCloud Add-On that replicates the behavior of Klaxon, which allows you to monitor web pages for changes on sections of the site that might be newsworthy.
duckduckgrayduck's Repositories
duckduckgrayduck/Klaxon
This repository contains a DocumentCloud Add-On that replicates the behavior of Klaxon, which allows you to monitor web pages for changes on sections of the site that might be newsworthy.
duckduckgrayduck/clouddl
Python library to download Google Drive & Dropbox content.
duckduckgrayduck/azure-table-extractor
DocumentCloud Add-On that uses Azure Document Intelligence to extract tables from documents
duckduckgrayduck/cloud-upload-addon
DocumentCloud Add-On that allows users to import documents from Google Drive and Dropbox.
duckduckgrayduck/doccloud-uploads-graph
duckduckgrayduck/documentcloud-cloud-vision-ocr
OCR Add-On that uses Google Cloud Vision API to OCR a document.
duckduckgrayduck/documentcloud-frontend
DocumentCloud's front end source code - Please report bugs, issues and feature requests to info@documentcloud.org
duckduckgrayduck/documentcloud-hello-world-addon
duckduckgrayduck/documentcloud-scraper-addon
duckduckgrayduck/documentcloud-sentiment-analysis-addon
duckduckgrayduck/documentcloud-whisper-addon
DocumentCloud Add-On that uses OpenAI's Whisper library to transcribe vidoes and upload the transcription to DocumentCloud
duckduckgrayduck/duplicate-remover
DocumentCloud Add-On that removes duplicate documents identified by hash value.
duckduckgrayduck/ea-pdf
Code for processing and archiving emails
duckduckgrayduck/email-archiver-addon
duckduckgrayduck/Empty-Page-Deletion
DocumentCloud Add-On that detects empty pages in a document and optionally deletes them.
duckduckgrayduck/fomb-scraper
duckduckgrayduck/google-drive-api-script
A simple Google Drive API Script that iterates through a list of Google Drive links stored in a txt file and downloads them using the API
duckduckgrayduck/googledrivedl
Google Drive Download Python Script
duckduckgrayduck/gpt4-vision-addon
DocumentCloud Add-On that uses GPT-4 Vision to pull tabular data from documents in CSV or JSON format
duckduckgrayduck/kroll-scraper
duckduckgrayduck/metadata-extractor-addon
Add-On that uses exiftool to extract PDF metadata and stores it as key/value pairs on DocumentCloud
duckduckgrayduck/ocr-scheduler
duckduckgrayduck/OCR-Tagger
DocumentCloud Add-On that tags documents with the OCR engine used on them, if any
duckduckgrayduck/page-count
DocumentCloud Add-On that coutns the number of pages in a query or selection of documents.
duckduckgrayduck/pdf-splitter-add-on
DocumentCloud Add-On that splits a DocumentCloud document on a designated page and creates two new documents
duckduckgrayduck/pii-detector-add-on
duckduckgrayduck/python-course-2023
Source code & materials for a Python course taught in Fall 2023.
duckduckgrayduck/python-documentcloud
A simple Python wrapper for the DocumentCloud API
duckduckgrayduck/schedulable-gpt-35
duckduckgrayduck/textract-table-extractor-add-on
DocumentCloud Add-On that uses Amazon Textract to extract tables from documents