Pinned Repositories
azure-table-extractor
DocumentCloud Add-On that uses Azure Document Intelligence to extract tables from documents
bulk-add-to-project-add-on
DocumentCloud Add-On that adds a set of queried documents to a project.
bulk-delete-annotations
Runs through a query or selection of documents and deletes all annotations for the documents.
bulk-delete-documents
DocumentCloud Add-On to mass delete comments from a query
bulk-delete-tags
DocumentCloud Add-On to bulk remove specific tag or key/value pairs from documents.
bulk-reprocress-addon
DocumentCloud Add-On that allows you to bulk re-process documents
Bulk-Tag-AddOn
A DocumentCloud Add-On that allows you to add tags and/or key value pairs to more than 25 documents at a time.
change-note-visibility
DocumentCloud Add-On that changes the access level (public, private, organization) of all notes on documents in a selection or query.
clouddl
Python library to download Google Drive & Dropbox content.
Klaxon
This repository contains a DocumentCloud Add-On that replicates the behavior of Klaxon, which allows you to monitor web pages for changes on sections of the site that might be newsworthy.
duckduckgrayduck's Repositories
duckduckgrayduck/bulk-add-to-project-add-on
DocumentCloud Add-On that adds a set of queried documents to a project.
duckduckgrayduck/bulk-delete-annotations
Runs through a query or selection of documents and deletes all annotations for the documents.
duckduckgrayduck/bulk-delete-documents
DocumentCloud Add-On to mass delete comments from a query
duckduckgrayduck/bulk-delete-tags
DocumentCloud Add-On to bulk remove specific tag or key/value pairs from documents.
duckduckgrayduck/bulk-reprocress-addon
DocumentCloud Add-On that allows you to bulk re-process documents
duckduckgrayduck/Bulk-Tag-AddOn
A DocumentCloud Add-On that allows you to add tags and/or key value pairs to more than 25 documents at a time.
duckduckgrayduck/change-note-visibility
DocumentCloud Add-On that changes the access level (public, private, organization) of all notes on documents in a selection or query.
duckduckgrayduck/clear-failed-uploads
Add-On that runs through a sets of documents you own on DocumentCloud and deletes the documents with errors.
duckduckgrayduck/compress-pdf-add-on
Given a public Google Drive or Dropbox link to a file or set of files, it will download the file(s), attempt to compress each file, and upload the document(s) to DocumentCloud if the resulting compressed file <500MB
duckduckgrayduck/convert-email-add-on
duckduckgrayduck/dc_batch_upload
Upload large amounts of documents to DocumentCloud
duckduckgrayduck/doccloud-n-gram-addon
duckduckgrayduck/doctr-ocr-add-on
DocumentCloud Add-On that uses the docTR OCR system
duckduckgrayduck/document-hash-add-on
DocumentCloud Add-On that calculates the SHA-1 algorithm of a document and adds it as key/value pair on the document.
duckduckgrayduck/document-rotator-addon
DocumentCloud Add-On that allows you to detect pages that need to be rotated in a document and auto-rotate them automatically.
duckduckgrayduck/documentcloud
DocumentCloud's back end source code - Please report bugs, issues and feature requests to info@documentcloud.org
duckduckgrayduck/documentcloud-azure-document-intelligence-ocr-addon
DocumentCloud Add-On that uses Azure's Document Intelligence API to OCR documents
duckduckgrayduck/documentcloud-custom-metadata-scraper-addon
The custom metadata output addon for documentcloud
duckduckgrayduck/documentcloud-gpt4-addon
An experiment with using GPT-3 to help analyze, categorize and structure documents.
duckduckgrayduck/documentcloud-legal-citation-identifcation-addon
duckduckgrayduck/documentcloud-metadata-grabber
duckduckgrayduck/documentcloud-multiple-regex-pattern-addon
duckduckgrayduck/documentcloud-regex-addon
duckduckgrayduck/google-translate-addon
duckduckgrayduck/muckrock
MuckRock's source code - Please report bugs, issues and feature requests to info@muckrock.com
duckduckgrayduck/pdf-auto-rotator
Experimental Python script that uses OpenCV, fitz, and numpy to calculate a skew offset for PDFs that have rotated pages and rotate them.
duckduckgrayduck/reflow-add-on
A DocumentCloud Add-On that uses K2pdfopt to optimize documents for mobile eReaders and smartphones
duckduckgrayduck/savepagenow
A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service
duckduckgrayduck/Site-Snapshot
DocumentCloud Add-On that uses pdfkit to take a snapshot of a site and upload the PDF to DocumentCloud
duckduckgrayduck/useful-universal-scraping-code