Pinned Repositories
aws
Barebones AWS Setup
cc-pyspark
Process Common Crawl data with Python and Spark
common_crawl_index
Index URLs in Common Crawl
data-engineering-ecosystem
Repo to migrate old wiki to, esp for devs and code examples
dataplicity-agent
Dataplicity Agent
insight_coding_challange
Insight coding challenge - Instacart dataset
linkrun
LinkRun - Data Engineering project done in 3 weeks during the Insight fellowship
SequenceSearch
Sequence search and feature highlight
trendsci's Repositories
trendsci/linkrun
LinkRun - Data Engineering project done in 3 weeks during the Insight fellowship
trendsci/aws
Barebones AWS Setup
trendsci/data-engineering-ecosystem
Repo to migrate old wiki to, esp for devs and code examples
trendsci/cc-pyspark
Process Common Crawl data with Python and Spark
trendsci/common_crawl_index
Index URLs in Common Crawl
trendsci/dataplicity-agent
Dataplicity Agent
trendsci/insight_coding_challange
Insight coding challenge - Instacart dataset
trendsci/SequenceSearch
Sequence search and feature highlight
trendsci/Insight_DE_GUS
trendsci/pegasus
VM based deployment for prototyping Big Data tools on Amazon Web Services
trendsci/tldextract
Accurately separate the TLD from the registered domain and subdomains of a URL, using the Public Suffix List.