Fluquid

We empower you to make data-driven decisions using Machine Learning, Analytics and Python Coding

Cork, Ireland

Pinned Repositories

browser_fingerprint
Language:Python2 2 01
builtwith
fork of https://bitbucket.org/richardpenman/builtwith
Language:Python0 2 00
cryptocurrency
altcoin market and project analyses
3 2 01
extract-social-media
Extract social media links and account names from websites.
Language:Python36 6 416
find_job_titles
find any kind of occupation or job title in a text or file
Language:Python82 4 728
html-to-etree
convenience method for parsing html to lxml elementtree using sane character decoding
Language:Python0 2 00
ilen-tech
Language:HTML0 2 00
kafka-docker
Dockerfile for Apache Kafka
Language:Shell0 2 00
sde
Structured Data Extractor. An application to extract structured data from web pages. It uses Data Extraction Based on Partial Tree Alignment (DEPTA) method. (UPDATE: I implemented a newer algorithm: https://github.com/seagatesoft/webdext)
Language:Java0 2 00
yandex-search
Search library for yandex.ru search engine.
Language:Python15 2 25

Fluquid's Repositories

fluquid/find_job_titles
find any kind of occupation or job title in a text or file
Language:Python82 4 728
fluquid/extract-social-media
Extract social media links and account names from websites.
Language:Python36 6 416
fluquid/yandex-search
Search library for yandex.ru search engine.
Language:Python15 2 25
fluquid/cryptocurrency
altcoin market and project analyses
3 2 01
fluquid/browser_fingerprint
Language:Python2 2 01
fluquid/builtwith
fork of https://bitbucket.org/richardpenman/builtwith
Language:Python0 2 00
fluquid/capcoin
Gets data from coincap.io into the CLI
Language:JavaScript0 2 00
fluquid/cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Language:Python0 2 00
fluquid/cookiecutter-pypackage
Cookiecutter template for a Python package.
Language:Python0 2 00
fluquid/dragnet
Just the facts -- web page content extraction
Language:Python0 2 00
fluquid/email-audit
Audit which email spam bots can collect from your sites.
Language:Python0 3 01
fluquid/html-to-etree
convenience method for parsing html to lxml elementtree using sane character decoding
Language:Python0 2 00
fluquid/ilen-tech
Language:HTML0 2 00
fluquid/kafka-docker
Dockerfile for Apache Kafka
Language:Shell0 2 00
fluquid/sde
Structured Data Extractor. An application to extract structured data from web pages. It uses Data Extraction Based on Partial Tree Alignment (DEPTA) method. (UPDATE: I implemented a newer algorithm: https://github.com/seagatesoft/webdext)
Language:Java0 2 00
fluquid/cookiecutter-pypackage-minimal
A minimal template for python packages
Language:Python2 0
fluquid/cookiecutter-scrapycloud
A bare minimum Scrapy project template ready for Scrapinghub's Scrapy Cloud service.
Language:Python2 0
fluquid/fluquid-lib
utility library
Language:Python2 0
fluquid/githubarchive.org
GitHub Archive is a project to record the public GitHub timeline, archive it, and make it easily accessible for further analysis.
Language:Ruby2 0
fluquid/html-text
Extract text from HTML
Language:Python2 01

Fluquid

Pinned Repositories

browser_fingerprint

builtwith

cryptocurrency

extract-social-media

find_job_titles

html-to-etree

ilen-tech

kafka-docker

sde

yandex-search

Fluquid's Repositories

fluquid/find_job_titles

fluquid/extract-social-media

fluquid/yandex-search

fluquid/cryptocurrency

fluquid/browser_fingerprint

fluquid/builtwith

fluquid/capcoin

fluquid/cookiecutter-data-science

fluquid/cookiecutter-pypackage

fluquid/dragnet

fluquid/email-audit

fluquid/html-to-etree

fluquid/ilen-tech

fluquid/kafka-docker

fluquid/sde

fluquid/cookiecutter-pypackage-minimal

fluquid/cookiecutter-scrapycloud

fluquid/fluquid-lib

fluquid/githubarchive.org

fluquid/html-text