Pinned Repositories
eir-calculator
A program to calculate effective interest rate (EIR) from given present value and a series of cash flow.
HAWK
HTML is All We Know
kutut
A Twitter bot that will tweet incoming direct messages.
nangendi
Scrapy spiders to scrape location data.
pwcahyo
sde
Structured Data Extractor. An application to extract structured data from web pages. It uses Data Extraction Based on Partial Tree Alignment (DEPTA) method. (UPDATE: I implemented a newer algorithm: https://github.com/seagatesoft/webdext)
simba
Sistem Informasi Manajemen Bantuan
webdext
Intelligent Web Data Extractor
webdext-dataset
Dataset to test Webdext.
seagatesoft's Repositories
seagatesoft/webdext
Intelligent Web Data Extractor
seagatesoft/pwcahyo
seagatesoft/webdext-dataset
Dataset to test Webdext.
seagatesoft/HAWK
HTML is All We Know
seagatesoft/sil2ah
Aplikasi berbasis web untuk membuat pohon keluarga.
seagatesoft/42-tips-sukses-kerja-remote
Source dari buku 42 Tips Sukses Kerja Remote
seagatesoft/aile
Automatic Item List Extraction
seagatesoft/artoo
artoo.js - the client-side scraping companion.
seagatesoft/awesome-web-scraping
List of libraries, tools and APIs for web scraping and data processing.
seagatesoft/data-terbuka-id
Data terbuka di Indonesia
seagatesoft/dateparser
python parser for human readable dates
seagatesoft/established-remote
A list of established remote companies
seagatesoft/fs2
File Structures 2 - Memory Mapped File Structures for Go
seagatesoft/incapsula-cracker
Use to bypass sites which use incapsula to block access to webscraping bots.
seagatesoft/jsgaf
Automatically exported from code.google.com/p/jsgaf
seagatesoft/machine-learning-programming-assignments-coursera-andrew-ng
Solutions to Andrew NG's machine learning course on Coursera
seagatesoft/pagelyzer
Suite of tools for detecting changes in web pages and their rendering
seagatesoft/public-amazon-crawler
seagatesoft/pyconid2020
Landing Page for Pycon ID 2020
seagatesoft/scrapy-hcf
Scrapy spider middleware to use Scrapinghub's Hub Crawl Frontier as a backend for URLs
seagatesoft/scrapy-workshop
Scrapy workshop code for PyCon APAC 2024
seagatesoft/scrapydemo
seagatesoft/seagatesoft.github.io
seagatesoft/sqrape
Simple Query Scraping with CSS and Go Reflection
seagatesoft/undercrawler
A generic crawler
seagatesoft/vips_java
Implementation of Vision Based Page Segmentation algorithm in Java
seagatesoft/webkit-crawler
Simple crawler based on PyQt4 for javascript powered websites.
seagatesoft/webpoet
seagatesoft/WhatWeb
Website Fingerprinter
seagatesoft/You-Dont-Know-JS
A book series on JavaScript. @YDKJS on twitter.