Pinned Repositories
abbyyR
R Client for the Abbyy Cloud OCR
actions-calver
CoAnSys
COntent ANalysis SYStem is a framework for mining scientific publications using Apache Hadoop.
Crawler
Universal Crawler for web or local file system crawlering. The module enables you to extend the functionality of any location-related searches in really easy way - by overriding the appropriate default functions in the subclass. For example you can change default web filters or add some statistics. In "thread-version" branch you can also find multi-threaded version. In "icm" branch you cand find extended module for collecting statistics from logs of Apache Hadoop jobs' tasks. In master you can find Demo1 and Demo2 - extensions of crawler. Let me know if you will use my code, have some suggestions or just find this helpfull :)
gobblin
Universal data ingestion framework for Hadoop.
Hadoop-map-reduce-relational-algebra
Mappery i reducery w JAVA do podstawowych operacji algebraicznych na zbiorach.
kafka
ML-workshop
Machine Learning Seminar 2017 - Workshop #Anaconda #Scikit-learn #Pandas #Keras #Jupyter
WebScraper
JAVA based web scraper - collects results from given sources using implemented selectors for each website template. You can define proxy servers or user agent for each selector to act as a specific user. Goal is to allow easy extending modules for traversing through many websites by implementing proper Selector. HTML navigating managed by JSoup. Package includes also ProxyFinder which download active proxies adresses from defined proxy selectors, so you needn't search by yourself.
wosiu's Repositories
wosiu/kafka
wosiu/abbyyR
R Client for the Abbyy Cloud OCR
wosiu/actions-calver
wosiu/AlexaShopList
Amazon Hackathon 2016
wosiu/Crawler
Universal Crawler for web or local file system crawlering. The module enables you to extend the functionality of any location-related searches in really easy way - by overriding the appropriate default functions in the subclass. For example you can change default web filters or add some statistics. In "thread-version" branch you can also find multi-threaded version. In "icm" branch you cand find extended module for collecting statistics from logs of Apache Hadoop jobs' tasks. In master you can find Demo1 and Demo2 - extensions of crawler. Let me know if you will use my code, have some suggestions or just find this helpfull :)
wosiu/gobblin
Universal data ingestion framework for Hadoop.
wosiu/ML-workshop
Machine Learning Seminar 2017 - Workshop #Anaconda #Scikit-learn #Pandas #Keras #Jupyter
wosiu/WebScraper
JAVA based web scraper - collects results from given sources using implemented selectors for each website template. You can define proxy servers or user agent for each selector to act as a specific user. Goal is to allow easy extending modules for traversing through many websites by implementing proper Selector. HTML navigating managed by JSoup. Package includes also ProxyFinder which download active proxies adresses from defined proxy selectors, so you needn't search by yourself.
wosiu/AlexaShopListLambda
wosiu/AsterixDB-EndToEnd
AsterixDB end to end example for finging similar tweets using shingles and Jaccard. Includes scripts for installing asterix, starting using ansible, feeding with sample data and running a query. Ensure you can ssh to localhost before running.
wosiu/awesome-ocr
A curated list of promising OCR resources
wosiu/C-like-interpreter-haskell
wosiu/calamari
OCR Engine based on OCRopy and Kraken
wosiu/CameraView
A well documented, high-level Android interface that makes capturing pictures and videos easy, addressing all of the common issues and needs.
wosiu/Compiler
wosiu/Deep-Learning-Papers-Reading-Roadmap
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
wosiu/hackathon_mini
wosiu/labelme
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
wosiu/MachineLearningProject3
3rd project for Machine Learning course. Simple character recognition.
wosiu/matrixMultiplication-MPI
wosiu/migramCountHadoop
wosiu/mikrokontroler
wosiu/Pregel-Giraph-Conected-Components
Toy example, with Single Pivot heuristic. (Not optimized)
wosiu/pwir-lab
wosiu/ShopListGUI
Review products in your online bucket.
wosiu/Spark-finding-triangles
Simple code in Scala
wosiu/testy-latte
Testy do kompilatora latte
wosiu/tfenv
Terraform version manager
wosiu/website
Source for the main Harbor website
wosiu/whyrhack