Pinned Repositories
lazycrazyowl's Repositories
lazycrazyowl/ResumeParser
Resume parser using UIMA Ruta
lazycrazyowl/Artificial-Intelligence-and-Machine-Learning
Algorithm implementations and homework solutions for the Stanford's online courses
lazycrazyowl/BarcelonaMedia-ViL.github.io
lazycrazyowl/BlueBook
This IPython Notebook contains a quantitative pricing model created for Fast Iron in the Kaggle competition 'Blue Book for Bulldozers'. The model predicts the sale price of a particular piece of heavy equipment so that Fast Iron can create a 'Blue Book' to enable customers to valuate their heavy equipment fleet at auction. Here python is used as a medium to apply supervised and unsupervised machine learning techniques to explain 88.90% of the variance observed in the training set and score an RMSLE of 0.745 when predicting values on the test set. In this competition 590 data scientists created predictive models based on a 'training dataset', provided by Fast Iron, and then used those models to predict sale prices on a 'test set' to compete for a $10,000 dollar award for the team or individual with the most accurate model. The model and methods used for my entry, which scored in the upper 20%, is shown in BlueBook.ipynb.
lazycrazyowl/data-portal-treemap
Chicago Data Portal (data.cityofchicago.org) tree map
lazycrazyowl/Django-IPython-Tutorial
An interactive tutorial that guides you through creating your first Django project. This notebook goes along with the offical guide from the Django project's website. This tutorial will take you through the process of creating a basic poll application.
lazycrazyowl/dodhackathon
Data files for DoD Hackathon
lazycrazyowl/fermi
A WGS de novo assembler based on the FMD-index for large genomes
lazycrazyowl/gimli
Annotation of biomedical entity names.
lazycrazyowl/grobid
grobid
lazycrazyowl/ir_tutorial_lucene
information retrieval tutorial using Apache Lucene
lazycrazyowl/json2s3
Upload and partition a JSON stream to AWS S3
lazycrazyowl/myrrix-recommender
Official mirror of the Myrrix open source recommender's Subversion repository
lazycrazyowl/neji
Flexible, easy and powerfull framework for faster biomedical concept recognition.
lazycrazyowl/NERO
Name Entity Recognition Optimizer
lazycrazyowl/oaqa-tutorial
A group of examples based on the CSE pipleline.
lazycrazyowl/pattern
Web mining module for Python
lazycrazyowl/PigEditor
Eclipse plugin for Apache Pig
lazycrazyowl/rCharts_nyt_home_price
lazycrazyowl/sublime-sort-numerically
Sublime Text 2 package that adds a command for sorting lines numerically rather than alphabetically.
lazycrazyowl/trading
This repository contains real trading examples explained and modeled in IPython Notebooks to generate discussion, feasible trading examples, and potential profit for the common man.
lazycrazyowl/turtle-in-html
Bookmark to visualize RDF embedded in HTML as Turtle
lazycrazyowl/txtfnnl
a UIMA-based text mining pipeline
lazycrazyowl/uima-development
Internal development repository, contains components that are not (yet) publicly released and are mostly not usable outside BM without additional adaptation.
lazycrazyowl/uima_prolog
Prolog interface for UIMA perception pipeline
lazycrazyowl/uima_sql
lazycrazyowl/uimaScala
A toolkit to write UIMA components and applications
lazycrazyowl/v3nlp
Natural Language Processing of Clinical & Medical Text with an automated tool for configuring and running UIMA-AS pipelines.
lazycrazyowl/vac
A language detection library named after the hindu goddess of communications and words Vāc.
lazycrazyowl/yelp-dataset-challenge
Information extraction over restaurant reviews for the Yelp Dataset Challenge