Pinned Repositories
aminerdbformat
annotator
Highlight, share, add notes and tags to any selected text on a page
balanced-employee-ip-agreement
GitHub's employee IP agreement, open sourced and reusable
dtm
This implements topics that change over time (Dynamic Topic Models) and a model of how individual documents predict that change.
elasticsearch-custom-similarity-example
An example how to implement a custom similarity (overlap similarity) for elasticsearch
medium-api-docs
Documentation for Medium's OAuth2 API
pdffigures
Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.
qa-datasets
Collection of Question Answering (QA) datasets from various source. The goal of this project is to provide a comprehensive collection of questions and answers to accelerate the development of future QA systems.
scimago_crawler
Crawl journal information from scimago
text-data-book-comments
Comments, errata, suggestions, and issues for the book "Text Data Analysis and Management: A Practical Introduction to Text Mining and Information Retrieval"
Crimson Inteltech Pvt. Ltd.'s Repositories
Rygbee/annotator
Highlight, share, add notes and tags to any selected text on a page
Rygbee/dtm
This implements topics that change over time (Dynamic Topic Models) and a model of how individual documents predict that change.
Rygbee/text-data-book-comments
Comments, errata, suggestions, and issues for the book "Text Data Analysis and Management: A Practical Introduction to Text Mining and Information Retrieval"
Rygbee/acm-crawler
A web-crawler for ACM publications (http://dl.acm.org)
Rygbee/AutoSPARQL
AutoSPARQL allows to create SPARQL queries over RDF knowledge bases from natural language with low effort.
Rygbee/awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
Rygbee/boilerpipe
Work in progress transmit from Google Code
Rygbee/citations-gadget
Automatically exported from code.google.com/p/citations-gadget
Rygbee/d3-cloud
Create word clouds in JavaScript.
Rygbee/dashing
The exceptionally handsome dashboard framework in Ruby and Coffeescript.
Rygbee/dataverse
A data repository framework to share and publish research data.
Rygbee/developer.github.com
GitHub Developer site
Rygbee/dissect
Rygbee/docs
Documentation for Polymer
Rygbee/elasticsearch-river-web
Web Crawler for Elasticsearch
Rygbee/elasticsearch-taste
Mahout Taste-based recommendation on Elasticsearch
Rygbee/essential-js-design-patterns
Repo for 'Learning JavaScript Design Patterns' - creative-commons book on JavaScript design patterns.
Rygbee/Ext-RESCAL
Scalable tensor factorization
Rygbee/gensim
Topic Modelling for Humans
Rygbee/Mallet
Rygbee/meta
A Modern C++ Data Sciences Toolkit
Rygbee/nltk-examples
Worked examples from the NLTK Book
Rygbee/online-hdp
Online inference for the Hierarchical Dirichlet Process. Fits hierarchical Dirichlet process topic models to massive data. The algorithm determines the number of topics.
Rygbee/OpenRefine
OpenRefine is a free, open source power tool for working with messy data and improving it
Rygbee/orientdb
OrientDB is the first Multi-Model DBMS with Document & Graph engine. OrientDB can run distributed (Multi-Master), supports SQL, ACID Transactions, Full-Text indexing, Reactive Queries and has a small memory footprint. OrientDB is licensed with Apache 2 license and the development is driven by Orient Technologies and a wide Open Source community.
Rygbee/protobuf
Protocol Buffers - Google's data interchange format
Rygbee/vega
A visualization grammar.
Rygbee/VIVO
VIVO is an extensible semantic web application that builds on Vitro for the purpose of enabling research discovery.
Rygbee/weka
weka mirror with git — http://www.cs.waikato.ac.nz/ml/weka/
Rygbee/wordcloud2.js
Tag cloud/Wordle presentation on 2D canvas or HTML