USC Information Retrieval & Data Science
USC Information Retrieval and Data Science Group
Los Angeles, CA
Pinned Repositories
AgePredictor
Age classification from text using PAN16, blogs, Fisher Callhome, and Cancer Forum
autoextractor
A toolkit for clustering web pages based on various similarity measures.
dl4j-kerasimport-examples
This repository contains deeplearning4j examples for importing and making use of models trained in keras
Image-Similarity-Deep-Ranking
Deep Ranking based ImageSimilarity will be developed as plugin on ImageSpace. https://users.eecs.northwestern.edu/~jwa368/pdfs/deep_ranking.pdf
NLTKRest
This is a REST Server endpoint built using Flask and Python.
polar.usc.edu
Polar USC activities related to NSF Polar CyberInfrastructure program at the University of Southern California
SentimentAnalysisParser
Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.
sparkler
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
supervising-ui
Web UI for labelling dataset for supervised learning.
tika-dockers
A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for images and video
USC Information Retrieval & Data Science's Repositories
USCDataScience/dl4j-kerasimport-examples
This repository contains deeplearning4j examples for importing and making use of models trained in keras
USCDataScience/hadoop-pot
A scalable Apache Hadoop-based implementation of the Pooled Time Series video similarity algorithm based on M. Ryoo et al paper CVPR 2015.
USCDataScience/video-recognition
USCDataScience/TextREST.jl
Language Detection REST Server using MIT Lincoln Lab’s Text.jl library
USCDataScience/counterfeit-electronics-tesseract
Training Tesseract to better extract serial numbers from images of electronic items
USCDataScience/imagecat2
Imagecat Version 2
USCDataScience/nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
USCDataScience/nutch-analytics
Nutch Crawl Analysis - Spark based project
USCDataScience/PersonaExtraction
USCDataScience/filetypeDetection
File Byte Histogram Machine learnig Classification
USCDataScience/marve
For extracting measurements and related entities from text
USCDataScience/memex-cca-esindex
USCDataScience/counterfeit-crawling
Focused Crawling and Evaluation of Counterfeit Electronics Sites
USCDataScience/NN-fileTypeDetection
This repository contains files of generating tika neural network model using Theano. Ir provides a way for you to build Deep Neural Network and increase Tika's detection capability.
USCDataScience/tika-dl4j-spark-imgrec
Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika
USCDataScience/Tika-NER-Libraries
This is a d3 visualization benchmarking named entities extracted by NLTK and Standford Core NLP
USCDataScience/TrojanFootball
Analyses athletes past performance and workload for a better training
USCDataScience/Annotated-Semantic-Relationships-Datasets
Public and free annotated datasets of relationships between entities/nominals
USCDataScience/loaded-language-linter
A small Node.JS library to detect loaded language.
USCDataScience/parser-indexer
Metadata Parser and Solr Indexer. For Python equivalent, checkout https://github.com/USCDataScience/parser-indexer-py
USCDataScience/PlanetaryIR
Information Retrieval for Planetary Science using DeepDive
USCDataScience/sparkler-jsdriver
USCDataScience/TattDL
Tattoo detection and localization
USCDataScience/BFA
USCDataScience/ColumbiaImageSearch
Columbia Image Search tool for MEMEX
USCDataScience/d3kit-timeline
A simple timeline component that labels do not overlap.
USCDataScience/DUCC-cTAKES-AWS
USCDataScience/polar-domain-discovery
Domain Discovery on Polar Domain
USCDataScience/scala-json-doclet
Scala Doclet that produces JSON output
USCDataScience/tika-python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.