prenastro

Software Developer

Los Angeles

Pinned Repositories

Crawler-for-News-Website
Developed a simple web crawler to measure aspects of a crawl, study the characteristics of the crawl, download web pages from the crawl and gather webpage metadata of C-Span website
Language:Java00
d3-cloud
Create word clouds in JavaScript.
Language:JavaScript00
Database-Systems-Assignments
CSCI 585 Assignments. 1. EER Diagram for E-Learn 2. SQL 3. KML - Nearest Neighbors and Convex Hull code 4. Tinkerpop Gremlin 5. Weka, Rapid Miner, Knime tools execution.
00
deepsentirank
Using AlexNet CNN to classify images into one of the classes defined in caffe_classes.py. Images with similar classes can be grouped together and used for Image Similarity Search. To test the model please run testModel.py
Language:Python00
FaceBook-Search
Language:PHP00
Facebook-Search-Android-App
Language:Java00
Heart-Disease-Prediction-System
project related
Language:Java02
PollApp
Polling App on WindowsPhone OS. Used for Survey purposes. Allows users to post their own questions and also vote for their favourite option for questions posted by others.
Language:C#00
Workshop_Management
Allows Faculty to set their own slots and manage their assigned slots for the workshops for various courses in college.
Language:HTML00
Xml-Parser
XML_Parser
Language:JavaScript0 2 00

prenastro's Repositories

prenastro/deepsentirank-1
Deep Learning based Sentiment Ranking for Multimedia
Language:Python
prenastro/deepsentirank
Using AlexNet CNN to classify images into one of the classes defined in caffe_classes.py. Images with similar classes can be grouped together and used for Image Similarity Search. To test the model please run testModel.py
Language:Python
prenastro/PolarPostProcessing
This code gets connected to Solr DB created for Sparkler Crawled Data to do further data extraction, classification, filtering and insights generation using various Machine Learning models. The ML models are capable of using keywords list from user, extract features from URL content, and classify (score) output and update Solr parameter accordingly. Apache Sparkler Link: https://github.com/USCDataScience/sparkler
Language:Jupyter Notebook
prenastro/polar-deep-insights
Conceptual - Temporal - Spatial analysis of the trec polar dataset
Language:JavaScript
prenastro/polar.usc.edu
Polar USC activities related to NSF Polar CyberInfrastructure program at the University of Southern California
Language:HTML
prenastro/Database-Systems-Assignments
CSCI 585 Assignments. 1. EER Diagram for E-Learn 2. SQL 3. KML - Nearest Neighbors and Convex Hull code 4. Tinkerpop Gremlin 5. Weka, Rapid Miner, Knime tools execution.
prenastro/Crawler-for-News-Website
Developed a simple web crawler to measure aspects of a crawl, study the characteristics of the crawl, download web pages from the crawl and gather webpage metadata of C-Span website
Language:Java
prenastro/Search-Engine-Enhancement
Adding Spell Checking, AutoComplete and Snippets functionality to Solr Search Engine. Enhanced Solr program with spelling correction and an autocomplete (suggest) function. Also used an external spelling correction program called Norvig’s spell correction program in conjunction with Solr, to enhance the autocomplete functionality of Solr. Norvig’s spell correction program uses a text file(‘’big.txt”) to get set of words to calculate edit distance. Here I am using Apache Tika for this purpose.
Language:PHP
prenastro/Solr-Ranking-Algos-Comparison
Imported a set of pages on Apache Solr and analyzed different ranking Algorithms like Lucene and PageRank. Using Solr to index documents, Tika and TagSoup library to extract text from any kind of HTML found on web. Developed a PHP client which accepts input from the user in HTML form, and sends request to the Solr server. Solr server processes the query and returns results which are parsed by the PHP program and displayed. Changing the ranking algorithm in Solr to PageRank. The app loops through each fetched webpage and extracts outgoing links. Using a mapping file which has web pages mapping to actual urls, filter out the urls not present in the file. Create a network graph with web pages as vertices and links representing an edge between two files using NetworkX Library. Search for a list of keywords and compare the two Algorithms.
Language:PHP
prenastro/Inverted-Index-Using-GCP-and-Hadoop-Cluster
Created an Inverted Index of words occurring in a set of web pages using a subset of 74 files from a total of 408 files (text extracted from HTML tags) derived from the Stanford WebBase project (https://ebiquity.umbc.edu/resource/html/id/351). Placed these files in a bucket on Google cloud storage and ran a Hadoop job to read inputs from this bucket.
Language:Java
prenastro/sparkler
Spark-Crawler : Evolving Apache Nutch to run on Spark.
Language:Java
prenastro/image_space
Image similarity and search application
Language:Python
prenastro/uscdatascience.github.io
USC Information Retrieval and Data Science Group
Language:HTML1
prenastro/pdi-topics
LDA Topic Modeling for Polar Data Insights
Language:Jupyter Notebook
prenastro/imagecat
ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to extract metadata and OCR information from those files/images using Tika and Tesseract OCR.
Language:Java
prenastro/img2text
Models, and associated helper code for GSOC 2017 project Tensorflow Image to Text in Apache Tika
Language:Python
prenastro/tika-python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Language:Python
prenastro/Xml-Parser
XML_Parser
Language:JavaScript
prenastro/Heart-Disease-Prediction-System
project related
Language:Java2
prenastro/Workshop_Management
Allows Faculty to set their own slots and manage their assigned slots for the workshops for various courses in college.
Language:HTML
prenastro/PollApp
Polling App on WindowsPhone OS. Used for Survey purposes. Allows users to post their own questions and also vote for their favourite option for questions posted by others.
Language:C#
prenastro/Machine-Learning-Data-Analysis
prenastro/Facebook-Search-Android-App
Language:Java
prenastro/FaceBook-Search
Language:PHP
prenastro/tika
Fork of APACHE TIKA - Specific Customizations for textual content extraction and enrichment
Language:Java
prenastro/npm-bower-yo-grunt
Docker container with node, npm, bower, yeoman and grunt packages.
Language:Shell
prenastro/d3-cloud
Create word clouds in JavaScript.
Language:JavaScript
prenastro/osgoculusviewer
An OsgViewer with support for the Oculus Rift
Language:C++