Pinned Repositories
FilterSeedList
Filter out from a list of urls those having one of the domains listed in a file
FirmsDocTermMatrixGenerator
GUrlSearcher
Google URL Searcher
RootJuice
A Java webscraper
TDM_generator
Generate a term-document matrix from a Solr collection
Url_scorer
Assign a score to each document (scraped firm URL) contained in a Solr collection
UrlMatchTableGenerator
UrlMatchTableGenerator2
Prepare a raw training set for official enterprises websites classification
UrlScorer
Assign a score to each document (scraped firm URL) contained in a Solr collection
UrlSearcher
Bing URL Searcher
SummaIstat's Repositories
SummaIstat/UrlSearcher
Bing URL Searcher
SummaIstat/UrlScorer
Assign a score to each document (scraped firm URL) contained in a Solr collection
SummaIstat/RootJuice
A Java webscraper
SummaIstat/UrlMatchTableGenerator
SummaIstat/FilterSeedList
Filter out from a list of urls those having one of the domains listed in a file
SummaIstat/FirmsDocTermMatrixGenerator
SummaIstat/GUrlSearcher
Google URL Searcher
SummaIstat/TDM_generator
Generate a term-document matrix from a Solr collection
SummaIstat/Url_scorer
Assign a score to each document (scraped firm URL) contained in a Solr collection
SummaIstat/UrlMatchTableGenerator2
Prepare a raw training set for official enterprises websites classification
SummaIstat/R-scripts
SummaIstat/RootJuice2
A Python webscraper
SummaIstat/SolrTSVImporter
Import TSV files into Solr
SummaIstat/SolrTSVImporter_6.6.0
SummaIstat/UrlScorer_6.6.0
Assign a score to each document (scraped firm URL) contained in a Solr collection