Pinned Repositories
wpsolr-search-engine
bakup wpsolr
silvestrelosada's Repositories
silvestrelosada/wpsolr-search-engine
bakup wpsolr
silvestrelosada/awesome-awesomeness
A curated list of awesome awesomeness
silvestrelosada/awesome-public-datasets
An awesome list of high-quality open datasets in public domains (on-going). By everyone, for everyone!
silvestrelosada/BERTSimilar
Get Similar Words and Embeddings using BERT Models
silvestrelosada/book
Taming Text Book Source Code
silvestrelosada/box-java-sdk
The Box SDK for Java.
silvestrelosada/CoordinateAscent
Python implementation of the Coordinate Ascent algorithm
silvestrelosada/DAE_RNN_News_Recommendation
Refer to paper "Embedding-based News Recommendation for Millions of Users" & "Article De-duplication Using Distributed Representations" published by Yahoo Japan
silvestrelosada/elasticsearch
Open Source, Distributed, RESTful Search Engine
silvestrelosada/embedded-elasticsearch
Tool that ease up creation of integration tests with Elasticsearch
silvestrelosada/fastrank
My most frequently used learning-to-rank algorithms ported to rust for efficiency. Try it: "pip install fastrank".
silvestrelosada/GLiNER
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
silvestrelosada/GLiREL
Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)
silvestrelosada/jpa-neo4j-test
silvestrelosada/k-NN
🆕 A machine learning plugin which supports an approximate k-NN search algorithm for Open Distro for Elasticsearch
silvestrelosada/lapdftext
LA-PDFText is a system for extracting accurate text from PDF-based research articles (and an interface to be able to improve performance where needed). The system is open-source and provides a simple baseline function for extracting text from primary research articles using rules that developers can customize. This means that the system works quite well for most applications (and might occasionally make mistakes and extract the wrong text), but it is always possible to 'hack' your own rules and improve performance.
silvestrelosada/lapdftext-original
Automatically exported from code.google.com/p/lapdftext
silvestrelosada/markup
The code we use to render README.your_favorite_markup
silvestrelosada/nlp-datasets
A list of datasets/corpora for NLP tasks, in reverse chronological order.
silvestrelosada/semantic-knowledge-graph
silvestrelosada/sentence_similarity_semantic_search
sentence_similarity_semantic_search
silvestrelosada/sentiment-analysis-spanish
silvestrelosada/siren-join
SIREn Plugin to add relational join capabilities to Elasticsearch
silvestrelosada/solr-ocrhighlighting
Highlighting various OCR formats directly in Solr
silvestrelosada/solr-ocrpayload-plugin
Efficient indexing and retrieval of OCR bounding boxes in Solr
silvestrelosada/solr-vector-scoring
Vector Plugin for Solr: calculate dot product / cosine similarity on documents
silvestrelosada/uima-components
silvestrelosada/uimafit-spring-experiments
Attempt to marry two object-lifecycle containers to bring benefits for all
silvestrelosada/wicked-charts
Beautiful and interactive javascript charts for Java-based web applications.
silvestrelosada/yodaqa
A Question Answering system built on top of the Apache UIMA framework.