apsaltis's Stars
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
kubeflow/kubeflow
Machine Learning Toolkit for Kubernetes
antlr/grammars-v4
Grammars written for ANTLR v4; expectation that the grammars are free of actions.
nteract/nteract
📘 The interactive computing suite for you! ✨
nteract/papermill
📚 Parameterize, execute, and analyze notebooks
apache/hive
Apache Hive
euske/pdfminer
Python PDF Parser (Not actively maintained). Check out pdfminer.six.
NVIDIA/thrust
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
deeplearning4j/deeplearning4j-examples
Deeplearning4j Examples (DL4J, DL4J Spark, DataVec)
OpenSIPS/opensips
OpenSIPS is a GPL implementation of a multi-functionality SIP Server that targets to deliver a high-level technical solution (performance, security and quality) to be used in professional SIP server platforms.
palantir/windows-event-forwarding
A repository for using windows event forwarding for incident detection and response
NationalSecurityAgency/lemongraph
Log-based transactional graph engine
vlm/asn1c
The ASN.1 Compiler
CrossRef/pdfextract
MOVED TO https://gitlab.com/crossref/pdfextract
cgrates/cgrates
Real-time Charging System for Telecom & ISP environments
DiceTechJobs/ConceptualSearch
Train a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jobs
DiceTechJobs/SolrPlugins
Dice Solr Plugins from Simon Hughes Dice.com
BMKEG/lapdftext
LA-PDFText is a system for extracting accurate text from PDF-based research articles (and an interface to be able to improve performance where needed). The system is open-source and provides a simple baseline function for extracting text from primary research articles using rules that developers can customize. This means that the system works quite well for most applications (and might occasionally make mistakes and extract the wrong text), but it is always possible to 'hack' your own rules and improve performance.
ad-freiburg/pdfact
A basic tool that extracts the structure from the PDF files of scientific articles.
OpenSIPS/docker-opensips
Docker Image Repository for OpenSIPS
ckorzen/pdf-text-extraction-benchmark
A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF documents, especially from scientific articles.
OctoberChang/awesome-text-summarization
A curated list of resources dedicated to text summarization
ropensci/rtika
R Interface to Apache Tika
deshpandetanmay/cdr-data-generator
Utility to generate Telecom Call Detail/Data Records Generator
oyvindberg/PDFExtract
my take at a PDF text extraction utility
Anghagaed/MISON
Implementing MISON by Microsoft in C++ as a test
AGProjects/cdrtool
CDR mediation and rating engine for Call Details Records.
ckorzen/icecite
The repository of Icecite, a research paper management system.
dhwajraj/spark-streaming-topic-model
Scalable Latent Dirichlet Allocation (LDA) based topic modeling of news corpus in Apache Spark Streaming
growse/eventlog-to-syslog
This is a fork of the codebase over at http://code.google.com/p/eventlog-to-syslog/ at revision 42. I've made some changes to bring some timestamp compliance with RFC5424.