peterdm's Stars
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
deepset-ai/haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
microsoft/LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
twitter/typeahead.js
typeahead.js is a fast and fully-featured autocomplete library
google/trax
Trax — Deep Learning with Clear Code and Speed
catboost/catboost
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
tensorflow/rust
Rust language bindings for TensorFlow
sindresorhus/promise-fun
Promise packages, patterns, chat, and tutorials
sindresorhus/globby
User-friendly glob matching
AlexIoannides/pyspark-example-project
Implementing best practices for PySpark ETL jobs and applications.
nlpyang/BertSum
Code for paper Fine-tune BERT for Extractive Summarization
koursaros-ai/nboost
NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms (i.e. Elasticsearch)
ULTR-Community/ULTRA
Unbiased Learning To Rank Algorithms (ULTRA)
UselessPickles/ts-enum-util
Strictly typed utilities for working with TypeScript enums
kilianc/node-apiserver
A ready to go, modular, multi transport, streaming friendly, JSON(P) API Server.
crosscompute/analytical-tutorials
Tutorials for writing analytical scripts
js1010/cuhnsw
CUDA implementation of Hierarchical Navigable Small World Graph algorithm
ffizer/ffizer
ffizer is a files and folders initializer / generator. Create any kind (or part) of project from template.
yugui/jsonnetunit
Unit testing framework for Jsonnet
okfn/docker-ckan
Docker images and Docker Compose setup for CKAN [Not Maintained]
adamkucharski/2020-ncov
Accompanying code for: Kucharski AJ, Russell TW et al. Early dynamics of transmission and control of COVID-19: a mathematical modelling study. Lancet Infectious Diseases, 2020
iomentum/cargo-scaffold
cargo scaffold lets you scaffold and generate projects described in a simple TOML file
holochain/hcrs
nodejs scaffold generation for holochain-rust
stadt-karlsruhe/ckanext-extractor
A full text and metadata extractor for CKAN
smeznar/SNoRe
SNoRe: Scalable Unsupervised Learning of Symbolic Node Representations
progirep/ParetoFrontEnumerationAlgorithm
An algorithm to enumerate all elements of a Pareto front for a multi-criterial optimization problem for which all optimization objectives have a finite range
peterdm/extractor
Wikipedia term extractor for SiteSimon
peterdm/stackmatch
Suggest stellar SO members for job postings
peterdm/stacksolr
Solr config for stack exchange extensions