lijinf2's Stars
NVIDIA/spark-rapids-ml
Spark RAPIDS MLlib – accelerate Apache Spark MLlib with GPUs
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
NVIDIA/spark-rapids
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
milvus-io/milvus
A cloud-native vector database, storage for next generation AI applications
lijinf2/losha
Distributed similarity search
lijinf2/gqr
lijinf2/singa
a distributed deep learning platform
megagonlabs/ditto
Code for the paper "Deep Entity Matching with Pre-trained Language Models"
rit-git/tagging
An extremely simple framework for tagging useful sentences from raw texts (PVLDB 2020).
weaviate/weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
lijinf2/launchpad
jqlang/jq
Command-line JSON processor
dbohdan/structured-text-tools
A list of command-line tools for manipulating structured text data
zemirco/json2csv
Convert json to csv with column titles
dilshod/xlsx2csv
Convert xslx to csv, it is fast, and works for huge xlsx files
flairNLP/flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
jiangqy/DCMH-CVPR2017
source code for paper "Deep Cross-Modal Hashing"
megagonlabs/sato
Code and data for Sato https://arxiv.org/abs/1911.06311.
didi/ChineseNLP
Datasets, SOTA results of every fields of Chinese NLP
nytimes/covid-19-data
A repository of data on coronavirus cases and deaths in the U.S.
microsoft/nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
DBAIWangGroup/nns_benchmark
Benchmark of Nearest Neighbor Search on High Dimensional Data
husky-team/husky
A more expressive and most importantly, more efficient system for distributed data analytics.
Yuzhen11/flexps
erikbern/ann-benchmarks
Benchmarks of approximate nearest neighbor libraries in Python
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
spotify/annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
rajendrashinde/MapReduce
lijinf2/BCC-GraphChi