dzqoo's Stars
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
kdn251/interviews
Everything you need to know to get the job.
nuster1128/LLM_Agent_Memory_Survey
langchain4j/langchain4j
Java version of LangChain
nocobase/nocobase
NocoBase is an extensibility-first, open-source no-code/low-code platform for building business applications and enterprise solutions.
younader/Vesuvius-Grandprize-Winner
meta-llama/llama3
The official Meta Llama 3 GitHub site
iflytek/spark-ai-python
星火大模型 python sdk库
open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
scalaj/scalaj-collection
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
nmslib/hnswlib
Header-only C++/python library for fast approximate nearest neighbors
zilliztech/feder
Visualize hnsw, faiss and other anns index
bytedance/CloudShuffleService
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
milvus-io/milvus
Milvus is a high-performance, cloud-native vector database designed to scale vector search.
taowen/awesome-lowcode
国内低代码平台从业者交流
feast-dev/feast
The Open Source Feature Store for Machine Learning
jelmerk/hnswlib
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
circe/circe
Yet another JSON library for Scala
apache/hbase-connectors
Apache HBase Connectors
Vonng/ddia
《Designing Data-Intensive Application》DDIA中文翻译
gingerredjade/flink-userportrait-main
基于Flink流处理的动态实时亿级全端用户画像系统
timgent/data-flare
Data quality control tool built on spark and deequ
iflytek/aiges
AI Serving framework loader
okkam-it/flink-examples
Flink jobs collection
ververica/flink-training-exercises
lihaolixuewei112612/scala-opentsdb
deshpandetanmay/flume-opentsdb-sink
This is a Sink which reads data from Kafka Topic and Writes it to OpenTSDB.
google/eng-practices
Google's Engineering Practices documentation
taosdata/TDengine
High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios