tongwei's Stars
infinilabs/analysis-pinyin
🛵 This Pinyin Analysis plugin is used to do conversion between Chinese characters and Pinyin.
mapstruct/mapstruct
An annotation processor for generating type-safe bean mappers
oap-project/raydp
RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.
apache/gravitino
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
apache/incubator-streampark
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
cloudreve/Cloudreve
🌩支持多家云存储的云盘系统 (Self-hosted file management and sharing system, supports multiple storage providers)
timqian/chinese-independent-blogs
中文独立博客列表
apache/fury
A blazingly fast multi-language serialization framework powered by JIT and zero-copy.
debezium/debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
mit-pdos/noria
Fast web applications through dynamic, partially-stateful dataflow
risingwavelabs/risingwave
Best-in-class stream processing, analytics, and management. Perform continuous analytics, or build event-driven applications, real-time ETL pipelines, and feature stores in minutes. Unified streaming and batch. PostgreSQL compatible.
xtreme1-io/xtreme1
Xtreme1 is an all-in-one data labeling and annotation platform for multimodal data training and supports 3D LiDAR point cloud, image, and LLM.
JanusGraph/janusgraph
JanusGraph: an open-source, distributed graph database
neo4j/neo4j
Graphs for Everyone
nightscape/spark-excel
A Spark plugin for reading and writing Excel files
shredder47/Nonspaced-Sentence-Tokenizer
Tokenizes words there are not seperated by space or any delimiter. It is implemented based on Zipf's law.
hankcs/HanLP
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
promeG/TinyPinyin
适用于Java和Android的快速、低内存占用的汉字转拼音库。
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
qdrant/qdrant
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
ansible/ansible
Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com.
LHRUN/paint-board
🎨 A powerful multi-end drawing board that brings together a lot of creative brushes to experience a whole new range of drawing effects!
koordinator-sh/koordinator
A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.
ytsaurus/ytsaurus
YTsaurus is a scalable and fault-tolerant open-source big data platform.
chineseocr/chineseocr
yolo3+ocr
apache/celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
intel/intel-one-mono
Intel One Mono font repository
JetBrains/kotlin
The Kotlin Programming Language.
square/okhttp
Square’s meticulous HTTP client for the JVM, Android, and GraalVM.
square/retrofit
A type-safe HTTP client for Android and the JVM