Pinned Repositories
bitsail
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.
BioSeq-GFN-AL
Code for "Biological Sequence Design with GFlowNets", 2022
bitsail
BitSail is a distributed, high-performance data integration framework and both support streaming and batch mode. At present, BitSail is mainly designed with the ELT model, which have EB data size and use for Bytedance。
elasticsearch
Free and Open, Distributed, RESTful Search Engine
flink
Apache Flink
hbase
Apache HBase
iceberg
Apache Iceberg
incubator-paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
JavaGuide
「Java学习+面试指南」一份涵盖大部分 Java 程序员所需要掌握的核心知识。准备 Java 面试,首选 JavaGuide!
love-star's Repositories
love-star/hbase
Apache HBase
love-star/ByteYoungDB
love-star/date_trans_with_transformer
a pytorch implementation of machine translation model(transformer) that translates human readable dates ("25th of June, 2009") into machine readable dates ("2009-06-25")
love-star/dragon-book-exercise-answers
Compilers Principles, Techniques, & Tools (purple dragon book) second edition exercise answers. 编译原理(紫龙书)第2版习题答案。
love-star/Falco
A quick and flexible single-cell RNA-seq processing framework on the cloud
love-star/hive
Notes on Hive Course
love-star/hudi-quickstart
Hudi-0.6.0快速入门
love-star/JavaBagu
春招和秋招过程中总结的Java后台开发相关基础知识(俗称八股文)
love-star/jvm-core-learning-example
关于Java虚拟机核心知识点学习积累的例子,是初学者及虚拟机核心知识巩固的最佳实践。
love-star/MiniSpark
Java implementation of a mini Spark-like framework named MiniSpark that can run on top of a HDFS cluster. MiniSpark supports operators including Map, FlatMap, MapPair, Reduce, ReduceByKey, Collect, Count, Parallelize, Join and Filter.
love-star/Nyspider
各种爬虫---大众点评,安居客,58,人人贷,拍拍贷, IT桔子,拉勾网,豆瓣,搜房网,ASO100,气象数据,猫眼电影,链家,PM25.in...
love-star/P2P-Over-MiddleBoxes-Demo
A simple demo of P2P communication over middle boxes such as NAT
love-star/PepBCL
We propose PepBCL, a novel BERT (Bidirectional Encoder Representation from Transformers)-based Contrastive Learning framework to predict the protein-Peptide binding residues based on protein sequences only.
love-star/QQrobot
QQ机器人:提供QQ群管理、智能聊天、归属地信息查询等功能。
love-star/Spark-GATK
Spark-GATK is a genomics analysis framwork based on Apache Spark and ADAM.
love-star/SparkBWA
SparkBWA is a new tool that exploits the capabilities of a Big Data technology as Apache Spark to boost the performance of one of the most widely adopted sequence aligner, the Burrows-Wheeler Aligner (BWA).
love-star/tesseract-job-admin
分布式调度后端代码
love-star/transformer
A TensorFlow Implementation of the Transformer: Attention Is All You Need