lidalei
Delivering business value with Data Engineering, Machine Learning and DevOps
FareHarborThe Netherlands
Pinned Repositories
AcademicArticlesRetrieval
A web-based information retrieve system.
DataflowTemplates
Google-provided Cloud Dataflow template pipelines for solving simple in-Cloud data tasks
DataMining
Various data mining algorithms implemented with sklearn and tensorflow.
flyway
Flyway by Redgate • Database Migrations Made Easy.
JFastText
Java interface for fastText
sentiment-analysis
Apply bag of words and Word2vec to sentiment analysis.
spark-clickhouse
VolumeRendering
Volume Rendering - Visualization Assignment One.
wechat-largefile
Split big files into parts and merge them back
youtube-8m
Algorithms and implementations to participate in Kaggle YouTube-8M Video Understanding Competition
lidalei's Repositories
lidalei/JFastText
Java interface for fastText
lidalei/AcademicArticlesRetrieval
A web-based information retrieve system.
lidalei/VolumeRendering
Volume Rendering - Visualization Assignment One.
lidalei/spark-clickhouse
lidalei/wechat-largefile
Split big files into parts and merge them back
lidalei/sentiment-analysis
Apply bag of words and Word2vec to sentiment analysis.
lidalei/TwitterStats
The HBase project at Technical University of Madrid. To support various top N queries.
lidalei/ZooKeeperClusterMonitor
A basic cluster monitor supported by ZooKeeper
lidalei/DataflowTemplates
Google-provided Cloud Dataflow template pipelines for solving simple in-Cloud data tasks
lidalei/flyway
Flyway by Redgate • Database Migrations Made Easy.
lidalei/LinearLogisticRegSpark
Project of Massively Parallel Machine Learning at Technical University of Madrid.
lidalei/youtube-8m
Algorithms and implementations to participate in Kaggle YouTube-8M Video Understanding Competition
lidalei/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
lidalei/airflow-dashboards
Grafana dashboards and StatsD exporter config for Airflow monitoring
lidalei/analytics-go
Segment analytics client for Go
lidalei/beam
Apache Beam is a unified programming model for Batch and Streaming
lidalei/cdiscount
To participate in the Kaggle cdiscount competition
lidalei/clickhouse-driver
ClickHouse Python Driver with native interface support
lidalei/data-engineering-blogs
lidalei/grpc
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
lidalei/hub
The single source of truth for all Meltano plugins, including all available Singer Taps and Targets: https://hub.meltano.com
lidalei/lidalei
lidalei/pyinsight
Insight service in Python
lidalei/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
lidalei/Robot-Motion-Planning
An educational game to help understand robot motion planning algorithm better
lidalei/tap-airtable
Singer TAP for Airtbale
lidalei/tap-zendesk
lidalei/TextRank
Python implementation of TextRank algorithm (https://web.eecs.umich.edu/~mihalcea/papers/mihalcea.emnlp04.pdf) for automatic keyword extraction and summarization using Levenshtein distance as relation between text units.
lidalei/TwitterTopHashtags
Find top3 frequent tweets using Kafka and Storm
lidalei/VideoMakerPro
Implemented the SLIC superpixel algorithm in C++ and image transitions.