Venkata09
13+ years of experience in backend development, data engineering. Machine Leaning Enthusiastic.
CapitaloneVirginia
Venkata09's Stars
iluwatar/java-design-patterns
Design patterns implemented in Java
awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
dl4s/dl4s
source code accompanying "Deep Learning for Search" book
davidmoten/rxjava2-jdbc
RxJava2 integration with JDBC including Non-blocking Connection Pools
cherryljr/LeetCode
LeetCode各题解法分析~(Java and Python)
bephrem1/interviewpen
Code samples for Back to Back SWE lessons (archive).
cedrickchee/awesome-transformer-nlp
A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.
thelastpickle/cassandra-reaper
Automated Repair Awesomeness for Apache Cassandra
datasciencescoop/Data-Science--Cheat-Sheet
Cheat Sheets
microservices-patterns/ftgo-application
Example code for the book Microservice patterns
delta-io/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
davidmoten/rxjava-jdbc
Efficient execution and functional composition of database calls using jdbc and RxJava Observables
zaratsian/Spark
Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References
alibaba/Sentinel
A powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)
apache/incubator-seata
:fire: Seata is an easy-to-use, high-performance, open source distributed transaction solution.
ageron/handson-ml2
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
yamrcraft/etl-light
A light Kafka to HDFS/S3 ETL library based on Apache Spark
CoxAutomotiveDataSolutions/waimak
Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
microsoft/SynapseML
Simple and Distributed Machine Learning
sainathadapa/kaggle-freesound-audio-tagging
8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)
explosion/spacy-course
👩🏫 Advanced NLP with spaCy: A free online course
Hvass-Labs/TensorFlow-Tutorials
TensorFlow Tutorials with YouTube Videos
jayparks/quasi-rnn
A PyTorch Implementation of "Quasi-Recurrent Neural Networks"
rohithreddy024/Text-Summarizer-Pytorch
Pytorch implementation of "A Deep Reinforced Model for Abstractive Summarization" paper and pointer generator network
intel-analytics/BigDL-2.x
BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray
MrPowers/spark-daria
Essential Spark extensions and helper methods ✨😲
kdn251/interviews
Everything you need to know to get the job.
japila-books/spark-sql-internals
The Internals of Spark SQL
apache/hudi
Upserts, Deletes And Incremental Processing on Big Data.
cerndb/hdfs-metadata
Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks and nodes.