Pinned Repositories
AdminBSBMaterialDesign
AdminBSB - Free admin panel that is based on Bootstrap 3.x with Material Design
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
awesome-analytics
A curated list of analytics frameworks, software and other tools.
awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
awesome-python
A curated list of awesome Python frameworks, libraries, software and resources
DataflowTemplates
Google-provided Cloud Dataflow template pipelines for solving simple in-Cloud data tasks
dataproc-initialization-actions
Run in all nodes of your cluster before the cluster starts - lets you customize your cluster
dwh-migration-tools
hive serde, i/o format extraction
HbaseRestExample
manasarovar
A simple search engine for the Datalake
yogeshtewari's Repositories
yogeshtewari/dataproc-initialization-actions
Run in all nodes of your cluster before the cluster starts - lets you customize your cluster
yogeshtewari/manasarovar
A simple search engine for the Datalake
yogeshtewari/AdminBSBMaterialDesign
AdminBSB - Free admin panel that is based on Bootstrap 3.x with Material Design
yogeshtewari/arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
yogeshtewari/awesome-analytics
A curated list of analytics frameworks, software and other tools.
yogeshtewari/awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
yogeshtewari/awesome-python
A curated list of awesome Python frameworks, libraries, software and resources
yogeshtewari/DataflowTemplates
Google-provided Cloud Dataflow template pipelines for solving simple in-Cloud data tasks
yogeshtewari/dwh-migration-tools
hive serde, i/o format extraction
yogeshtewari/HbaseRestExample
yogeshtewari/incubator-atlas
Mirror of Apache Atlas (Incubating)
yogeshtewari/PythonFlaskRemoteApp
yogeshtewari/spark
Mirror of Apache Spark
yogeshtewari/SparkHbaseExample
yogeshtewari/storm
Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more
yogeshtewari/tensorflow
Computation using data flow graphs for scalable machine learning
yogeshtewari/TensorFlowOnSpark
TensorFlowOnSpark brings TensorFlow programs onto Apache Spark clusters
yogeshtewari/yarn-book
Code samples for the book
yogeshtewari/yogeshtewari.github.io