vatsan
Engineering Manager, ML@Meta. Previously, ML at Salesforce Einstein, Pivotal, Sony, IBM Almaden Research, and UT Austin CS
MetaSan Francisco
Pinned Repositories
dspcfboilerplate
Boilerplate code for flask apps on PCF that interact with a backend environment (ex: Pivotal BDS or ElephantSQL).
gp-ark-tweet-nlp
A PL/Java Wrapper on Ark-Tweet-NLP (http://www.ark.cs.cmu.edu/TweetNLP/) - Twitter Parts-of-speech tagger in Postgres/Greenplum
gp-sql-snippets
Temporary home for data processing/machine learning SQL snippets on Greenplum/HAWQ
gp_jupyter_notebook_templates
Collection of Jupyter notebook templates to work with Greenplum/HAWQ/PostgreSQL
gp_xgboost_gridsearch
In-database parallel grid-search for XGBoost on Greenplum
pandas_via_psql
Invoke Pandas plotting by piping in SQL output via PSQL (Can be used with Postgres or Greenplum or any SQL engine).
postgresopen-2017
Scalable in-database machine learning with PL/Python: Postgres Open SV 2017 talk
pymadlib
A Python wrapper for MADlib (http://madlib.net) - an open source library for scalable in-database machine learning algorithms
slam
An implementation of particle filtering algorithm for simultaneous localization and mapping (SLAM) in autonomous robots.
text_analytics_on_mpp
Collection of tutorials on text analytics/NLP, including vector space models, neural language models and topic models on the Pivotal MPP platform (Greenplum/HAWQ).
vatsan's Repositories
vatsan/slam
An implementation of particle filtering algorithm for simultaneous localization and mapping (SLAM) in autonomous robots.
vatsan/gp-ark-tweet-nlp
A PL/Java Wrapper on Ark-Tweet-NLP (http://www.ark.cs.cmu.edu/TweetNLP/) - Twitter Parts-of-speech tagger in Postgres/Greenplum
vatsan/text_analytics_on_mpp
Collection of tutorials on text analytics/NLP, including vector space models, neural language models and topic models on the Pivotal MPP platform (Greenplum/HAWQ).
vatsan/pandas_via_psql
Invoke Pandas plotting by piping in SQL output via PSQL (Can be used with Postgres or Greenplum or any SQL engine).
vatsan/postgresopen-2017
Scalable in-database machine learning with PL/Python: Postgres Open SV 2017 talk
vatsan/gp_xgboost_gridsearch
In-database parallel grid-search for XGBoost on Greenplum
vatsan/dspcfboilerplate
Boilerplate code for flask apps on PCF that interact with a backend environment (ex: Pivotal BDS or ElephantSQL).
vatsan/gp-sql-snippets
Temporary home for data processing/machine learning SQL snippets on Greenplum/HAWQ
vatsan/gp_jupyter_notebook_templates
Collection of Jupyter notebook templates to work with Greenplum/HAWQ/PostgreSQL
vatsan/tasa
Topic and sentiment analysis of tweets (demo)
vatsan/xgboost
Large-scale and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, on single node, hadoop yarn and more.
vatsan/conda-buildpack
Buildpack for Conda.
vatsan/fnc-1
vatsan/gpcorenlp
vatsan/gpdb
Pivotal Greenplum Database
vatsan/hrc_emails
vatsan/incubator-madlib
Mirror of Apache MADlib (Incubating)
vatsan/keras_practice
vatsan/LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
vatsan/madlib
Open-source library for scalable in-database analytics.
vatsan/Meta
Python Meta Programming
vatsan/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
vatsan/PDLTools
PDL Tools is a library of reusable tools used and developed by the Pivotal Data Science and Data Engineering teams.
vatsan/PivotalR
vatsan/ppt
vatsan/scdf
vatsan/shap
A unified approach to explain the output of any machine learning model.
vatsan/spaCy
Industrial-strength Natural Language Processing with Python and Cython
vatsan/tf_samples
vatsan/vatsan.github.io
Personal website based on Jekyll Chirpy