Pinned Repositories
bidding_ad_optimization_module
Module to predict prob. of a browser's conversion based on PVSVR & CPA
Building-Custom-lemmatizer
Building a custom Lemmatizer to plug in to the TF-IDF and other Information Retrieval problems
Creating-Custom-job-feeds-for-Linkedin
Building relevant job feeds based on TF-IDF(custom written on uni-grams) , lemmatization and other NLTK constructs
Custom-Distance-function-for-typos-in-hand-generated-datasets-with-QWERY-Keyboard
Building a custom distance function for typographical errors - particularly with the QWERTY keyboard
Experiments-in-Data-mining
So how much is your Linkedin network worth ? Exploring useful data from Linkedin
MapReduce-with-PySpark
Repo to host code for scaling up native python code with Map reduce in Python
Performance-aggregation-platform---learning-in-near-real-time
Predicting_wine_quality
Quora-Challenges
Scraping-with-scrapy-in-python
A baby problem in scraping in python
ekta1007's Repositories
ekta1007/Experiments-in-Data-mining
So how much is your Linkedin network worth ? Exploring useful data from Linkedin
ekta1007/Custom-Distance-function-for-typos-in-hand-generated-datasets-with-QWERY-Keyboard
Building a custom distance function for typographical errors - particularly with the QWERTY keyboard
ekta1007/bidding_ad_optimization_module
Module to predict prob. of a browser's conversion based on PVSVR & CPA
ekta1007/Creating-Custom-job-feeds-for-Linkedin
Building relevant job feeds based on TF-IDF(custom written on uni-grams) , lemmatization and other NLTK constructs
ekta1007/Quora-Challenges
ekta1007/Building-Custom-lemmatizer
Building a custom Lemmatizer to plug in to the TF-IDF and other Information Retrieval problems
ekta1007/Predicting_wine_quality
ekta1007/Scraping-with-scrapy-in-python
A baby problem in scraping in python
ekta1007/MapReduce-with-PySpark
Repo to host code for scaling up native python code with Map reduce in Python
ekta1007/Performance-aggregation-platform---learning-in-near-real-time
ekta1007/redshift-udfs
SQL for many helpful Redshift UDFs, and the scripts for generating and testing those UDFs
ekta1007/Sampling-techniques
Initially developed for Kaggle's Expedia contest
ekta1007/scrapy-linkedin
Using Scrapy to get Linkedin's person public profile.
ekta1007/AB-testing-Framework
Performance Benchmarks for min sample size & tests of acceptance on OEC (overall evaluation criteria)
ekta1007/Data-mining-Pro
Creating scripts to automate all the near-boring part that comes between data preparation & start of modelling
ekta1007/finddupes
ekta1007/flask
A microframework based on Werkzeug, Jinja2 and good intentions
ekta1007/Generating-synthetic-weblogs-with-Selenium
ekta1007/Hello-world
ekta1007/kaggle-avazu
ekta1007/Kaggle_yandex
Initially written for Yandex competition hosted at kaggle
ekta1007/mincemeatpy
Lightweight MapReduce in python
ekta1007/multipolyfit
A multivariate polynomial regression function in python
ekta1007/play_store_scraper
Scraps the play store for app information.
ekta1007/predict_malfunction_components
ekta1007/python
scripts in python for some specific use, for instances measuring in learning to rank
ekta1007/pywFM
pywFM is a Python wrapper for Steffen Rendle's factorization machines library libFM
ekta1007/utilities