tf-idf-vectorizer
There are 116 repositories under tf-idf-vectorizer topic.
zjohn77/retrieval
Tunable full text search engine in JavaScript that: (1) works natively on web apps like Express.js; (2) easy to customize (via BM25) to specific types of documents (e.g. tweets, scientifc journals); (3) is deployable on either the client-side or the server side.
esharma3/myers-briggs-personality-prediction
NLP based Classification Model that predicts a person's personality type as one of the 16 Myers Briggs personality types. Extremely challenging project dealing with correlation between human psychology and casual writing styles and handling heavily imbalanced classes. Check the app here - https://mb-predictor-motetuzs5q-uc.a.run.app/
kodiks/turkish-news-classification
Turkish News Category Classification Tutorial
jalajthanaki/Basic_Ecommerce_Recomendation_System
This repository contains the code for basic kind of E-commerce recommendation engine. By using the concept of TF-IDF and cosine similarity, we have built this recommendation engine.
Bill-Klay/Skincare-Recommendation-Android-Application
Skincare recommendation android application that uses dataset from Kaggle and scrapped data from cosmetics websites to work a Tf-IDF vectorizer for content based filtering, and KNN and Decision trees for collaborative based filtering. The notebook also contains other approaches for POC including SVD. Backend APIs are based on Flask, Android application is made using Java with Android Studio whereas Firebase acts as the database and the middleware for relaying login information as well to serve the data to the application.
agushendra7/twitter-sentiment-analysis-using-inset-and-random-forest
Twitter Sentiment Analysis Using InSet (Indonesia Sentiment Lexicon) and Random Forest Classifier
iAmKankan/Natural-Language-Processing-NLP-Tutorial
NLP tutorials and guidelines to learn efficiently
jalajthanaki/medical_notes_extractive_summarization
Extractive summarizationof medical transcriptions
opennlp/Large-Scale-Text-Classification
Large Scale benchmarking of state of the art text vectorizers
parvez86/Smart-Recruitment-System
A simple Django-based resume ranker website where recruiters post their jobs and candidates applies for their desired vacancies. The system gets the document similarity between the job description and the candidate resumes, generates similarity scores using the KNN model, and rank or shortlist the candidate resumes.
chlaudiah/Sentiment-Classification-FD-Reviews
Text Classification for Sentiment Analysis using Female Daily's Reviews Dataset
chunwangpro/textual-information-extraction-and-numeric-processing
Extract textual information from Amazon products reviews and draw correlations through regression and fluctuation analysis.
agushendra7/twitter-sentiment-analysis-using-vader-and-random-forest
Twitter Sentiment Analysis Using Vader Lexicon and Random Forest Classifier
rochitasundar/TwitterSentimentAnalysis-BigDataProject
Scrapped tweets using twitter API (for keyword ‘Netflix’) on an AWS EC2 instance, ingested data into S3 via kinesis firehose. Used Spark ML on databricks to build a pipeline for sentiment classification model and Athena & QuickSight to build a dashboard
rzninvo/Information-Retrieval-Course-Project
Course Project of Information Retrieval.
samimakhan/Spam-Classification-Project
Spam Classifier project for my end-of-semester project for Intro to AI class. We were a group of four people. I worked on all the Naive Bayes models.
shrebox/Information-Retrieval
Compilation of Information Retrieval codes.
Dutta-SD/AggDetectApp
A web application that detects aggression and misogyny in text using BERT augmentation, sentiment analysis, XGBoost, TF-IDF vectorization, LIME explainability. [Paper accepted at ICON 2021]
GNDEC-Minor-Project-KGJ/book-recommendation-system
A recommendation system for books. Built by following two filtering methods that are Collaborative Filtering and Content Based Filtering. Algorithms used are KNN, Pearson Correlation, and TF-IDF. Every dataset used can be easily found in the data folder of the respository.
jeyadosstimothy/ML-on-CrisisLex
Application of Machine Learning Techniques for Text Classification and Topic Modelling on CrisisLexT26 dataset.
Kaushalmam/Search-engine
Implementation of a search engine using a vector space model.
mhmdsab/Spam-Classifier
spam classifier with a dataset of 5000 mail
DanniRodrJ/Content-Based_Movie_Recommendation_System
Sistema de recomendación de películas basado en contenido. Utilizando TF-IDF y la similitud del coseno. La data fue extraída, transformada y analizada para el entrenamiento del modelo. Disponibilizandolo junto con la data limpia para futuras consultas, a través del despliegue con FastAPI y Render.
faizulislamfair/Mars-Marvel
This project is built using MERN Stack & Tailwind CSS! As the condition of earth is uncertain, we can't but look for alternative habitual planets to hold onto the survival of human race and so we've developed this project to encourage amateur astronomers, citizen scientists and anybody interested to carry out research on Mars!
Izu-33/scotus
This repo illustrates the use of NLP techniques in legal analytics. Herein, contributors used the Supreme Court of The United States facts and their corresponding issue areas to predict the outcome of a case. After training an LSTM neural network, contributors implemented the model in a streamlit app.
kendalldyke14/NYTCriticsChoicePrediction
New York Times Movie Critics Choice Designation Prediction
lovpatel93/Kaggle-Spam
Spam Detection – Cluster SMS messages to “Spam” and “Ham” (Kaggle Challenge)
mahyarsab/StackOverflow_Questions_Quality_Capstone_NLP_Project
The objective of this capstone project is to use Natural Language Processing (NLP) to create a machine-learning model that predicts the quality of questions posted on Stack Overflow, a popular question-and-answer platform for software developers.
Nisarg221B/Recommendation-System
Food Recommendation System based on three model of recommendations
nivesayee/recipe-genie
Recipe Genie is a recipe recommendation system that recommends recipes to users based on the ingredients they have at home.
shanuhalli/Project-Resume-Classification
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.
LasithaAmarasinghe/Movie-Recommender-System
This ML model recommends movies that may align with the user's preferences based on TF-IDF matrix.
Sanjeev-Kumar78/Book-Recommendation-System
This is a book recommendation system based on the book rating data from GoodReads_100k dataset. The dataset contains 100k book.