Pinned Repositories
BigDataPatent
Text Mining Patents for Big Data Course Project
COS738-AutomaticPatentClassification
Machine Learning System to classify a Patent Application based on the title and resume, using INPI-Brazil RPI text data
data
Data and code behind the articles and graphics at FiveThirtyEight
ipc-classifier
IPC classifier for WIPO-alpha dataset
keyphrase_extraction
Implementing keyword extraction algorithm using tf-idf weighting, see
pandas-videos
Jupyter notebook and datasets from the pandas Q&A video series
pycon-2016-tutorial
Tutorial: Machine Learning with Text in scikit-learn
pypatent-1
Search for and retrieve US Patent and Trademark Office Patent Data
tensorflow
Computation using data flow graphs for scalable machine learning
twitter_nlp
Twitter NLP Tools
samiratzn's Repositories
samiratzn/data
Data and code behind the articles and graphics at FiveThirtyEight
samiratzn/bokeh
Interactive Web Plotting for Python
samiratzn/Data_Science_Fundamentals
samiratzn/DeepGraphCNNforTexts
samiratzn/Document-Classifier-LSTM
A bidirectional LSTM with attention for multiclass/multilabel text classification.
samiratzn/document_classification
samiratzn/DocumentSimilarity
A Search Engine searching for Semantically similar question
samiratzn/handson-ml
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in python using Scikit-Learn and TensorFlow.
samiratzn/HDLTex
HDLTex: Hierarchical Deep Learning for Text Classification
samiratzn/imbalanced-learn
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
samiratzn/LLM-Finetuning-Hub
Repository that contains LLM fine-tuning and deployment scripts along with our research findings.
samiratzn/Machine-Learning-with-Python
Python codes for common Machine Learning Algorithms
samiratzn/natural-language-processing
Resources for "Natural Language Processing" Coursera course.
samiratzn/NeelBlog
Contains the code and csv from my blog
samiratzn/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
samiratzn/NLP-with-Python
Scikit-Learn, NLTK, Spacy, Gensim, Textblob and more
samiratzn/nltk-trainer
Train NLTK objects with zero code
samiratzn/patent_app_detect
Derives popular terminology included within a particular patent technology area (CPC classification), based on text analysis of patent abstract information
samiratzn/patents-public-data
Patent analysis using the Google Patents Public Datasets on BigQuery
samiratzn/pdfminer
Python PDF Parser
samiratzn/Quadflor
End-2-end multi-label classification in python
samiratzn/scikit-learn
scikit-learn: machine learning in Python
samiratzn/scikit-learn-videos
Jupyter notebooks from the scikit-learn video series
samiratzn/scikit-multilearn
A scikit-learn based module for multi-label et. al. classification
samiratzn/sklearn-hierarchical-classification
Hierarchical classification module based on scikit-learn's interfaces
samiratzn/Sklearn_Tutorial
A previous project based on scikit-learn in COMP 5212, HKUST. The project includes logistic regression model, multi-layer perceptron, SVM and related cross-validation procedures.
samiratzn/solutions
Solutions for projects.
samiratzn/Text-Documents-classiffication
Patent-Documents classification
samiratzn/text_classification
all kinds of text classificaiton models and more with deep learning
samiratzn/ThinkStats2
Text and supporting code for Think Stats, 2nd Edition