mallika2011's Stars
SimplifyJobs/Summer2025-Internships
Collection of Summer 2025 tech internships!
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
khangich/machine-learning-interview
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.
jacobeisenstein/gt-nlp-class
Course materials for Georgia Tech CS 4650 and 7650, "Natural Language"
ThilinaRajapakse/simpletransformers
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
tapaswenipathak/Open-Source-Programs
A list of open source programs.
thunlp/NRLPapers
Must-read papers on network representation learning (NRL) / network embedding (NE)
jbmouret/matplotlib_for_papers
Handout for the tutorial "Creating publication-quality figures with matplotlib"
marcotcr/checklist
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
graph4ai/graph4nlp
Graph4nlp is the library for the easy use of Graph Neural Networks for NLP. Welcome to visit our DLG4NLP website (https://dlg4nlp.github.io/index.html) for various learning resources!
nlpyang/PreSumm
code for EMNLP 2019 paper Text Summarization with Pretrained Encoders
palash1992/GEM
kohlschutter/boilerpipe
Work in progress transmit from Google Code
yunjey/show-attend-and-tell
TensorFlow Implementation of "Show, Attend and Tell"
michiyasunaga/qagnn
[NAACL 2021] QAGNN: Question Answering using Language Models and Knowledge Graphs 🤖
CornellNLP/ConvoKit
ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the use of the toolkit on these datasets.
bohanli/BERT-flow
TensorFlow implementation of On the Sentence Embeddings from Pre-trained Language Models (EMNLP 2020)
soulbliss/NLP-conference-compendium
Compendium of the resources available from top NLP conferences.
pratikratadiya/awesome-ml-internships
List of companies offering Machine learning and Data Science internships
facebookresearch/SentAugment
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in combination with self-training and knowledge-distillation, or for retrieving paraphrases.
AI4Bharat/Indic-BERT-v1
Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT
krishnap25/mauve
Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.
aravindpai/How-to-build-own-text-summarizer-using-deep-learning
In this notebook, we will build an abstractive based text summarizer using deep learning from the scratch in python using keras
tshi04/SeaNMF
Short Text Topic Modeling
kharyal/jupyter-notebook-on-servers
ever wondered how to run jupyter notebook on servers like ada?
Adi2K/Precog-HASOC-2021
Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages 2021
ananyaarun/Structure-Based-Hate-Speech-Detection
Building a structure-based hate-speech detection system using NLP tools and ML models as part of IRE 2020 Final Project.
GaurangTandon/m20-ire-major-project
Question Generation Summarizer - done as a Major Project in IRE M20 course at IIIT-H
Tanvi141/WikiCats