token-classification

There are 51 repositories under token-classification topic.

KRLabsOrg/LettuceDetect
LettuceDetect is a hallucination detection framework for RAG applications.
Language:Python49216
modelscope/AdaSeq
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
Language:Python448 12 4642
4AI/LS-LLaMA
A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning
Language:Python155 2 2124
luozhouyang/transformers-keras
Transformer-based models implemented in tensorflow 2.x(using keras).
Language:Python76 4 413
satya77/Transformer_Temporal_Tagger
Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging
Language:Python66 3 125
Kardbord/hfapigo
Unofficial (Golang) Go bindings for the Hugging Face Inference API
Language:Go62 2 145
fido-ai/ua-datasets
A collection of datasets for Ukrainian language
Language:Python56 2 12
ufal/factgenie
Lightweight self-hosted span annotation tool
Language:Python34 3 1177
datnnt1997/VPhoBertTagger
Token classification using Phobert Models for Vietnamese
Language:Python12 1 03
Babelscape/cner
CNER: Concept and Named Entity Recognition
Language:Jupyter Notebook11 3 01
nachoDRT/MERIT-Dataset
The MERIT Dataset is a fully synthetic, labeled dataset created for training and benchmarking LLMs on Visually Rich Document Understanding tasks. It is also designed to help detect biases and improve interpretability in LLMs, where we are actively working. This repository is actively maintained, and new features are continuously being added.
Language:Python10 2 01
pha123661/NTU-2022Fall-ADL
Applied Deep Learning 深度學習之應用 by Vivian Chen 陳縕儂 at NTU CSIE
Language:Python10 1 00
rafaelpierre/bullet
bullet: A Zero-Shot / Few-Shot Learning, LLM Based, text classification framework
Language:Jupyter Notebook10 2 53
aditeyabaral/maple
Implementation of the paper, MAPLE - MAsking words to generate blackout Poetry using sequence-to-sequence LEarning, ICNLSP 2021
Language:Python8 2 22
Ahwar/NER-NLP-with-ONNX-Java
A Java NLP application that identifies names, organizations, and locations in text by utilizing Hugging Face's RoBERTa NER model through the ONNX runtime and the Deep Java Library.
Language:Java8 2 12
Babelscape/ID10M
Data and code for the paper "ID10M: Idiom Identification in 10 Languages" (NAACL 2022).
Language:Python7 3 11
Antarlekhaka/code
Multi-task NLP Annotation Framework
Language:JavaScript6 1 92
awsaf49/pii-data-detection
The Learning Agency Lab - PII Data Detection || Develop automated techniques to detect and remove PII from educational data.
Language:Jupyter Notebook6 1 01
VirtualRoyalty/gan-plus-nlp
Generative adversarial approach to most popular NLP tasks
Language:Jupyter Notebook4 1 11
AlexKly/Detailed-NER-Dataset-RU
Labeled Russian text token-by-token for training models for NER task based samples got from parsing different resources and generated by ChatGPT.
Language:Python3 1 00
AshutoshDongare/softskill-NER
Fine tuning 🤗 transformer model for softskill NER task
Language:Jupyter Notebook3 1 01
1024-m/NAACL-2024-SemEval-TASK-8C
Code for the paper : Black-Box Word-Level Text Boundary Detection in Partially Machine Generated Texts
Language:Jupyter Notebook2 1 00
abmami/Fine-tuning-CamemBERT-for-Keyword-Extraction
Fine-tuning CamemBERT for French keywords extraction on custom dataset.
Language:Jupyter Notebook2 1 32
aditeyabaral/maple-v2
MAPLEv2 - Multi-task Approach for generating blackout Poetry with Linguistic Evaluation
Language:Python2 1 01
anudeepvanjavakam1/lit_or_not_on_reddit
This app searches reddit posts and comments to determine if a product or service has a positive or negative sentiment and predicts top product mentions using Named Entity Recognition
Language:Jupyter Notebook2 1 00
mahvash-siavashpour/BERT-Token-Classification-for-Persian-Kasr-e-Ezafeh
Identify if each of the words in a Persian sentence need a kasr-e-ezafeh tag or not.
Language:Jupyter Notebook2 1 00
MohammedAly22/Tasneef
A state-of-the-art Arabic part-of-speech tagger leveraging the XLMR transformer model With an impressive testing accuracy of 97.49% and a remarkable testing F1-score of 96.44% on the Arabic UD Treebank.
Language:Jupyter Notebook2 2 00
nlp4se/RE-Miner-Dashboard
NLP interactive dashboard for users to interact with the RE-Miner Ecosystem for data analysis, visualization, and NLP-based insights.
Language:SCSS2 1 01
Semihocakli/nlp-with-hugging-face
Language:Jupyter Notebook2 1 00
TirendazAcademy/Multilingual-NER-App
Building a multilingual NER app with HuggingFace, Gradio and Comet
Language:Jupyter Notebook2 2 0
Harito97/Reconstruct_Vietnamese_diacritics
Develop a deep learning model to accurately restore Vietnamese diacritics.
Language:Python1 1 00
MohammedAly22/ArabNizer
ArabiNizer is a state-of-the-art Arabic named entity recognizer (NER) leveraging the XLMR transformer model with an impressive testing accuracy of 95.00% and a remarkable testing F1-score of 88.00% on the PAN-X.AR subset from XTREME.
Language:Jupyter Notebook1 2 00
naivenlp/rapidnlp-datasets
Data pipelines for both TensorFlow and PyTorch!
Language:Python1 1 00
prasoonvarshney/scientific-entity-recognition
End-to-end pipeline for (1) automatic scraping and parsing of NLP research papers, (2) token-level entity annotations in Label Studio, and (3) BERT-based models for span identification and entity recognition
Language:Jupyter Notebook1 2 00
vedantMahangade/PII-Data-Detection
A reliable automated LLM based Model for detecting PII in Student Writing
Language:Jupyter Notebook1 1 00
vineetk1/sequence-tagging
Sequence-tagging using deep learning
Language:Python1 0 00

token-classification

KRLabsOrg/LettuceDetect

modelscope/AdaSeq

4AI/LS-LLaMA

luozhouyang/transformers-keras

satya77/Transformer_Temporal_Tagger

Kardbord/hfapigo

fido-ai/ua-datasets

ufal/factgenie

datnnt1997/VPhoBertTagger

Babelscape/cner

nachoDRT/MERIT-Dataset

pha123661/NTU-2022Fall-ADL

rafaelpierre/bullet

aditeyabaral/maple

Ahwar/NER-NLP-with-ONNX-Java

Babelscape/ID10M

Antarlekhaka/code

awsaf49/pii-data-detection

VirtualRoyalty/gan-plus-nlp

AlexKly/Detailed-NER-Dataset-RU

AshutoshDongare/softskill-NER

1024-m/NAACL-2024-SemEval-TASK-8C

abmami/Fine-tuning-CamemBERT-for-Keyword-Extraction

aditeyabaral/maple-v2

anudeepvanjavakam1/lit_or_not_on_reddit

mahvash-siavashpour/BERT-Token-Classification-for-Persian-Kasr-e-Ezafeh

MohammedAly22/Tasneef

nlp4se/RE-Miner-Dashboard

Semihocakli/nlp-with-hugging-face

TirendazAcademy/Multilingual-NER-App

Harito97/Reconstruct_Vietnamese_diacritics

MohammedAly22/ArabNizer

naivenlp/rapidnlp-datasets

prasoonvarshney/scientific-entity-recognition

vedantMahangade/PII-Data-Detection

vineetk1/sequence-tagging