nlg-dataset

There are 15 repositories under nlg-dataset topic.

  • shawnwun/RNNLG

    RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.

    Language:Python491404126
  • INK-USC/CommonGen

    A Constrained Text Generation Challenge Towards Generative Commonsense Reasoning

    Language:Python1408023
  • amazon-science/bold

    Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper

  • hadyelsahar/RE-NLG-Dataset

    T-Rex : A Large Scale Alignment of Natural Language with Knowledge Base Triples

    Language:Python647412
  • majumderb/recipe-personalization

    EMNLP 2019: Generating Personalized Recipes from Historical User Preferences

    Language:Python618618
  • harsh19/spot-the-diff

    EMNLP 2018. Learning to Describe Differences Between Pairs of Similar Images. Harsh Jhamtani, Taylor Berg-Kirkpatrick.

    Language:Jupyter Notebook58468
  • harsh19/ChessCommentaryGeneration

    Harsh Jhamtani*, Varun Gangal*, Eduard Hovy, Graham Neubig, Taylor Berg-Kirkpatrick. Learning to Generate Move-by-Move Commentary for Chess Games from Large-Scale Social Forum Data. ACL 2018

    Language:OpenEdge ABL356711
  • Tixierae/OrangeSum

    The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".

    Language:Jupyter Notebook22312
  • maxent-ai/Datasets

    datasets with text data for use in NLP, Text analysis, information extraction, ML research.

    Language:Jupyter Notebook16503
  • rashad101/RoMe

    PyTorch code for ACL 2022 paper: RoMe: A Robust Metric for Evaluating Natural Language Generation https://aclanthology.org/2022.acl-long.387/

    Language:Python10225
  • marco-roberti/pytorch-e2e-dataset

    The E2E Dataset, packed as a PyTorch DataSet subclass

    Language:Python6300
  • pratikratadiya/Narendra_Modi_speeches

    Collection of text transcript of speeches delivered by the PM of India Mr. Narendra Modi.

    Language:Jupyter Notebook3003
  • Dia-Bete/PersonaBasedCorpus

    Repository for the LREC-COLING 2024 Paper: Persona-Based Corpus in the Diabetes Mellitus Domain – Applying a Human-Centered Approach to a Low-Resource Context

  • grig-guz/tree-content-structuring

    Content structuring for NLG with discourse dependency trees.

    Language:Python1100
  • asnota/conceptfr

    A repository for ConceptFR project files.

    Language:Jupyter Notebook10