Natural Language Generation

Generation of text used in data-to-text, conversational agents, and narrative generation applications.

74 resources12 categoriesView Original

Datasets(10 items)

A

Alex Context NLG Dataset

A dataset for NLG in dialogue systems in the public transport information domain.

Datasets
B

Box-score data

This dataset consists of (human-written) NBA basketball game summaries aligned with their corresponding box- and line-scores.

Datasets
E

E2E

This shared task focuses on recent end-to-end (E2E), data-driven NLG methods, which jointly learn sentence planning and surface realisation from non-aligned data.

Datasets
N

Neural-Wikipedian

The repository contains the code along with the required corpora that were used in order to build a system that "learns" how to generate English biographies for Semantic Web triples.

Datasets
T

The Schema-Guided Dialogue Dataset

The Schema-Guided Dialogue (SGD) dataset consists of over 20k annotated multi-domain, task-oriented conversations between a human and a virtual assistant.

Datasets
T

The Wikipedia company corpus

Company descriptions collected from Wikipedia. The dataset contains semantic representations, short, and long descriptions for 51K companies in English.

Datasets
W

WeatherGov

Computer-generated weather forecasts from weather.gov (US public forecast), along with corresponding weather data.

Datasets
W

WebNLG

The enriched version of the WebNLG - a resource for evaluating common NLG tasks, including Discourse Ordering, Lexicalization and Referring Expression Generation.

Datasets
W

WikiBio - wikipedia biography dataset

This dataset gathers 728,321 biographies from wikipedia. It aims at evaluating text generation algorithms.

Datasets
Y

YelpNLG

YelpNLG provides resources for natural language generation of restaurant reviews.

Datasets

Neural Natural Language Generation(12 items)

A

aitextgen

A robust Python tool for text-based AI training and generation using GPT-2.

Neural Natural Language Generation
G

graph-2-text

Graph to sequence implemented in Pytorch combining Graph convolutional networks and opennmt-py.

Neural Natural Language Generation
I

Image Caption Generator

A Neural Network based generative model for captioning images using Tensorflow.

Neural Natural Language Generation
L

lightnlg

A minimalistic codebase for finetuning and interacting with NLG models using PyTorch Lightning.

Neural Natural Language Generation
P

PaperRobot: Incremental Draft Generation of Sci...

We present a PaperRobot who performs as an automatic research assistant.

Neural Natural Language Generation
P

PPLM

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

Neural Natural Language Generation
Q

Question Generation using hugstransformers

Question generation is the task of automatically generating questions from a text paragraph.

Neural Natural Language Generation
S

Summary Generation From Structured Data

For converting information present in the form of structured data into natural language text.

Neural Natural Language Generation
T

Texar

Texar is a toolkit aiming to support a broad set of machine learning, especially natural language processing and text generation tasks.

Neural Natural Language Generation
T

textgenrnn

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

Neural Natural Language Generation
T

This Word Does Not Exist

This is a project allows people to train a variant of GPT-2 that makes up words, definitions and examples from scratch.

Neural Natural Language Generation
T

Transformers

State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.

Neural Natural Language Generation