π CTRLsum (original) (raw)
π New model addition
Model description
Current summarization systems yield generic summaries that are disconnected from usersβ preferences and expectations. To address this limitation, we present CTRLsum, a novel framework for controllable summarization.
Our approach enables users to control multiple aspects of generated summaries by interacting with the summarization system through textual input in the form of a set of keywords or descriptive prompts.
Using a single unified model, CTRLsum is able to achieve a broad scope of summary manipulation at inference time without requiring additional human annotations or pre-defining a set of control aspects during training.
We quantitatively demonstrate the effectiveness of our approach on three domains of summarization datasets and five control aspects:
- entity-centric
- length-controllable summarization
- contribution summarization on scientific papers
- invention purpose summarization on patent filings
- question-guided summarization on news articles in a reading comprehension setting
Moreover, when used in a standard, uncontrolled summarization setting, CTRLsum achieves state-of-the-art results on the CNN/DailyMail dataset.
Open source status
- the model implementation is available: https://github.com/salesforce/ctrl-sum
- the model weights are available: Download link available in the README of the repo
- who are the authors: @jxhe @muggin