Abstract
Newspapers need to attract readers with headlines, anticipating their readers’ preferences. These preferences rely on topical, structural, and lexical factors. We model each of these factors in a multi-task GRU network to predict headline popularity. We find that pre-trained word embeddings provide significant improvements over untrained embeddings, as do the combination of two auxiliary tasks, newssection prediction and part-of-speech tagging. However, we also find that performance is very similar to that of a simple Logistic Regression model over character n-grams. Feature analysis reveals structural patterns of headline popularity, including the use of forward-looking deictic expressions and second person pronouns.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. EMNLP 2018 |
Editors | Ellen Riloff, David Chiang, Julia Hockenmaier, Junichi Tsujii |
Number of pages | 6 |
Place of Publication | Stroudsburg, PA |
Publisher | Association for Computational Linguistics |
Publication date | 2018 |
Pages | 659-664 |
ISBN (Electronic) | 9781948087841 |
Publication status | Published - 2018 |
Event | 2018 Conference on Empirical Methods in Natural Language Processing - Square Meeting Center, Brussels, Belgium Duration: 31 Oct 2018 → 4 Nov 2018 http://emnlp2018.org/ |
Conference
Conference | 2018 Conference on Empirical Methods in Natural Language Processing |
---|---|
Location | Square Meeting Center |
Country/Territory | Belgium |
City | Brussels |
Period | 31/10/2018 → 04/11/2018 |
Internet address |