The Complementary Nature of Different NLP Toolkits for Named Entity Recognition in Social Media

Batista,F; Álvaro Figueira

The Complementary Nature of Different NLP Toolkits for Named Entity Recognition in Social Media

dc.contributor.author	Batista,F	en
dc.contributor.author	Álvaro Figueira	en
dc.date.accessioned	2018-01-10T10:19:46Z
dc.date.available	2018-01-10T10:19:46Z
dc.date.issued	2017	en
dc.description.abstract	In this paper we study the combined use of four different NLP toolkits—Stanford CoreNLP, GATE, OpenNLP and Twitter NLP tools—in the context of social media posts. Previous studies have shown performance comparisons between these tools, both on news and social media corporas. In this paper, we go further by trying to understand how differently these toolkits predict Named Entities, in terms of their precision and recall for three different entity types, and how they can complement each other in this task in order to achieve a combined performance superior to each individual one. Experiments on two publicly available datasets from the workshops WNUT-2015 and #MSM2013 show that using an ensemble of toolkits can improve the recognition of specific entity types - up to 10.62% for the entity type Person, 1.97% for the type Location and 1.31% for the type Organization, depending on the dataset and the criteria used for the voting. Our results also showed improvements of 3.76% and 1.69%, in each dataset respectively, on the average performance of the three entity types. © Springer International Publishing AG 2017.	en
dc.identifier.uri	http://repositorio.inesctec.pt/handle/123456789/5825
dc.identifier.uri	http://dx.doi.org/10.1007/978-3-319-65340-2_65	en
dc.language	eng	en
dc.relation	5088	en
dc.rights	info:eu-repo/semantics/openAccess	en
dc.title	The Complementary Nature of Different NLP Toolkits for Named Entity Recognition in Social Media	en
dc.type	conferenceObject	en
dc.type	Publication	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: P-00M-YF0.pdf
Size:: 229.39 KB
Format:: Adobe Portable Document Format
Description:

Download

Collections

CRACS - Indexed Articles in Conferences