Unraveling Coreference Resolution in NLP!
Last Updated on February 25, 2021 by Editorial Team
Author(s): Ananya Banerjee
Natural Language Processing
Unraveling Coreference Resolution inΒ NLP!
Coreference Resolution is one of the most essential Natural Language Processing (NLP) tasks. But, before we begin to understand Coreference Resolution, it is essential to understand the definition of discourse.
Discourse in the context of NLP refers to a sequence of sentences occurring one after the other. There will obviously be entities that are being talked about and possible references to those entities in the discourse. We use the word βmentionβ to refer to these references.
An example of a discourse:
Ana is a Graduate Student at UT Dallas. She loves working in Natural Language Processing at the institute. Her hobbies include blogging, dancing andΒ singing.
Here, βAnaβ, βNatural Language Processingβ and βUT Dallasβ are possible entities.
βSheβ and βHerβ are references to the entity βAnaβ and βthe instituteβ is a reference to the entity βUTΒ Dallasβ.
Reference, in NLP, is a linguistic process where one word in a sentence or discourse may refer to another word or entity. The task of resolving such references is known as Reference Resolution. In the above example, βSheβ and βHerβ referring to the entity βAnaβ and βthe instituteβ referring to the entity βUT Dallasβ are two examples of Reference Resolution.
Letβs Summarize:
Discourse in the context of NLP refers to a sequence of sentences occurring one after theΒ other
Reference is a linguistic process where one word in a sentence or discourse refers to another word orΒ entity
The task of resolving such references is known as Reference Resolution.
Coreference Resolution in particular, is the process of resolving pronouns to identify which entities are they referring to. It is also a kind of Reference Resolution. The entities resolved may be a person, place, organization, orΒ event.
Referent is the object that is being referred to. For example, βAnaβ is the referent in the aboveΒ example.
Referring expression are the mentions or linguistic expressions given in the discourse.
Two or more referring expressions that refer to the same discourse entity are said to corefer[1].
Now, let us look at another example to understand thisΒ better.
Example Discourse:
βElon Musk was born on June 28, 1971. He is the founder, CEOΒ , chief engineer and designer of SpaceX. The 49 year old is widely known as the mind behind Neuralink.β
Referring Expressions: Elon Musk, He, The 49 yearΒ old
Referent: ElonΒ Musk
Corefering Expressions: {Elon Musk, He}, {Elon Musk, The 49 yearΒ old}
Now, that we understand the basics of coreference resolution, it is essential to understand what kind of references may exist in text. The knowledge about the kind of references helps us devise strategies to resolve them, if and whenΒ found.
References are usually of two kinds: Exaphor and Endophor. Endophor refers to an entity that appears in the discourse. While Exaphor refers to an entity that does not appear in the discourse.
Example ofΒ Endophor
Sentence: βAna loves to read. She recently read a wonderful story.β
Here βSheβ refers to βAnaβ which appears as a possible referrent that is mentioned explicitly in the discourse.
Example ofΒ Exaphor
Sentence: βPick that up.β(pointing to anΒ object)
Here βthatβ refers to a object which appears as a possible referrent for a object that it not mentioned explicitly in the discourse
There are primarily two kinds of Endophors: Anaphor and Cataphor. Anaphor refers to a situation wherein the referential entity or referent appears before its referencing pronoun in the discourse.
Example ofΒ Anaphor
Sentence: βAna bought a dress. She lovesΒ it.β
Here βSheβ refers to βAnaβ whose occurence precedes the occurence of its referencing pronoun βSheβ in the discourse.
While, Cataphor refers to a situation wherein the entity or referent occurs later than its referencing pronoun in the discourse.
Example ofΒ Cataphor
Sentence: βWhen she bought the dress, Ana didnβt know it wasΒ torn.β
Here βsheβ occurs before its referential entity or referent βAnaβ in the discourse. Thus, this is an example of cataphor.
The set of corefering expressions is also called a coreference chain or a cluster. Now, that we understand what kinds of references are prevalent in literature, it is essential to grasp its linguistic properties asΒ well.
Understanding these linguistic properties of the coreference relation helps us understand how to best perform coreference resolution and minimize the rate of error in the process[1]. The important thing to remember is that these properties may vary from one language to another depending upon its rules. So, please make sure that you are well versed in the grammatical rules of a language before attempting to perform coreference resolution. For the sake of lucidity in this article, we will consider English as our primary language.
Some of the linguistic properties which we will be talking aboutΒ are:
Number and Gender Agreement
Recency
Grammatical Role
Verb Semantics
Selectional Restrictions
Repeated Mention
Parallelism
Now, let me explain these properties one by one. Number Agreement basically means that the referencing expressions should agree in number. While Gender Agreement implies that the referencing expressions agree in gender. Letβs look at an example ofΒ each.
Example of Gender and Number Agreement:
βAnalisa works at Google. She loves herΒ work.β
Here, βAnalisaβ and βSheβ agree in gender Female and they agree in number i.e., only one person βAnalisaβ works at Google and thus we use βSheβ to reference her rather than using other pronouns like βheβ, βtheyβ, etc. This is gender and number agreement.
Here, βhorsesβ are plural. Hence, the referent used is βtheyβ in order to refer to the entity βhorsesβ.Thus, they agree on numbers. This is number agreement.
Another property to remember is the Grammatical role. This property takes advantage of the inherent grammatical nature of a sentence which gives more saliency value to subject entity as compared to an object entity. In other words, we assume that an entity which is a subject is usually more important than an objectΒ entity.
Example of Grammatical Role
βAna works at an MNC with Tia. She usually worksΒ harder.β
In this sentence, we have βAnaβ and βTiaβ as candidate referents for the word βSheβ. Here βAnaβ is the subject while βTiaβ is the object. Thus, keeping in mind the saliency, we deem βAnaβ to be coreferent to βSheβ rather than βTiaβΒ .
The next factor to consider is Verb Semantics. Some verbs tend to lend more meaning to one of their arguments as compared to others while performing semantic analysis.
Example of Verb Semantics
βAna helped Christa. She was the architect behind the project.β
βAna condemned Christa. She was the architect behind the project.β
Here, in the first sentence, the usage of verb βhelpedβ implies that the probability of Ana being the architect behind the project is higher than Christa. Thus, βSheβ refers to βAnaβ in the first sentence.
However, in the second sentence, the usage of verb βcondemnedβ implies that the probability of Christa being the architect behind the project is higher than Ana. Thus, βSheβ refers to βChristaβ in the second sentence.
The next thing to understand is Selectional Restrictions. This utilizes semantic knowledge about a sentence to determine the referentβs preference.
Example of Selectional Restrictions
βI ate the roasted chicken in my pajamas after roasting it for three hours in theΒ oven.β
Here, two possible referents for βitβ are βpajamasβ and βchickenβ. The usage of the verb βeatβ (βateβ is past tense for βeatβ) implies that the referent entity must be edible, thereby choosing βchickenβ as the referent forΒ βitβ.
Another essential feature to understand is Repeated Mention. This feature or property says that if an entity or a set of entities are referred to repeatedly in the discourse, then the probability of them being possible referents increases exponentially.
Example of Repeated Mention[2]
βJohn needed a car to get to his new job. He decided that he wanted something sporty. Bill went to the Acura dealership with him. He bought an Integra.β
Here, the repeated mention of βJohnβ as compared to βBillβ as the focal point implies that βHeβ refers to βJohnβ and notΒ βBillβ.
Last but not the least, we talk about Parallelism. This property lends more importance to a referent if it can draw similar properties in terms of syntactic and semantic information from another sentence.
Example of Parallelism[2]
Mary went with Sue to the Acura dealership.
Sally went with her to the Mazda dealership.
Here, βherβ refers to Sue as both the sentences imply similar synatactic and semantic structure and we can draw parallels betweenΒ them.
Please do note that most of these factors are more valid while performing anaphoric reference resolution than other forms of reference resolution. In particular, the Exaphoric resolution of coreference is typically much harder to deal with and requires quite a different tactic.
Now, that you understand what is coreference resolution and how to resolve the references in case of ambiguity, let me also briefly mention two well-known libraries that can be used for coreference resolution. The first is StanfordCoreNLP and neuralcoref by Huggingface. You can use any of them to help ease your journey towards coreference resolution. You can also test these coreference systems online without any downloading necessary, here andΒ here.
A few of the most well-known applications of coreference resolution can be found in Machine Translation, text, or natural language understanding tasks such as information extraction, question answering, summarization, etc.
I hope this article helped you better grasp the underlying concepts behind Coreference Resolution.
Thank you forΒ reading!
References:
- Speech and Language Processing, 3rd Edition by Dan Jurafsky and James H.Β Martin
- Coreference Resolution and Discourse Coherence by Dr Mithun Balakrishnan
πππ£ππ§ππππππ βπ π£ππππ£ππππ βππ€π ππ¦π₯ππ π! was originally published in Towards AI on Medium, where people are continuing the conversation by highlighting and responding to this story.
Published via Towards AI