Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Read by thought-leaders and decision-makers around the world. Phone Number: +1-650-246-9381 Email: [email protected]
228 Park Avenue South New York, NY 10003 United States
Website: Publisher: https://towardsai.net/#publisher Diversity Policy: https://towardsai.net/about Ethics Policy: https://towardsai.net/about Masthead: https://towardsai.net/about
Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Founders: Roberto Iriondo, , Job Title: Co-founder and Advisor Works for: Towards AI, Inc. Follow Roberto: X, LinkedIn, GitHub, Google Scholar, Towards AI Profile, Medium, ML@CMU, FreeCodeCamp, Crunchbase, Bloomberg, Roberto Iriondo, Generative AI Lab, Generative AI Lab Denis Piffaretti, Job Title: Co-founder Works for: Towards AI, Inc. Louie Peters, Job Title: Co-founder Works for: Towards AI, Inc. Louis-François Bouchard, Job Title: Co-founder Works for: Towards AI, Inc. Cover:
Towards AI Cover
Logo:
Towards AI Logo
Areas Served: Worldwide Alternate Name: Towards AI, Inc. Alternate Name: Towards AI Co. Alternate Name: towards ai Alternate Name: towardsai Alternate Name: towards.ai Alternate Name: tai Alternate Name: toward ai Alternate Name: toward.ai Alternate Name: Towards AI, Inc. Alternate Name: towardsai.net Alternate Name: pub.towardsai.net
5 stars – based on 497 reviews

Frequently Used, Contextual References

TODO: Remember to copy unique IDs whenever it needs used. i.e., URL: 304b2e42315e

Resources

Take our 85+ lesson From Beginner to Advanced LLM Developer Certification: From choosing a project to deploying a working product this is the most comprehensive and practical LLM course out there!

Publication

Unraveling Coreference Resolution in NLP!
Natural Language Processing

Unraveling Coreference Resolution in NLP!

Last Updated on February 25, 2021 by Editorial Team

Author(s): Ananya Banerjee

Natural Language Processing

Unraveling Coreference Resolution inΒ NLP!

Coreference Resolution

Coreference Resolution is one of the most essential Natural Language Processing (NLP) tasks. But, before we begin to understand Coreference Resolution, it is essential to understand the definition of discourse.

Discourse in the context of NLP refers to a sequence of sentences occurring one after the other. There will obviously be entities that are being talked about and possible references to those entities in the discourse. We use the word β€œmention” to refer to these references.

An example of a discourse:

Ana is a Graduate Student at UT Dallas. She loves working in Natural Language Processing at the institute. Her hobbies include blogging, dancing andΒ singing.

Here, β€œAna”, β€œNatural Language Processing” and β€œUT Dallas” are possible entities.

β€œShe” and β€œHer” are references to the entity β€œAna” and β€œthe institute” is a reference to the entity β€œUTΒ Dallas”.

Reference, in NLP, is a linguistic process where one word in a sentence or discourse may refer to another word or entity. The task of resolving such references is known as Reference Resolution. In the above example, β€œShe” and β€œHer” referring to the entity β€œAna” and β€œthe institute” referring to the entity β€œUT Dallas” are two examples of Reference Resolution.

Let’s Summarize:

Discourse in the context of NLP refers to a sequence of sentences occurring one after theΒ other

Reference is a linguistic process where one word in a sentence or discourse refers to another word orΒ entity

The task of resolving such references is known as Reference Resolution.

Coreference Resolution in particular, is the process of resolving pronouns to identify which entities are they referring to. It is also a kind of Reference Resolution. The entities resolved may be a person, place, organization, orΒ event.

Referent is the object that is being referred to. For example, β€œAna” is the referent in the aboveΒ example.

Referring expression are the mentions or linguistic expressions given in the discourse.

Two or more referring expressions that refer to the same discourse entity are said to corefer[1].

Now, let us look at another example to understand thisΒ better.

Example Discourse:

β€œElon Musk was born on June 28, 1971. He is the founder, CEOΒ , chief engineer and designer of SpaceX. The 49 year old is widely known as the mind behind Neuralink.”

Referring Expressions: Elon Musk, He, The 49 yearΒ old

Referent: ElonΒ Musk

Corefering Expressions: {Elon Musk, He}, {Elon Musk, The 49 yearΒ old}

Now, that we understand the basics of coreference resolution, it is essential to understand what kind of references may exist in text. The knowledge about the kind of references helps us devise strategies to resolve them, if and whenΒ found.

References are usually of two kinds: Exaphor and Endophor. Endophor refers to an entity that appears in the discourse. While Exaphor refers to an entity that does not appear in the discourse.

Example ofΒ Endophor

Sentence: β€œAna loves to read. She recently read a wonderful story.”

Here β€œShe” refers to β€œAna” which appears as a possible referrent that is mentioned explicitly in the discourse.

Example ofΒ Exaphor

Sentence: β€œPick that up.”(pointing to anΒ object)

Here β€œthat” refers to a object which appears as a possible referrent for a object that it not mentioned explicitly in the discourse

There are primarily two kinds of Endophors: Anaphor and Cataphor. Anaphor refers to a situation wherein the referential entity or referent appears before its referencing pronoun in the discourse.

Example ofΒ Anaphor

Sentence: β€œAna bought a dress. She lovesΒ it.”

Here β€œShe” refers to β€œAna” whose occurence precedes the occurence of its referencing pronoun β€œShe” in the discourse.

While, Cataphor refers to a situation wherein the entity or referent occurs later than its referencing pronoun in the discourse.

Example ofΒ Cataphor

Sentence: β€œWhen she bought the dress, Ana didn’t know it wasΒ torn.”

Here β€œshe” occurs before its referential entity or referent β€œAna” in the discourse. Thus, this is an example of cataphor.

The set of corefering expressions is also called a coreference chain or a cluster. Now, that we understand what kinds of references are prevalent in literature, it is essential to grasp its linguistic properties asΒ well.

Understanding these linguistic properties of the coreference relation helps us understand how to best perform coreference resolution and minimize the rate of error in the process[1]. The important thing to remember is that these properties may vary from one language to another depending upon its rules. So, please make sure that you are well versed in the grammatical rules of a language before attempting to perform coreference resolution. For the sake of lucidity in this article, we will consider English as our primary language.

Some of the linguistic properties which we will be talking aboutΒ are:

Number and Gender Agreement

Recency

Grammatical Role

Verb Semantics

Selectional Restrictions

Repeated Mention

Parallelism

Now, let me explain these properties one by one. Number Agreement basically means that the referencing expressions should agree in number. While Gender Agreement implies that the referencing expressions agree in gender. Let’s look at an example ofΒ each.

Example of Gender and Number Agreement:

β€œAnalisa works at Google. She loves herΒ work.”

Gender Agreement

Here, β€œAnalisa” and β€œShe” agree in gender Female and they agree in number i.e., only one person β€œAnalisa” works at Google and thus we use β€œShe” to reference her rather than using other pronouns like β€œhe”, β€œthey”, etc. This is gender and number agreement.

Number Agreement

Here, β€œhorses” are plural. Hence, the referent used is β€œthey” in order to refer to the entity β€œhorses”.Thus, they agree on numbers. This is number agreement.

Another property to remember is the Grammatical role. This property takes advantage of the inherent grammatical nature of a sentence which gives more saliency value to subject entity as compared to an object entity. In other words, we assume that an entity which is a subject is usually more important than an objectΒ entity.

Example of Grammatical Role

β€œAna works at an MNC with Tia. She usually worksΒ harder.”

Grammatical Role

In this sentence, we have β€œAna” and β€œTia” as candidate referents for the word β€œShe”. Here β€œAna” is the subject while β€œTia” is the object. Thus, keeping in mind the saliency, we deem β€œAna” to be coreferent to β€œShe” rather than β€œTia” .

The next factor to consider is Verb Semantics. Some verbs tend to lend more meaning to one of their arguments as compared to others while performing semantic analysis.

Example of Verb Semantics

β€œAna helped Christa. She was the architect behind the project.”

β€œAna condemned Christa. She was the architect behind the project.”

Here, in the first sentence, the usage of verb β€œhelped” implies that the probability of Ana being the architect behind the project is higher than Christa. Thus, β€œShe” refers to β€œAna” in the first sentence.

However, in the second sentence, the usage of verb β€œcondemned” implies that the probability of Christa being the architect behind the project is higher than Ana. Thus, β€œShe” refers to β€œChrista” in the second sentence.

The next thing to understand is Selectional Restrictions. This utilizes semantic knowledge about a sentence to determine the referent’s preference.

Example of Selectional Restrictions

β€œI ate the roasted chicken in my pajamas after roasting it for three hours in theΒ oven.”

Selectional Restrictions

Here, two possible referents for β€œit” are β€œpajamas” and β€œchicken”. The usage of the verb β€œeat” (β€œate” is past tense for β€œeat”) implies that the referent entity must be edible, thereby choosing β€œchicken” as the referent forΒ β€œit”.

Another essential feature to understand is Repeated Mention. This feature or property says that if an entity or a set of entities are referred to repeatedly in the discourse, then the probability of them being possible referents increases exponentially.

Example of Repeated Mention[2]

β€œJohn needed a car to get to his new job. He decided that he wanted something sporty. Bill went to the Acura dealership with him. He bought an Integra.”

Repeated Mention

Here, the repeated mention of β€œJohn” as compared to β€œBill” as the focal point implies that β€œHe” refers to β€œJohn” and notΒ β€œBill”.

Last but not the least, we talk about Parallelism. This property lends more importance to a referent if it can draw similar properties in terms of syntactic and semantic information from another sentence.

Example of Parallelism[2]

Mary went with Sue to the Acura dealership.

Sally went with her to the Mazda dealership.

Parallelism

Here, β€œher” refers to Sue as both the sentences imply similar synatactic and semantic structure and we can draw parallels betweenΒ them.

Please do note that most of these factors are more valid while performing anaphoric reference resolution than other forms of reference resolution. In particular, the Exaphoric resolution of coreference is typically much harder to deal with and requires quite a different tactic.

Now, that you understand what is coreference resolution and how to resolve the references in case of ambiguity, let me also briefly mention two well-known libraries that can be used for coreference resolution. The first is StanfordCoreNLP and neuralcoref by Huggingface. You can use any of them to help ease your journey towards coreference resolution. You can also test these coreference systems online without any downloading necessary, here andΒ here.

A few of the most well-known applications of coreference resolution can be found in Machine Translation, text, or natural language understanding tasks such as information extraction, question answering, summarization, etc.

I hope this article helped you better grasp the underlying concepts behind Coreference Resolution.

Thank you forΒ reading!

References:

  1. Speech and Language Processing, 3rd Edition by Dan Jurafsky and James H.Β Martin
  2. Coreference Resolution and Discourse Coherence by Dr Mithun Balakrishnan


π•Œπ•Ÿπ•£π•’π•§π•–π•π•π•šπ•Ÿπ•˜ β„‚π• π•£π•–π•—π•–π•£π•–π•Ÿπ•”π•– ℝ𝕖𝕀𝕠𝕝𝕦π•₯π•šπ• π•Ÿ! was originally published in Towards AI on Medium, where people are continuing the conversation by highlighting and responding to this story.

Published via Towards AI

Feedback ↓