Identification of Discourse Boundaries Using Anaphorically Annotated Text

Authors

  • Muhammad Aatif Department of Computer Science, University of Peshawar, KP Pakistan
  • Mohammad Abid Khan Department of Computer Science, University of Peshawar, KP Pakistan

Keywords:

Discourse, Discourse unit, Discourse Boundaries, boundaries identification, Discourse Boundaries Identification, Anaphorically Annotated Corpus, Algorithm

Abstract

For effective and efficient natural language processing (NLP) systems, it is of great significance that a discourse unit must be made a unit of processing. The major obstacle to achieving that goal is the identification of discourse boundaries (DBs). This paper presents an algorithm about the identification of DBs in English text using anaphorically annotated text. The proposed algorithm is based on the structure of anaphoric annotations carried out in Phrase Detectives Corpus 2.1.4 (PD2). It has been tested on anaphorically annotated documents selected at random from PD2 and showed an accuracy of 97.66%.

Downloads

Published

2020-06-30

Issue

Section

Original Articles

How to Cite

[1]
M. Aatif and M. A. Khan, “Identification of Discourse Boundaries Using Anaphorically Annotated Text”, jictra, pp. 47–53, Jun. 2020, Accessed: Mar. 23, 2025. [Online]. Available: https://jictra.com.pk/index.php/jictra/article/view/207