PropBank

Database of verbal propositions

PropBank is a corpus that is annotated with verbal propositions and their arguments—a "proposition bank". Although "PropBank" refers to a specific corpus produced by Martha Palmer et al.,[1] the term propbank is also coming to be used as a common noun referring to any corpus that has been annotated with propositions and their arguments.

The PropBank project has played a role in recent[when?] research in natural language processing, and has been used in semantic role labelling.[2]

Comparison

PropBank differs from FrameNet, the resource to which it is most frequently compared, in several ways.

PropBank is a verb-oriented resource, while FrameNet is centered on the more abstract notion of frames, which generalizes descriptions across similar verbs (e.g. "describe" and "characterize") as well as nouns and other words (e.g. "description").[2] PropBank does not annotate events or states of affairs described using nouns. PropBank commits to annotating all verbs in a corpus, whereas the FrameNet project chooses sets of example sentences from a large corpus and only in a few cases has annotated longer continuous stretches of text.

PropBank-style annotations often remain close to the syntactic level, while FrameNet-style annotations are sometimes more semantically motivated. From the start, PropBank was developed with the idea of serving as training data for machine learning-based semantic role labeling systems in mind. It requires that all arguments to a verb be syntactic constituents and different senses of a word are only distinguished if the differences bear on the arguments.[3] Due to such differences, semantic role labeling with respect to PropBank is often a somewhat easier task than producing FrameNet-style annotations.

See also

  • VerbNet
  • FrameNet

References

  1. ^ Palmer M, Kingsbury P, Gildea D (2005). "The Proposition Bank: An Annotated Corpus of Semantic Roles". Computational Linguistics. 31 (1): 71–106. CiteSeerX 10.1.1.136.8985. doi:10.1162/0891201053630264. S2CID 2486369.
  2. ^ a b Palmer, Martha; Kingsbury, Paul; Gildea, Daniel. "The Proposition Bank: An Annotated Corpus of Semantic Roles" (PDF). University of Rochester. Archived from the original (PDF) on November 28, 2022. Retrieved March 19, 2023.
  3. ^ Edward Loper; Szu-ting Yi & Martha Palmer (2007). "Combining Lexical Resources: Mapping Between PropBank and VerbNet" (PDF). Proceedings of the 7th International Workshop on Computational Linguistics.

External links

  • PropBank website
  • NomBank website
  • SALSA website
  • v
  • t
  • e
Corpus linguistics
Text corpora,
EnglishText corpora,
non-EnglishOrganizations
  • v
  • t
  • e
General terms
Text analysis
Text segmentation
Automatic summarization
Machine translation
Distributional semantics models
Language resources,
datasets and corpora
Types and
standards
Data
Automatic identification
and data capture
Topic model
Computer-assisted
reviewing
Natural language
user interface
Related


Stub icon

This computational linguistics-related article is a stub. You can help Wikipedia by expanding it.

  • v
  • t
  • e
Stub icon

This article about a digital library is a stub. You can help Wikipedia by expanding it.

  • v
  • t
  • e