Skip to main navigation Skip to search Skip to main content

Sentence boundary detection in adjudicatory decisions in the United States

  • University of Pittsburgh
  • Hofstra University
  • Carnegie Mellon University

Research output: Contribution to journalArticlepeer-review

34 Scopus citations

Abstract

We report results of an effort to enable computers to segment US adjudicatory decisions into sentences. We created a data set of 80 court decisions from four different domains. We show that legal decisions are more challenging for existing sentence boundary detection systems than for non-legal texts. Existing sentence boundary detection systems are based on a number of assumptions that do not hold for legal texts, hence their performance is impaired. We show that a general statistical sequence labeling model is capable of learning the definition more efficiently. We have trained a number of conditional random fields models that outperform the traditional sentence boundary detection systems when applied to adjudicatory decisions.

Original languageEnglish
Pages (from-to)21-45
Number of pages25
JournalTAL Traitement Automatique des Langues
Volume58
Issue number2
StatePublished - 2017
Externally publishedYes

Keywords

  • Adjudicatory decisions
  • Artificial intelligence
  • Conditional random fields
  • Law
  • Sentence boundary detection
  • Text annotation

Fingerprint

Dive into the research topics of 'Sentence boundary detection in adjudicatory decisions in the United States'. Together they form a unique fingerprint.

Cite this