TY - GEN
T1 - Uncovering Trauma in Genocide Tribunals
T2 - 19th International Conference on Artificial Intelligence and Law, ICAIL 2023
AU - Schirmer, Miriam
AU - Olguín Nolasco, Isaac Misael
AU - Mosca, Edoardo
AU - Xu, Shanshan
AU - Pfeffer, Jürgen
N1 - Publisher Copyright:
© ICAIL 2023. All rights reserved.
PY - 2023/6/19
Y1 - 2023/6/19
N2 - This paper applies Natural Language Processing (NLP) methods to analyze the exposure to trauma experienced by witnesses in international criminal tribunals when testifying in court. One major contribution of this study is the creation of a substantially extended version of the Genocide Transcript Corpus (GTC) that includes 52,845 text segments of transcripts from three different genocide tribunals. Based on this data, we first examine the prevalence of trauma-related content in witness statements. Second, we are implementing a binary classification algorithm to automatically detect potentially traumatic content. Therefore, in a preparatory step, an Active Learning (AL) approach is applied to establish the ideal size for the training data set. Subsequently, this data is used to train a transformer model. In this case, the two models BERTbase and HateBERT are used for both steps, allowing for a comparison of a base-level model with a model that has already been pre-trained on data more relevant in the context of harmful vocabulary. In a third step, the study employs an Explainable Artificial Intelligence (XAI) model to gain a deeper understanding of the reasoning behind the model’s classifications. Our results suggest that both BERTbase and HateBERT perform comparatively well on this classification task, with no model clearly outperforming the other. The classification outcomes further suggest that a reduced data set size can achieve equally high performance metrics and might be a preferable choice in certain use cases. The results can be used to establish more trauma-informed legal procedures in genocide-related tribunals, including the identification of potentially re-traumatizing examination approaches at an early stage.
AB - This paper applies Natural Language Processing (NLP) methods to analyze the exposure to trauma experienced by witnesses in international criminal tribunals when testifying in court. One major contribution of this study is the creation of a substantially extended version of the Genocide Transcript Corpus (GTC) that includes 52,845 text segments of transcripts from three different genocide tribunals. Based on this data, we first examine the prevalence of trauma-related content in witness statements. Second, we are implementing a binary classification algorithm to automatically detect potentially traumatic content. Therefore, in a preparatory step, an Active Learning (AL) approach is applied to establish the ideal size for the training data set. Subsequently, this data is used to train a transformer model. In this case, the two models BERTbase and HateBERT are used for both steps, allowing for a comparison of a base-level model with a model that has already been pre-trained on data more relevant in the context of harmful vocabulary. In a third step, the study employs an Explainable Artificial Intelligence (XAI) model to gain a deeper understanding of the reasoning behind the model’s classifications. Our results suggest that both BERTbase and HateBERT perform comparatively well on this classification task, with no model clearly outperforming the other. The classification outcomes further suggest that a reduced data set size can achieve equally high performance metrics and might be a preferable choice in certain use cases. The results can be used to establish more trauma-informed legal procedures in genocide-related tribunals, including the identification of potentially re-traumatizing examination approaches at an early stage.
KW - BERT
KW - XAI
KW - classification
KW - genocide
KW - trauma
UR - http://www.scopus.com/inward/record.url?scp=85177839300&partnerID=8YFLogxK
U2 - 10.1145/3594536.3595147
DO - 10.1145/3594536.3595147
M3 - Conference contribution
AN - SCOPUS:85177839300
T3 - 19th International Conference on Artificial Intelligence and Law, ICAIL 2023 - Proceedings of the Conference
SP - 257
EP - 266
BT - 19th International Conference on Artificial Intelligence and Law, ICAIL 2023 - Proceedings of the Conference
PB - Association for Computing Machinery, Inc
Y2 - 19 June 2023 through 23 June 2023
ER -