Time Period Categorization in Fiction: A Comparative Analysis of Machine Learning Techniques

Westin, Fereshta

In: Cataloging & Classification Quarterly, 2024, S. 1-30

Online unknown

Zugriff:

View record in SwePub (Volltext)

This study investigates the automatic categorization of time period metadata in fiction, a critical but often overlooked aspect of cataloging. Using a comparative analysis approach, the performance of three machine learning techniques, namely Latent Dirichlet Allocation (LDA), Sentence-BERT (SBERT), and Term Frequency-Inverse Document Frequency (TF-IDF) were assessed, by examining their precision, recall, F1 scores, and confusion matrix results. LDA identifies underlying topics within the text, TF-IDF measures word importance, and SBERT measures sentence semantic similarity. Based on F1-score analysis and confusion matrix outcomes, TF-IDF and LDA effectively categorize text data by time period, while SBERT performed poorly across all time period categories.

Titel:	Time Period Categorization in Fiction: A Comparative Analysis of Machine Learning Techniques
Autor/in / Beteiligte Person:	Westin, Fereshta
Link:	View record in SwePub (Volltext) https://doi.org/10.1080/01639374.2024.2315548 Volltext
Zeitschrift:	Cataloging & Classification Quarterly, 2024, S. 1-30
Veröffentlichung:	2024
Medientyp:	unknown
ISSN:	0163-9374 (print) ; 1544-4554 (print)
DOI:	10.1080/01639374.2024.2315548
Schlagwort:	Cataloging for digital resources fiction LDA machine learning SBERT text analysis TF-IDF time period categorization Library and Information Science Biblioteks- och informationsvetenskap
Sonstiges:	Nachgewiesen in: SwePub Sprachen: English File Description: electronic

Klicken Sie ein Format an und speichern Sie dann die Daten oder geben Sie eine Empfänger-Adresse ein und lassen Sie sich per Email zusenden.

BibTeX Citavi, JabRef, u.a.
(Literaturverwaltung)

PDF kein Volltext!
(Merkzettel, Notizen)

RIS Endnote, Citavi u.a.
(Literaturverwaltung)

MODS
(XML zur Weiterverarbeitung)

oder

Wählen Sie das für Sie passende Zitationsformat und kopieren Sie es dann in die Zwischenablage, lassen es sich per Mail zusenden oder speichern es als PDF-Datei.

Gewünschter Zitations-Stil:

oder

Bitte prüfen Sie, ob die Zitation formal korrekt ist, bevor Sie sie in einer Arbeit verwenden. Benutzen Sie gegebenenfalls den "Exportieren"-Dialog, wenn Sie ein Literaturverwaltungsprogramm verwenden und die Zitat-Angaben selbst formatieren wollen.