Journal of Theoretical & Computational Science

Journal of Theoretical & Computational Science
Open Access

ISSN: 2376-130X

Abstract

Automatic Extraction and Analysis of Direct Speech from Texts Using Large Language Model

Viktor Kuzminskyi* and Dmytro Lande

This paper proposes an approach for the automatic extraction of direct quotes from texts using Large Language Models (LLMs) and their analysis to build semantic networks of authors and concepts. After retrieving relevant documents, LLMs are employed to extract quotes, their authors and metadata, which are stored in a structured JavaScript Object Notation (JSON) format. Based on this data, a semantic network is constructed, which is then clustered using LLMs. The concept of a "swarm of virtual experts" is introduced for more precise extraction of key concepts. The model illustrates how authors form groups based on shared interests and discussion topics. One of the innovative aspects of the approach is the automatic generation of cluster names.

Published Date: 2024-12-04; Received Date: 2024-11-04

Top