HIN-MELM-AE AND DePori-BASED AUTOMATIC TEXT SUMMARIZATION FOR MULTI-TEXT DOCUMENTS AND MULTI-LINGUAL SUMMARIES VIA ENSEMBLE LEARNING
DOI:
https://doi.org/10.6977/IJoSI.202512_9(6).0003Keywords:
Hyperfan-IN Multilayer Extreme Learning Machine Auto Encoder (HIN-MLELM-AE), Sentence Bidirectional Encoder Representations from Transformers (SBERT), Info-Squared Fuzzy C Means Clustering (InS-FCM), Latent Dirichlet Allocation (LDA), Sememe Similarity induced Hidden Markov Model (SemSim-HMM), Parts Of Speech (POS), Term Frequency-Inverse Document Frequency (TF-IDF), and Variational Auto Encoder (VAE).Abstract
Automatic Text Summarization (ATS) emerged from the need to manage the growing volume of textual information. ATS is a process of creating a short and accurate summary of a longer text document.The prevailing studies didn’t perform ATS for multi-document and multi-lingual summaries.This paper presents an improved ensemble learning-based automatic text summarization with slang filtering using HIN-MELM-AE and Dehghani Poor and rich optimization algorithm (DePori) techniques.Initially, the text document is taken and then pre-processed. Afterward, the slang identification and filtering are done on the pre-processed text by using DePori. Next, the slang-filtered text is transformed by InS-FCM-based clustering, LDA-based topic modeling, TF-IDF analysis, and frequent term selection. From the transformed data, the POS tagging is performed by utilizing SemSim-HMM. Then, the significant entity is extracted from the transformed data and POS-tagged text. After that, the SBERT is employed to perform entity vectorization. Finally, the ATS is done by the ensemble models, which include HIN-MELM-AE, AE, VAE, and SBERT. Next, the cosine similarity evaluation is done from the output of ensemble models. Next, the voting-based fusion, re-ranking, and optimal sentence selection are performed. At last, the summarized text is obtained.The results proved that the proposed model achieved a high accuracy of 98.72%, thus outperforming conventional methods.
Downloads
Published
Issue
Section
License
Copyright in a work is a bundle of rights. IJoSI's, copyright covers what may be done with the work in terms of making copies, making derivative works, abstracting parts of it for citation or quotation elsewhere and so on. IJoSI requires authors to sign over rights when their article is ready for publication so that the publisher from then on owns the work. Until that point, all rights belong to the creator(s) of the work. The format of IJoSI copy right form can be found at the IJoSI web site.The issues of International Journal of Systematic Innovation (IJoSI) are published in electronic format and in print. Our website, journal papers, and manuscripts etc. are stored on one server. Readers can have free online access to our journal papers. Authors transfer copyright to the publisher as part of a journal publishing agreement, but have the right to:
1. Share their article for personal use, internal institutional use and scholarly sharing purposes, with a DOI link to the version of record on our server.
2. Retain patent, trademark and other intellectual property rights (including research data).
3. Proper attribution and credit for the published work.