Authorship attribution based on Life-Like Network Automata
- PMID: 29566100
- PMCID: PMC5863954
- DOI: 10.1371/journal.pone.0193703
Authorship attribution based on Life-Like Network Automata
Abstract
The authorship attribution is a problem of considerable practical and technical interest. Several methods have been designed to infer the authorship of disputed documents in multiple contexts. While traditional statistical methods based solely on word counts and related measurements have provided a simple, yet effective solution in particular cases; they are prone to manipulation. Recently, texts have been successfully modeled as networks, where words are represented by nodes linked according to textual similarity measurements. Such models are useful to identify informative topological patterns for the authorship recognition task. However, there is no consensus on which measurements should be used. Thus, we proposed a novel method to characterize text networks, by considering both topological and dynamical aspects of networks. Using concepts and methods from cellular automata theory, we devised a strategy to grasp informative spatio-temporal patterns from this model. Our experiments revealed an outperformance over structural analysis relying only on topological measurements, such as clustering coefficient, betweenness and shortest paths. The optimized results obtained here pave the way for a better characterization of textual networks.
Conflict of interest statement
Figures







Similar articles
-
Probing the topological properties of complex networks modeling short written texts.PLoS One. 2015 Feb 26;10(2):e0118394. doi: 10.1371/journal.pone.0118394. eCollection 2015. PLoS One. 2015. PMID: 25719799 Free PMC article.
-
Text Authorship Identified Using the Dynamics of Word Co-Occurrence Networks.PLoS One. 2017 Jan 26;12(1):e0170527. doi: 10.1371/journal.pone.0170527. eCollection 2017. PLoS One. 2017. PMID: 28125703 Free PMC article.
-
Automatic Authorship Detection Using Textual Patterns Extracted from Integrated Syntactic Graphs.Sensors (Basel). 2016 Aug 29;16(9):1374. doi: 10.3390/s16091374. Sensors (Basel). 2016. PMID: 27589740 Free PMC article.
-
Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion.BMC Med Inform Decis Mak. 2021 Nov 29;21(Suppl 9):335. doi: 10.1186/s12911-021-01622-7. BMC Med Inform Decis Mak. 2021. PMID: 34844576 Free PMC article. Review.
-
NeuCube: a spiking neural network architecture for mapping, learning and understanding of spatio-temporal brain data.Neural Netw. 2014 Apr;52:62-76. doi: 10.1016/j.neunet.2014.01.006. Epub 2014 Jan 20. Neural Netw. 2014. PMID: 24508754 Review.
Cited by
-
Classification of Literary Works: Fractality and Complexity of the Narrative, Essay, and Research Article.Entropy (Basel). 2020 Aug 17;22(8):904. doi: 10.3390/e22080904. Entropy (Basel). 2020. PMID: 33286673 Free PMC article.
-
Using citation networks to evaluate the impact of text length on keyword extraction.PLoS One. 2023 Nov 27;18(11):e0294500. doi: 10.1371/journal.pone.0294500. eCollection 2023. PLoS One. 2023. PMID: 38011182 Free PMC article.
-
Identifying the perceived local properties of networks reconstructed from biased random walks.PLoS One. 2024 Jan 19;19(1):e0296088. doi: 10.1371/journal.pone.0296088. eCollection 2024. PLoS One. 2024. PMID: 38241390 Free PMC article.
-
Comparing random walks in graph embedding and link prediction.PLoS One. 2024 Nov 6;19(11):e0312863. doi: 10.1371/journal.pone.0312863. eCollection 2024. PLoS One. 2024. PMID: 39504339 Free PMC article.
-
A Hybrid Model with New Word Weighting for Fast Filtering Spam Short Texts.Sensors (Basel). 2023 Nov 4;23(21):8975. doi: 10.3390/s23218975. Sensors (Basel). 2023. PMID: 37960672 Free PMC article.
References
-
- Franco-Salvador M, Rosso P, Montes-y-Gómez M. A systematic study of knowledge graph analysis for cross-language plagiarism detection. Information Processing & Management. 2016;52(4):550–570. doi: 10.1016/j.ipm.2015.12.004 - DOI
-
- Labbé C, Labbé D. Duplicate and fake publications in the scientific literature: how many SCIgen papers in computer science? Scientometrics. 2013;94(1):379–396. doi: 10.1007/s11192-012-0781-y - DOI
-
- Vacca JR. Computer Forensics: Computer Crime Scene Investigation (Networking Series) (Networking Series). Rockland, MA, USA: Charles River Media, Inc; 2005.
-
- Stamatatos E. A survey of modern authorship attribution methods. Journal of the American Society for Information Science and Technology. 2009;60(3):538–556. doi: 10.1002/asi.21001 - DOI
-
- Amancio DR. Authorship recognition via fluctuation analysis of network topology and word intermittency. Journal of Statistical Mechanics: Theory and Experiment. 2015;2015(3):P03005 doi: 10.1088/1742-5468/2015/03/P03005 - DOI
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources