Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 29;20(5):e0322791.
doi: 10.1371/journal.pone.0322791. eCollection 2025.

What are we learning with Yoga? Mapping the scientific literature on Yoga using a vector-text-mining approach

Affiliations

What are we learning with Yoga? Mapping the scientific literature on Yoga using a vector-text-mining approach

Rosangela Ieger-Raittz et al. PLoS One. .

Abstract

The techniques used in yoga have roots in traditions that precede modern science. Research shows that yoga enhances quality of life and well-being, positively impacting physical and mental health. As yoga gains acceptance in Western countries, scientific studies on the subject increase exponentially. However, many of these studies are considered inconsistent due to the diverse methodologies and focuses in the field, which creates challenges for researchers and hampers progress. This study aims to develop a comprehensive framework for existing literature on yoga, facilitating multidisciplinary collaboration and bringing new light to relevant aspects. Given the complexity of the subject, advanced modeling techniques are necessary. Contemporary artificial intelligence methods have advanced Bioinformatics, including text mining (TM), allowing us to employ vector representations of texts to derive semantic insights and organize literature effectively. Based on TM resources, we provided a better general understanding of yoga and highlighted the relationships between yoga practice and various domains, including biochemical parameters and neuroscience. It also reveals that practitioners can learn to engage with their bodies and environments actively, enhancing their quality of life. However, there is a lack of research exploring the mechanisms behind this learning and its potential for further enhancement. Vector TM has made it possible to bolster and improve human analysis. The set of resources developed allowed us to determine the mapping of the literature, the analysis of which revealed 4 dimensions (exercise, physiology, theory and therapeutic) divided into 9 cohesive groups, representing the trends in the literature. The resulting platforms are available to Yoga researchers to evaluate our findings and make their forays into the existing literature.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. A simple representation of a phylogenetic tree.
Each letter (A-E) represents a leaf, while the lines depict branches. The points where branches meet are called nodes, and the most basal node in the tree is referred to as the root.
Fig 2
Fig 2. Schematic of the methodology.
The first stage consisted of surveying the literature on PubMed. Only texts with ‘YOGA’ in the title were filtered for the subsequent stages. The texts (title and abstract) were processed, and a vector representing each word, document, and logical expression was obtained. This representation permitted phylogenies, clustering, visualizations, the HTML-Text Mining (HTML-TM), and subsequent analyses.
Fig 3
Fig 3. Mapping of Yoga literature with all 113 logical expressions (LOGEXP) studied.
The phylogenetic tree presents the relationship of the 113 LOGEXP and divides the findings into nine groups with related topics. The tree characterizes the field of yoga as a reflection of the literature. Each branch presents a LOGEXP and its identification number. It also indicates the number of studies/documents (nDcs) and the number of titles (nTit) hit by the logical expression. Group II.BRAIN-RESPIRATORY-HEART is subdivided into four groups: A. Respiratory, B. Metabolic and Cardiovascular, C. Biomarkers, D. Neurological.
Fig 4
Fig 4. The circles correspond to the groups in Table 1. We vectorized each group by the mean of logical expressions (LOGEXP) vectors inside it.
To define the centers of the circles, a t-SNE dimension reduction is applied, and the area is the total of different papers accessed by the corresponding group. The observed overlapping is illustrative and represents possible interconnections between them.
Fig 5
Fig 5. Dimensional Scatterplot Visualization of Word Proximity.
It is possible to obtain the closest words or studies within the same vector space from a vectorized word. From the word ‘PARASYMPATHETIC’ as an example, we can receive: A. phylogenetic tree associating the word with the 40 closest words according to Yoga literature and B. list of the closest papers to the word. Note that the tree shows words related to the subject, such as ‘NERVOUS’, ‘AUTONOMIC’, and physiological characteristics modulated by the PARASYMPATHETIC system, such as ‘HEART’, ‘RATE’, ‘CARDIAC’, ‘PRESSURE’, ‘HEMODYNAMIC’, related to the cardiovascular system, and ‘RESPIRATORY’. Similarly, the listed papers present the cardio-respiratory and autonomic nervous systems as the central subject of the study. The references for articles are: [,,,,,–76].
Fig 6
Fig 6. t-SNE visualization of vectorized documents.
Each dot represents the ‘TITLE+ABSTRACT’ for each document. A. Semantic searches for ‘HEART’ (green), ‘CARDIOVASCULAR’ (magenta), and ‘BLOOD’ (yellow). The search considers 40 hits closest to each query. B. Distribution of 300 nearest hits for six semantics searches listed in the legend. Note that in B, the overlap between the topics of ‘NEURO’ (black) and ‘LEARNING’ (green) is visible, as well as between ‘BREATH’ (blue) and ‘CARDIO’ (yellow), and more subtly between ‘MOOD’ and ‘BALANCE’.

References

    1. Falkenberg RI, Eising C, Peters ML. Yoga and immune system functioning: a systematic review of randomized controlled trials. J Behav Med. 2018;41(4):467–82. doi: 10.1007/s10865-018-9914-y - DOI - PubMed
    1. Gururaja C, Rangaprakash D, Deshpande G. Research Trends in the Application of Yoga to Human Health: A Data Science Approach. Int J Public Ment Health Neurosci. 2020;7(1):8–13. - PMC - PubMed
    1. Varambally S, George S, Gangadhar BN. Yoga for psychiatric disorders: from fad to evidence-based intervention?. Br J Psychiatry. 2020;216(6):291–3. doi: 10.1192/bjp.2019.249 - DOI - PubMed
    1. Karri RR, Bhavanani AB, Ramanathan M, Mopidevi VG. Methodological issues in yoga therapy research among psychiatric patients. Indian J Psychiatry. 2023;65(1):12–7. doi: 10.4103/indianjpsychiatry.indianjpsychiatry_464_22 - DOI - PMC - PubMed
    1. Schmalzl L, Powers C, Henje Blom E. Neurophysiological and neurocognitive mechanisms underlying the effects of yoga-based practices: towards a comprehensive theoretical framework. Front Hum Neurosci. 2015;9:235. doi: 10.3389/fnhum.2015.00235 - DOI - PMC - PubMed

LinkOut - more resources