Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Nov;42(5):2353-2362.
doi: 10.3892/ijmm.2018.3845. Epub 2018 Aug 28.

Comprehensive literature data-mining analysis reveals a broad genetic network functionally associated with autism spectrum disorder

Affiliations

Comprehensive literature data-mining analysis reveals a broad genetic network functionally associated with autism spectrum disorder

Cheng Xu et al. Int J Mol Med. 2018 Nov.

Abstract

Previous studies have indicated that genetic factors are the predominate cause of Autism spectrum disorder (ASD). Nevertheless, to the best of our knowledge, to date no systematic study has summarized these data and provided an objective, complete list of genes with demonstrated associations with ASD. The present study included a literature data mining analysis of >2,064 articles including publications from January 2000 to April 2016, which identified 488 ASD target genes. Gene set enrichment analysis (GSEA), sub‑network enrichment analysis (SNEA) and network connectivity analysis (NCA) were conducted to assess the functional profile and pathogenic significance of these genes. A total of 2 literature metrics were proposed to prioritize the curated ASD genes with specific significance. This approach resulted in the development of an ASD genetic database. Subsequent analysis indicated that 391 of the 488 genes were enriched in 97 biological pathways (P<1x10‑8), demonstrating significant functional associations with each other. The majority of these curated ASD genes also serve significant roles in the pathogenesis of other neuropsychiatric disorders. These results suggest that the genetic causes of ASD are within a large network composed of functionally‑associated genes. The genetic database, together with the metric scores developed in the present study, provides a basis for future biological/genetic modeling in the field.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Distribution of 488 genes among different association types. The X-axis represents the gene index (gene 1-488). The Y-axis represents the types of gene-ASD associations. The color at a coordinate (x, y) represents the number of articles reporting a 'y' association type for the gene 'x'. The legend on the right denotes the number-color correspondence; the continuous change of color from blue to red represents a change of number from 0 to 10, where 0 signifies that there are no articles supporting the association. ASD, autism spectrum disorder.
Figure 2
Figure 2
Histogram of the publications describing gene-disease associations between 488 unique genes and autism spectrum disorder. (A) Histogram of article publication date. (B) Histogram of the number of novel genes identified in each year.
Figure 3
Figure 3
All autism spectrum disorder-gene single reference associations rank-ordered by QScore. A higher QScore for a given association was derived from a combination of a recent publication date (blue bar) and high citation rate (red bar). QScore, quality score.
Figure 4
Figure 4
Pathways for (A) behavior, (B) neural function and (C) brain function/development. In all networks, the weight of the line (number attached to the line) between two nodes is the number of genes shared by those two pathways. The size of the nodes represents the P-value (−log10 transformed) of the enriched pathway. The brighter the color of a node, the higher the number of the pathways connected to the node.
Figure 5
Figure 5
Disease-disease interaction network between autism spectrum disorder and the top 10 mental disorders from the sub-network enrichment analysis. CNTNAP2, contactin associated protein like 2; SLC6A4, solute carrier family 6 member 4; MECP2, methyl CpG-binding protein 2; OXT, oxytocin neurophysin 1 prepropeptide; IL6, interleukin 6; FMR1, fragile × mental retardation 1.
Figure 6
Figure 6
Gene network of top 14 genes by NScore and QScore. (A) QScore group; the brighter the color (red), the higher the QScore. (B) NScore group; the brighter the color (green), the higher the NScore. The size of each node represents the number of pathways (from the pathway enrichment analysis) a gene was identified in. The weight of the line between any two nodes is the number of pathways shared by those two genes. NScore, novelty score; QScore, quality score. Due to high number of edges, the weights on the edge may not differentiable. Please refer to ASD_GD→Related Genes, where the related pathways of each gene have been provided.

References

    1. Chaste P, Leboyer M. Autism risk factors: Genes, environment, and gene-environment interactions. Dialogues Clin Neurosci. 2012;14:281–292. - PMC - PubMed
    1. Rutter M. Genetic studies of autism: From the 1970s into the millennium. J Abnorm Child Psychol. 2000;28:3–14. doi: 10.1023/A:1005113900068. - DOI - PubMed
    1. Aronson M, Hagberg B, Gillberg C. Attention deficits and autistic spectrum problems in children exposed to alcohol during gestation: A follow-up study. Dev Med Child Neurol. 1997;39:583–587. doi: 10.1111/j.1469-8749.1997.tb07493.x. - DOI - PubMed
    1. Kolevzon A, Gross R, Reichenberg A. Prenatal and perinatal risk factors for autism: A review and integration of findings. Arch Pediatr Adolesc Med. 2007;161:326–333. doi: 10.1001/archpedi.161.4.326. - DOI - PubMed
    1. Minshew NJ. Brief report: Brain mechanisms in autism: Functional and structural abnormalities. J Autism Dev Disord. 1996;26:205–209. doi: 10.1007/BF02172013. - DOI - PubMed