Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Mar 26;6(3):e1000734.
doi: 10.1371/journal.pcbi.1000734.

A human-specific de novo protein-coding gene associated with human brain functions

Affiliations

A human-specific de novo protein-coding gene associated with human brain functions

Chuan-Yun Li et al. PLoS Comput Biol. .

Abstract

To understand whether any human-specific new genes may be associated with human brain functions, we computationally screened the genetic vulnerable factors identified through Genome-Wide Association Studies and linkage analyses of nicotine addiction and found one human-specific de novo protein-coding gene, FLJ33706 (alternative gene symbol C20orf203). Cross-species analysis revealed interesting evolutionary paths of how this gene had originated from noncoding DNA sequences: insertion of repeat elements especially Alu contributed to the formation of the first coding exon and six standard splice junctions on the branch leading to humans and chimpanzees, and two subsequent substitutions in the human lineage escaped two stop codons and created an open reading frame of 194 amino acids. We experimentally verified FLJ33706's mRNA and protein expression in the brain. Real-Time PCR in multiple tissues demonstrated that FLJ33706 was most abundantly expressed in brain. Human polymorphism data suggested that FLJ33706 encodes a protein under purifying selection. A specifically designed antibody detected its protein expression across human cortex, cerebellum and midbrain. Immunohistochemistry study in normal human brain cortex revealed the localization of FLJ33706 protein in neurons. Elevated expressions of FLJ33706 were detected in Alzheimer's brain samples, suggesting the role of this novel gene in human-specific pathogenesis of Alzheimer's disease. FLJ33706 provided the strongest evidence so far that human-specific de novo genes can have protein-coding potential and differential protein expression, and be involved in human brain functions.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Gene structure of FLJ33706, a human-specific de novo protein-coding gene.
Data for the tracks ‘Spliced Human EST’ and ‘Human mRNA’ was extracted and assembled from UCSC Genome Browser. We re-sequenced all available mRNAs and spliced ESTs, shown in the track ‘Re-sequenced ESTs/mRNAs’. On the basis of these data, we inferred gene structure for this novel gene, with six exons marked as ‘1∼6’ in the track ‘FLJ33706 Gene Structure’. The exons partially derived from re-sequenced data were highlighted in green. An ORF with two short coding exons located at exon 3 and exon 4 was identified to encode a 194-amino-acid-long peptide (track ‘Open Reading Frame (ORF)’). Newly inserted transposable elements, especially Alu sequences, contributed substantially to the formation of the first coding exon and six standard splicing junctions on the branch leading to human and chimpanzee, marked as ‘a∼f’ in the track ‘FLJ33706 Gene Structure’. All repeat elements in this region were shown in track ‘RepeatMasker’, extracted from UCSC Genome Browser. Coding exons in tracks ‘Spliced Human EST’, ‘Human mRNA’ and ‘FLJ33706 Gene Structure’ were represented by higher vertical bars, while UTR regions and intronic regions were represented by lower vertical bars. Size scales were added in the figure to give benchmarks for gene sizes. Tracks with different size scales were separated by horizon lines.
Figure 2
Figure 2. Syntenic alignments of five flanking splicing junctions of FLJ33706.
(Top) The syntenic alignments of five flanking splicing junctions of FLJ33706 among 13 species were shown, in which intron regions were highlighted in blue boxes and splicing sites were highlighted in red. Lineage-specific insertions, unalignable bases in the gap region and uncertain regions were were marked as ‘-’, ‘ = ’ and ‘N’, respectively. (Bottom) The splicing sites from 8 species were shown in the context of the phylogeny. Red ‘Y’ represents presence of the splicing signal and blue ‘N’ represents absence. Exons and introns are not drawn to scale.
Figure 3
Figure 3. Alignment of human FLJ33706 ORF with orthologous genomic sequences in four other primates.
For each position in human FLJ33706 ORF, the corresponding orthologous genomic sequences in chimpanzee, gorilla, orangutan and rhesus monkey were aligned to human reference to identify the types of variations. Only amino acid sites with at least one variation among “Human-Human” (SNP), “Human-Chimpanzee”, “Human-Gorilla”, “Human-Orangutan” or “Human-Rhesus” were shown. Identical sites were shown as black dots while divergent sites were shown in red (non-synonymous mutations), green (synonymous mutations) and blue (SNP). Two human-specific mutations that escaped stop codons were highlighted by black frames. Amino acids with non-synonymous variations were highlighted in red while synonymous variations in blue. All sequencing data in this study were traced and manually checked to ensure reliability.
Figure 4
Figure 4. FLJ33706 mRNA expression in peripheral tissues and brain regions.
FLJ33706 mRNA levels were measured in eight peripheral tissues and eight brain regions using TaqMan-based Real-Time PCR system. Relative quantity was calculated using expression means of human leukocyte as references (Fold = 1.0). FLJ33706 had relatively higher expression levels in brain regions (highlighted in red) than in peripheral tissues (shown in white). The expression levels of human BDNF1 and BDNF4 (shown in blue bars) in cortex (BDNF1_CTX, BDNF4_CTX) were also compared with those of FLJ33706 using also leukocyte FLJ33706 expression as a reference.
Figure 5
Figure 5. Western-blot assays to determine the protein expression levels of FLJ33706.
(A) A specific band with molecular mass of about 22 kDa in SDS-PAGE, which was consistent with the predicted molecular weight of FLJ33706 putative proteins, was detected in the Western-blot assay. The band could not be detected in pre-immune control and peptide competition control. Pre: pre-immune reaction assay; Ab: FLJ33706 antibody assay; Pep: peptide competition assay. (B) E. coli samples before and after the transformation of FLJ33706 recombination plasmids were assayed by Western blot using (a) His-tag specific antibody and (b) anti-FLJ33706. FLJ33706 expression in human cortex was shown in (c) as the control. Before: E. coli samples before the plasmid transformation; After: E. coli samples after the plasmid transformation. (C) The specific band can be detected in all human brain regions, but not in mouse brain regions used as controls. H-CTX, M-CTX: human/mouse cortex; H-MID, M-MID: human/mouse midbrain; H-CER, M-CER: human/mouse cerebellum; (D) FLJ33706 expression can be detected in different human individuals. 384, 994, 275, 995, 705, 710, 1277: individual IDs. 22KD: theoretical molecular weight of FLJ33706 protein; 42KD: molecular weight of beta-actin protein as endogenous control. In A and C, antibodies for FLJ33706 and endogenous control were mixed in Western assays.
Figure 6
Figure 6. Immunohistochemistry studies of FLJ33706 protein in human brain.
Results from confocal immunofluorescence imaging to visualize FLJ33706 protein (red) in normal human cortex were shown. Neurons were marked with beta-tubulin-III (green). The nucleus was also stained with DAPI (blue). The optimal dilution of FLJ33706 antibody was optimised based on the detection of cytoplasmic signal in brain cells. The three rows showed results from three independent experiments.

Similar articles

Cited by

References

    1. Ohno S. Berlin: Springer-Verlag; 1970. Evolutionby gene duplication.
    1. Leister D. Tandem and segmental gene duplication and recombination in the evolution of plant disease resistance gene. Trends Genet. 2004;20:116–122. - PubMed
    1. Marques AC, Dupanloup I, Vinckenbosch N, Reymond A, Kaessmann H. Emergence of young human genes after a burst of retroposition in primates. PLoS Biol. 2005;3:e357. - PMC - PubMed
    1. Long M, Langley CH. Natural selection and the origin of jingwei, a chimeric processed functional gene in Drosophila. Science. 1993;260:91–95. - PubMed
    1. Wang W, Zhang J, Alvarez C, Llopart A, Long M. The origin of the Jingwei gene and the complex modular structure of its parental gene, yellow emperor, in Drosophila melanogaster. Mol Biol Evol. 2000;17:1294–1301. - PubMed

Publication types

Substances