Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 23;17(3):e0264776.
doi: 10.1371/journal.pone.0264776. eCollection 2022.

Analysis of transcribed sequences from young and mature zebrafish thrombocytes

Affiliations

Analysis of transcribed sequences from young and mature zebrafish thrombocytes

Weam Fallatah et al. PLoS One. .

Abstract

The zebrafish is an excellent model system to study thrombocyte function and development. Due to the difficulties in separating young and mature thrombocytes, comparative transcriptomics between these two cell types has not been performed. It is important to study these differences in order to understand the mechanism of thrombocyte maturation. Here, we performed single-cell RNA sequencing of the young and mature zebrafish thrombocytes and compared the two datasets for young and mature thrombocyte transcripts. We found a total of 9143 genes expressed cumulatively in both young and mature thrombocytes, and among these, 72% of zebrafish thrombocyte-expressed genes have human orthologs according to the Ensembl human genome annotation. We also found 397 uniquely expressed genes in young and 2153 uniquely expressed genes in mature thrombocytes. Of these 397 and 2153 genes, 272 and 1620 corresponded to human orthologous genes, respectively. Of all genes expressed in both young and mature thrombocytes, 4224 have been reported to be expressed in human megakaryocytes, and 1603 were found in platelets. Among these orthologs, 156 transcription factor transcripts in thrombocytes were found in megakaryocytes and 60 transcription factor transcripts were found in platelets including a few already known factors such as Nfe2 and Nfe212a (related to Nfe2) that are present in both megakaryocytes, and platelets. These results indicate that thrombocytes have more megakaryocyte features and since platelets are megakaryocyte fragments, platelets also appear to be thrombocyte equivalents. In conclusion, our study delineates the differential gene expression patterns of young and mature thrombocytes, highlighting the processes regulating thrombocyte maturation. Future knockdown studies of these young and mature thrombocyte-specific genes are feasible and will provide the basis for understanding megakaryocyte maturation.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. PANTHER analysis and classification of GFP+ and RFP+ thrombocyte transcripts.
A. When 8,746 genes expressed in GFP+ and 6990 genes expressed in RFP+ thrombocytes thrombocytes were uploaded into the PANTHER program, and Danio rerio species was selected, GFP+ 7452 genes and RFP+ 5978 genes, respectively were detected by this program. The program yielded a total number of protein hits of 4704 and 3811 for these GFP+ and RFP+genes, respectively. B. When 2153 unique genes expressed in GFP+ thrombocytes and 397 unique genes expressed in RFP+ thrombocytes were uploaded into the PANTHER program, and Danio rerio species was selected, GFP+ 1778 genes and RFP+ 300 genes, respectively, were detected by this program. The program yielded a total number of protein hits of 1104 and 210 for these unique GFP+ and RFP+unique genes, respectively. In both A and B, the percentage of each of the protein classes for GFP+ and RFP+ gene lists are shown by different colors in the pie chart, and their classification is provided by its side with similarly coded colors.
Fig 2
Fig 2. Heat map of differentially expressed genes between GFP+ and RFP+ thrombocytes.
Genes were arbitrarily clustered via k-means clustering ranging from k = 2 to k = 10. The expression of genes is based on the average mean counts of cells falling into the generated clusters. The cellRanger t-SNE graphs were used in choosing the best clustering and 4 clusters were found. A spreadsheet with a sheet for each cluster type was generated. The counts covered a large range (0–120+), and row-wise scaling on the values was performed, which provided a range of -2 to 2. A. Expression of all genes, 8746 genes in GFP+ thrombocytes and 6990 genes in RFP+ thrombocytes. B. Expression of chromatin/chromatin-binding or regulatory protein genes, 120 genes in GFP+ thrombocytes and 106 genes in RFP+ thrombocytes. C. Expression of cytoskeleton protein genes, 217 genes in GFP+ thrombocytes and 158 genes in RFP+ thrombocytes. D. Expression of nucleic acid-binding protein genes, 535 genes in GFP+ thrombocytes, and 478 genes in RFP+ thrombocytes. E. Expression of gene-specific transcriptional regulator genes, 449 genes in GFP+ thrombocytes, and 329 genes in RFP+ thrombocytes. Purple indicates GFP+ thrombocytes, and pink indicates RFP+ thrombocytes followed by clusters from 1 to 4 shown in light green to dark green. For each gene, red is upregulated, and blue is downregulated in the corresponding sample.
Fig 3
Fig 3. Transcripts per million (TPM) distribution for each of the two samples GFP_L001 and RFP_L001 representing GFP+ and RFP+ thrombocytes.
Transcripts from 24 scRNASeq files, 12 for GFP+ and 12 for RFP+ thrombocytes were quantified. Among these files, 4 GFP+, and 4 RFP+ files (in fastq format) that showed a mapping percentage of almost 45–46% to the zebrafish genome were chosen. The 4 GFP fastq files were merged to one fastq file, and similarly 4 RFP fastq files were also merged to one fastq file and used in TPM analysis. Bar graphs show TPM distribution obtained in this analysis and are divided into five intervals for each sample representing the number of genes with TPM values from 0 to >30, and these TPM values are coded by different colors with a key shown below.
Fig 4
Fig 4. Quantile mapping of GFP+ thrombocyte and RFP+ thrombocyte datasets.
We have divided the expressed genes in each dataset into 10 quantiles. We have compared the genes in each quantile across the pair of datasets. In each quantile, the quantile map shows the number of common genes between the two datasets. The bars on the right of the quantile map indicate the color codes and the number of genes. The data value bars on the rightmost indicate mean TPM values; green color, the highest TPM values for GFP+ thrombocytes, and purple color, the highest TPM values for RFP+ thrombocytes.
Fig 5
Fig 5. Gene ontology analysis of GFP+ and RFP+ thrombocyte transcripts.
The highly expressed gene lists for GFP+ and RFP+ thrombocytes were analyzed separately using DAVID to identify gene ontology (GO) categories. The top 15 GO enrichment analysis of A. GFP+ thrombocytes and B. RFP+ thrombocytes by DAVID functional analysis. Bars represent the number (x-axis) of highly expressed genes in pathways (y-axis) for each dataset.
Fig 6
Fig 6. Significantly enriched Kyoto encyclopedia of genes and genomes.
The highly expressed gene lists for GFP+ and RFP+ thrombocytes were analyzed separately using the DAVID to identify enriched KEGG pathways. Bars represent the number of highly expressed genes in pathways for GFP+ and RFP+ thrombocyte datasets.
Fig 7
Fig 7. Different categories of human ortholog gene transcripts found in GFP+ and RFP+ thrombocytes.
The raw data was used to obtain the list of zebrafish genes expressed in GFP+ thrombocytes and RFP+ thrombocytes using Ensembl and ZFIN databases. The list of human orthologs corresponding to zebrafish genes was also obtained using Ensembl and ZFIN databases. The list of these human orthologs were uploaded into the PANTHER program to obtain protein categories as well as to find human orthologs corresponding to unique and commonly expressed genes in GFP+ and RFP+ thrombocytes. The bar graph represents the corresponding human orthologs of the unique RFP+ genes, unique GFP+ genes, and both RFP+ and GFP+ common genes shown in red, green, and purple colors, respectively. The number of human ortholog genes is shown in each bar, including their functional protein categories. The protein categories are gene-specific transcriptional regulators, nucleic acid-binding proteins, cytoskeleton proteins, and chromatin/chromatin-binding or regulatory proteins.
Fig 8
Fig 8. PANTHER analysis and classification of human orthologs of total thrombocyte gene transcripts found in human megakaryocytes and platelets.
A. The 9143 total thrombocyte gene transcripts and 7708 human megakaryocyte gene transcripts were combined in a spreadsheet and color-coded. Then duplicate genes were selected first and the duplicates were eliminated to give a list of 4224 human orthologs which were uploaded into the PANTHER program, and Danio rerio species was selected. The program detected 4119 genes and yielded a total number of 2633 protein hits for these genes. B. The 9143 total thrombocyte gene transcripts and 2819 human platelet gene transcripts were combined in a spreadsheet and color-coded. Then duplicate gene transcripts were selected first, and the duplicates were eliminated to give a list of 1603 gene transcripts for human orthologs, which were uploaded into the PANTHER program, and Danio rerio species was selected. The program detected 1553 genes and yielded a total number of 987 protein hits for these genes. In both A and B, the percentage of each of the protein classes for RFP+ and GFP+ gene lists are shown by different colors in the pie chart, and their classification is provided by its side with similarly coded colors.

References

    1. Thattaliyath B, Cykowski M, Jagadeeswaran P. Young thrombocytes initiate the formation of arterial thrombi in zebrafish. Blood. 2005;106(1):118–24. Epub 2005/03/17. doi: 10.1182/blood-2004-10-4118 ; PubMed Central PMCID: PMC1895120. - DOI - PMC - PubMed
    1. Kulkarni V, Kim S, Zafreen L, Jagadeeswaran P. Separation of young and mature thrombocytes by a novel immuno-selection method. Blood Cells Mol Dis. 2012;48(3):183–7. Epub 2012/02/03. doi: 10.1016/j.bcmd.2011.12.006 ; PubMed Central PMCID: PMC6462260. - DOI - PMC - PubMed
    1. Gregory M, Jagadeeswaran P. Selective labeling of zebrafish thrombocytes: quantitation of thrombocyte function and detection during development. Blood Cells Mol Dis. 2002;28(3):418–27. Epub 2002/10/09. doi: 10.1006/bcmd.2002.0527 . - DOI - PubMed
    1. Fallatah W, De Silva IW, Verbeck GF, Jagadeeswaran P. Generation of transgenic zebrafish with 2 populations of RFP- and GFP-labeled thrombocytes: analysis of their lipids. Blood Adv. 2019;3(9):1406–15. Epub 2019/05/06. doi: 10.1182/bloodadvances.2018023960 ; PubMed Central PMCID: PMC6517667. - DOI - PMC - PubMed
    1. Martin JF, Wagner GP. The origin of platelets enabled the evolution of eutherian placentation. Biol Lett. 2019;15(7):20190374. Epub 2019/07/11. doi: 10.1098/rsbl.2019.0374 ; PubMed Central PMCID: PMC6684973. - DOI - PMC - PubMed

Publication types

LinkOut - more resources