Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jul;33(7):1870-4.
doi: 10.1093/molbev/msw054. Epub 2016 Mar 22.

MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets

Affiliations

MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets

Sudhir Kumar et al. Mol Biol Evol. 2016 Jul.

Abstract

We present the latest version of the Molecular Evolutionary Genetics Analysis (Mega) software, which contains many sophisticated methods and tools for phylogenomics and phylomedicine. In this major upgrade, Mega has been optimized for use on 64-bit computing systems for analyzing larger datasets. Researchers can now explore and analyze tens of thousands of sequences in Mega The new version also provides an advanced wizard for building timetrees and includes a new functionality to automatically predict gene duplication events in gene family trees. The 64-bit Mega is made available in two interfaces: graphical and command line. The graphical user interface (GUI) is a native Microsoft Windows application that can also be used on Mac OS X. The command line Mega is available as native applications for Windows, Linux, and Mac OS X. They are intended for use in high-throughput and scripted analysis. Both versions are available from www.megasoftware.net free of charge.

Keywords: evolution.; gene families; software; timetree.

PubMed Disclaimer

Figures

F <sc>ig</sc> . 1.
F ig . 1.
Time and memory requirements for phylogenetic analyses using the NJ method ( A , B ) and the ML analysis ( C , D ). For NJ analysis, we used the Tamura–Nei (1993) model, uniform rates of evolution among sites, and pairwise deletion option to deal with the missing data. Time usage increases polynomially with the number of sequences (third degree polynomial, R  2  = 1), as does the peak memory used ( R  2  = 1) ( A , B ). The same model and parameters were used for ML tree inference, where the time taken and the memory needs increased linearly with the number of sequences. For ML analysis, the SPR (Subtree–Pruning–Regrafting) heuristic was used for tree searching and all 5,287 sites in the sequence alignment were included. All the analyses were performed on a Dell Optiplex 9010 computer with an Intel Core-i7-3770 3.4 GHz processor, 20 GB of RAM, NVidia GeForce GT 640 graphics card, and a 64-bit Windows 7 Enterprise operating system.
F <sc>ig</sc> . 2.
F ig . 2.
The Gene Duplication Wizard ( A ) to guide users through the process of searching gene duplication events in a gene family tree. In the first step, the user loads a gene tree from a Newick formatted text file. Second, species associated with sequences are specified using a graphical interface. In the third step, the user has the option to load a trusted species tree, in which case it will be possible to identify all duplication events in the gene tree, from a Newick file. Fourth, the user has the option to specify the root of the gene tree in a graphical interface. If the user provides a trusted species tree, then they must designate the root of that tree. Finally, the user launches the analysis and the results are displayed in the Tree Explorer window (see fig. 3 ).
F <sc>ig</sc> . 3.
F ig . 3.
Tree Explorer window with gene duplications marked with closed blue diamonds and speciation events, if a trusted species tree is provided, are identified by open red diamonds (see fig. 2 legend for more information).

Comment in

References

    1. Edgar RC. 2004. . Muscle: a multiple sequence alignment method with reduced time and space complexity . BMC Bioinformatics 5 : 113.. - PMC - PubMed
    1. Filipski A Murillo O Freydenzon A Tamura K Kumar S. 2014. . Prospects for building large timetrees using molecular data with incomplete gene coverage among species . Mol Biol Evol 31 : 2542 – 2550 . - PMC - PubMed
    1. Hedges SB Marin J Suleski M Paymer M Kumar S. 2015. . Tree of life reveals clock-like speciation and diversification . Mol Biol Evol 32 : 835 – 845 . - PMC - PubMed
    1. Kumar S Stecher G Peterson D Tamura K. 2012. . M ega-cc : computing core of molecular evolutionary genetics analysis program for automated and iterative data analysis . Bioinformatics 28 : 2685 – 2686 . - PMC - PubMed
    1. Kumar S Tamura K Nei M. 1994. . M ega : molecular evolutionary genetics analysis software for microcomputers . Comput Appl Biosci . 10 : 189 – 191 . - PubMed

Publication types