Treffer: MyESL: A Software for Evolutionary Sparse Learning in Molecular Phylogenetics and Genomics.

Title:
MyESL: A Software for Evolutionary Sparse Learning in Molecular Phylogenetics and Genomics.
Authors:
Sanderford M; Institute for Genomics and Evolutionary Medicine, Temple University, 1925 N. 12th Street, Philadelphia, PA 19122, USA., Sharma S; Institute for Genomics and Evolutionary Medicine, Temple University, 1925 N. 12th Street, Philadelphia, PA 19122, USA.; Department of Biology, Temple University, 1900 N. 12th Street, Philadelphia, PA 19122, USA., Stecher G; Institute for Genomics and Evolutionary Medicine, Temple University, 1925 N. 12th Street, Philadelphia, PA 19122, USA., Suleski M; Institute for Genomics and Evolutionary Medicine, Temple University, 1925 N. 12th Street, Philadelphia, PA 19122, USA., Liu J; Infinia ML Inc., 4309 Emperor Blvd, Durham, NC 27703, USA., Ye J; Zhejiang Lab, 2880 Wenyi West Road, Hangzhou, Zhejlang 311100, P.R. China., Kumar S; Institute for Genomics and Evolutionary Medicine, Temple University, 1925 N. 12th Street, Philadelphia, PA 19122, USA.; Department of Biology, Temple University, 1900 N. 12th Street, Philadelphia, PA 19122, USA.
Source:
Molecular biology and evolution [Mol Biol Evol] 2025 Oct 01; Vol. 42 (10).
Publication Type:
Journal Article
Language:
English
Journal Info:
Publisher: Oxford University Press Country of Publication: United States NLM ID: 8501455 Publication Model: Print Cited Medium: Internet ISSN: 1537-1719 (Electronic) Linking ISSN: 07374038 NLM ISO Abbreviation: Mol Biol Evol Subsets: MEDLINE
Imprint Name(s):
Publication: 2003- : New York, NY : Oxford University Press
Original Publication: [Chicago, Ill.] : University of Chicago Press, [c1983-
Comments:
Update of: ArXiv. 2025 Jan 9:arXiv:2501.04941v1.. (PMID: 39867426)
References:
Mol Biol Evol. 2024 Jul 3;41(7):. (PMID: 38916040)
Mol Biol Evol. 2024 Dec 6;41(12):. (PMID: 39708372)
Nature. 2024 May;629(8013):851-860. (PMID: 38560995)
BMC Bioinformatics. 2020 Sep 15;21(1):407. (PMID: 32933477)
G3 (Bethesda). 2016 Dec 7;6(12):3927-3939. (PMID: 27672114)
Nat Commun. 2025 Apr 04;16(1):3217. (PMID: 40185716)
Mol Biol Evol. 2021 Oct 27;38(11):4674-4682. (PMID: 34343318)
Nat Ecol Evol. 2017 Apr 10;1(5):126. (PMID: 28812701)
Grant Information:
R35 GM139540 United States GM NIGMS NIH HHS; R35GM139540-05 United States NH NIH HHS
Contributed Indexing:
Keywords: machine learning; molecular evolution; phylogenomics; sparse learning
Entry Date(s):
Date Created: 20250919 Date Completed: 20251006 Latest Revision: 20251014
Update Code:
20251014
PubMed Central ID:
PMC12498521
DOI:
10.1093/molbev/msaf224
PMID:
40971735
Database:
MEDLINE

Weitere Informationen

Evolutionary sparse learning uses supervised machine learning to build evolutionary models where genomic sites loci are parameters. It uses the Least Absolute Shrinkage and Selection Operator with bi-level sparsity to connect a specific phylogenetic hypothesis with sequence variation across genomic loci. The MyESL software addresses the need for open-source tools to perform evolutionary sparse learning analyses, offering features to preprocess input phylogenomic alignments, post-process output models to generate molecular evolutionary metrics, and make Least Absolute Shrinkage and Selection Operator regression adaptable and efficient for phylogenetic trees and alignments. The core of MyESL, which constructs models with logistic regressions using bi-level sparsity, is written in C++. Its input data preprocessing and result post-processing tools are developed in Python. Compared to other tools, MyESL is more computationally efficient and provides evolution-friendly inputs and outputs. These features have already enabled the use of MyESL in two phylogenomic applications, one to identify outlier sequences and fragile clades in inferred phylogenies and another to build genetic models of convergent traits. In addition to the use in a Python environment, MyESL is available as a standalone executable compatible across multiple platforms, which can be directly integrated into scripts and third-party software. The source code, executable, and documentation for MyESL are openly accessible at https://github.com/kumarlabgit/MyESL.
(© The Author(s) 2025. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution.)

Conflict of Interest: None declared.