Treffer: Semblans: automated assembly and processing of RNA-seq data.
BMC Genomics. 2013 May 14;14:328. (PMID: 23672450)
Am J Bot. 2018 Mar;105(3):446-462. (PMID: 29738076)
PLoS Comput Biol. 2011 Oct;7(10):e1002195. (PMID: 22039361)
PeerJ. 2018 Aug 3;6:e5428. (PMID: 30083482)
Bioinformatics. 2019 Jul 1;35(13):2199-2207. (PMID: 30452539)
Nat Methods. 2015 Jan;12(1):59-60. (PMID: 25402007)
BMC Bioinformatics. 2023 Apr 4;24(1):133. (PMID: 37016291)
Nature. 2019 Oct;574(7780):679-685. (PMID: 31645766)
Bioinformatics. 2014 Aug 1;30(15):2114-20. (PMID: 24695404)
Genome Biol. 2014 Jul 26;15(7):410. (PMID: 25063469)
Genome Res. 2016 Aug;26(8):1134-44. (PMID: 27252236)
PeerJ. 2023 Nov 27;11:e16456. (PMID: 38034874)
Mol Ecol Resour. 2022 Jul;22(5):2070-2086. (PMID: 35119207)
Nat Methods. 2017 Apr;14(4):417-419. (PMID: 28263959)
Genome Biol. 2019 Nov 28;20(1):257. (PMID: 31779668)
Genome Res. 2003 Sep;13(9):2129-41. (PMID: 12952881)
Mol Biol Evol. 2014 Nov;31(11):3081-92. (PMID: 25158799)
Gigascience. 2015 Oct 19;4:48. (PMID: 26500767)
Weitere Informationen
Motivation: Recent advancements in parallel sequencing methods have precipitated a surge in publicly available short-read sequence data. This has encouraged the development of novel computational tools for the de novo assembly of transcriptomes from RNA-seq data. Despite the availability of these tools, performing an end-to-end transcriptome assembly remains a programmatically involved task necessitating familiarity with best practices. Aside from quality control steps, including error correction, adapter trimming, and chimera filtration needing to be correctly used, moving data between programs often requires manual reformatting or restructuring, which can further impede throughput. Here, we introduce Semblans, a tool for streamlining the assembly process that efficiently and consistently produces high-quality transcriptome assemblies.
Results: Semblans abstracts the key quality control, reconstitution, and postprocessing steps of transcriptome assembly from raw short-read sequences to annotated coding sequences. Evaluating its performance against previously assembled transcriptomes on the basis of assembly quality, we find that Semblans produced higher quality assemblies for 98 of the 101 short-read runs tested.
Availability and Implementation: Semblans is written in C++ and runs on Unix-compliant operating systems. Source code, documentation, and compiled binaries are hosted under the GNU General Public License at https://github.com/gladshire/Semblans.
(© The Author(s) 2025. Published by Oxford University Press.)