Treffer: DockerBIO: web application for efficient use of bioinformatics Docker images.

Title:
DockerBIO: web application for efficient use of bioinformatics Docker images.
Authors:
Kwon C; Department of Computer Science and Engineering, Incheon National University, Incheon, The Republic of Korea.; MyGenomeBox, Co, Incheon, The Republic of Korea., Kim J; MyGenomeBox, Co, Incheon, The Republic of Korea., Ahn J; Department of Computer Science and Engineering, Incheon National University, Incheon, The Republic of Korea.
Source:
PeerJ [PeerJ] 2018 Nov 27; Vol. 6, pp. e5954. Date of Electronic Publication: 2018 Nov 27 (Print Publication: 2018).
Publication Type:
Journal Article
Language:
English
Journal Info:
Publisher: PeerJ Inc Country of Publication: United States NLM ID: 101603425 Publication Model: eCollection Cited Medium: Print ISSN: 2167-8359 (Print) Linking ISSN: 21678359 NLM ISO Abbreviation: PeerJ Subsets: PubMed not MEDLINE
Imprint Name(s):
Original Publication: Corte Madera, CA : PeerJ Inc.
References:
Comput Methods Programs Biomed. 2017 Jan;138:73-81. (PMID: 27886717)
Genom Data. 2015 Sep 1;5:139-146. (PMID: 26167452)
Nat Methods. 2012 Mar 04;9(4):357-9. (PMID: 22388286)
Bioinformatics. 2011 Nov 1;27(21):2987-93. (PMID: 21903627)
Front Genet. 2012 Mar 15;3:35. (PMID: 22435069)
Curr Protoc Bioinformatics. 2013;43:11.10.1-33. (PMID: 25431634)
Nat Methods. 2018 Jul;15(7):475-476. (PMID: 29967506)
Nat Protoc. 2016 Sep;11(9):1650-67. (PMID: 27560171)
Bioinformatics. 2017 Aug 15;33(16):2580-2582. (PMID: 28379341)
Gigascience. 2018 Jul 1;7(7):. (PMID: 29961842)
Bioinformatics. 2010 Mar 1;26(5):589-95. (PMID: 20080505)
Nucleic Acids Res. 2016 Jan 4;44(D1):D717-25. (PMID: 26590259)
PeerJ. 2015 Sep 24;3:e1273. (PMID: 26421241)
J Pathol Inform. 2016 Dec 30;7:53. (PMID: 28163975)
Genome Res. 2005 Oct;15(10):1451-5. (PMID: 16169926)
Nat Biotechnol. 2017 Apr;35(4):342-346. (PMID: 28288103)
Contributed Indexing:
Keywords: Bioinformatics; DNA pipeline; DNA-Seq; Docker; Dockerbio; Mygenomebox; NGS pipeline; RNA pipeline; RNA-Seq
Entry Date(s):
Date Created: 20181206 Latest Revision: 20220330
Update Code:
20250114
PubMed Central ID:
PMC6266945
DOI:
10.7717/peerj.5954
PMID:
30515360
Database:
MEDLINE

Weitere Informationen

Background and Objective: Docker is a light containerization program that shows almost the same performance as a local environment. Recently, many bioinformatics tools have been distributed as Docker images that include complex settings such as libraries, configurations, and data if needed, as well as the actual tools. Users can simply download and run them without making the effort to compile and configure them, and can obtain reproducible results. In spite of these advantages, several problems remain. First, there is a lack of clear standards for distribution of Docker images, and the Docker Hub often provides multiple images with the same objective but different uses. For these reasons, it can be difficult for users to learn how to select and use them. Second, Docker images are often not suitable as a component of a pipeline, because many of them include big data. Moreover, a group of users can have difficulties when sharing a pipeline composed of Docker images. Users of a group may modify scripts or use different versions of the data, which causes inconsistent results.
Methods and Results: To handle the problems described above, we developed a Java web application, DockerBIO, which provides reliable, verified, light-weight Docker images for various bioinformatics tools and for various kinds of reference data. With DockerBIO, users can easily build a pipeline with tools and data registered at DockerBIO, and if necessary, users can easily register new tools or data. Built pipelines are registered in DockerBIO, which provides an efficient running environment for the pipelines registered at DockerBIO. This enables user groups to run their pipelines without expending much effort to copy and modify them.

ChangHyuk Kwon and Jason Kim are employed by MyGenomeBox, Co.