Serviceeinschränkungen vom 12.-22.02.2026 - weitere Infos auf der UB-Homepage

Treffer: An integrated machine learning approach for predicting DosR-regulated genes in Mycobacterium tuberculosis.

Title:
An integrated machine learning approach for predicting DosR-regulated genes in Mycobacterium tuberculosis.
Authors:
Zhang Y; School of Crystallography, Birkbeck College, University of London, Malet Street, London, WC1E 7HX, UK., Hatch KA, Bacon J, Wernisch L
Source:
BMC systems biology [BMC Syst Biol] 2010 Mar 31; Vol. 4, pp. 37. Date of Electronic Publication: 2010 Mar 31.
Publication Type:
Journal Article; Research Support, Non-U.S. Gov't
Language:
English
Journal Info:
Publisher: BioMed Central Country of Publication: England NLM ID: 101301827 Publication Model: Electronic Cited Medium: Internet ISSN: 1752-0509 (Electronic) Linking ISSN: 17520509 NLM ISO Abbreviation: BMC Syst Biol Subsets: MEDLINE
Imprint Name(s):
Original Publication: London : BioMed Central
References:
Bioinformatics. 2008 Jun 1;24(11):1349-58. (PMID: 18400771)
Bioinformatics. 2006 Mar 15;22(6):739-46. (PMID: 16368767)
BMC Genomics. 2008 Feb 22;9:87. (PMID: 18294384)
Tuberculosis (Edinb). 2004;84(3-4):247-55. (PMID: 15207494)
BMC Bioinformatics. 2007 Feb 23;8:61. (PMID: 17319944)
Metab Eng. 2005 Mar;7(2):128-41. (PMID: 15781421)
Bioinformatics. 2004 Aug 4;20 Suppl 1:i248-56. (PMID: 15262806)
Genome Biol. 2006;7(3):R25. (PMID: 16584535)
Tuberculosis (Edinb). 2004;84(3-4):205-17. (PMID: 15207490)
Proc Natl Acad Sci U S A. 2003 Dec 23;100(26):15522-7. (PMID: 14673099)
Microbiology (Reading). 2005 Dec;151(Pt 12):4045-4053. (PMID: 16339949)
Mol Microbiol. 2003 May;48(3):833-43. (PMID: 12694625)
Theor Biol Med Model. 2005 Jun 24;2:23. (PMID: 15978125)
Microbiology (Reading). 2007 May;153(Pt 5):1435-1444. (PMID: 17464057)
IEEE Trans Pattern Anal Mach Intell. 2008 Feb;30(2):283-98. (PMID: 18084059)
Proc Natl Acad Sci U S A. 2001 Jun 19;98(13):7534-9. (PMID: 11416222)
Bioinformatics. 2006 Nov 15;22(22):2775-81. (PMID: 16966362)
Grant Information:
G0802079 United Kingdom MRC_ Medical Research Council; MC_U105260799 United Kingdom MRC_ Medical Research Council; United Kingdom WT_ Wellcome Trust; United Kingdom DH_ Department of Health
Substance Nomenclature:
0 (Bacterial Proteins)
0 (DNA-Binding Proteins)
0 (DosR protein, Mycobacterium tuberculosis)
0 (RNA, Messenger)
0 (Tumor Suppressor Protein p53)
EC 2.7.- (Protein Kinases)
S88TT14065 (Oxygen)
Entry Date(s):
Date Created: 20100402 Date Completed: 20100728 Latest Revision: 20250529
Update Code:
20250530
PubMed Central ID:
PMC2867773
DOI:
10.1186/1752-0509-4-37
PMID:
20356371
Database:
MEDLINE

Weitere Informationen

Background: DosR is an important regulator of the response to stress such as limited oxygen availability in Mycobacterium tuberculosis. Time course gene expression data enable us to dissect this response on the gene regulatory level. The mRNA expression profile of a regulator, however, is not necessarily a direct reflection of its activity. Knowing the transcription factor activity (TFA) can be exploited to predict novel target genes regulated by the same transcription factor. Various approaches have been proposed to reconstruct TFAs from gene expression data. Most of them capture only a first-order approximation to the complex transcriptional processes by assuming linear gene responses and linear dynamics in TFA, or ignore the temporal information in data from such systems.
Results: In this paper, we approach the problem of inferring dynamic hidden TFAs using Gaussian processes (GP). We are able to model dynamic TFAs and to account for both linear and nonlinear gene responses. To test the validity of the proposed approach, we reconstruct the hidden TFA of p53, a tumour suppressor activated by DNA damage, using published time course gene expression data. Our reconstructed TFA is closer to the experimentally determined profile of p53 concentration than that from the original study. We then apply the model to time course gene expression data obtained from chemostat cultures of M. tuberculosis under reduced oxygen availability. After estimation of the TFA of DosR based on a number of known target genes using the GP model, we predict novel DosR-regulated genes: the parameters of the model are interpreted as relevance parameters indicating an existing functional relationship between TFA and gene expression. We further improve the prediction by integrating promoter sequence information in a logistic regression model. Apart from the documented DosR-regulated genes, our prediction yields ten novel genes under direct control of DosR.
Conclusions: Chemostat cultures are an ideal experimental system for controlling noise and variability when monitoring the response of bacterial organisms such as M. tuberculosis to finely controlled changes in culture conditions and available metabolites. Nonlinear hidden TFA dynamics of regulators can be reconstructed remarkably well with Gaussian processes from such data. Moreover, estimated parameters of the GP can be used to assess whether a gene is controlled by the reconstructed TFA or not. It is straightforward to combine these parameters with further information, such as the presence of binding motifs, to increase prediction accuracy.