Microbiology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Microbiology 153 (2007), 3631-3644; DOI  10.1099/mic.0.2007/006205-0
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplementary data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Ibrahim, M.
Right arrow Articles by Gardan, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ibrahim, M.
Right arrow Articles by Gardan, R.
Agricola
Right arrow Articles by Ibrahim, M.
Right arrow Articles by Gardan, R.
Microbiology 153 (2007), 3631-3644; DOI  10.1099/mic.0.2007/006205-0
© 2007 Society for General Microbiology

A genome-wide survey of short coding sequences in streptococci

Mariam Ibrahim1,{dagger}, Pierre Nicolas2,{dagger}, Philippe Bessières2, Alexander Bolotin3, Véronique Monnet1 and Rozenn Gardan1

1 Unité de Biochimie Bactérienne, UR477, INRA, 78350 Jouy-en-Josas, France
2 Unité Mathématique Informatique et Génome, UR1077, INRA, 78350 Jouy-en-Josas, France
3 Unité de Génétique Microbienne, UR895, INRA, 78350 Jouy-en-Josas, France

Correspondence
Rozenn Gardan
rozenn.gardan{at}jouy.inra.fr

Identification of short genes that encode peptides of fewer than 60 aa is challenging, both experimentally and in silico. As a consequence, the universe of these short coding sequences (CDSs) remains largely unknown, although some are acknowledged to play important roles in cell–cell communication, particularly in Gram-positive bacteria. This paper reports a thorough search for short CDSs across streptococcal genomes. Our bioinformatic approach relied on a combination of advanced intrinsic and extrinsic methods. In the first step, intrinsic sequence information (nucleotide composition and presence of RBSs) served to identify new short putative CDSs (spCDSs) and to eliminate the differences between annotation policies. In the second step, pseudogene fragments and false predictions were filtered out. The last step consisted of screening the remaining spCDSs for lines of extrinsic evidence involving sequence and gene-context comparisons. A total of 789 spCDSs across 20 complete genomes (19 Streptococcus and one Enterococcus) received the support of at least one line of extrinsic evidence, which corresponds to an average of 20 short CDSs per million base pairs. Most of these had no known function, and a significant fraction (31 %) are not even annotated as hypothetical genes in GenBank records. As an illustration of the value of this list, we describe a new family of CDSs, encoding very short hydrophobic peptides (20–23 aa) situated just upstream of some of the positive transcriptional regulators of the Rgg family. The expression of seven other short CDSs from Streptococcus thermophilus CNRZ1066 that encode peptides ranging in length from 41 to 56 aa was confirmed by real-time quantitative RT-PCR and revealed a variety of expression patterns. Finally, one peptide from this list, encoded by a gene that is not annotated in GenBank, was identified in a cell-envelope-enriched fraction of S. thermophilus CNRZ1066.


Abbreviations: CDS, coding sequence; CT, threshold cycle; HMM, hidden Markov model; LC-MS/MS, liquid chromatography–tandem MS; pHMM, probability of a ‘true positive’ prediction after decoding with HMM; SHP, short hydrophobic peptide; spCDS, short putative CDS

{dagger}These authors contributed equally to this work.

Two supplementary tables listing spCDSs of the genomes studied that passed the filtration of pseudogenes and false predictions, and predicted and annotated short CDSs supported by extrinsic evidence in one Enterococcus faecalis and 18 streptococcal genomes, and a supplementary figure showing the chromosomal context of CDSs encoding the SHPs associated with rgg genes, are available with the online version of this paper.




This article has been cited by other articles:


Home page
J. Bacteriol.Home page
M. Ibrahim, A. Guillot, F. Wessner, F. Algaron, C. Besset, P. Courtin, R. Gardan, and V. Monnet
Control of the Transcription of a Short Gene Encoding a Cyclic Peptide in Streptococcus thermophilus: a New Quorum-Sensing System?
J. Bacteriol., December 15, 2007; 189(24): 8844 - 8854.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
INT J SYST EVOL MICROBIOL MICROBIOLOGY J GEN VIROL
J MED MICROBIOL ALL SGM JOURNALS
Copyright © 2007 Society for General Microbiology.