Title
A set of Alu-free frequent decamers from mammalian genomes enriched in transcription factor signals.
Abstract
We have recently reported that the statistical analysis of the frequency distribution of short oligonucleotides within mammalian and viral genomes allows the pr oduction of sets of DNA sequences enriched in signals for transcription factors. Such statistical approaches could facilitate the identification of new promoter regions playing a role in the transcriptional regulation of gene expression. In the case of mammalian oligonucleotides, we found that the published set of frequent decamers enriched in transcriptional motifs is not suitable for studies on genes of Home sapiens and evolutionarily related genomes, because it contains decameric sequences belonging to genomic repeats. We report here that most of the decameric sequences of DNA repeats belong to Alu repeats. Accordingly, we produced a subset of Alu-free frequent decamers. In addition, we eliminated from the subset of Alu-free frequent decamers those that are frequently present within other common human repeats, including (GT)(n), (AT)(n), (CA)(n), (ATT)(n), (CAA)(n), and (GTT)(n). The Alu-free (repeats-free) subset of frequent mammalian decamers is enriched in signals for transcription factors and allows the identification of putative signals in genes, such as those coding for plasminogen activator, adenosine deaminase and p53, that contain a large number of Alu-like repeats interspersed within our genomic sequences. The newly generated compilation of frequent decamers described here might be used to locate genomic regions playing functional roles in the expression of genes of Home sapiens and related primates.
Year
DOI
Venue
1994
10.1093/bioinformatics/10.5.501
COMPUTER APPLICATIONS IN THE BIOSCIENCES
Keywords
Field
DocType
transcription factor
Genome,Sequence alignment,Alu element,Transcription (biology),Gene,Biology,Nucleic acid sequence,DNA,DNA sequencing,Bioinformatics,Genetics
Journal
Volume
Issue
ISSN
10
5
0266-7061
Citations 
PageRank 
References 
0
0.34
0
Authors
5
Name
Order
Citations
PageRank
Roberto Gambari193.02
Stefano Volinia29418.64
C Nesti300.34
C Scapoli401.69
I Barrai559.96