Information theory is a powerful tool for understanding the DNA andRNA patterns that define genetic control systems. My theoretical workis divided into several levels. Level 0 is the study of geneticsequences bound by proteins or other macromolecules, briefly describedbelow. The success of this theory suggested that other aspects ofinformation theory should also apply to molecular biology. Level 1theory introduces the more general concept of the molecular machine,and the concept of a machine capacity equivalent to Shannon's channelcapacity. In Level 2, the Second Law of Thermodynamics is connected tothe capacity theorem. This defines the limits of Maxwell's Demon andmolecular computers. The project also has several interrelatedactivities: developing theory, doing computer analysis, runninggenetic engineering experiments and building nanotechnologies. Inlevel 0 I showed that binding sites on nucleic acids usually containjust about the amount of information needed for molecules to find thesites in the genome. Apparent exceptions to this """"""""working hypothesis""""""""have revealed many new phenomena. The first major anomaly was found atbacteriophage T7 promoters, which conserve twice as much informationas the polymerase requires to locate them. One explanation is that asecond protein binds to the DNA; a second one is that the phage usethe excess to take over the cell. In another case, we discovered thatthe F incD region has a three-fold excess conservation, which impliesthat three proteins bind there. We are investigating these and otheranomalies experimentally. An anomaly in the binding sites for the P1RepA protein led to the hypothesis that the initial step of DNAreplication and RNA transcription is a base flipped out from the DNA.The experimental evidence supports this hypothesis. Two graphicalmethods have been invented to display the structure of binding sites.A sequence logo shows the average patterns in a set of binding sites.The patented sequence walker shows individual binding sites.Displaying many walkers simultaneously has become such a powerful toolfor investigating genetic structure that it can replace consensussequences. Walkers can be used to distinguish mutations frompolymorphisms, and this has clinical applications, especially foranalysis of splice junctions. Molecular information theory givesclear concepts of how molecules use energy. Turning this around tellsus how to build molecular devices. A number of nanotechnologyprojects are in progress, including: a molecular computer (EuropeanPatent 1057118, United States Patent 6,774,222), a method for molecular sequencing (patent pending), and a molecular engine (patentpending).

Agency
National Institute of Health (NIH)
Institute
Division of Basic Sciences - NCI (NCI)
Type
Intramural Research (Z01)
Project #
1Z01BC008396-17
Application #
7291845
Study Section
(CCRN)
Project Start
Project End
Budget Start
Budget End
Support Year
17
Fiscal Year
2005
Total Cost
Indirect Cost
Name
Basic Sciences
Department
Type
DUNS #
City
State
Country
United States
Zip Code
Jeong, Jae-Ho; Kim, Hyun-Ju; Kim, Kun-Hee et al. (2012) An unusual feature associated with LEE1 P1 promoters in enteropathogenic Escherichia coli (EPEC). Mol Microbiol 83:612-22
Shultzaberger, Ryan K; Chen, Zehua; Lewis, Karen A et al. (2007) Anatomy of Escherichia coli sigma70 promoters. Nucleic Acids Res 35:771-88
Bindewald, Eckart; Schneider, Thomas D; Shapiro, Bruce A (2006) CorreLogo: an online server for 3D sequence logos of RNA and DNA alignments. Nucleic Acids Res 34:W405-11
Schneider, Thomas D (2006) Claude Shannon: biologist. The founder of information theory used biology to formulate the channel capacity. IEEE Eng Med Biol Mag 25:30-3
Schneider, Thomas D (2006) Twenty Years of Delila and Molecular Information Theory: The Altenberg-Austin Workshop in Theoretical Biology Biological Information, Beyond Metaphor: Causality, Explanation, and Unification Altenberg, Austria, 11-14 July 2002. Biol Theory 1:250-260
Chen, Zehua; Schneider, Thomas D (2006) Comparative analysis of tandem T7-like promoter containing regions in enterobacterial genomes reveals a novel group of genetic islands. Nucleic Acids Res 34:1133-47
Khan, Sikandar G; Metin, Ahmet; Gozukara, Engin et al. (2004) Two essential splice lariat branchpoint sequences in one intron in a xeroderma pigmentosum DNA repair gene: mutations result in reduced XPC mRNA levels that correlate with cancer risk. Hum Mol Genet 13:343-52
Hengen, Paul N; Lyakhov, Ilya G; Stewart, Lisa E et al. (2003) Molecular flip-flops formed by overlapping Fis sites. Nucleic Acids Res 31:6663-73
Schneider, Thomas D (2002) Consensus sequence Zen. Appl Bioinformatics 1:111-9
Emmert, S; Schneider, T D; Khan, S G et al. (2001) The human XPG gene: gene architecture, alternative splicing and single nucleotide polymorphisms. Nucleic Acids Res 29:1443-52

Showing the most recent 10 out of 20 publications