BiologyDept.

 

DePauw University
 Fall 2002
 

Instructor: Fornari

BIO 290

Genomics
&
Bioinformatics

Lecture:10:30-11:20 MW in Olin 205

Instructor: Chet Fornari
offices: Olin 232 & 233
phones:
658-4781; x4775 (Biology Office)
e-mails: cfornari

click here for all registration information


Texts: A Primer of Genome Science by Greg Gibson & Spencer V. Muse. 2002. Sinauer Associates, Inc., Publishers   

Applied Molecular Genetics by Roger L. Miesfeld. 1999. Wiley-Liss publishers

Recommended Supplements: (1) Introduction to bioinformatics by T.K. Attwood & D.J. Parry-Smith. 1999. Addison Wesley Longman Publishers. (2) "Trends Guide to Bioinformatics: Database Searching, Sequence alignment, Gene finding, Functional Genomics, Protein Classification, Phylogenies." Trends Supplement, 1998. Elsevier Publishers. (3) An excellent text for learning the programming language, Perl (Practical extraction and report language), an ideal language for biological data analysis: oreilly.com -- Online Catalog Beginning Perl for Bioinformatics

What is BIO 290, "Genomics & Bioinformatics"?  

Bio 290 Genomics and Bioinformatics: An introduction to the structural properties, functional dynamics, and genetic architectures of genomes from a diverse selection of model organisms, ranging from microbes to humans. Scientists analyze these genomes by both molecular genetic techniques and bioinformatics, including proteomics. A major topic is the Human Genome Project and its implications for treating human diseases; a central theme of the course is the development and analysis of evolutionary relationships among diverse organisms, including humans. Prerequisites: BIO 120 and BIO 140; recommended: BIO 220 and CHEM 120.

All students enrolled in this course have heard my many references to "structure, function, organization, and evolution." Now add "sequence" to the start of the list, and insert "genomics" before evolution to generate: "sequence, structure, function, organization, genomics, evolution." In this course, we will sort out, discuss, and analyze the relationships and implications of the terms in this new list. Adding "sequence" and "genomics" to the standard list creates a modern, state-of-the-art, Molecular Biological focus.

One of the "Primary Directives" of modern Genetics remains the same: to determine the relationship of the genotype to the phenotype. Now with the advent of Genomics and Bioinformatics, the experimental approaches to fulfilling the Primary Directive are different from those used in the past decades of genetic research. Instead of dissecting a biological process displaying a known phenotype with a battery of mutants paired in all possible cross combinations to determine the number and types of genes responsible for the process, and eventually to isolate the genes in question by recombinant DNA technologies (to get the final genotype, i.e. the gene sequences), modern and future geneticists will start with a gene sequence of unknown function and then proceed to identify and characterize its function and roles in biological processes. In other words, modern geneticists start with the genotype and work towards elucidating the phenotype, and how that phenotype is produced by local and global gene expression interactions and patterns. The new DNA and protein technologies include "knock-out" mutants, microarrays on DNA chips, protein interaction assays, sophisticated microscopic methods to locate expressed proteins. One common experimental approach underlies all these different, biochemical and genetic techniques: Bioinformatics!

The Goal is not only to know all the gene sequences in many organisms, but also to understand all the genes' functions in all these organisms, and how all the genes interact locally to produce a phenotype, and how they interact globally to explain the similarities and differences observed in the great, wonderful diversity of life. Genomics (structural, functional, and comparative -- see below) with Bioinformatics will lead the way to answering these genotype/phenotype questions.

Pre-requisites: BIO 120 and BIO 140 half courses; recommended: one semester of chemistry, and BIO 220; or permission of instructor. click here for all course registration information.

What is "Genomics"?

Genomics is rapidly becoming a multifaceted sub-discipline of biology, and aims to understand: (1) the molecular organization and (2) information content of the entire genome (= the collection of all the genes in a gamete, or the haploid set of chromosomes) and its gene products (the  transcriptome and proteome -- see below).Genomics includes the development and application of new mapping, sequencing, and computational procedures for the (molecular) analysis of the entire genome.2 Genomics has 3 distinct subfields: (a) Structural genomics = the genetic and physical mapping, and sequencing of entire genomes, (b) Functional genomics = the analysis of gene (and non-gene) sequences, particularly all those genes that make RNA (mRNA, tRNA, rRNA, snRNA, etc.), or the transcriptome, and all those genes that make only mRNA for translation into protein, or the proteome, (c) comparative genomics = the comparison of entire genomes from different organisms to understand functional and evolutionary relationships.2 

1. An Introduction to Genetic Analysis by A.J.F. Griffiths, J.H. Miller, D.T. Suzuki, R.C. Lewontin, and W.M. Gelbart. 2000. Ch. 14, p. 436. W.H. Freeman and Co., Publishers.
2. Genetics by P.J. Russell. 2002. Ch. 9, p. 220. Benjamin Cummings, Publihsers.
see also: What is Genomics  (Genomics is operationally defined as investigations into the structure and function of very large numbers of genes undertaken in a simultaneous fashion.) and
Genomics glossaries and taxonomies

What is "Bioinformatics"?

Bioinformatics* fuses biology with mathematics (especially statistics) and computer science (algorithms and their implementations in a scripting language like Perl above) to:

  1. find genes within a genomic sequence.
  2. align sequences in databases to determine the degree of matching.
  3. predict the structure and function of gene products.
  4. describe the interactions between genes and gene products at a global level within the cell and between organisms.
  5. postulate phylogenetic relationships for sequences.

 "Bioinformatics** is the application of computational tools and techniques to the management and analysis of biological data."

*Definition (except for parenthetical phrases) taken from: Genetics by P.J. Russell. 2002. Glossary,  p. 729. Benjamin Cummings, Publihsers.
**Tisdall, J. D. (2001) Beginning Perl for Bioinformatics, p. vii. O'Reilly & Associates, Inc., Sebastopol, CA.

Description of Course Contents

The lecture portion of the course serves two purposes: 

(1) to introduce you to the basic, core concepts of Genomics as provided by both history and key experiments. How are genomes studied and characterized molecularly? What key experiments led to greater understanding of the structure to function relationships of whole genomes? Which bioinformatic methods reveal and conceptualize the raw information provided by genomics research?

(2) to provide you with a solid theoretical basis for not only methodology but also more importantly for hypothesis construction and testing by proper experimental design (i.e., the scientific method used in genomics and bioinformatics).*

Every attempt will be made to integrate major concepts to show the unity of the various sub-disciplines comprising molecular biology by applying the knowledge and techniques of genomics and bioinformatics. Every attempt will be made to collect a seemingly overwhelming amount of details into regular, concept-based patterns forming the over-arching themes and principles of modern biology. Yes, these patterns and themes exist! Reductionism will lead to Holism, especially by way of Bioinformatics, and to an increased awareness of how sets of regularly repeating themes and patterns, first observed in macromolecular sequences, combine in myriad ways to generate the wonderful, rich diversity of living organisms. 

*In other words, together we will try to get at least a glimpse of the subtle variations in relatively simple biological structures, and how these variations combine in numerous ways to contribute to a wonderful and exciting biological complexity and hence diversity. As always with all my courses, the pedagogy rests on two main themes: (1) Knowing, understanding, and analyzing major concepts, and how these concepts relate to each other, to the major concepts of all biology, and to disciplines and ideas outside the normal realm of biological science; (2) Understanding the theory and practice of the scientific method, and then how it transforms questions, observations, problems, etc., into concept and ultimately theory by way of well designed experiments coupled with informed interpretation of the experimental data.


A note about your text for this course: a constant, repeating theme throughout the entire book, in each chapter, almost like a fractal image, is the fundamental relationship of structural genomics to functional genomics to comparative genomics, which is interlaced always with Bioinformatic analyses.

Outline of Course Contents: Lecture Topics and Reading Assignments

(PGS = A Primer of Genome Science text;  AMG = Applied Molecular Genetics text

Review of the Basics I: Molecular Biology
Review of the Basics II: Recombinant DNA Technology

Chap.

Topic

Questions & Problems


AMG-1
Biochemical Basis of Applied Molecular Genetics:
p.3-7, Central Dogma; DNA structure (fig.1.3)
p. 12-17, DNA metabolizing enzymes
p. 17-22, Biochemical methods to study DNA and RNA

 

AMG-4 Characterization of Genomic DNA:
p. 83-88, Overview of Genome organization in bacteria, yeasts, & humans
p. 88-94, Genomic Mapping
p. 97-99, Cosmid Vectors, fig. 4.11
p. 99 and 101, BAC vectors, fig. 4.13
AMG-9 Contemporary Applied Molecular Genetics:
p. 244-250, Accessing Molecular Genetic Information through the Internet

 Appendix B,C,D, p.275-280;
App. G

Properties of Nucleic Acids, Properties of Amino Acids, Properties of Common Restriction Enzymes

Useful Internet Resources

I. Introduction and Overview: Genomics and Model Organisms;
Methods Survey, basic Bioinformatics.

Chap. 1

Topic

Questions & Problems

PGS-1

p. 1-4, Core Aims of Genome Science.
from PowerPoint slides: Polymorphisms & Gene mapping;
p.241-244, nature of SNP's and polymorhisms.
p. 4-12, Mapping Genomes: Genetic Maps, Physical Maps, Cytological Maps, Comparative Maps (see also AMG-4, p.88-94)
p. 21-26, Internet Resources
p. 42, Box 1.3: Managing and Distributing Genome Data
(Selected Model Organismal Genomes and Databases; refer back to these pages: 26-37, 46-55 for your Projects)

 

 

II. Structural Genomics & Bioinformatics

Chap.

Topic

Questions & Problems

PGS-2

p. 63-76, Automated DNA sequencing (includes Box. 2.1, p. 72-75)
(see also AMG, p.19-20, fig.1.13, p.21; p. 237-238, fig. 9.1, p.239)
p. 78-84, Genome Sequencing: Hierarchical
p. 84-90, Genome Sequencing: Shotgun (includes Box 2.2, 86-87)
p. 90, Sequence Verification
p. 91-101, Genome Annotation: EST sequencing, Ab initio gene dis-
covery, non-protein coding genes
p. 101-105, Structural features of Genome Sequences
p. 105-117, Functional Annotation and Gene Family Clusters; note
COG's, p. 109-111 with Orthologs and Paralogs

 

III. Functional Genomics & Bioinformatics: the Transcriptome.

Chap.

Topic

Questions & Problems

PGS-3

p. 123-126: cDNA Microarray technology
p. 137-141: oligonucleotide Microarray technology
(see also AMG, p.238, 241-244 & figs 9.3, 9.4)
p. 141-148: Microarray data mining (without the Boxes)
p. 148-152: SAGE
(see also AMG, p.240, fig 9.2)
p. 156-157: Differential Display
p. 166-170: Properties of Transcriptomes; Cancer

 

IV. Functional Genomics & Bioinformatics: the Proteome.

Chap.

Topic

Questions & Problems

PGS-4

p. 183-188 (without Box 4.1): Web and Internet Sites for comparing and identifying protein domains
p. 188-193: 2D-PAGE, fig. 4.4
p. 202-204: Yeast two-hybrid screens, fig. 4.12 (see also AMG, p. 128-130, and fig. 5.9)
p. 204-207: Structural Proteomics and defining a Protein Domain
p. 207-209: Protein Structure Determination, fig. 4.14
 

V. The Importance of SNP's: Theory and Practice.

Chap.

Topic

Questions & Problems

PGS-5

(to be included in full course)  

VI. Comparative Genomics & Bioinformatics

Chap.

Topic

Questions & Problems

PGS-6

(to be included in full course)  

Projects and Presentations


Grade categories, distributions, scaling, and Exam dates:
3 to 4 quizzes = 75% of final grade
Project and Presentation = 25% of final grade


Links to Web Sites

(A) Bioinformatics:

  1. Bioinformatics
  2. RSBS Bioinformatics Group homepage
  3. The Brutlag Bioinformatics Group
  4. Bioinformatics.org Welcome
  5. UCLA Bioinformatics
  6. Bioinformatics Technology Conference Coverage
  7. Bioinformatics.org Welcome
  8. emboss.list

(B) Model Organisms:

  1. MGI 2.7 - Mouse Genome Informatics (MGI) Main Menu

  2. ZFIN (Zebrafish Genome)

  3. WormBase-HomePage(c.elegans)

  4. BDGP Home (drosophila)

  5. FlyBase @ flybase.bio.indiana.edu

  6. GOLD (prokaryotes)

  7. The Institute for Genomic Research (Microbes)

  8. TIGR - CMR (Bacteria/Archaea)

  9. Saccharomyces Genome Database (yeast)

  10. Welcome to MIPS (yeast)

  11. Inside Genomics Reports for Drosophila Microarray

  12. BioMedNet Research Tools

(C) Genomics & Proteomics:

  1. What is Genomics

  2. Genomics-Proteomics Education

  3. Genomics glossaries and taxonomies

  4. Education and Genetics Human Genome Project 

  5. InformationThe Institute for Genomic Research

  6. NCBI HomePage

  7. EMBL - European Molecular Biology Laboratory

  8. Ensembl Genome Server

  9. Eukaryotic Animal Mammal Human

  10. Karyotyping Activity

  11. Human Cytogenetics - Chromosomes and Karyotypes

  12. Human Genome Working Draft

  13. The Human Genome Project at UC Santa Cruz

(D) Programs:

  1. bio.perl.org - Main page
  2. ActiveState - Products - Language Distributions - ActivePerl - Home
  3. Parsing Protein Domains with Perl [Nov. 16, 2001]
  4. PipMaker and MultiPipMaker
  5. Oligonucleotide Calculator
  6. OMG Home
  7. OneNet Communications Inc W3 Personal Web Server
  8. ORF finder
  9. Ian Korf

(E) Methods:

  1. QIAGEN - DNA Sequencing & Genomics WWW Links
  2. Genome Sequencing
  3. GENETIC ENGINEERING HOME PAGE
  4. LGMmicroarray links
  5. Welcome to Affymetrix
  6. DNA Microarray (Genome Chip) (by Leming Shi, PhD)
  7. BayGenomics Welcome
  8. papers on microarray data analysis
  9. Web resources on Gene Expression and DNA Microarray Technologies
  10. Listing of DNA microarray links
  11. Pat Brown's Lab Homepage
  12. An Introduction to DNA Microarray Technology
  13. TIGR Microarray Resources
  14. Microarray Home Pages
  15. DNA microarray and expression bioinformatics
  16. Genotyping Techniques
  17.  

(F) General/Miscellaneous:

  1. The Biology Project Molecular Biology

  2. GeneLetter - Current Issue

  3. ExPASy Molecular Biology Server

  4. The DNA Files

  5. GeneMap'98

  6. WU Libraries Genetics, Genomics, Molecular Biology, Computational Biology

Links Collections for Specific Tasks in Genomics & Bioinformatics:

Protein Motif and Domain Analyses

DNA Sequence and Gene Analyses


top of page