Module 1 : Basics of proteomics
Lecture 1 : Genomics to Proteomics
 
  Box for Terminologies
  • Genome: The entire sequence of an organism’s hereditary information, including both coding and non-coding regions, encoded in its DNA is known as the genome.
  • Contigs: A set of overlapping DNA fragments that are obtained from a single genetic source. These contigs are used to deduce the original DNA sequence.
  • Sequencing: DNA fragments that have been amplified using the BAC vectors are sequenced to obtain the nitrogenous bases of each fragment. These are then used to deduce the original sequence of the intact DNA by aligning the fragments having overlapping end sequences.
  • Transcriptome: The set of all RNA molecules, including mRNA, rRNA and tRNA, present in an organism is referred to as the transcriptome.
  • Proteome: The entire complement of proteins expressed by the genome of an organism under specific defined conditions is known as the proteome. Like the transcriptome, the proteome of an organism also varies with external factors and conditions.
  • Genomic DNA: The deoxyribonucleic acid polymeric sequence that acts as the store for genetic information and is essential for the synthesis of RNA and protein molecules, which are necessary for cellular functioning in all organisms.
  • Transcription: The process by which the genomic DNA is converted into a chemically related molecule, the messenger RNA or mRNA. Several enzymes and other factors are involved in this process. All regions of the DNA, coding and non-coding, get transcribed into the corresponding mRNA.
  • Pre-mRNA: The mRNA transcript that is produced from DNA as soon as transcription ends is known as the pre-mRNA. This contains both coding and non-coding sequences, is short lived and is further processed before translation.
  • BLAST: Basic Local Allignment Search Tool for Gene matching as provided online by NCBI.
  • HMM: Hidden Markov Model, which is a modified Bioinformatic based algorithm for sequence alignment.