DNA SEQUENCING AND COMPARISON - PowerPoint PPT Presentation

1 / 14
About This Presentation
Title:

DNA SEQUENCING AND COMPARISON

Description:

Represents one of the hardest problems that a computer can solve: NP ... 2) NCBI (National center for Biotechnology information) http://www.ncbi.nlm.nih.gov ... – PowerPoint PPT presentation

Number of Views:26
Avg rating:3.0/5.0
Slides: 15
Provided by: keithho9
Category:

less

Transcript and Presenter's Notes

Title: DNA SEQUENCING AND COMPARISON


1
DNA SEQUENCING AND COMPARISON
  • CS 731
  • Professor Heng Ji
  • Presented by Keith Hockaday

2
INTRODUCTION
  • THERE ARE TWO TYPES OF CELLS
  • Prokaryotic
  • single celled organism
  • Eukaryotic
  • multi-cellular organism
  • My Presentation will focus on the eukaryotic cell

3
DNA/RNA
  • They are nucleic acids found in the nucleus
  • of a cell
  • They have double helix structure

4
GENE EXPRESSION
  • The sequence of A, C, T, and Gs are converted
    into a
  • sequence of amino acids
  • Isolating a gene from a DNA molecule
  • Transcription (mRNA)
  • Translation (3D structure protein)

5
EXTRACTING DNA
  • http//learn.genetics.utah.edu/content/labs/extrac
    tion/
  • The process will yield chromosomes

6
GENOME
  • What is a genome
  • The human genome project and BIOINFORMATICS

7
COMPARING DNA SEQUENCES
Represents one of the hardest problems that a
computer can solve NP Hard exponential The
Algorithm LCS problem Two
sequences 10 characters long
require 1,048,576 comparisons To see animation
of the comparisons http//69.123.166.15782/DNA_S
EQUENCER.php
8
WEB TOOLS
  • 1) PROTOGENE
  • Web Server http//www.tcoffee.o
    rg/
  • It accepts a protein sequence
    and returns the
  • associated nucleotide
    sequence.

9
WEB TOOLS
  • 2) NCBI (National center for Biotechnology
    information)
  • http//www.ncbi.nlm.nih.g
    ov/
  • software and tools are
    downloadable
  • 3) BLASTP (Basic Local Assignment Search
    Tool-Protein)
  • A web tool hosted by NCBI

10
A SIMPLE EXAMPLE
  • Search NCBI for insulin protein sequence
  • I received this hit
  • mapwmhlltvallalwgpnsvqayssqhlcgsnlvealymtcgrsgf
    yr phdrreledqveqaelgleagglqpsalemilqkrgivdqccnnict
    fn qlqnycnvp
  • Enter the sequence in PROTOGENE
  • The results are listed below in the text
    area.

11
AN EXAMPLE USING BLASTP
  • BLASTP can be accessed at this location
  • http//blast.ncbi.nlm.n
    ih.gov/Blast.cgi

12
(No Transcript)
13
RESULTS
  • Each line corresponds to a bar on the previous
    slide

LOCUS P12497 1435 aa linear VRL 14-OCT-2008
DEFINITION ACCESSION P12497 VERSION
P12497.3 GI77416886 DBSOURCE swissprot locus
POL_HV1N5, accession P12497 created Oct 1,
1989. sequence
updated Jan 23, 2007. ORGANISM Human
immunodeficiency virus type 1 (NEW YORK-5
ISOLATE)
14
Perl Application
  • Main.pl is program that automates the entire
  • procedure.
  • My plan is to convert the Perl Application
  • into in architecture consisting of
  • PHP
  • MySql
  • Apache
  • To be hosted at
  • http//69.123.166.15782/DNA_SEQUENCER.php
Write a Comment
User Comments (0)
About PowerShow.com