Creating BioMoby Workflows In Taverna - PowerPoint PPT Presentation

1 / 85
About This Presentation
Title:

Creating BioMoby Workflows In Taverna

Description:

iCAPTURE Centre, St. Paul's Hospital. Vancouver, BC. Downloading Taverna ... Between releases, the Moby functionality may be updated and you can find ... – PowerPoint PPT presentation

Number of Views:27
Avg rating:3.0/5.0
Slides: 86
Provided by: eddie89
Category:

less

Transcript and Presenter's Notes

Title: Creating BioMoby Workflows In Taverna


1
Creating BioMoby WorkflowsIn Taverna
  • Mark Wilkinson
  • Edward (Eddie) Kawas
  • iCAPTURE Centre, St. Pauls Hospital
  • Vancouver, BC

2
Downloading Taverna
3
  • Taverna can be obtained fromhttp//taverna.sourc
    eforge.net

4
  • Once at the site, click on Download

5
  • Download the version appropriate for your
    operating system.

6
  • Between releases, the Moby functionality may be
    updated and you can find instructions on how to
    acquire those updates fromhttp//biomoby.open-bi
    o.org/index.php/moby-clients/taverna_plugin

7
Running
  • For a more comprehensive guide on Running and
    using Taverna, please refer to http//taverna.sour
    ceforge.net/usermanual/manual.html

8
  • Assuming that you have downloaded and unzipped
    Taverna, you can run it by double-clicking on
    runme.bat (windows) or executing runme.sh
    (Unix/Linux/OS X)

9
  • Tavernas splash screen

10
  • Once Taverna has loaded, you will see 3 windows
  • Advanced Model Explorer
  • Workflow Diagram
  • Available Services

11
  • The Advanced model explorer is Tavernas primary
    editor and allows you to load, save and edit any
    property of a workflow.

12
  • Workflow diagram contains a read only graphical
    representation of your workflow.

13
  • The Available services window lists all of the
    services available to a workflow designer.

14
  • Under the node Biomoby _at_ Moby services and
    Moby data types are represented.
  • The Object ontology is available as children of
    MOBY Objects node
  • Services are sorted by service provider authority

15
  • If you wish to use registries other than the
    default one, you can add a new Moby Scavenger
    by choosing to Add new Biomoby scavenger

16
  • Enter the registrys location and click okay.

17
Creating Workflows
  • I have the workflow saved and would like to offer
    it for download.

18
  • We will start by adding the Object ontology node
    Object to our workflow.

19
  • The Advanced model explorer now shows that we
    have a processor called Object
  • Object has 3 input ports id, namespace and
    article name
  • Object has 1 output port mobyData
  • The Workflow diagram illustrates our processor

20
  • We can discover services that consume our data
    type, context click on Object and choose Moby
    Object Details

21
  • A window will pop up that tells you what services
    Object feeds into and is produced by

22
  • Expanding the Feeds into node results in a list
    of service provider authorities
  • Expanding an authority, for example,
    bioinfo.icapture.ubc.ca, reveals a list of
    services

23
  • We will choose to add the service called
    MOBYSHoundGetGenBankFasta to our workflow.

24
  • A look at the state of our current workflow.

25
  • And graphically.
  • The service consumes Object, with article name
    identifier, and produces FASTA, with article name
    fasta.

26
  • To discover more services, context click on the
    service that outputs the data type that you would
    like to discover consuming services for and
    choose Moby Service Details.

27
  • The resultant window displays the services inputs
    and outputs.
  • There are also tool tips that show up when your
    mouse hovers over any particular input or output
    that tells you what namespaces the data type is
    valid in

28
  • Context clicking on an output reveals a menu with
    3 options.
  • A brief search for services that consume our
    datatype
  • A semantic search for services that consume our
    datatype
  • Adding a parser to the workflow that understands
    our datatype

29
  • The result of choosing to add a parser for FASTA
    to our workflow.
  • The parser allows us to extract
  • The namespace and id from FASTA
  • The namespace and id from the child String
  • The textual content from the child String

30
  • The result of choosing to conduct a brief search
    for services that consume FASTA

31
  • We will add the service getDragonBlastText to our
    workflow by choosing Add service - from the
    context menu

32
  • The current state of our workflow shown
    graphically.

33
  • A more complex view of our workflow

34
  • Finding services that consume NCBI_BLAST_Text
    starts by viewing the details of the service
    getDragonBlastText

35
  • Conduct a brief search

36
  • Add the service parseBlastText to our workflow

37
  • Our current workflow

38
  • Workflow inputs are added by context clicking on
    Workflow inputs in the Advanced model explorer
    and choosing Create New Input

39
  • The result from adding 2 inputs
  • Id
  • namespace

40
  • The workflow input id will be connected to
    Objects input port id

41
  • Workflow after connecting the workflow input id

42
  • The workflow input namespace will connect to
    Objects input port namespace

43
  • Workflow after connection the workflow inputs.

44
  • Workflow outputs are added by context clicking on
    Workflow outputs in the Advanced model explorer
    and choosing Create New Output

45
  • The result from adding 2 workflow outputs
  • moby_blast_ids
  • fasta_out

46
  • The output moby_blast_ids will be connected to
    parseBlastTexts output port Object(Collection
    hit_ids)

47
  • The output fasta_out will be connected to
    Parse_Moby_Data_FASTAs output port
    fasta_content

48
(No Transcript)
49
  • To run the workflow, click on Tools and Workflow
    Invocation
  • Choose Run workflow

50
  • A prompt to add values to our 2 workflow inputs

51
  • To add a value to the input id click on id from
    the left pane and choose New Input

52
  • Enter 656461 as the id

53
  • Choose namespace from the left and click on New
    Input

54
  • Enter NCBI_gi as the value for namespace
  • Once you are done, click on Run Workflow

55
  • Our workflow in action

56
  • Once the workflow is complete, we can examine the
    results of our workflow.

57
  • A detailed report is available outlining what
    happened when and in what order.

58
  • We can examine the intermediate inputs and
    output, as well as visualize our workflow.

59
  • If we choose the Graph tab, our workflow is
    illustrated.

60
  • Intermediate inputs allow us to examine what a
    service has accepted as input

61
  • Similarly, Intermediate outputs allows us to
    examine the output from any particular service.

62
  • Without the parser, FASTA is represented as a
    Moby message, fully enclosed in its wrapper.
  • Non-moby services do not expect this kind of
    message

63
  • Non-moby services expect the just the sequence
    and using the Parse_Moby_Data processor, we can
    extract just that

64
  • Moby services can interact with the other
    services in Taverna.
  • Lets add a Soaplab service.

65
  • We will choose a nucleic_restriction soaplab
    service called restrict

66
  • Choose the restrict service and add it to the
    workflow.

67
  • We will connect the output port fasta_content
    from the service Parse_Moby_Data_FASTA to the
    input port sequence_direct_data from the
    service restrict

68
  • The result of our actions so far.
  • We will need to add another workflow output to
    capture the output of restrict.

69
  • Create an output called restrict_out

70
  • Connect the output port outfile from the
    service restrict to the workflow output
    restrict_out

71
  • Once the connections have been made, run the
    workflow again using the same inputs.

72
  • The workflow on the left has some extra services
    added to it.
  • FASTA2HighestGenericSequenceObject from the
    authority bioinfo.icapture.ubc.ca
  • runRepeatMasker from the authority genome.imim.es
  • A Moby parser for the output DNASequence from
    runRepeatMasker.
  • A workflow output Masked_Sequence
  • Add them to your workflow

73
  • The service runRepeatMasker is configurable, i.e.
    it consumes Secondary parameters.
  • To edit these parameters, context click on the
    service and choose Configure Moby Service

74
  • The name of the parameter is on the left and the
    value is on the right.
  • Clicking on the Value will bring up a drop down
    menu, an input text field, or any other
    appropriate field depending on the parameter.

75
  • The parameter species contains an enumerated list
    of possibilities.
  • Select human.
  • When you have made your selection, you may close
    the window.

76
  • Lets run the workflow

77
  • We will run our workflow with a list
  • Click on id in the left pane and then click on
    New Input twice

78
  • Enter 656461 and 654321 as the ids
  • Enter NCBI_gi as the value for namespace
  • Our workflow will now run using each id with the
    single namespace

79
  • Notice how the workflow is running with
    iterations. This is happening because the Enacter
    is performing a cross-product on the input

80
  • You can still view intermediate inputs and
    outputs.
  • Using the queryIDs, you can track each invocation
    of a moby service through the whole workflow

81
  • Imagine now that you want to run the workflow
    using a FASTA sequence that you input yourself
    (without the gi identifier)
  • To do this, context click on getDragonBlastText
    and choose Moby Service Details
  • Expand the Inputs node and context click on
    FASTA(sequence)
  • Choose Add Datatype FASTA(sequence) to the
    workflow
  • A FASTA datatype will be added to the workflow
    and the appropriate links created

82
  • Notice the datatype FASTA on the left of the
    workflow
  • Since the datatype FASTA hasa String, a String
    was also added to our workflow and the
    appropriate connection was made
  • We will now have to add another workflow input
    and connect it to the String component of FASTA.

83
  • A workflow input sequence was added to the
    workflow and a connection was made from the
    workflow input to the input port value of
    String.
  • We also removed the link between
    MOBYSHoundGetGenBankFasta and getDragonBlastText
    by context clicking on the link in the Advanced
    model explorers Data links and choosing to
    remove the link
  • Now when we choose to run our workflow, we will
    also have the chance to enter a FASTA sequence

84
  • Go ahead an enter any FASTA sequence as the input
    to the workflow input sequence
  • Run the workflow

85
  • Any results can be saved by simply choosing to
    Save to disk
  • You will be prompted to enter a directory to save
    the results.
  • Each workflow output will be saved in a folder
    with the same name as a workflow output and the
    contents of the folder will be the results
  • You can also choose Excel, which produces an
    Excel worksheet with columns representing the
    workflow outputs and with rows that represent the
    actual data.
Write a Comment
User Comments (0)
About PowerShow.com