Ab-initio Architecture PowerPoint PPT Presentation

presentation player overlay
About This Presentation
Transcript and Presenter's Notes

Title: Ab-initio Architecture


1
(No Transcript)
2
What Does Ab Initio Mean ?
  • Ab Initio is Latin for From the Beginning
  • From the beginning the software was designed to
    support a complete range of business
    applications, from simple to the most complex.
  • The graphical development environment and a
    powerful set of components allows the customers
    to get valuable results from the beginning.

3
Ab Initio Provides For
  • Distribution - a platform for applications to run
    on collections of cpus
  • Complexity - the ability for applications to run
    in parallel on any combination of single-CPU
    computers, multi-CPU computers, and networks of
    computers.

4
Typical Ab-Initio System Configuration
  • Ab Initio software consists of two main programs.
  • CogtOperating System, which your system
    administrator installs on a host UNIX or Windows
    NT Server, as well as on processing nodes. (The
    host is also referred to as the control node).
  • Graphical Development Environment (GDE), which
    you install on your PC (client node) and
    configure to communicate with the host (control
    node).

5
Abinitio Architecture
6
Applications of Ab Initio Software
  • Big Data processing.
  • Parallel execution of existing applications.
  • Parallel sort/merge processing.
  • Data transformation.

7
Applications of Ab Initio Software
  • Big Data processing.
  • Parallel execution of existing applications.
  • Parallel sort/merge processing.
  • Data transformation.

8
Parallel Processing
  • Parallel processing refers to the simultaneous
    performance of multiple operations. The
    CogtOperating System uses three kinds of parallel
    processing
  • Component-level parallelism A graph with
    multiple components running on separate data uses
    component-level parallelism.
  • Data parallelism A graph that deals with data
    divided into segments and operates on each
    segment simultaneously uses data parallelism.
  • Pipeline parallelism A graph with multiple
    components running simultaneously on the same
    data uses pipeline parallelism.

9
File Types
10
Serial Files
  • A serial file is Abinitio online training
    software's term for flat, non-parallel files.

11
Multifile
  • A multifile is a parallel file that is composed
    of individual files.
  • The individual files are partitions of the
    multifile. Each multifile contains one Control
    partition and one or more data partitions.
  • Data in Multifile
  • The data in a multifile is usually divided
    across partitions by one of these methods
  • Random or round robin partitioning
  • Partitioning based on ranges or functions
  • Replication or broadcast, in which each partition
    is an identical copy of the serial data.

12
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com