Efficient XML Interchange - PowerPoint PPT Presentation

About This Presentation
Title:

Efficient XML Interchange

Description:

limited uptake of XML & friends in certain domains. performance is ... optimized codecs. type assigment through grammar. generic text coding always available ... – PowerPoint PPT presentation

Number of Views:97
Avg rating:3.0/5.0
Slides: 18
Provided by: danielvo7
Learn more at: https://lists.w3.org
Category:

less

Transcript and Presenter's Notes

Title: Efficient XML Interchange


1
Efficient XML Interchange
  • What is it?Why is it?How does it fit in?

2
What is Efficient XML Interchange?
  • Alternative Representation of XML Infoset
  • support full XML (Infoset) data model
  • not a subset
  • no really, not a subset!
  • Interchange Format
  • optimized for data exchange
  • transmission, storage, processing
  • can use Schema, conventional compression

3
Why?
  • Expand the Web
  • limited uptake of XML friends in certain
    domains
  • performance is problem
  • noteworthy domains
  • mobile, embedded, scientific,
  • Lesson From Binary XML Formats
  • real need, and real solutions
  • widely applicable, win-win
  • multiple formats cause segregation, limit adoption

4
Integration into XML Stack
  • Same Data Model
  • merely an alternative encoding
  • Open Issues
  • format, or encoding?
  • content negotiation?
  • schema knowledge vs content negotiation
  • modes, configurability (e.g. simple types)

5
WebAPI / EXI?
  • Impact on
  • APIs
  • initalisation encoding modes, schema info?
  • XMLHttpRequest
  • again modes, schema info?
  • diversity of formats?
  • Are data models in sync?
  • HTML as XML?
  • REX
  • fragment support?

6
(No Transcript)
7
Efficient XML Interchange
  • Format Basics

8
Efficient XML Interchange
  • Goal(s)
  • maintain XML (Infoset) data model
  • seamless integration into XML software stack
  • improve compaction AND processing
  • Observation
  • smallness has multiple benefits
  • e.g. energy consumption during transmission
  • allows XML deployment in new scenarios
  • Underlying Philosophy
  • exploit a-priori knowledge of (likely) content

9
How does it work?
  • Exploit Knowledge, at Several Different Levels
  • XML knowledge
  • copious syntactic redundancy
  • Schema knowledge
  • schema describes content in detail
  • heuristics
  • e.g. (declared) elements gtgt processing
    instructions
  • e.g. repeated string elements
  • e.g. small numbers gtgt large numbers
  • Cooperation with Conventional Compression
  • heavily biased data stream as compressor input

10
EXI Base Format
  • Coding Grammars
  • generic grammar describe full XML Infoset
  • arbitrary elements, PIs, comments, entity
    references, etc.
  • schema-derived grammar
  • describes a specific format
  • content-derived grammar
  • add rules depending on encountered elements
  • splice these together, at very fine granularity
  • allow anything, but know what is (currently)
    likely
  • likely content more efficient encoding

11
EXI Base FormatBuilt-in, Generic Element Grammar
12
EXI Base FormatA Schema-Based Grammar
  • Element Content Model
  • (optional) attribute color
  • (optional) element desc
  • (mandatory) elements quantity, price

13
EXI Base FormatMerged Generic Schema Derived
Grammar
SE(quantity)
SE(price)
SE(quantity)
SE(desc)
SE(), CH, ER, CM, PI
SE(), CH, ER, CM, PI
SE(), CH, ER, CM, PI
EE
EE
14
Other, Major EXI Features
  • Simple Type Values
  • optimized codecs
  • type assigment through grammar
  • generic text coding always available
  • string / value tables
  • Bit-Packed vs byte-aligned codec
  • biased input into deflate compression

15
(No Transcript)
16
(No Transcript)
17
Impact on the XML Stack
  • Questions
  • content negotiation, header
  • http integration?
  • what do you need? what would be a problem?
  • pre-shared schemas
  • which formats? samples?
  • (X)HTML? AJAX?
  • need hooks in the specification?
  • options / variables
  • different schemas, different options?
Write a Comment
User Comments (0)
About PowerShow.com