Title: Overcoming the limitation of XML Documentation Type Definition XML schema generator
1Overcoming the limitation of XML Documentation
Type DefinitionXML schema generator
- CS689
- Hae-Soon Kwon
- 11/16/2000
2Content
- Research Problem
- Research Goal
- Literature Review
- Significance of the Study
- Research Design
- Data Analysis
- Schedule
- Facilities and Resources
- Deliverables
3Research Problem
- Extensible Markup Language(XML) developed by
W3C to replace the conventional web languages. - XML documents originate well-formed ness and
validation. - If the Document Type Definition file(DTD) is
declared for that XML document then we can say
it is valid. - DTD is a grammar to use the free parsers in that
XML document.
4Research Problem (continue)
- Several weaknesses of XML DTD
- - Difficulty of implementation
- - Not guarantee of validation for the document
- - No supporting by regular language grammar
- To overcome the weaknesses
- - W3C released XML Schema ( October 2000)
- - There are several researches and tools to
generate - DTD automatically but they have their own
limitations. - - New effort is required to provide a method for
generating XML Schema.
5Research Goal
- To make an automatic XML Schema generator
- To make valid XML document
- To provide the convenience way to the XML users
to create XML Schema
6XML Schema-compared with DTD
- XML Schema has prefix xsd associated with its
type( ex) ltxsdelementgt lt/xsdelementgt ) - XML distinguishes element type to element,
complexType and simpleType and so on - Element value type is explained in more specific
way - (ex) ltxsdelement nameage valuexsdinteger/
gt ) - For more information W3C Schema Recommendation(
http//www.w3.org/TR -
/xmlschema)
7Basic technical term
ltpersongt ltnamegt ltfirst namegt Chris lt/first
namegt ltlast namegt Allen lt/last namegt
lt/namegt ltagegt 25 lt/agegt ltgendergt male
lt/gendergt lt/persongt
Next XML DTD
8Basic technical term
lt!ELEMENT person ( name, age, gender
)gt lt!ELEMENT name ( first_name,
last_name)gt lt!ELEMENT first_name (PCDATA)gt lt!ELEM
ENT last_name (PCDATA)gt lt!ELEMENT
age (PCDATA)gt lt!ELEMENT gender (PCDATA)gt
Next XML Schema
9Basic technical term
ltxsdschema xmlnsxsdhttp//www.w3.org/1999/X
MLSchemagt ltxsdelement nameperson
typeperson/gt ltxsdcomplexType
namepersongt ltxsdelement namename
typename/gt ltxsdcomplexType namenamegt
ltxsdelement namefirst_name type
xsdstring/gt ltxsdelementnamelast_name
typexsdstring/gt lt/xsdcomplexTypegt
ltxsdelement nameage typexsdinteger/gt
ltxsdelement namegendertypexsdstring/gt
lt/xsdcomplexTypegt
10Literature Review
- DTD-Miner (Chung-Hue Moh, et al. 2000)
- automatic DTD generator
- Using Regular Tree Automata as XML Schema ( Boris
Chidlovskii, 2000) give a method to convert tree
automata to DTD - DTD Creation from URL or Simple Text( Keith E.
Shafer, 1995) give methods to communicate with
clients and to handle errors - Using a Custom DTD( Web Design Group, 1999)
11Significance of the Study
- The study for automatic generating XML Schema is
the requirement of current trends and this study
may be a cornerstone in this field. -
12Research Design Frame Work
Browser for user
XML document
Schema file
Schema generator module
Schema file
Web interface
Interface module
Union tree construction module
Schema construction module
Pre-processor module
Pre-processor module
If max repetition is greater than 1
Document tree Extraction module
If max repletion 1
Set of tags
13Research Design implementation
- To communicate with users
- HTML and CGI script
- To generate trees and Schema
- Java
14Data Analysis
- Schema generator will be tested to measure how
correctly it works. - Both URL and text form XML documents will be used
for input file. - Each line that is generated by the generator will
be compared with original Schema by hand. - Every program implementation step will be tested
and feedback.
15Schedule
Jan
Mar
May
July
Oct
Algorithm development
Web-related programming
Schema generator module implementation
Test and feedback
16Facilities and Resources
- Tree automata algorithm
- DTD-Miner
- Automatic DTD creating tool Fred
- DTD/Schema Editors/Tools
- Miscellaneous
17Deliverables
- Automatic XML Schema generator
- Its algorithm
18Thank you
Question?
Happy Thanksgiving