Tools for acquisition, organization and maintenance of knowledge in an environment of heterogeneous - PowerPoint PPT Presentation

Loading...

PPT – Tools for acquisition, organization and maintenance of knowledge in an environment of heterogeneous PowerPoint presentation | free to download - id: dfe10-M2JlM



Loading


The Adobe Flash plugin is needed to view this content

Get the plugin now

View by Category
About This Presentation
Title:

Tools for acquisition, organization and maintenance of knowledge in an environment of heterogeneous

Description:

Design and verify by pilot applications new ways of information ... ontological data to XML and XML is further transformed to HTML via XSL. Click. SemanticLog ... – PowerPoint PPT presentation

Number of Views:42
Avg rating:3.0/5.0
Slides: 27
Provided by: Yil2
Learn more at: http://www.infostat.sk
Category:

less

Write a Comment
User Comments (0)
Transcript and Presenter's Notes

Title: Tools for acquisition, organization and maintenance of knowledge in an environment of heterogeneous


1
Tools for acquisition, organization and
maintenance ofknowledge in an environment of
heterogeneous information resources
Principal investigator Slovak University
ofTechnology in Bratislava Istitute of
Informatics SAS Bratislava Pavol Jozef
afárikUniversity in Koice
2
Objectives
  • Design and verify by pilot applications new ways
    of information and knowledge processing in
    heterogeneous environment, in particular
    acquisition, representation, organization and
    maintenance of actual knowledge, and to develop
    tools for supporting new model of heterogeneous
    environment.

3
The research areas
  • models of heterogeneous environment (uncertainty,
    systems for modelling imperfect information,
    models of application domain, user models,
    context models, navigation models, metadata and
    ontologies, multilanguage sources, and multiagent
    systems),
  • knowledge acquisition (information
    recommendation, acquisition of user
    model/environment model, special languages for
    flexible query, and ontology creation),
  • knowledge organization (ontologies, various
    inductive methods, and small world networks), and
  • knowledge presentation (adaptive navigation,
    adaptive content presentation, and virtual and
    enriched reality).

4
The main outcome
  • design and experimental realization of the tools
    for information and knowledge processing within
    the domain of job offers
  • The tools operate with data that can be seen in
    the sequence
  • from primary data on the Internet, or given by
    user,
  • through acquired documents,
  • the documents containing relevant data with
    respect for application domain (which are, in our
    case, job offers),
  • in direction towards the offers chosen from
    documents with job opportunity offers,
  • up to effective presentation of the offers to
    user.

5
Transformation chain of data
6
Application domain
  • Job offers heterogenous environment of
    resources
  • Goals
  • to improve a process of a job offer searching
  • to increase a chance to find the job according
    users requirements
  • to enable companies to find the right applicant
    for the job position

7
An example of the job offer procesing
Offer downloading
Korporatívna pamät
Indexing
Evaluation of relevancy
Záznamy o ponukách
Index
Semantic annotation
Bratislava
Cathegorizing, indexing and searching
PHP
Presentation
Zhluk ponuky na pozíciu systémový správca
Zhluk ponuky na pozíciu programátor
Job Offer
Hladám prácu programátora v PHP. Min. plat
20000Sk/mes. Lokalita Bratislava a okolie
Job position Web Developer Job type
Full-time Salaray 120000-150000 Locality
Bratislava
URL www.jobsusa.com/it/offers?id022143 Kópia
/storage/html/jobusa_it_id_022143.html Dátum
20.05.2005
8
Intelligent management in the job offer domain
  • How to map users requirements to job offers
  • How to organize and present offers regarding to
    the users preferences
  • How to maintain offers within the continuously
    changing environment within the domain of job
    offers
  • How to design user friendly interface

9
Corporate memory as a framework for data oriented
integration
Corporate Memory
Interaction layer
XML-RPC Connector / WS Connector (SOAP) Java
Connector
Manipulation layer
Reasoning
File API
RDQL/Ontology API
SQL/ DB API
Jena/Sesame
Physical layer
File storage
RDB storage
RDF DB/ RDF-RDB Mapping
10
(No Transcript)
11
The role of tools for transformation data from web
12
General architecture of a tool
13
Methods and tools for data and offers acquisition
  • RIDAR (Relevant Internet Data Resource
    Identification) connects to existing search
    engines and identify relevant web resources
  • WebCrawler and ERID (Estimate Relevance of
    Internet Documents) recursively explore web
    resources and store
  • DocConverter transforms documents from one format
    to another format. At the moment, it transforms
    HTML documents to TXT documents for the need of
    other tools.
  • OSID (Offer Separation for Internet Documents)
    extract offers (e.g. job offers) ExPoS - from
    documents, which contain more job offers. In the
    next phase, the tools orientates also to offer
    selection that provides clean offers without
    page header, footer, menu, banners and other
    offer not related stuff.
  • Wraper extraction data and information from web
    pages
  • JOE, JOP manual ontology data

14
Tools for data analyzing and organizing
  • Clustering
  • Aspekt probability clustering using aspect
    models
  • ClusterNavigator navigation in the organization
    and companies maps and clustering of graphs
  • OSFCL cluster hierarchy of word form the domain
    using modified Rice-Siff method

15
Tools for data analyzing and organizing
  • Indexing
  • DaiRFTS indexing of text documents and
    searching within them in the frame of OGSA-DAI
  • JdbSearch indexing and querying in the set of
    documents using SQL queries and the original
    algorithm of indexing.

16
Another tools for data analyzing and organizing
  • Categorizing and searching
  • IGAP finding user preferences or new relations
    within application domain on the basis of
    monotonic classification task computing
  • topK - finding top k the best objects from the
    more sorted lists of evaluated objects
  • CriteriaSearch search for offers and data on
    the basis of the users criteria and requirements
  • Omin evaluation and sorting of elements in
    xhtml documents on the basis of the visual
    importance
  • RWM Support utility to manage related words for
    Morphonary or other tools

17
Semantic data processing
  • OnTeA
  • OntoSim, ConCom
  • OntoCase
  • NATAN
  • SQex
  • SOWA
  • RD2Onto

18
Support tools for language processing
  • NALIT- Identification of natural language on the
    basis of language profiles
  • Tvaroslovník Morphonary support of slovak
    language in the 2nd pilot application

19
Presentation tools
  • Tools Prescott and FACTIC (faceted semantic
    browser) support presentation, which transforms
    ontological data to XML and XML is further
    transformed to HTML via XSL
  • Click
  • SemanticLog

20
Semantic annotation - OnTeA
21
OnTeA
  • Nacíta sa obsah dokumentu v pôvodnej alebo
    textovej forme.
  • V texte sa hladajú regulárne výrazy a ak sú
    nájdené v korporatívnej pamäti sa vyhladá
    zodpovedajúci prvok (individual), ktorý sa
    priradí do mnoiny nájdených prvkov.
  • Ak prvok nie je nájdený, pri niektorých
    regulárných výrazok kde je definovaná aj
    trieda prvku je moné prvok vytvorit, pricom sa
    vytvorí iba jednoduchý prvok urcitej triedy
    s vlastnostou rdflabel.
  • Proces sa opakuje pre vetky regulárne výrazy
    a výsledkom je mnoina objavených prvkov.
  • Vytvorí sa prázdny prvok ponuky. Zistia sa vsetky
    moné vlastnosti ponuky.
  • Prechádza sa cez kadú vlastnost a ak sa trieda
    vlastnosti zhoduje s triedou objaveného prvku
    tento je pridany ako vlastnost ponuky.
  • Toto sa urobí pre vetky vlastnosti ponuky ako aj
    pre vetky objavené prvky z mnoiny.

22
OnTeA
23
Integration via CM
24
(No Transcript)
25
Conclusion
  • Results
  • Implemented tools integrated within 1st pilot
    application
  • Project webpage
  • http//nazou.fiit.stuba.sk/
  • Proceedings Tools for acquisition, organisation
    and presenting of information and knowledge

26
  • Thank you for your kind attention.
About PowerShow.com