An overview of The IBM Intelligent Miner for Data - PowerPoint PPT Presentation

1 / 28
About This Presentation
Title:

An overview of The IBM Intelligent Miner for Data

Description:

Discovering similar time sequences. Steps for mining data using the Data ... patterns are used to find predictable patterns of behavior over a period of time. ... – PowerPoint PPT presentation

Number of Views:219
Avg rating:3.0/5.0
Slides: 29
Provided by: sxr0
Category:

less

Transcript and Presenter's Notes

Title: An overview of The IBM Intelligent Miner for Data


1
An overview of The IBM Intelligent Miner for Data
  • By Neeraja Rudrabhatla
  • 11/04/1999

2
Mining Features supported by the Data Miner
  • Association Rules
  • Clustering - Demographic, Neural networks
  • Predicting classifications - Neural Networks,
    Decision Trees
  • Predicting values
  • Discovering sequential patterns
  • Discovering similar time sequences

3
Steps for mining data using the Data Miner
  • Creation of data
  • Analyze and prepare data for mining
  • Mine the data using one or a combination of
    mining techniques
  • Visualize mining results using advanced graphical
    techniques

4
Main Window of the Data Miner
5
Database used for mining association rules
Store ID Customer Date(yymmdd)
Transaction ItemID 001 0000007 950109
00982 122 001 0000007 950109 00982
125 001 0000007 950109 00982
133 001 0000007 950109 00982 150
001 0000003 950109 00983 153 001
0000003 950109 00983 154 001 0000003
950109 00983 162 001 0000003
950109 00983 166 001 0000005 950109
00984 147 001 0000005 950109
00984 174 001 0000005 950109 00984
191 001 0000005 950109 00984
198 001 0000008 950109 00985 147 001
0000008 950109 00985 174 001
0000008 950109 00985 182 001 0000008
950109 00985 184 001 0000006
950109 00986 174 001 0000006 950109
00986 186 001 0000006 950109
00986 187 001 0000006 950109 00986
188 001 0000002 950109 00987 109
6
Name Mapping
7
Results of mining for associations
8
Results on the automobile Database
9
Another view
10
Database used for Clustering
  • Gender Age Siblings Income Type
    Product
  • female 18.02 1 97 red
    2
  • female 13.03 6 490
    green 3
  • male 11.0 3 647
    red 4
  • female 47.5 2 3192
    green 5
  • male 11.07 5 736
    blue 6
  • female 24.0 3 22358 blue
    7
  • female 62.1 0 3936
    green 8
  • female 04.08 1 516 pink
    1
  • female 40.1 0 9478 red
    2
  • female 04.08 0 193
    pink 3
  • female 45.8 5 16984 green
    4
  • male 21.07 0 10428 blue
    5
  • male 07.02 0 960
    blue 6
  • female 42.5 0 10835 pink
    7
  • female 36.9 2 37083 green
    8
  • male 10.03 3 877
    blue 1
  • male 02.03 0 10
    blue 2
  • female 20.0 0 15432 green
    3

11
Clustering - Demographic
Max clusters 9 Accuracy 5
12
Details of Cluster 7
13
Detailed pie-chart for attribute Type
14
Detailed bar-graph of attribute Age
15
Output obtained with Clustering using Neural
Networks
16
Details of Cluster 6
17
Database used for Classification
  • Day Outlook Temperature Humidity Wind
    PlayTennis
  • D1 Sunny Hot High Weak
    No
  • D2 Sunny Hot High
    Strong No
  • D3 Overcast Hot High Weak
    Yes
  • D4 Rain Mild High Weak
    Yes
  • D5 Rain Cool Normal Weak
    Yes
  • D6 Rain Cool Normal Strong
    No
  • D7 Overcast Cool Normal Strong
    Yes
  • D8 Sunny Mild High Weak
    No
  • D9 Sunny Cool Normal Weak
    Yes
  • D10 Rain Mild Normal Weak
    Yes
  • D11 Sunny Mild Normal Strong
    Yes
  • D12 Overcast Mild High Strong
    Yes
  • D13 Overcast Hot Normal Weak
    Yes
  • D14 Rain Mild High Strong
    No

18
Classification using Decision Tree
19
A view of a leaf node of the decision tree
20
Classification using neural network
In-sample 4 Out-Sample 1 Accuracy 80 Error
10 Learning Rate 0.1 Momentum 0.9
21
Viewing the results in bar-graphs
22
Database for Value Prediction
D1 Sunny 80 High Weak
No D2 Sunny 75 High
Strong No D3 Overcast 70 High
Weak Yes D4 Rain
55 High Weak Yes D5
Rain 32 Normal Weak
Yes D6 Rain 35 Normal
Strong No D7 Overcast 40
Normal Strong Yes D8 Sunny
60 High Weak No D9
Sunny 20 Normal Weak
Yes D10 Rain 67 Normal Weak
Yes D11 Sunny 62 Normal
Strong Yes D12 Overcast 58 High
Strong Yes D13 Overcast 74
Normal Weak Yes D14 Rain
61 High Strong No
23
Results of PlayTennis
In-sample 2 Out-sample 1
24
One partition of the PlayTennis-Prediction
25
Textual Representation of a single partition
26
Sequential Patterns Mining and Time Sequence
Mining
  • Sequential patterns are used to find predictable
    patterns of behavior over a period of time.
  • (A certain behavior at a given time is likely to
    produce another behavior or a sequence of
    behaviors within a certain time-span)
  • Time sequences help find all occurrences of
    similar subsequences in a database of time
    sequences.

27
Sequences
  • Combine several objects into a single object that
    you can run
  • The benefit is that you can combine several steps
    into one step
  • If you combine several functions into a sequence,
    you need run only the sequence, which then runs
    each of the objects within it

28
Applications
  • The Intelligent Miner offerings are intended for
    use by Data Analysts and Business Technologists
    in the following areas
  • Perform database marketing
  • Streamline business and manufacturing processes
  • Detect potential cases of fraud
  • Helps in customer relationship management
Write a Comment
User Comments (0)
About PowerShow.com