Combining Shape and Physical Models for Online Cursive Handwriting Synthesis - PowerPoint PPT Presentation

1 / 43
About This Presentation
Title:

Combining Shape and Physical Models for Online Cursive Handwriting Synthesis

Description:

Concatenate letters with their neighbors to form a cursive handwriting. can't be easily achieved ... In the concatenation process, the trajectories of letters ... – PowerPoint PPT presentation

Number of Views:127
Avg rating:3.0/5.0
Slides: 44
Provided by: hly24
Category:

less

Transcript and Presenter's Notes

Title: Combining Shape and Physical Models for Online Cursive Handwriting Synthesis


1
Combining Shape and Physical Models for Online
Cursive Handwriting Synthesis
  • Jue Wang (University of
    Washington)Chenyu Wu (Carnegie Mellon
    University)Ying-Qing Xu (Microsoft
    Research Asia)Heung-Yeung Shum (Microsoft
    Research Asia)

International Journal on Document Analysis and
Recognition (IJDAR) 2004
2
Introduction
  • Handwriting computing techniques (pen-based
    devices)
  • Handwriting recognition
  • make it possible for computers to understand the
    information involved in handwriting
  • Handwriting modulation
  • handwriting editing, error correction, script
    searching

3
Introduction
  • Handwriting Modeling Synthesis
  • Movement-simulation techniques
  • base on motor models and try to model the process
    of handwriting production
  • focus on the representation and analysis of real
    handwriting signals rather than handwriting
    synthesis

4
Introduction
  • Shape-simulation methods
  • consider the static shape of handwriting
    trajectory
  • more practical than movement-simulation tech
    when dynamic information is not available
  • straight forward approach synthesize form
    collected handwritten glyphs
  • learning-based cursive handwriting synthesis
    approach

5
Introduction
  • Successful handwriting synthesis algorithm
  • shapes of letters vs. training samples
  • connection between synthesized letters
  • A novel cursive handwriting synthesis tech
  • Combine the advantages of the shape-simulation
    and the movement-simulation methods

6
Outline
  • Sample collection and segmentation
  • Learning strategies
  • Synthesis Strategies
  • Experimental results
  • Discussion and Conclusion

7
Sample Collection
  • About 200 words
  • Each letter has appeared more than 5 times
  • These handwriting samples firstly pass through a
    low pass filter and then be re-sampled to
    produce equidistant points

8
Sample Segmentation
  • Overview
  • Segmentation-based recognition method
  • Recognition-based segmentation
  • (rely heavily on the performance of the
    recognition engine)
  • Level-building
  • simultaneously outputs the recognition and
    segmentation results
  • segmentation and recognition are merged to give
    an optimal result

9
A Two-level Framework
  • Framework of traditional handwriting segmentation
    approaches
  • Temporal handwriting sequence
  • is a low level feature that denotes the
    coordinate and velocity of the sequence at time t

10
Segmentation
  • The segmentation problem is to find the identity
    string I1,,In, with the corresponding segments
    of the sequence S1,,Sn, S1 z1,,zt1,,
    Snztn-1,, zT,that best explain the sequence

11
Segmentation
  • For the training of the writer-independent
    segmentation system
  • low-level feature-based segmentation algorithm
    works well for a small number of writers
  • A script code is calculated from handwriting data
    as the middle-level feature

12
Middle Level Feature
  • Five kinds of key points are extracted
  • points of maximum/minimum x-coordinate (X,X-)
  • points of maximum/minimum y-coordinate (Y,Y-)
  • crossing points ( )
  • Average direction of the interval sequence
    between two adjacent key points

13
Script Codes Examples
14
Middle Level Feature
  • Samples of each character are divided into
    several clusters
  • those in the same cluster have a similar
    structural topology
  • Since the length of script code might not be the
    same in all cases ? cant directly compute the
    similarity
  • The script code is modeled as a homogeneous
    Markov chain

15
Middle Level Feature
  • Given two script codes T1, T2
  • We may compute the stationary distributions ,
    and transition matrix A1, A2
  • The similarity between two script codes is
    measured as

16
Middle Level Feature
  • The position of , , A1, A2 are enforced
    symmetrically
  • balance the variance of the KL
    divergence and the difference in code length
  • If both the stationary distribution and the
    transition matrix of two script codes are matched
    well, and their code lengths are almost the same
    ? d(T1, T2) is close to 1

17
Segmentation
  • After introducing the script code as middle-level
    features, the optimization problem becomes
  • improve the accuracy of segmentation
  • dramatically reduce the computational complexity
    of level-building

18
Graph Model
19
Result
20
Outline
  • Sample collection and segmentation
  • Learning strategies
  • Synthesis Strategies
  • Experimental results
  • Discussion and Conclusion

21
Learning Strategies
  • Data alignment
  • Trajectory matching
  • Training set alignment
  • Shape models

22
Trajectory Matching
  • Segmentation and reconstruction of on-line
    handwritten scripts (1998, Pattern Recognition)

Each piece is simple arc, points can be
equidistantly sampled from it to represent the
stroke
23
Trajectory Matching
  • Landmark-point-extraction method
  • pen-down, pen-up points
  • local extrema of curvature
  • inflection points of curvature
  • A handwriting sample can be divided into as many
    as six pieces
  • The same character are mostly composed of the
    same number of pieces and they match each other
    naturally

24
Trajectory Matching
  • A handwriting sample can be represented by a
    point vector
  • s number of static pieces segmented from the
    sample
  • ni number of points extracted from the i th piece

25
Trajectory Matching
  • The following is to align different vector into a
    common coordinate frame
  • estimate an affine transform for each samplethat
    transforms the sample into the coordinate frame
  • Affine transformations translation, rotation,
    scaling

26
Training Set Alignment
  • Iterative algorithm(Learning from one example
    through shared densities on transforms (IEEE CVPR
    2000) )
  • Deformable energy based criterion is defined as

27
Training Set Alignment - Algorithm
  • Maintain an affine transform matrix Ui for each
    sample, which is set to identity initially
  • Compute the deformable energy-based criterion E
  • Repeat until convergence
  • For each one of the six unit affine matrixes14,
    Aj, j 1,,6
  • Let
  • Apply to the sample and recalculate the
    criterion E
  • If E has been reduced, accept ,
    otherwise
  • Let and apply
    again,If E has been reduce, accept ,
    otherwise revert to Ui
  • End

28
Shape Models
  • By modeling the distribution of aligned vectors,
    new examples can be generated that are similar to
    those in the training set
  • Like the Active Shape Model, principal component
    analysis is applied to the data
    (PCA)(Statistical models of appearance for
    computer vision, Draft report, 2000)

29
Shape Model
  • Formally, the covariance of the data is
    calculated as
  • Then the eigenvectors and corresponding
    eigenvalues of S are computed and sorted so
    that
  • The training set is approximated by
  • represent the
    t eigenvectors corresponding to the largest
    eigenvalues
  • b is a vt-dimensional vector given by
  • By varying the elements in b, new handwriting
    trajectory can be generated from this model
  • apply limits of to the elements bi

30
Outline
  • Sample collection and segmentation
  • Learning strategies
  • Synthesis Strategies
  • Experimental results
  • Discussion and Conclusion

31
Synthesis Strategies
  • Generate each individual letter in the word
  • Then the baselines of these letters are aligned
    and juxtaposed in a sequence
  • Concatenate letters with their neighbors to form
    a cursive handwriting
  • ?cant be easily achieved
  • To solve this problem, a delta log-normal model
    based conditional sampling algorithm is proposed

32
Individual Letter Synthesis
33
Delta Log-normal Model
  • A powerful tool in analyzing rapid human
    movements
  • With respect to handwriting generation, the
    movement of a simple stroke is controlled by
    velocity
  • The magnitude of the velocity is described
    as(Why handwriting segmentation can be
    misleading?, 13th international conference on PR,
    1996)

log-normal function (on a logarithmic scale axis)
34
Delta Log-normal Model
  • The angular velocity can be expressed as
  • The angular velocity is calculated as the
    derivative of
  • Give , the curvature along a stroke piece
    is calculated as
  • The static shape of the piece is an arc,
    characterized by

initial directionc0 constant
(arc length)
35
Delta Log-normal Model-Example
Why Handwriting Segmentation Can Be Misleading,
1996 IEEE ICPR
36
Conditional Sampling
  • First, the trajectories of synthesized
    handwriting letters are decomposed into static
    pieces
  • The first piece of a trajectory is called head
    piece, and the last piece is called the tail
    piece
  • In the concatenation process, the trajectories of
    letters will be deformed to produce a natural
    cursive handwriting,by changing the parameters
    of the head and the tail pieces from

37
Conditional Sampling
  • A deformation energy of a stroke is defined as
  • A concatenation energy between the i th letter
    and the (i1) th letter is defined as
  • By minimizing the second and the third items, the
    two letters are forced to connect with each other
    smoothly and naturally

38
Conditional Sampling
  • The concatenation energy of a whole word is
    calculated as
  • We must ensure that the deformed letters are
    consistent with models
  • The sampling energy is calculated as
  • The whole energy formulation is finally given as

39
Synthesis-Iterative Approach
  • Randomly generate a vector b(i) for each letter
    initially
  • Generate trajectories Si of letters and calculate
    an affine transform Ti for each letter (transform
    it to its desired position)
  • For each pair of adjacent letters Si, Si1,
    deform the pieces in these letters to minimize
    the concatenation energy Ec(i, i1)
  • Project the deformed shape into the model
    coordinate frame
  • Update the model parameters
  • If not converged return to step 2

40
Experimental Results
41
Discussion Conclusion
  • Performance is limited by samples used for
    training since the shape models can only generate
    novel shapes within the variation of training
    samples
  • Although some experimental results are shown, it
    is still not known how to make an objective
    evaluation on the synthesized scripts and compare
    different synthesis approaches

42
Markov chains
  • Markov chain on a space X with transitions T is a
    random process (infinite sequence of random
    variables) (x(0), x(1),x(t),) that satisfy
  • That is, the probability of being in a particular
    state at time t given the state history depends
    only on the state at time t-1
  • If the transition probabilities are fixed for all
    t, the chain is considered homogeneous

43
Stationary distribution
  • Consider the Markov chain given above
  • The stationary distribution is
Write a Comment
User Comments (0)
About PowerShow.com