The Value of Knowing a Demand Curve: Regret Bounds for Online Posted-Price Auctions - PowerPoint PPT Presentation

About This Presentation
Title:

The Value of Knowing a Demand Curve: Regret Bounds for Online Posted-Price Auctions

Description:

( Our motivation came from ticketmaster.com.) 10. Online Posted-Price Auctions ... ( Our motivation came from ticketmaster.com. ... – PowerPoint PPT presentation

Number of Views:89
Avg rating:3.0/5.0
Slides: 53
Provided by: robertkl
Category:

less

Transcript and Presenter's Notes

Title: The Value of Knowing a Demand Curve: Regret Bounds for Online Posted-Price Auctions


1
The Value of Knowing a Demand Curve Regret
Bounds for Online Posted-Price Auctions
  • Bobby Kleinberg and Tom Leighton

2
Introduction
  • How do we measure the value of knowing the demand
    curve for a good?

3
Introduction
  • How do we measure the value of knowing the demand
    curve for a good?
  • Mathematical formulation What is the difference
    in expected revenue between
  • an informed seller who knows the demand curve,
    and
  • an uninformed seller using an adaptive pricing
    strategy
  • assuming both pursue the optimal strategy.

4
Online Posted-Price Auctions
  • 1 seller, n buyers, each wants one item.
  • Buyers interact with seller one at a time.
  • Transaction
  • Seller posts price.

5
Online Posted-Price Auctions
  • 1 seller, n buyers, each wants one item.
  • Buyers interact with seller one at a time.
  • Transaction
  • Seller posts price.
  • Buyer arrives.

6
Online Posted-Price Auctions
  • 1 seller, n buyers, each wants one item.
  • Buyers interact with seller one at a time.
  • Transaction
  • Seller posts price.
  • Buyer arrives.
  • Buyer gives
  • YES/NO
  • response.

YES
7
Online Posted-Price Auctions
  • 1 seller, n buyers, each wants one item.
  • Buyers interact with seller one at a time.
  • Transaction
  • Seller posts price.
  • Buyer arrives.
  • Buyer gives
  • YES/NO
  • response.
  • Seller may update price

YES
10
after each transaction.
8
Online Posted-Price Auctions
  • A natural transaction model for many forms of
    commerce, including web commerce. (Our
    motivation came from ticketmaster.com.)

10
9
Online Posted-Price Auctions
  • A natural transaction model for many forms of
    commerce, including web commerce. (Our
    motivation came from ticketmaster.com.)
  • Clearly strategyproof, since agents strategic
    behavior is limited to their YES/NO responses.

10
10
Informed vs. Uninformed Sellers
Uninformed
11
Informed vs. Uninformed Sellers
Value Ask Revenue Ask Revenue
.8
.8
.8
12
Informed vs. Uninformed Sellers
Value Ask Revenue Ask Revenue
.5 .8
.8
.8
13
Informed vs. Uninformed Sellers
Value Ask Revenue Ask Revenue
.9 .5 .5 .8 .8
.8
.8
14
Informed vs. Uninformed Sellers
Value Ask Revenue Ask Revenue
.9 .5 .5 .8 .8
.75 .8
.8
15
Informed vs. Uninformed Sellers
Value Ask Revenue Ask Revenue
.9 .5 .5 .8 .8
.7 .75 0 .8 0
.8
16
Informed vs. Uninformed Sellers
Value Ask Revenue Ask Revenue
.9 .5 .5 .8 .8
.7 .75 0 .8 0
.6 .8
17
Informed vs. Uninformed Sellers
Value Ask Revenue Ask Revenue
.9 .5 .5 .8 .8
.7 .75 0 .8 0
.8 .6 .6 .8 .8
18
Informed vs. Uninformed Sellers
Value Ask Revenue Ask Revenue
.9 .5 .5 .8 .8
.7 .75 0 .8 0
.8 .6 .6 .8 .8
1.1
1.6
Ex ante regret 0.5
19
Informed vs. Uninformed Sellers
Value Ask Revenue Ask Revenue
.9 .5 .5 .7 .7
.7 .75 0 .7 .7
.8 .6 .6 .7 .7
1.1
2.1
Ex post regret 1.0
20
Definition of Regret
  • Regret difference in expected revenue between
    informed and uninformed seller.
  • Ex ante regret corresponds to asking, What is
    the value of knowing the demand curve?
  • Competitive ratio was already considered by Blum,
    Kumar, et al (SODA03). They exhibited a
    (1e)-competitive pricing strategy under a mild
    hypothesis on the informed sellers revenue.

21
3 Problem Variants
  • Identical valuations All buyers have same
    threshold price v, which is unknown to seller.
  • Random valuations Buyers are independent
    samples from a fixed probability distribution
    (demand curve) which is unknown to seller.
  • Worst-case valuations Make no assumptions about
    buyers valuations, they may be chosen by an
    oblivious adversary.
  • Always assume prices are between 0 and 1.

22
Regret Bounds for the Three Cases
Valuation Model Lower Bound Upper Bound
Identical O(log log n) O(log log n)
Random O(n1/2) O((n log n)1/2)
Worst-Case O(n2/3) O(n2/3(log n)1/3)
23
Identical Valuations
Valuation Model Lower Bound Upper Bound
Identical O(log log n) O(log log n)
  • Exponentially better than binary search!!
  • Equivalent to a question considered by Karp,
    Koutsoupias, Papadimitriou, Shenker in the
    context of congestion control. (KKPS, FOCS
    2000).
  • Our lower bound settles two of their open
    questions.

24
Random Valuations
1
Demand curve D(x) Pr(accepting price x)
D(x)
0
x
1
25
Best Informed Strategy
Expected revenue at price x f(x) xD(x).
26
Best Informed Strategy
If demand curve is known, best strategy is fixed
price maximizing area of rectangle.
27
Best Informed Strategy
If demand curve is known, best strategy is fixed
price maximizing area of rectangle.
Best known uninformed strategy is based on the
multi-armed bandit problem...
28
The Multi-Armed Bandit Problem
  • You are in a casino with K slot machines. Each
    generates random payoffs by i.i.d. sampling from
    an unknown distribution.

29
The Multi-Armed Bandit Problem
  • You are in a casino with K slot machines. Each
    generates random payoffs by i.i.d. sampling from
    an unknown distribution.
  • You choose a slot machine on each step and
    observe the payoff.

0.3
0.7
0.4
0.5
0.1
0.6
0.2
0.2
0.7
0.3
0.8
0.5
0.6
0.1
0.4
30
The Multi-Armed Bandit Problem
  • You are in a casino with K slot machines. Each
    generates random payoffs by i.i.d. sampling from
    an unknown distribution.
  • You choose a slot machine on each step and
    observe the payoff.
  • Your expected payoff is compared with that of the
    best single slot machine.

0.3
0.7
0.4
0.5
0.1
0.6
0.2
0.2
0.7
0.3
0.8
0.5
0.6
0.1
0.4
31
The Multi-Armed Bandit Problem
  • Assuming best play
  • Ex ante regret ?(log n)
  • Lai-Robbins, 1986
  • Ex post regret ?(vn)
  • Auer et al, 1995
  • Ex post bound applies even if the payoffs are
    adversarial rather than random.
  • (Oblivious adversary.)

0.3
0.7
0.4
0.5
0.1
0.6
0.2
0.2
0.7
0.3
0.8
0.5
0.6
0.1
0.4
32
Application to Online Pricing
  • Our problem resembles a multi-armed bandit
    problem with a continuum of slot machines, one
    for each price in 0,1.
  • Divide 0,1 into K subintervals, treat them as a
    finite set of slot machines.

33
Application to Online Pricing
  • Our problem resembles a multi-armed bandit
    problem with a continuum of slot machines, one
    for each price in 0,1.
  • Divide 0,1 into K subintervals, treat them as a
    finite set of slot machines.
  • The existing bandit algorithms have regret O(K2
    log n K-2n), provided xD(x) is smooth and has a
    unique global max in 0,1.
  • Optimizing K yields regret O((n log n)½).

34
The Continuum-Armed Bandit
  • The continuum-armed bandit problem has algorithms
    with regret O(n¾), when exp. payoff depends
    smoothly on the action chosen.

Finite- Armed 2?0- Armed
Ex Ante ?(log n) O(n¾)
Ex Post ?(vn)
35
The Continuum-Armed Bandit
  • The continuum-armed bandit problem has algorithms
    with regret O(n¾), when exp. payoff depends
    smoothly on the action chosen.
  • But Best-known lower bound on regret was O(log
    n) coming from the finite-armed case.

Finite- Armed 2?0- Armed
Ex Ante ?(log n) O(log n) O(n¾)
Ex Post ?(vn)
36
The Continuum-Armed Bandit
  • The continuum-armed bandit problem has algorithms
    with regret O(n¾), when exp. payoff depends
    smoothly on the action chosen.
  • But Best-known lower bound on regret was O(log
    n) coming from the finite-armed case.
  • We prove O(vn).

Finite- Armed 2?0- Armed
Ex Ante ?(log n) O(vn) O(n¾)
Ex Post ?(vn)
?
37
Lower Bound Decision Tree Setup
38
Lower Bound Decision Tree Setup
0.3
½
¼
¾
?
?
?
?
39
Lower Bound Decision Tree Setup
0.2
½
¼
¾
?
?
?
?
40
Lower Bound Decision Tree Setup
0.4
½
¼
¾
?
?
?
?
41
Lower Bound Decision Tree Setup
vi ALG OPT Reg.
0.3 0 0.3 0.3
0.2 0 0 0
0.4 0.125 0.3 0.175
0.125 0.6 0.475
½
¼
¾
?
?
?
?
42
How not to prove a lower bound!
  • Natural idea Lower bound on incremental regret
    at each level

½
¼
¾
?
?
?
?
43
How not to prove a lower bound!
  • Natural idea Lower bound on incremental regret
    at each level
  • If regret is O(j-½) at level j, then total regret
    after n steps would be O(vn).

½
1
¼
¾

?
?
?
?
v?
1 v½ v? O(vn)
44
How not to prove a lower bound!
  • Natural idea Lower bound on incremental regret
    at each level
  • If regret is O(j-½) at level j, then total regret
    after n steps would be O(vn).
  • This is how lower bounds were proved for the
    finite-armed bandit problem, for example.

½
1
¼
¾

?
?
?
?
v?
1 v½ v? O(vn)
45
How not to prove a lower bound!
  • The problem If you only want to minimize
    incremental regret at level j, you can typically
    make it O(1/j).
  • Combining the lower bounds at each level gives
    only the very weak lower bound Regret O(log
    n).

½
1
¼
¾
½
?
?
?
?
?
1 ½ ? O(log n)
46
How to prove a lower bound
  • So instead a subtler approach is required.
  • Must account for the cost of experimentation.
  • We define a measure of knowledge, KD such that
    regret scales at least linearly with KD.
  • KD ?(vn) ? TOO COSTLY
  • KD o(vn) ? TOO RISKY

½
¼
¾
?
?
?
?
47
Discussion of lower bound
  • Our lower bound doesnt rely on a contrived
    demand curve. In fact, we show that it holds for
    almost every demand curve satisfying some
    generic axioms. (e.g. smoothness)

48
Discussion of lower bound
  • Our lower bound doesnt rely on a contrived
    demand curve. In fact, we show that it holds for
    almost every demand curve satisfying some
    generic axioms. (e.g. smoothness)
  • The definition of KD is quite subtle. This is
    the hard part of the proof.

49
Discussion of lower bound
  • Our lower bound doesnt rely on a contrived
    demand curve. In fact, we show that it holds for
    almost every demand curve satisfying some
    generic axioms. (e.g. smoothness)
  • The definition of KD is quite subtle. This is
    the hard part of the proof.
  • An ex post lower bound of O(vn) is easy. The
    difficulty is solely in strengthening it to an ex
    ante lower bound.

50
Open Problems
  • Close the log-factor gaps in random and
    worst-case models.

51
Open Problems
  • Close the log-factor gaps in random and
    worst-case models.
  • What if buyers have some control over the timing
    of their arrival? Can a temporally strategyproof
    mechanism have o(n) regret? Parkes

52
Open Problems
  • Close the log-factor gaps in random and
    worst-case models.
  • What if buyers have some control over the timing
    of their arrival? Can a temporally strategyproof
    mechanism have o(n) regret? Parkes
  • Investigate online posted-price combinatorial
    auctions, e.g. auctioning paths in a graph.
    Hartline
Write a Comment
User Comments (0)
About PowerShow.com