MIT LIBRARIES DUPL
14
i-i
B
u
JIn
e>i.."M
IT
MIT
Sloan School
of
Management
Sloan
Working
Paper
4181-01
eBusiness@MIT
Working
Paper
102
October
2001
ANALYZING
THE
ECONOMIC
EFFICIENCY
OF
EBAY-LIKE
ONLINE
REPUTATION
REPORTING
MECHANISMS
Chrysanthos
Dellarocas
This
paper
isavailable
through
theCenter
foreBusiness(aJMIT
web
siteat thefollowing
URL:
http://ebusiness.mit.edu/research/papers.html
This
paper
alsocan be
downloaded
without
charge
from
theSocial
Science
Research
Network
Electronic
Paper
Collection:
http://papers.ssm.coin/abstract=289968
Analyzing
theeconomic
efficiencyofeBay-iikeonlinereputationreportingmechanisms^
ChrysanthosDellarocas
SloanSchool of
Management
MassachusettsInstituteofTechnology Cambridge,
MA
02139,USA
dellfemitedu
Abstract
This paperintroducesa
model
foranalyzing marketplaces,suchaseBay,which
relyonbinary reputationmechanisms
forqualitysignalingandqualitycontrol. Inourmodel
sellerskeeptheir actualqualityprivateand choose
what
quality to advertise.The
reputationmechanism
isprimarilyusedtoinducesellerstoadvertisetruthfully.Buyers basetheirratings
on
thedifferencebetween
expectedandactual quality.Furthennore.ratersare lenientand
do
not post negativeratingsunless transactionsend upexceptionally bad. ItISshown
that, insucha setting, the fairnessofthemarketoutcome
isdeterminedby
the relationshipbetweenratingleniencyand correspondingstrictness
when
assessinga seller's feedbackprofile. Ifbuyers judgesellerstoostrictly(relative tohow
lenientlytheyrate)then,atsteadystate,sellerswillbeforcedtounderstatetheirtrue quality.
On
theother hand,ifbuyersjudgetoolenientlythensellerscangetaway
withconsistently overstatingtheirtrue quality.
An
optimaljudgment
rule,which
resultsinoutcomes
where,atsteadystate,buyersaccurately predictthe truequalityofsellers,istheoreticallypossibleto
denve
forallleniencylevels.Furthennore,ifbuyersjudgesellersusing that rule,thenthe
more
lenientbuyersarewhen
ratingsellers,the
more
likelyitisthat sellers will finditoptimal tosettledown
tosteady-state qualitylevels,asopposedto oscillatingbetween
good
qualityand badquality.However,
it isarguedthatthisoptimalruledepends onseveral parameters,
which
are difficult toestmiatefromtheinformationthateBay
currently
makes
availabletoitsmembers.
Itistherefore questionabletowhatextent unsophisticated buyersarecurrently using
eBay
feedbackinfonnation inan optimalway.'Thisis
work
inprogress. Suggestions andcomments
verywelcome.
Please addresscorrespondenceto1. Introduction
Online reputation reportingsystemsareemergingasan important quality signalingandqualitycontrol
mechanism
inonline tradingcommunities (Kollock 1999;Resnicket. al.2000
). Reputationsystemscollect feedback frommembers
of anonlinecommunity
regarding past transactionswithothermembers
ofthatcommunity
Feedbackisanalyzed,aggregatedandmade
publicly availabletothecommunity
in theform ofmember
feedbackprofiles. Ifoneacceptsthatpastbehaviorisa relatively reliablepredictoroffuturebehavior,then theseprofilescanactas apowerfulqualitysignalingandqualitycontrol
mechanism,
essentiallyactingas the digitalequivalentofa
member's
reputation.eBay
reliesonitsreputationmechanism
almostexclusivelyinordertobothproducetrust and inducegood
behavioronthe partofits
members.
eBay
buyersandsellersareencouragedto rateone anotherattheend of eachtransaction.A
ratingcanbeadesignationof"praise","complaint"or "neutral", togetherwithashort text
comment.
eBay makes
thesums
ofpraise,complaintandneutral ratingssubmittedforeachmember,
aswellasallindividualcomments,
publicly availabletoall itsusers.Anecdotalandempiricalresults
seem
todemonstratethateBay'sreputationsystemhasmanaged
toprovideremarkablestabilityinanotherwiseveryrisky tradingenvironment
(Dewan
andHsu
2001; Resnick and Zeckhauser2001).The
risingpractical importanceofonline reputationsystemsnotonlyinvitesbut rather necessitatesrigorous research
on
theirfunctioningand consequences.Are
suchmechanisms
trulyreliable?Do
theypromoteefficientmarketoutcomes']'
To
whatextent are theymanipulablebystrategicbuyersand sellers?What
isthe bestway
todesignthem?
How
shouldbuyers (andsellers)usethe informationprovidedby
such
mechanisms
intheirdecision-makingprocess?Thisisjustasmall subsetofunanswered
questions,which
inviteexcitingandvaluable research.The
studyofreputation asamechanism
forinducinggood
behaviorinmarkets with asymmetricinfonnationiscertainlynotnew.Severaleconomistshave publishedimportant
works
analyzingitsproperties(Rogerson, 1983; Schmalensee, 1978; Shapiro, 1982;
Smallwood
andConlisk, 1979; Wilson,1985,just to
name
afew).Nevertheless,althoughpast
work
ineconomicshas studiedsome
oftheoveralleffectsofreputation,ithaspaid verylittleattentiontotheanalysisofspecific
mechanisms
forformingandcommunicating
reputation,inpartbecauseintraditionalbrickand mortarsocietiessuch
mechanisms
arelargely informal (theyareoften referred toas"word-of-mouthadvertising")and defydetailedmodeling.
The
few publishedresultsfocusingonthe effectsofspecificpropertiesofreputation
mechanisms
clearlymake
thepointthatsuchpropertiescan havesignificant effectsonthemarket outcome. For example,
Rogerson
(1983)shows
thatreputationbasedonsubjective binaryratings(e.g.good/bad,praise/complaint) createsanexternality,
which
affectsthe entiremarket. Shapiro (1982)
shows
that,unless themechanism by which
reputationisformedsatisfiescertainproperties, sellers
may
finditoptimaltocontinuouslyoscillatein quality,periodicallybuilding
good
reputationand subsequently milkingit.On
theotherhand, the design and implementation ofonline reputationsystemshas sofarbeentheresearchdomain
ofcomputer
scientists(seeBreseeet. al., 1998;Sarwaret.al.,2000; Schaferet. al.2001 foroverviews ofpast work).
The
emphasis ofpastwork
inthearea hasbeen on developingalgorithmsand systemsforcollecting,aggregatingandextracting useful infonnation fromsetsofuserratings,drawing fromwork
in infonnationretrieval,datamining andcollaborative filtering.The
analysisand evaluationoftheproposedalgorithmsistypically
done
in termsof computational complexity andstatisticalmetrics,suchas theirrunningtime,
memory
requirements,averagerecall andprecision,averagebias,etc.We
believethatthereisaneedforwork
thatbridgesthetwo
disciplines:research,which
takesintoaccountthealgorithmicdetailsofspecificreputationsystemsbut alsomodels
how
thesesystemsareembedded
inside tradingcommunities andinvestigatestheireffectivenessandimpact, notonlyintermsof
computationalandstatisticalproperties,butrather intermsoftheir overall impactintheefficiencyofthe
market andthewelfareofthevariousclassesofmarketparticipants.
Given
thatreputation systems were conceivedinorderto assistchoiceinenvironmentsofimperfect information,theirimpact inthoselattermarket dimensions should betheultimatedeterminantofsuccessof any
new
proposednew
algorithmandsystem.
This papercontributesin thisdirectionby proposinga
model
foranalyzingtheeconomic
efficiencyofbinary reputation systems, suchas theone used
by
eBay.Section2 introducesthe
model
anditsunderlying assumptions.We
assume
thatbuyersatisfactiononeBay
isafunctionofthedifferencebetweentheadvertisedand truequalityofanitem. Insucha setting, the
reputation
mechanism
isprimarilyusedtoinducesellers toadvertisetruthfully.Section 3 formalizesthismiuition
mto
anumber
ofpropertiesthateBay-like reputationmechanisms
shouldsatisfy, inordertobeconsidered wellfiinclioning.
Section4appliesour
model
inordertodetemiine underwhat
circumstances suchmechanisms
canindeedbewell functioning.Itis
shown
that the fairnessofthemarketoutcome
isdeterminedbytherelationshipbetweenratingleniencyand
judgment
strictnesswhen
assessinga seller'sfeedbackprofile.An
optimaljudgment
rule,which
resultsinoutcomes
where,atsteadystate,buyersaccurately predictthe truequalityofsellers, istheoreticallypossibletoderiveforallleniency levels.
A
rathersurprisingconclusionof ouranalysisisthat, ifbuyersusethisoptimal
judgment
rule,thenthemore
lenientbuyersarewhen
ratingsellers,the
more
likelyit isthat sellers will finditoptimaltosettledown
tosteady-statequality levels,asopposedto oscillating
between
good
qualityand badquality. In thatsense,raterleniencybenefitsthe overallstabilityofthesystem.However,
it isarguedthatthisoptimalruledependson
severalparameters,which
are difficult toestimate fromthefeedbackinformationthateBay
currentlymakes
availabletoitsmembers
Itistherefore questionabletowhatextentunsophisticatedbuyersarecurrently usingeBay
feedback infomiationin anoptimalway.
Section5considers the implicationsofrelaxing
some
ofthesimplifyingassumptions onwhich
our analysisisbased.Finally, Section6
summarizes
thecontributionsandconclusionsofthepaper.2.
A
model
of reputation-mediated marketplaces with binaryfeedback
This section introducesa
model
foranalyzing marketplaces,which
relyexclusivelyon
abinary reputationmechanism
forqualitysignalingandqualitycontrol.A
binaryreputationmechanism
isamechanism where
ratersaregiventheopportunityto ratepasttransactions usingone of
two
values,commonly
interpretedas"positive"(i.e.satisfactory)and"negative"(i.e.unsatisfactory,problematic).
Our
intentionistousethismodel
inordertostudy theeconomic
impact ofreputationmechanisms
similartotheone usedby eBay
(seeResnick and Zeckhauser 2001 for adetailed description) .
Inour model,qualitiesarenon-negativereal-valuedquantities,
which
subsume
aspectsof both productqualityandservicequality.
We
assume
thateachsellerproducesitems,whose
real quality q^ isunknown
tobuyersand can only be determined with accuracyafterconsumption.
We
furtherassume
thatallbuyersprefer higher qualitytolowerquality,althoughtheymightdifferintheextentto
which
theyarepreparedtopay foran extraunitofquality. Finally,
we
assume
thatalthoughthe realqualityofitemsisnotcommunicated
tobuyers,sellersdo inform buyers by advertising.On
eBay,advertisingcorrespondstotheseller-supplied description,
which
accompaniesallitems.The
advertised quality q^ of anitemiscompletelycontrolled
by
the seller(i.e.thereisno
validationofany kind bya thirdparty)andmay
ormay
notcorrespondto itsreal quality.
'
Inadditionto"positive" (praise)and"negative" (complaint)ratings,eBay'sreputation
mechanism
alsosupports "neutral"ratings(which,however,arerarelyusedinactual practice).
As
willbecome
apparentbelow, our
model subsumes
raterswho
would
submit"neutral"ratingsoneBay
intothesetofraterswho
Sellersaimsto
maximize
thepresentvalueoftheirpayofffunction ;r(x,^ ,q )= G{x,q
,q)-c(x,q
)where
.visthevolume
ofsales, C(.)isthegrossrevenuefunctionandc(.)isthe costftinction.We
assume
that dc/dq^
>0
imdd/r/dq^ >
forallsellers.Under
theabove assumptions,sellershave anincentivetoover-advertisequality.The
marketwould
thendegeneratetoa"marketforlemons" (Akerlof1970). Inordertoavoidthis from happening, buyersare
giventheoptionto rateeachtransactionusinga"positive" or "negative"rating.
A
reputation system, operatedby
atrustworthythird party,accumulatesallratings into afeedbackprofileR
=
(Z ,1. ,Z )foreachseller,where
Z
isthesum
ofallpositiveratingsreceivedfor that seller+ - norating +^ * '-'
duringthemostrecenttime
window,
X
isthesum
ofallnegativeratingsreceivedduringthesame
periodand
Z
isthenumber
oftransactionsforwhich no
ratingwas
submitted".Time
windowing
isusedinnorating ^ ^
ordertoaddress thepossibility that sellers
may
improveor deterioratetheirbehavior overtime.For example,on
eBay, feedbackprofilesdisplay thesums
ofratingsreceivedduringthe past6months
only.Buyer
utility from purchase ofa singleitemismodeled by [/=
^
9-
p
,where
/;istheprice,^
isabuyer'squalitysensitivityand qisthelevelofqualityperceived
by
thebuyerafterconsumption.When
consideringapurchase,buyers
combine
all theinfomiationthat isavailabletothem, i.e.anitem's advertised qualityandaseller'sfeedbackprofile,inordertofomiasubjectiveassessmentof anitem'sestimatedqualityq ,where;
9,
=/(?„. R)
(1)Armed
withknowledge
ofpricesandestimatedqualities,buyersproceedtopurchaseone ofthe available items,presumablytheonewhich maximizes
theirexpectedutilityU
=
6
q-
p
. Followingapurchase,buyers observetheitem'sperceivedquality q
=
q+
£,where
f
isanormallydistributed errorterm withstandard deviation
a
.The
introductionof anerrortermisintendedtocollectivelymodel
anumber
ofphenomena, which
occurinactual practice.For example:• buyers
may
misinterpretaseller'sadvertised quality(thisshouldbemodeled
as ^=q
+
e,however,ouranalysisisidentical if
we
addthe errortermto q instead)• sellers
may
exhibit small variationsinactual quality from onetransactiontoanother• buyers
may
havesmall differencesin theirperceptionofqualitybased,say, ontheirmoods
thatday•
some
aspectsofperceived qualitydepend on
factorsbeyond
aseller'scontrol(e.g.post-office delays)Finally,buyers decide whetherto rale atransactionaswellas
what
rating to give.Our model
assumesthatratingsare a functionofabuyer's satisfaction relative toher expectations.
We
defineabuyer'ssatisfactionfromagiventransactiontobethedifferencebetweenperceivedand expectedutility. Thatis,
S
=U-U=9(q
-q
+£)
.Under
theabove assumptions,5
isanormallydistributedrandom
variablewith
mean
(q-
q,) andstandard deviation6a.
One
interestingpropertyof eBay,which
hasbeenreportedon
severalempiricalstudies,isthatmost buyersgiveveryfewnegativeratings tosellers.Resnickand Zeckhauser (2001)reportthat lessthan
0.5%
of buyersand 1.2%
ofsellerspostneutral or negativefeedback abouttheirtradingpartners (see Figure 1).They
tentativelyconcludethat raterseither rategenerouslyor prefertorefrainfromratingatallafterbadexperiences.
WF2:
Assuming
WFl
holds,underallsteady-statesellerstrategies{q^.qj
the qualityofsellersasestimated
by
buyersbefore transactions takeplace,isequaltotheirtruequality(i.e.q^=
q^). Beforewe
proceed,letusjustifytheabovedefinitionby
providinga brief rationaleforthedesirabilityofproperties
WFl
andWF2.
First,thevalueofreputation
mechanisms
ingeneralrelies ontheassumptionthatpastbehaviorisa reliablepredictoroffuturebehavior (Wilson 1985). If oscillations
were
optimal, the predictive valueof cumulativefunctionsofpastratings,suchas
1,1
,w ouldbegreatly diminished.Inenvironmentswhere
theprimary(oronly)
mechanism
for certifyingandcontrollingsellerqualityisbasedonreputation,it is,therefore,desirablethat sellers find itoptimal tosettle
down
toasteady-statebehaviorratherthanto oscillate.Second,
accordmg
toour model,buyersmake
purchasedecisionsbased onknowledge
ofpricesandestimatedqualities. Ifit
were
possiblefor sellers tosettledown
intoasteady-state strategythatwould
consistentlydeceivebuyersintoestimating q > q ,thenthis
would
allowsellerstoearn additionalprofitsattheexpense ofbuyers.Inthepresenceofcompetitive marketplaces,buyers
would
theneventually leavethemarketplaceinfavorofothermarketswithbetterinformation.
On
theotherhand,if,underallpossible steady-stateseller strategies the effectofthereputationmechanism was
such,sothatbuyersestimatedq
<q
,thentheoppositeeffectwould
takeplace:buyerswould
realizeextra surplusattheexpense ofsellers.
Once
again,we
would
thenexpectthat sellerswould
desert themarketplace in favorofother,more
transparentmarkets.
The
onlyfairsteady-statestrategy, therefore,isonewhere
q^-
q^.
4.
Can
binary reputationmechanisms
be
wellfunctioning?Thissectionwilldemonstratethat,givena rating function,
which
hasthegeneralform givenby
(2),whether aneBay-like binary reputationsystemsatisfiesproperty
WF2
depends onthe relationshipbetween(2)andthe quality estimation function <?
= /(^
,R). Furthermore,we
willshow
that, ifbuyersare lenientenough
when
theyrateand correspondinglystrictwhen
theyjudgeseller profiles, sellers will finditoptimaltosettle
down
tosteady-statetrueandadvertised qualitylevelsifsuch anequilibriumexistsunderperfect information.4.1 Estimatedvs.real qualities insteadystate
Let us firstfocusourattention
on
thecircumstances underwhich
abinary reputationmechanism
satisfiescondition
WF2.
We
areassumingthatWFl
holds.Therefore, thereexistsatleastonesteady-statestrategy iq ,q ) foreachseller''.A
steady-state strategyisastrategythatoptimizesa seller'spayofffunction,whileatthe
same
timeresultinginanestimated quality q^= f{q^,R)
,which
isstableovertime.Denote
q
=
q+ ^
,where ^
isthedeceptionfactor,thatis,thedistortionbetween
estimatedandrealqualityatsteadystate.If.^
>
thenbuyersoverestimatea seller's true quality,whereasif.,<
thenbuyers underestimatetrue quality. LetA'bethetotalnumber
ofsalestransactionsofagivenseller inthemostrecenttime
window.
ItiseasytoseethatA'=
Z
+
1
+
S
.Assuming
thatbuyersrateaccordingto •'-t- - noruling ^
(2),forlarge
N
atsteadystatethefollowingwillhold:I
=N
?Tob[S>Q]
=
N^[(q^
-<? )/o-]=
/V0[-^/cr]
Z
=
N
ProliS<-X\
=
N-
0[(9^-q^)la-Xlieo)]
=
N-^{^lo-Xl(e-
a)]where
<t>(.) isthestandardnormalCDF.
''
Section 4.2willexploretheconditionsunder
which
sellers willindeedfinditoptimalto settledown
toaGiven
that q= f(q
,R),satisfactionofconditionWF2
dependson
thequalityassessment function/More
specifically,/must
bechosensothat,forallsteady-statestrategies(q^,qj
theequation:q=q^+^
= f(q^,R(4))
(4)hasauniquesolutionat
^ =
.eBay
doesnotspecify,orevenrecommend,
aspecificqualityassessment function/ Itsimplypublishesthequantities
S
andI
foreachsellerandallowsbuyerstouseany assessmentrulethey seefit.Itisimportanttonoteat thispointthat
eBay
doesnot currently publishthequantity£„„^„„„^(andtherefore
N)
foraseller.
As
we
willshow
below,knowledge
ofA'isessential forconstructingreliablequalityassessmentfunctions.
The
resultsofthispaper, therefore,make
astrongargumentthatthenumber
oftransactionsthathavereceived
no
ratingshouldbe addedtotheprofileinformationpublishedby
eBay
andsimilarsystems.
In thispaper,
we
willexploreonegeneral familyofqualityassessmentfunctionswhich,when
implementedcorrectlyand used
m
conjunction withrating rule(2),satisfyWF2.
We
willfurtherexploredifferentways
in
which
usersofa reputationmechanism
canuse I.^,2
and
TV inordertocorrectlyimplement
thosefunctions.
The
generalform of ourqualifyassessmentfunctions isgivenby:,=/„..R,
=
k
i'f'^»
(5)[O
if^=(R)>0
Inotherwords, buyersassessthequalifyof an itemtobeequalto thatadvertisedbytheseller, if,basedon
the seller's profile,theyconcludethat the sellerdoesnot over-advertise.Otherwise, buyers
assume
thattheseller liesandassess
minimum
quality. Function(5)thereforeuses the infonnalion providedby
thereputation
mechanism
inordertoderivea(binary)assessmentoUruthfiilnessinadvertising.Assuming
that sellershaveaway
ofreliably inferring thesignof^
from feedbackprofileinfomiation,sellers
who
over-advertisetheir quality will quickly seetheirestimated qualityfallto zero.Therefore, if/isgiven
by
(5),equation(4)hasnosolutionfor^ >
.Note
thatfunction(5)doesnot preventsellersfromunder-advertisingtheirqualifybecausefor ^=
^+ ^
,all
^ <
arealso solutionsofequation(4).However,
giventhatwe
haveassumed
that drtldq^>
,we
would
not expectany profit-maximizingsellertounder-advertise. Therefore, theonlysteady-statesellerstrategy for sellers
would
beto truthfullyadvertisetheir real quality. In that case,buyerswould
estimateq
=
o=
,adesirableoutcome,which
satisfiesWF2.
Let us
now
explore threedifferentways
inwhich
buyers canuseZ
,S
andA' inordertoestimate the signof^
Assessment
based on
thenumber of
positivesOne way
toestimatesellerhonestyistorequirethatthefractionofpositiveratingsofgood
sellersexceeda threshold.From
(3)we
canseethat f]=
Y. /A' canbeinterpretedas apoint estimatorof^[-^1
o].Given
that <i>[-4/a]
<
0.5 for alli^>
,assessmentofthesignof^
reducestotestingthestatisticalhypothesisH
://>
0.5 given fj.The
correspondingqualityassessmentfunction thenbecomes:{q
ifH
acceptedif
H^
rejected (6)where
H^:t]>
0.5given^7h
X
/NHypothesis
H
canbetestedusingone oftheknown
techniquesforcomputing
confidenceintervalsofproportions followingbinomialdistributions(e.g.BlythandStill 1983).
Function(6)isan appealing
method
forassessingseller qualitybecause ofitsrelativesimplicity.Note
thatitscomputation doesnot require
knowledge
ofthemodel
parametersA,dand
a
.However,
(6)isdifficultto
compute
reliablywithoutknowledge
ofM
thetotalnumber
ofrated plus unrated transactionsofaseller.As
was
mentioned,eBay
doesnotmake
A'known
toitsmembers.
Taking f]=
^JC^^
+
^_)would
resultin largeoverestimationof<t>[-^/a],especially because,dueto ratingleniency, 5^„^„„„|, isexpectedtobe
quitesignificant (inthe datasetofFigure 1 S^^^„„ /
N
= 48.3%
). Forthatreason,onewould
infer thatqualityassessment based
on
thenumber
ofpositiveratingsisnot(and shouldnot be)widely usedon
eBay. Thishypothesisisconsistentwithempirical observations(Dewan
andHsu
2001).Section42 willdiscussanother disadvantage offunction(6),
which
isthatitmakes
iteasierfor sellers tooscillatebetweenperiods
where
theymilktheirgood
reputationbyoverstatingtheirqualityanddeceivingbuyersandperiods
where
theyrestore theirreputationby
offeringbetterquality thanwhat
buyersexpect.Assessment
based
on
thenumber of
negativesInananalogous manner,
we
expectgood
sellers tohavefewnegativeratings.Therefore,anotherway
toestimatesellerhonestyistorequirethatthefractionofnegativeratingsof
good
sellersstaybelow
athreshold.
From
(3)we
canseethat^
=
Z
/A'canbeinterpreted asapoint estimatorofcl)[^/(T
-
/I /(6* cr)].Given
that cl)[,,^/cr-A/(6' o")]>
0[-/l/(6' cr)] for all.,=> ,assessmentofthesignof^^ reducestotestingthestatisticalhypothesis
H
[ :^
<
<^[-
A
1(9a
)\ givenC
The
correspondingqualityassessment functionthenbecomes:\q if
H'
acceptedjo if//' rejected
vhere
H[: ( <k' =^[-Al{d a)\g\\m (
=
1. IN
^'
|0 if//' rejected (7)
Let uscall k' =0[-/l/(6' cr)J the
optimum
trustworthinessthreshold. A* isamonotonicallydecreasing functionoftheleniency factorX
.Therefore,themore
lenientbuyersarewhen
theyrate,thelowerthe thresholdofnegativeratings totransactionsabove which theyshouldnottrustsellers,andviceversa.ThisISa result thatcorrespondswellto
documented
empiricalfindings: mosteBay
buyersweigh
negativeratings
much
more
heavilythan positiveratingswhen
assessingthetrustworthinessofaprospectiveseller(Dewan
andHsu
2001).Given
thattheyseem
toberather lenientwhen
they ratethosesellers,accordingto(7),
we
would
expectthem
tobeslnctwhen
assessing the qualityofsellers,andthereforetoberelativelyintolerantofnegativeratings.
The
big question,however,iswhether buyersuse therightthresholdwhen
theyjudgesellers (inotherwords,arebuyers capable of
making
the"nght"judgment
oijusthow
many-negative ratingsaretoomany?).From
equation(7)we
canalsoseethatanoptimum
k' canbederived foreveryA
.One way
ofinterpretingthisresultisthat satisfactionof
WF2
isalwayspossibleno
matterhow
lenient(orstrict)buyersare
when
theyrate,providedthatiheystriketherightbalance betweenrating leniencyand
qualityassessmentstrictness.ln thenextsection,
we
shallprovethat,more
lenientrating(andcorrespondinglystrictassessment)increasesthelikchhood that sellers will finditoptimalto settle
down
toasteady-state behavior.Some
degree ofleniency,therefore,can bebeneficialto the stabilityofthemarketplace.Itisalso importanttopointoutthat, unlessbuyers usethe rightthreshold k'
when
evaluatingthenumber
ofnegativeratingsofaseller,
WF2
willnotbesatisfied. Ifbuyersuseathreshold k> k' thenthere willbe some<^ > forwhich
//' willbesatisfiedandsellerswillbeabletoconsistentlydeceive buyersby
over-advertisingtheir quality. In contrast, \ik<k'
,therewillbe^^^ < suchthat //^'^ willberejectedforall
(^
> ^
, Inthelattercase, topreventtheirestimated qualityfrom
droppingtozero, sellers willbeforcedtounder-advertise and, therefore,beconsistentlyunder-appreciatedby |^„ |•
We
see,therefore, thatthechoiceofthe"right" k' iscrucial tothewell functioningof
thereputationmechanism,
and ofthemarketplaceingeneral.Itisimportanttoaskwhether buyers can be reasonably expectedtobeabletocorrectly deriveit.From
equation(7),calculationofk' requiresknowledge
ofthemodel
parametersi,^
anda
Itisunlikelythatbuyerswould
haveaccurateunderstandingandknowledge
ofthoseparameters(especially
o
,which
partly reflectspropertiesoftheseller). Nevertheless,evenifthemodel
parametersarenotknown,
itispossibletoestimate thevalueof^[-Xl(da)]
from2
,S_ andA'.
From
(3):-^la
=<^CLJN)
IA-' =a>[-/l/(6ia)]=*[<D"'(^ /yV)+
0'(I
IN)] (8)-Al(e cr)^^la
=^-\ZJN)
J ^ ^ n v y , i y . n{iN
issmallthenaconfidenceintervalshouldbeconstructedfor k'.
Even
withthehelpofequation (8).buyersstill needtoknow
A'inordertoproperlycompute
function(7)Overall,function(7)definesa rather fragile rule forassessingsellerqualityefficiently.
Given
thatalotofeBay
buyersareheavilybasingtheir sellerqualityassessmentsonthenumber
ofnegativeratingson
thesellers' feedbackprofile,it isveryinteresting toask
what
methods
theyusetocompute
theirtrustworthinessthresholds and,even
more
important,whethertheirtrustworthiness thresholdsdo
indeedcome
closetosatisfyingWF2.
Clearly,these areimportant questions,which
invitefurtherempiricaland experimentalresults tocomplement
the resultsofthiswork.Assessment
based on
theratiobetweennegativesand
positivesIn both previouscases,correctimplementationofthequalityassessmentfunction required
knowledge
ofM
One
mightthinkthat,by basingqualityassessment onthe ratiobetween
negativeandpositiveonemay
beabletoderivean optimal assessmentfunction from
Z^
andZ
only.We
willshow
that thisisnotpossible.From
(3)we
get:Function p(^)isnon-negative and monotonicallyincreasingin
^
. Furthermore p(Q)=
2 <t>l-Ai{d a)].
H"
.p<l
0[-/li{fi a)] givenp
= Z_
/Z^
.The
correspondingqualityassessmentfunctionthenbecomes:
{q
if//' acceptedif//^rejected (10)
where
Hl:p<2
*[-/l1(6 a)]givenp
=
Y. lY.^Unless buyers have
knowledge
ofthemodel
parametersA,6mAa
,calculationof2 <i>[-Xl{d a)\ from(8)requires
knowledge
ofM
Therefore, usingthe ratioofnegativestopositivesisverysimilartousingthefractionofnegativesandisequallytricky to"getright"without
knowledge
ofA'.4.2Existenceofsteady-statebehavior
The
analysisofSection4.1 hasbeen based ontheassumptionthat sellers settledown
tosteady-staterealandadvertised qualitylevels.Thissectionwillinvestigatetheconditionsunder
which
sellers will indeedfindItoptimalto
do
so.The
alternativeisto oscillatebetweenbuildingagood
reputationandthenmilking itby
over-advertisingrealquality.As
we
arguedinSection3,reputation-mediatedmarketplaces should be designedinordertoinducesellerstosettledown
tosteadystate behavior(otherwise infomiation aboutpastbehaviorwillnotbevery helpfulas a
way
ofpredictingthe future).The
principalresultofthissection isthatwhen
qualityassessmentisbased onfunctions(7)or(10),which
involve negativeratings, then,ifthe ratingleniencyfactor
A
islargeenough,sellerswillfinditoptimaltosettle
down
tosteadystatebehavior. In contrast,thereisno such guaranteewhen
qualityassessmentisbasedon function(6),
which
only involves positiveratings.Thisresultshows
thatmore
lenientrating(coupled with
more
strictqualityassessment)supportsstability inthe system. Forthesame
reason,although
more
fragileanddifficultto"getnght", functions(7)and(10),i.e. functionswhich
basesellerqualityassessmentonthe
number
ofnegativeratings, arepreferredtofunction(6),which
only looksattheseller'spositiveratings.
Inordertoderiveourresult, letus consider
ways
inwhich
sellersmay
attempttorealizeadditional profitsthrough oscillatingbehavior.
Assume
thata sellerisabletoperform N^ transactionsbeforeratingsofthose transactionsarepostedtoherfeedbackprofile.Thisnumber
depends onthefrequency oftransactionsandthedelay
between
transactionsandthepostingofratingsby
buyers (oneBay,thisdelayistypically 2-3 weeks).Let us consider aseller
who,
attheend ofperiod0,has completedA'transactionsinthe currenttimewindow
andhasaccumulatedagood
reputation,by producing andadvertising itemsofqualityq ,thequality thatoptimizesprofitsassumingsteady-state behavior. Let usfurther
assume
thatbuyersassess qualitybased onfunction(7). Attheend ofperiod0:I
=
—
(^=
0)=
cD[-/l/(^a)]=
A: (11)N
period
Atthebeginningofperiod 1 the sellerdecidestomilk her reputation
by
choosinga realquality q'andthenover-advertisingherquality
by ^
sothatherprofitisma.ximizedrelative tothesteadystatecase.Given
theseller's
good
past reputation,initiallybuyerswillbedeceived.However,
aftertheypurchasetheseller'sitems,theywillrealize their inferiorqualityandwillpostproportionally
more
negativeratings.Therefore,A'
ptTloJI
'nTn.
>k
(12)and the seller'ssubsequentestimated qualitywill fall tozero
Assuming
thatsome
buyersarewillingtobuy
from
somebody
with zero qualityifthe priee islow enough, oursellerwillslayin business.Inordertoincrease her reputationonceagain,sheneedstoreducethe ratio
Z_/
N
tobelow
thethreshold A*.The
onlyway
shecan achievethisistogo
throughaperiodwhere
sheproduceshigher quality items but receiveslowerprices,surpassing buyers' expectations(who
now
expect q=
)by .5 ,. Let usassume
thatit
would
takeN,
"redeeming"transactionsbeforeZ
/A'<
A'. Attheend ofperiod2:N
0[-A/(e a)]+N^ *[^,/a-A/(0
a)]+ N^ <i>[-4Ja-A./(0 a)]N
+N
+iV,k'=0[-A/{O a)] (13)
A
profit-maximizingsellerwillchooseto oscillateifthe profitfromthe"deceiving"transactionsrelative tothe steady-stateprofitexceedsthe lossfromthe"redeeming"transactionsrelative to thesteady-stateprofit.
Ifthese
two
quantitieshaveafiniteratio,then,providedthatthenumber N^
of"redeeming"transactionsthatarenecessaryinorderto
"undo"
thereputationeffectsofA^^ "deceiving"transactionsishighenough,sellers willnotfinditprofitable to oscillateandwill settle
down
tosteady-staterealandadvertised qualitylevels.
From
(13)aftersome
algebraicmanipulation,we
get:A,
0[£/cr-A/(0a)]-0[-A/{0
a)]N. «[-/?/{0 a)]
-fl>[-^,
/a-A/{0
(T)]=
gU.4r4,)
(14)After
some
manipulationwe
getdg/dA>0
and3'g/3/l">
.In fact g(.)grows
exponentiallywith/IFigure2 plotsg(A)for
^ =
<T-
' ""''c-rno ronrocsntatU/P ^;QlllCCnf,<=-
.^-
,= andsome
representative valuesof1,=
c,=
c,.05
115
2Leniency
factor(lambda)ksi=1 ksi=2 ksi=3 ksi=4 ksi=5
Figure2:
Minimum
ratioof"redeeming"
to"deceiving" transactionsneeded
inorder
torestoreone'sgood
reputation followingaperiodof qualityover-reporting.^
Furthermore,foragiven
A
,g{.)grows
rapidlywithi^^ anddecreasesveryslowly with i^,. Thismeans
thatthe
minimum
necessaryratioof redeemingtodeceiving transactionsgrows
with theamount
ofinitialFrom
Figure2itisevidentthatinmarketplaceswhere
buyersrate leniently(and assess qualitystrictly),sellersneed
many
more
"redeemingtransactions"inorderto restore theirgood
reputation following afew"deceivingtransactions".
The
relativenumber
of redeemingtransactionsincreases exponentiallywiththeleniencyfactor Otherwisesaid, the larger the
A
,themore
difficultitisfor sellers to restore theirreputationoncethey loseit.Consequently, if
A
issufficiently large, sellers will finditoptimaltosettledown
tosteady-staterealandadvertised qualitylevels.Q.E.D.A
similarresultcanbederivedifbuyers basequalityassessmenton
theratioZ_ /I^
.Incontrast, ifbuyersbasequalityassessment
on
function(7),ouranalysisgives:A', _
1-2
0[^, /CT](15)
Equation(15) gives /V,
=
A', for i^^=
<^^and/V, slightly lessthanN^for ^^^<
^^. Otherwisesaid, followinga setof deceivingtransactions, ittakes thesame number
of(orfewer)redeemingtransactionsinorderto restoreone's
good
reputation. Insucha setting, itismore
likely thatsome
sellers willhaveprofitfunctionsfor
which
itwillbeoptimalto oscillate. Therefore,oneexpectsthatinreputation-mediatedmarketplaces
where
buyersuse(6) toassessseller quality,therewillbeless stabilitythaninmarketplaceswhere
sellersuse(7)or(10).The
resultsofthissectionprovidesome
interestingargumentsforbothratingleniencyaswell asforbasingthe qualityassessmentofsellersontheirnegative,ratherthantheirpositiveratings.
5.Realitychecks
and
some
recommendations
The
resultsoftheprevious sectionshave beenderivedby making
anumber
ofsimplifyingassumptionsabout buyerbehavior.
More
specifically,we
haveassumed
thatallbuyers havethesame
qualitysensitivityandleniency factor
A
. Furthemiore,we
haveassumed
thatbuyers always submitratingswhenever
theirsatisfaction rises
above
zero orfallsbelow
-A
.Both assumptionsarenotlikely tohold ina realmarketplace. Buyers havedifferent personalities,andtherefore, areexpectedtohavedifferentquality
sensitivities,as wellasleniency parameters.Furthermore,ratings
do
incuracost(timetologon and submitthem)and
some
buyersdonot botherrating,evenwhen
transactions turnoutreallygood
orverybad. Inthissection,
we
will injectabitofrealitytoourmodel
andwillexplorehow
ourresultschange ifwe
takeintoaccounttheaboveconsiderations.
Reality
Check
#1:Some
buyersneverrateWe
needtomodify ourratingfunctionr(S)in (2), sothatwhen
S>
, r(S)="+" withprobability /? andr(S)
=
norating with probability (1-
/9). Similariy,when
S<-A,
r{S)="-" with probability y andr(S)
=
norating withprobability (1-
7).Under
thisnew
ratingfunction,thestatistical hypothesisin (6)becomes
H^:r]>^
0.5 ,whilethehypothesisin(7)becomes
H'^:C<7
0[-A
/(d a)].We
seethatournew
assumptionintroducestwo
additionalparameterstoour model.The
parametersneedtobereliablyestimatedinorderforproperty
WF2
tobesatisfied.Reality
Check
#2:Buyersdiffer inqualitysensitivityand
leniencyLet's define a;
= A/
(9andlet's call p{co) theprobability distnbutionoftt»among
buyers.Then
(3)mustbemodifiedas:
1
=NProb[S>0]
=
N<t>[-^/a]
(16)
Ifqualityassessmentisbased onthe fractionofpositiveratingsusing(5),thenrealitycheck#2 doesnot introduce additional complications.
However,
ifqualityassessmentisbasedonthe fractionofnegative ratmgs,which,m
thepresenceoflenient ratingsisthe rulemost
likelyto result in stable sellerbehavior, thenthmgs
dobecome
considerablymore
complicated.More
specifically,itiseasytoseethatthestatisticalhypothesistobetestedin(6)must
become
H'^:^<
k'=
j<I)[-a»/cr]p{w)
dco. Inordertocalculatethe"right" k',one needs
knowledge
ofp(QJ).
Things
become
evenmore
complicatedifwe
combine
realitychecks#1 and#2,which would
bethesituation thatmostcloselycorrespondsto actual reality.
Of
course,one can begintothinkofways
inwhichindividualbuyers might beabletoestimate,
maybe
withsome
degreeoferror,the additionalmodel
parameters /3,yandp(6;) from
Z
,I
andjV .However,
insteadofembarking
inthisdirection,atthisstagewe
believethatwe
areprovidedenough
argumentsto
make
one ofthemain
pointsofthispaper: Binaryreputationmechanisms
canintheoryhewellfunctioningundertheassumption of simpleratingand assessmentrules,butonlyifbuyersusethe
rightthresholds
when
judgingsellertrustworthiness. Calculatingthe rightthreshold fromS
,S alone, theonlyinformation currentlyprovided by eBay,isverydifficult. Calculatingthe rightthresholdfrom
I
,Z andA' ispossibleunderthesimplifyingassumptionsofSection 2 butbecomes
more
andmore
difficultasour
models
approachreality. In realistic cases, thecorrectassessmentruledependsnotonlyon
thefeedbackprofileofa sellerbutalsoonpropertiesofthe raterpopulation.
Given
thattheefficiencyofthemarketplacecruciallydepends onthe selectionofcorrectassessmentthresholdsonthe partofthebuyer,
themostsensiblecourseofresearch thereforeshould betothinkofadditional informationthatthe reputation
mechanism
can provideto raters, inordertomake
thiscalculationeasier.One
ideaisforthe operatorofthemarketplace, orsome
trusted third party, to introduceasmallnumber
ofhonestsellers into themarket.
The
behaviorofthosesellersmust be completely underthecontrolofthemarketplaceoperator buttheir identitiesshouldbe
unknown
tobuyers(sothatbuyersarenotbiasedwhen
theyratethosesellers).
Given
thatthemarketplaceknows
thatforthosesellers ,^=
,itcanusetheirfeedbackprofiles inordertoderive estimatesofy5,7,p(ft;)andk'
Those
estimates, or suitablychosenderivatives,canthenbe
communicated
tothebuyersforthepurposeoffacilitating their sellerassessmentprocess.
6.Conclusions
The
objectiveofthispaperwas
toexploretowhat
extent binary reputationmechanisms, suchastheone usedateBay, arecapableof inducingefficientmarketoutcomes
in marketplaceswhere
(a) truequality informationisunknown
tobuyers,(b)advertised quality iscompletelyunderthe controlofthesellerand(c)theonlyinformation availabletobuyersisanitem's advertised quality plus theseller'sfeedbackprofile.
The
firstcontributionofthepaperisthedefinitionofa setofconditionsforevaluating thewell functioningofareputation
mechanism
issuchsettings.We
considerareputationmechanism
tobewell-functioningifit(a) inducessellers tosettle
down
to asteady-statebehavior assumingitisoptimal forthem
todo
sounderperfect qualityinformationand(b)atsteady-state, sellerqualityasestimated
by
buyersbefore transactions takeplaceisequal totheirtrue quality.The
secondcontributionofthepaperisananalysisof whetherbinary reputationmechanisms
canbe well-functioningundertheassumptionsthat(a)ratings arebasedonthedifferencebetween
buyers'trueutilityfollowingatransactionandtheirexpectations beforethetransactionand(b)buyersarerelatively lenient
when
theyrateand correspondinglystrictwhen
they assessa seller'sfeedbackprofile.Our
firstconclusionisthatifbinaryfeedbackprofilesareusedtodecidewhethera selleradvertisesassess quality equalto the
minimum
quality),then, intheory, binary reputationsystems canbewell functioning,providedthatbuyersstrikethe rightbalance betweenratingleniencyandqualityassessmentstrictness Furthermore,assumingthatbuyersbasetheir
judgment
onthe ratioofnegativeratingsreceivedbyaseller, ifbuyersare lenient
enough
when
theyrateand correspondinglystrictwhen
theyjudgesellerprofiles,
we
haveshown
that sellers will finditoptimaltosettledown
tosteady-state qualitylevelsifsuchanequilibriumexistsunderperfectinfomiation.Thisisaninteresting
way
inwhich
(a)judgingsellertrustworthinessbased ontheirnegativeratingsispreferabletobasingitontheirpositiveratingsand(b)
some
degreeofratingleniency helps bringstability to thesystem.Our
second conclusion isthat,unlessbuyersusethe"nght"threshold parameterswhen
theyjudgesellerprofiles,binary reputation
mechanisms
willnot function wellandthe resultingmarketoutcome
willbeunfairfor either thebuyersor thesellers.Inthatsense,althoughbinary reputation
mechanism
can bewell functioningin theory,theyareexpectedtobequite fragileinpractice.The
crucialquestion thereforebecomes
whetherbinary feedbackprofilesprovidesellers(esp. relativelyunsophisticatedones) withenough
infonnaliontoderive the"right" sellerjudgment
rules.We
have foundthatthe "right"judgment
rule(e.g.what
isthe rightnumber
ofnegativeratingsabove
which
a sellershouldnotbetrusted?) isdifficultto infercorrectlyfromknowledge
ofthesum
ofpositiveandnegativeratingsalone,
which
istheonlyinformation currentlyprovidedby eBay
toitsmembers.
Ifknowledge
ofthesum
ofunrated transactionsisaddedtofeedbackprofiles, then,underanumber
ofsimplifyingassumptions,itispossibletoderivenon-obviousbutrelativelysimple "optimal"
judgment
rules
which
result inwell functioning reputationmechanisms.However,
ifthesimplifying assumptionsaredropped,calculationofthe right
judgment
rulefromS^
,£_ andN
onceagainbecomes
difficult,asitrequires
knowledge
notonlyofsellerratingsbutofthe raterpopulationas well.Our
findings leadto therecommendation
thatmore
informationshouldbe providedto assist ratersof such marketplacesusefeedbackprofilesinthe"right"way.One
idea isfortheoperatorofthemarketplace, orsome
trusted third party, to introduceasmallnumber
ofhonestsellersunderitscontrolintothemarket. Ifone knows,a priori, that a sellerishonest,herfeedbackprofilecanthenbe usedbythemarketplaceto
estimateandpublisha
number
ofadditional"market-wide" parameters which,togetherwith2^
,S_ andN
,willhelpbuyers
more
reliablyassess the qualityofothersellers inthe system.The
theoretical resultsofthispaperraisesome
intriguing questionsrelated to the efficiency, fairnessandstabilityofeBay-like electronic marketplaces.
The
authorwould
welcome
experimental andempiricalevidencethat willshed
more
lightintothequestionsraisedandwould
validate the conclusionsdrawn
fromhismodels.
References
Akerlof,G. (1970)
The
marketfor'lemons': Quality uncertaintyandthemarketmechanism.
Quarterly JournalofEconomics
84, pp.488-500.Blyth,C.R.and StillH.A. (1983) Binomial ConfidenceIntervals.Journal
of
theAmerican
StatisticalAssociation78, pp. 108-116.
Bresee,J.S.,
Heckerman,
D.,and Kadie, C. (1998)EmpiricalAnalysisofPredictiveAlgorithmsforCollaborativeFiltering.InProceedings ofthe14'''Conference
on
UncertaintyinArtificialIntelligence (UAl-98),pp.43-52,SanFrancisco, July 24-26, 1998.Dew
an, S and Hsu,V
(2001)Trustin ElectronicMarkets:PriceDiscoveryinGeneralistVersusSpecialtyOnlineAuctions.
Working
Paper.January31,2001.Kollock,P.(1999)
The
ProductionofTrustinOnlineMarkets. InAdvances
inGroup
Processes(Vol. 16), eds. E.J.Lawler,M. Macy,
S.Thyne, andHA.
Walker, Greenwich,CT:
JAIPress.Resnick,P.,Zeckhauser,R.,Friedman, H.,
Kuwabara,
K.(2000) [Reputation Systems.Communicaiions
ofthe
ACM,
Vol. 43,(12),December
2000,pp.45-48.Resnick, P.and Zeckhauser,R. (2001)Trust
Among
StrangersinInternetTransactions: Empirical AnalysisofeBay'sReputation System.
Working
PaperfortheNBER
workshop
onempirical studiesofelectroniccommerce
January 2001.Rogerson,
W
P (1983) Reputation and productquality.BellJounuilof
Economics,Vol. 14(2),pp.508-16.Sarwar, B. M., Karypis,G.,Konstan,J.A.,andRiedl,J.(2000).Analysisof
Recommender
AlgorithmsforE-Commerce.
ACM
E-Commerce
2000
Conference, Minneapolis,MN,
Oct. 17-20, 2000.Schmalensee,R. (1978).Advertisingand ProductQuality.Joz/rm;/ o/Po/;7;ca/fcowowi', Vol. 86,pp.
485-503.
Schafer,J.B., Konstan,J.,andRiedl,J., (2001)Electronic
Commerce Recommender
Applications.Journalof Data Mining
and
Knowledge
Discovery. January,200
1.
Shapiro, C. (1982)
Consumer
Information,ProductQuality,andSellerReputation. BellJournalof
Economics
13(1),pp
20-35,Spring1982.Smallwood,
D.andConlisk,J.(1979).ProductQualityinMarketsWhere
Consumers Are
Imperfectly Informed. QuarterlyJournalof
Economics.Vol. 93,pp. 1-23.Wilson, Robert(1985). Reputationsin
Games
andMarkets.InGame-TheoreticModels of
Bargaining,DEC
?00?
MIT LIBRARIES
IInilI mil