Showing posts with label mturk. Show all posts
Showing posts with label mturk. Show all posts

Tuesday, November 8, 2016

Trump and Clinton Supporters Agree on Relative Morality ... Mostly

When looking at the endless scandals swirling around the heads of the two rival candidates Hillary Clinton and Donald Trump, it can seem like the two candidates are equally tainted. Many people throw up their hands pleading for some other option.

How can we evaluate the alleged actions of these two candidates?

Is there some kind of objective way to do so?

And how does the decision to support a candidate affect the perception of an offense?

In order to attempt to address these questions I recruited 137 submissions on Amazon Mechanical Turk who submitted around 1100 five way "relative offense" rankings of five randomly matched actions from a list of 150. 46 submissions  were from individuals supporting Clinton, 37 from individuals supporting Trump, and 54 other or not supplied. Ranking was from 1 "Least Offensive" to 5 "Most Offensive".

Within any item set of five actions only one action could be matched with an individual ranking.
 For each action the mean number of times that action was classified under each ranking was calculated. That value was multiplied by the action ranking and summed across all levels to create an index.

The smallest index are the least offensive to non-offensive actions while the highest rankings are those actions respondents considered the most offensive.

Table 1: This table shows all 150 actions ranked from least offensive on average to most offensive as rated by all respondents. The Trump, Clinton, and Other columns are how each of these respective supporters rank the actions. A .5 means that two actions were ranked the same. The Trump_Clinton column is calculated by taking the rankings of Trump supporters for an action and subtracting the rank for those actions for Clinton supporters. Values in which Trump supporters and Clinton supporters diverge by more than 30 ranks are highlighted.
AllTrumpClintonOtherTrump_Clinton*ScandalAction
1522.53Eating meat.
21215-1Posting a much younger picture of yourself on a dating website.
36.55.581Not wearing your seat belt when driving.
415.52613.5Dating someone of a different race.
52.5152.5-12.5Donald TrumpSwearing in public.
68.5172.5-8.5Offering to pay someone 10 cents for every paper they deliver.
76.51217.5-5.5Stealing candy from a baby.
8122510-13Jumping in line in front of someone else when waiting for customer service.
92.52721-24.5Not tipping a waiter after average service.
1041433.5-10Hoard the armrest when sitting next to a stranger on a plane or movie theater.
1160.55.5555Having sex with someone of the same gender.
122783819Trey RadelBuying illegal drugs for personal consumption.
1325.51128.514.5Using toilet paper to vandalizing a stranger's house.
1437132524Taking the cab hailed for someone else.
152927122Leaving chewing gum under a public table.
161357.520-44.5Exaggerating the size of your penis in order to convince someone to have sex with you.
1725.544521.5Illegally copying copyrighted music.
181816322Not voting.
192294813Illegally streaming copyrighted videos.
201721.530.5-4.5Publishing with permission a different person's work as your own.
213051.512-21.5Hunting a wild animal.
228.52045-11.5Using a bad or ineffective preparedness test to screen potential students.
2357.536.51221Sticking gum in someone's hair.
244340.5222.5Having sex with your landlord to pay the rent.
2520750.513Not flushing a public toilet after pooping.
2646.536.51410Hillary ClintonContinuing to stay married to your spouse after that person had multiple affairs.
27.53233.519-1.5Paying a prostitute for sex.
27.510.51827-7.5Spending money on something you do not need such as entertainment when you know there are people dying of hunger next door.
2915.5669-50.5Donald TrumpEarning 1000 times more money per hour as your lowest paid full-time employees.
3046.53977.5Voting illegally twice.
3110.52940.5-18.5IRSUsing public funds to pay for professional development conferences which may not be very productive.
32212726-6Hillary ClintonRefusing to release the transcripts of paid speeches you gave.
33286117.5-33Earning 1000 times more money per hour as your lowest paid contractor.
343440.523-6.5Lying about damages in order to keep a tenant's security deposit.
3563642.5-1FBISelling guns with GPS trackers in them to a dangerous illegal organization with the intention of using the data to prosecute the organization.
361919650Urinating on a toilet seat and not cleaning it.
3774.5612413.5Donald TrumpMisrepresenting your success to convince others to invest in you.
386551.528.513.5Donald TrumpReposting images from a white supremacist group.
3923.55440.5-30.5Writing false review's on a business's website because you are angry with them.
4049.531.530.518Sneaking out of a resturant without paying for a meal.
411421.581-7.5Copying a fellow student's homework.
423557.539-22.5Hillary ClintonA politician lying or falsely representing one quarter (25%) of his/her public statements.
433236.558-4.5Stealing products from a convenience store. (Shoplifting)
445577.542.5-22.5Donald TrumpFalsely reporting the magnitude of your donations to charities to make yourself look better.
45.552.545.5527Donald TrumpPublicly shaming someone for being fat or ugly.
45.581.5691612.5Refusing to renounce the endorsement of a white supremacist group.
4723.54372-19.5Speeding through a school zone.
4852.549453.5Promising special interest groups to vote for them if they donate to your campaign.
4979245355Masturbating in a public bathroom.
5057.56136-3.5Donald TrumpLying about the number of floors in a building you own in order to charge higher rent.
51417160-30Using spray paint to vandalize a stranger's car.
524136.5684.5Smoking in an area around non-smokers where "no smoking" signs are posted.
5374.5424832.5Spitting in someone's soup when they are not looking.
54416455.5-23Publishing without permission a different person's work as your own.
554411250.5-68Publicly renouncing homosexuality while secretly having sex with gay prostitutes.
5674.5484826.5Publicizing false claims about a person because you are angry with that person.
5766.54755.519.5Chris ChristieUsing your position to control how public funds are used in to punish a political rival.
583251.563-19.5Providing alcohol to minors (a person less than 18 years old).
5992443548A public official hiring a less qualified friend over a well qualified stranger.
6074.55642.518.5A fully mobile person refusing to give up a seat to a disabled or infirm person.
6146.59333.5-46.5Publicizing someone's address in an attempt to intimidate someone else.
6239108929Donald TrumpJudging a person on their appearance.
6352.56861-15.5Selling illegal drugs.
6474.531.57543Hillary ClintonImproperly storing national secrets.
6552.56477.5-11.5Donald TrumpA politician lying or falsely representing three quarters (75%) of his/her public statements.
6681.5307051.5Donald TrumpEnjoying firing people from their occupation.
676886.554-18.5Stealing large sums of money from a company you work for.
6846.510057-53.5Donald TrumpEarning over 10 million dollars in a year and not paying any federal income taxes.
6960.58964-28.5Donald TrumpFiling bankruptcy in order to avoid paying the people who worked for you.
706351.583.511.5Sabotaging a competitors work.
7181.573738.5Setting up fake accounts for your customers in order to increase profits.
7281.5239358.5Publicly exposing yourself.
73100.510137-0.5Donald TrumpEnter the occupied changing room of members of the opposite sex without consent.
7449.58377.5-33.5Clinton/TrumpA politician accepting money from special interest groups.
75377499-37A police officer accepting a bribe in exchange for not writing a ticket.
76106.5816725.5Stealing someone's car.
779595.566-0.5
Driving drunk or high.
7866.577.580-11IRSUsing public funds designated for tax collection to produce a parody video.
7957.59277.5-34.5
Using racial epitaphs.
8057.533.59424Cheating on a test.
8190597431Not giving food to a starving person in front of you.
82638562-22A politician lying or falsely representing half (50%) of his/her public statements.
83957286.523Anthony WeinerA married person sending photographs of his/her genitalia to someone who is not that person's spouse.
84111.5678244.5Giving a blind person inaccurate change because they cannot tell the difference.
85106.5847122.5Urging someone who is attempting to remain sober to take a drink.
86937083.523Without authorization setting up and using a credit card in another person's name.
87100.545.510655Masturbating in public.
8884.586.577.5-2Using public resources entrusted to your care to enrich yourself.
89108756933Mocking a disabled person for a physical handicap.
9084.577.5857Bribing a public official to enrich oneself.
91109105.5593.5Donald TrumpSetting up a fake educational institution in order to enrich yourself.
923794111.5-57Torturing criminals as punishment for crimes.
939088952Bill ClintonCheating on your wife/husband.
9469.510291-32.5Stalking someone you know in order to intimidate that person.
958855111.533Doctor KevorkianKilling someone who is in pain and going to die within the next six months and wants help dying.
969890.586.57.5Tonya HardingHiring someone to break the leg of a rival athlete.
9774.590.5101-16Using threats of lawsuits to silence a woman who claims to have been assaulted by you.
98114829832 Hillary ClintonUsing a private email server resulting in the risk of compromised national security.
998797107.5-10Writing laws to prevent people who do not support you from voting.
10074.5107105-32.5Requiring a starving person attend your religious gathering before providing giving food.
1019798.5104-1.5Donald TrumpUsing your superior physical strength to force a person to kiss you.
10286103.5107.5-17.5Stealing someone's identity in order to commit a crime.
10310380113.523Burning down your house to claim the insurance money.
104111.511490-2.5Using public resources entrusted to your care to enrich an ally or friend.
105100.511597-14.5Preventing someone from registering their child in your school because of that person's skin color.
1061161091007Adolf HitlerEncouraging racially motivated violence to promote your political aspirations.
107104120102-16Bribing a public official to protect oneself.
10874.5118115-43.5Rejecting someone's rental application because of that person's skin color.
10990124.5109-34.5Bill CosbySneaking a drug into someone's food or drink in order to force that person to have sex with you.
11011912888-9Killing a healthy and well behaved pet for because you don't have to the time to take care of it.
111100.5105.5110-5Stealing someone's needed pain medication for personal use.
112139.51239616.5Killing a healthy and well behaved pet because you find the animal annoying.
113143103.59239.5Edward SnowdenPublicizing national security secrets which might result in lives being lost.
11411495.5121.518.5Donald TrumpThreatening to sue someone in order to keep them from telling the truth.
115132124.51037.5A person of influence encouraging a crowd to physically attack someone or a group of people he/she does not like.
11669.5117127-47.5George W BushTorturing terrorists with the hope of gathering information about future terrorist plots.
11712177.5143.543.5Ordering the assassination of a dictator.
118105120126-15Recording someone having sex without their knowledge.
119114122121.5-8Restricting the use of life saving technology in order to increase profits.
120136108116.528Misrepresenting the effectiveness of a life saving drug in order to increase profits.
12113398.513134.5Killing someone who is in pain and going to die within the next six months but does not want to die.
122117110146.57Not reporting an instance of known child abuse when you are legally mandated to report.
123130.5136.5113.5-6Physically abusing your spouse.
124128.511212516.5Sending someone's spouse to the front line to die in order to marry the surviving window.
125122135120-13Hiding information about the health risks of deadly product you sell.
126124.5134123.5-9.5Stealing large sums of money from a mentally infirm client.
127110144135-34A medical doctor refusing to treat a patient who needs urgent care because they are unable to pay.
128119129123.5-10Significantly raising the price of a life saving drug in order to increase profits.
129139.5131116.58.5Viewing child pornography.
130141.51381183.5Killing a healthy and well behaved pet for fun.
131119143128-24Someone with a known dangerous sexually transmitted disease having unprotected sex with someone who is unaware of the condition.
132123133131-10Using your superior physical strength to hold an unwilling person while you grabbed their genitalia.
133128.511613112.5Brock TurnerTaking and sharing naked pictures of a person who is unconscious.
134124.5131136.5-6.5Brock TurnerHave involuntary sex with a person you find unconscious.
135.5130.5112136.518.5Vladimir PutinAssassinating a political rival.
135.5127145129-18Physically forcing your spouse to have sex with you.
13795149.5141.5-54.5Harry S TrumanDropping an atomic bomb on a civilian city controlled by a rival nation
138148.51391199.5Allowing the executing of a convict if you have secret knowledge of the person's innocence.
139.513812014918A 40 year old having sex with a fifteen year old.
139.5148.5136.514812Using your power to pressure an employee to have sex with you.
141137140141.5-3Not reporting an instance of known child sexual abuse.
142148.512613822.5Killing innocent people to further your political agenda.
143144.5142139.52.5Adolf HitlerEncouraging people who follow you to attack and kill people you do not like.
144144.5127139.517.5Killing your spouse to claim the life insurance premium.
145134.5141146.5-6.5Killing someone for their money.
146126131145-5George WashingtonPossessing a personal slave who has no human rights.
147134.5147134-12.5Donald TrumpGroping without warning the genitalia of someone else.
148148.51461332.5Making child pornography.
149141.5148143.5-6.5Killing innocent people for personal reasons such as curiosity or entertainment.
150146149.5150-3.5Killing somebody to protect your reputation.
* Please note that I am making no claim to the veracity of the scandal.
** Upon viewing this list I realize that there are a number of scandals I forgot to include related to Hillary. I am unconvinced that any of them would have been ranked very high on the list. However they should have been included. Sorry team Trump.

General Differences
Looking at the table we can see that generally it agrees with out intuition with innocuous or generally non-offensive actions being ranked at the top, while actions that involve doing significant harm to others being ranked at the bottom.

Interestingly the rankings between Trump and Clinton supporters agree generally with a 84% correlation in ranking values.

From the differences in rankings of actions between Clinton and Trump supporters we start to get an idea on how these two different groups think. The largest difference is 68 ranks of difference with Trump supporters much more accepting than Clinton supporters of "Publicly renouncing homosexuality while secretly having sex with gay prostitutes". Clinton supporters seem less tolerant of deception in general with exaggerations of penis size and the writing of false reviews on business webpages being ranked much more offensive than their Trump counterparts.

Clinton supporters also find it generally more offensive to do harm to others such as dropping the atomic bomb on civilian populations and torturing terrorists or criminals.

The two camps have fiercely different perspectives on money. Clinton supporters are very concerned with the perceived injustices caused by unequal access to resources. These supporters are much more prone to rank as more offensive inequality in earnings as well as politicians accepting public donations. These supporters find it much more offensive for a doctor to refuse treatment on the basis of insufficient funds.

There also appears to be a difference in how concerned Trump supporters are with racial inequality with Trump supporters much less concerned with the act of rejecting an applicant due to the color of skin or minding the use of racial epitaphs.

Scandals
Perhaps unsurprisingly for the largest scandals Trump supporters and Clinton supporters seem to disagree with how offensive the actions of their candidates are. Clinton supports find the surprise groping of genitalia one of the worse actions someone can take while Trump supporters place it a bit lower on the list. For Trump supporters, improperly storing national secrets and using a private email server are ranked as much more offensive than for Clinton supporters.

Summary
While Clinton and Trump supporters mostly agree in generally how they rank objectionable actions, they do seem to disagree in some areas that seem consistent with differences in popular representation. Clinton supporters being concerned with economic, social, and political justice. Trump supporters being concerned with protecting economic rights as well as individual freedoms such as the right to offend others through word or deed.

Tuesday, January 19, 2016

Who are Turkopticon's Top Contributors?

In my most recent post "Turkopticon: Defender of Amazon's Anonymous Workforce" I introduced Turkopticon, the social art project designed to provide basic tools for Amazon's massive Mechanical TURK workforce to share information about employers (requesters).

Turkopticon, has a been a runaway success with nearly 285 thousands reviews submitted by over 17 thousand reviewers since its inception in 2009. Collectively these reviews make up 53 million characters which maps to about 7.6 million words as 5 letters per average word plus two spaces. At 100 words every 7 minutes this represents approximately 371 days collectively spent just writing reviews. It is probably safe to considered this estimation an underestimation.

So given this massive investment of individuals in writing these reviews, I find myself wanting to ask, "who is investing this kind of energy producing this public good?"

In general, while there are many contributors, 500 contributors represent 54% of the reviews written. With the top 100 reviewers making up 30% of the reviews written and the top 15 representing 11% of all reviews written.

Figure 1: Using this graph we can find the Gini coefficient for number of submissions at around 82% indicating that a very few individuals are doing nearly all of the work.
Within Turkopticon there is no ranking system for reviwer quality so it is not obvious who are the top contributors and what their reviewing patterns look like. In this article we will examine some general features of the top contributors.

Table 1: A list of the Top 15 Turkopticon review contributors. Rank is the reviewer rank by number of reviews written. Name is the reviewer's name. Nrev is the number of reviews written. DaysTO is the number of days between the oldest review and the most recent review. Nchar is the average number of characters written in each review. FAIR, FAST, PAY, and COMM are quantitative scales that Turkopticon requests reviewers rank requesters by. Fair indicates how the requester was at either rejecting or failing to reject work. Fast indicates how quickly the requester approved or rejected work. Pay indicates how the reviewer perceived the payment scheme for work was. And Comm refers to communication which indicates, if the worker attempted to communicate with the requester, how well that requester addressed the worker's concerns.

Rank
Name
Nrev
DaysTO
Nchar
FAIR
FAST
PAY
COMM
1
bigbytes
5236
294
219
4.89
4.99
2.74
3.26
2
kimadagem
3732
327
490
4.98
4.95
3.23
2.45
3
worry
2637
649
186
4.97
4.87
3.29
3.84
4
jmbus...@h...
2539
538
110
3.10
3.05
3.08
1.55
5
surve...@h...
2488
177
344
4.85
4.77
4.13
4.27
6
jaso...@h...
2100
260
78
4.98
4.90
4.78
4.73
7
shiver
1721
303
139
4.94
4.89
4.44
3.81
8
Thom Burr
1594
434
288
4.69
4.81
4.54
3.52
9
jessema...@g...
1539
467
157
4.96
4.70
3.64
4.00
10
absin...@y...
1320
309
75
4.97
4.91
3.99
3.78
11
Rosey
1313
634
101
4.80
4.76
4.35
4.18
12
CaliBboy
1281
83
201
4.02
4.07
2.71
3.84
13
ptosis
1278
367
110
3.00
3.04
2.89
3.29
14
NurseRachet (moderator)
1274
669
351
4.76
4.70
3.91
3.72
15
TdgEsaka
1234
523
258
4.75
4.81
3.73
3.00

Find the full list as a google document here (First Tab).

From Table 1 we can see that all of the top 15 reviewers have contributed over 1,200 reviews with bibytes being the most prolific reviewer contributing over 52 hundred. In terms of the reviewer active on Turkopticon the longest, NurseRachet (a forum moderator) has been on the longest followed by worry and Rosey. In terms of the longest winded kimadagem has the longest average character count per review at 490 characters or  approximately 70 words per review while CaliBboy has the shortest reviews at only 75 characters or around 10 words.

In terms of the averages the four rating scales there is a fair bit of diversity between the top reviewers with jaso...@h.. having the highest average score between the four scales of 4.8 and jmbus...@h... having the lowest average scores, around 2.7 followed by ptosis with a average a tiny bit higher than 3.

So now we have a pretty good idea of what in general the top contributors to Turkopticon look like.

But what of the quality of the contributions?

In order to understand what a quality contribution in Turkopticon looks like we must consider the standards that the community has come up with after years of trial and error.
1. The four different scales should be distinct categories. That is a high pay rate should not cause someone to automatically rank a high Fairness or visa versa.
2. To this end what is referred as 1-Bombs an attempt to artificially drop a requesters score by ranking all scales 1 should be avoided. Similarly, 5-Bombs should also be avoided.
3. Within Turkopticon there is also the ability to flag reviews as problematic. If one of your reviews is flagged, it means someone has a problem with it.
4. In general we would like reviews to be approached with a level head so that reviewers write independent reviews rather than ones based on their current mood.
5. Finally, in general we would like reviewers to review as many categories as they can when writing reviews.

Variables
From these 5 guidelines, I will attempt to generate variables that measure each of these targets.
1. For different scales I will focus on the relationship between pay and the other three scales for individual requesters (FairPay, FastPay, and CommPay for the correlations between Fair, Fast, and Comm with pay respectively). The reason I focus on Pay is that it seems to be the scale often times that concerns Mturk workers the most.

Table 2: For reviewers the average correlation between Pay and other scales.
ALL
Top 100
Top 15
FAIRPAY
0.80
0.56
0.44
FASTPAY
0.73
0.48
0.37
COMMPAY
0.81
0.67
0.61

From Table 2 we can see that the average reviewer has a very strong positive correlation between Pay and the other scales with FAIR, FAST, and COMM in the .73-.81 range. In contrast the Top 100 and especially the Top 15 all have much lower correlations. We should not necessarily hope for a zero correlation between these factors since one might expect a requester who pays too low might also act unfairly, not respond quickly to submissions, or have poor communication habits.

2. 1-Bombs and 5-Bombs are easy to observe in the data in terms of all 1s or all 5s. However, it is worth noting that all of either 1s or 5s might actually be a valid review given the circumstances. Variables 1Bomb and 5Bomb will be a variable measuring the likelihood that an individuals review will be either of the two categories.

3. Flags are also a variable that can be directly observed. Multiple flags can be featured on a single review. The highest flag hit in my data has 17 flags. The variable FLAG is the average/expected number of flags for an individual reviewer's reviews.

Table 3:  The prevalence rates of 1-Bombs, 5-Bombs, and Flags.
ALL
Top 100
Top 15
1BOMB  
0.192
0.038
0.019
5BOMB  
0.179
0.049
0.025
FLAGS
0.014
0.005
0.005

From Table 3 we can see the prevalence rates of 1-Bombs, 5-Bombs, and Flags is much higher among the general reviewers than that of the Top 100 and especially among the top 15.

4. In order to attempt to measure "level-headedness" I will just look at how reviews trend from a rating perspective. That is, is the value of the current review correlated (either positively or negatively) with the value of the next review?

Table 4: The auto-regressive one step correlation between review levels. In this case the "ALL" category only includes the 3,700 reviewers who have written more than 10 reviews.

 ALL
Top 100
Top 15
FAIRar1
 0.00
0.10
0.10
FASTar1
 0.00
0.09
0.12
PAYar1
 0.02
0.10
0.07
COMMar1
-0.07
0.04
0.04


From Table 4 we can see that inter-review correlation is pretty small especially when compared with the correlation between pay and other scales within the same review (Table 2). Interestingly for the average reviewer, there is almost no correlation across reviews. This might be a result of reviewers writing less reviews in general, thus spacing them more widely and therefore less likely to be sequentially influenced by personal psychological trends.

5. Finally in terms of completeness we can easily measure completeness in terms of how frequently reviews of individual scales were not completed.

Table 5: The completion rates of individual scales.

ALL
Top 100
Top 15
FAIRC
0.849
0.665
0.705
FASTC
0.825
0.651
0.695
PAYC
0.901
0.916
0.918
COMMC
0.605
0.147
0.081

From Table 5 we can see that the completion rates of all scales are more or less equivalent between that of the general reviewers and that of the Top 100 and Top 15 except in the case of COMM. In this case we can see that the top reviewers are much less likely to rate communication.

Constructing A Quality Scale

In order to construct the best scale given our data, we will choose those variables and values that seems to typical of the top 15 most prolific reviewers. From Tables 2 and 3 we can see very distinct differences between the average reviewer and top reviewers. However, for our auto-correlation and completeness rates we see very little differences in general except that the top reviewers are much less likely to rate communication. I can't know exactly why this is the case but I suspect it is a combination of top reviewers avoiding 1-Bombs and 5-Bombs perhaps in combination with top reviewers finding it not typically worth their time to directly communicate with requesters.

So here is my proposed index using standardized coefficients (x/sd(x)):
ReviewerProblemIndex = 3*Flag + 3*1Bomb + 1/2*5Bomb +
                                          1*FairPay + 1*FastPay + 1*CommPay

Because we have standardized the coefficients we can read the scalars in front as directly representing the weight of that variable. Flags, I will weight the strongest as they are an indicator that someone in the community has a problem with the review. Next highest rating are 1Bombs which are widely regarded as a serious problem and frequently discussed on the Turkopticon forum.

5Bombs, FAIRPay, FastPay, and CommPay are also discussed but not considered as important (Turkopticon Discuss). I have caused the 5Bombs to be half as important as FairPay, FastPay, and CommPay variables as it seems cruel to penalize someone for being generous with reviews.

So let's apply our index and see how our top 15 reviewers score!

Table 6: The top 15 most prolific contributors ranked based on the ReviewerProlemIndex (Index, RPI). IRank is the ranking of reviewers in terms of the RPI. Name is reviewer name. Nrev is the number of reviews written. Rank is the reviewers ranked in terms of number of reviews written. The other variables are described above.

IRank  Index   Name Nrev  Rank  Flag  1Bomb  5Bomb  FairPay  FastPay  CommPay
1 1.9 jessema...@g... 1539 9 0.001 0.001 0.016 0.12 0.09 0.20
2 2.1 kimadagem 3732 2 0.002 0.000 0.014 0.05 -0.01 0.27
3 3.2 worry 2637 3 0.000 0.003 0.006 0.11 0.11 0.53
4 3.5 absin...@y... 1320 10 0.000 0.000 0.007 0.24 0.13 0.55
5 3.5 bigbytes 5236 1 0.001 0.000 0.007 0.20 0.04 0.54
6 4.0 surve...@h... 2488 5 0.001 0.001 0.008 0.32 0.29 0.34
7 6.4 shiver 1721 7 0.001 0.005 0.015 0.50 0.33 0.76
8 6.6 jaso...@h... 2100 6 0.001 0.004 0.070 0.41 0.27 0.83
9 10.9 Thom Burr 1594 8 0.002 0.013 0.030 0.87 0.84 0.92
10 11.0 Rosey 1313 11 0.004 0.009 0.022 0.81 0.81 0.85
11 12.4 NurseRachet (moderator) 1274 14 0.016 0.022 0.078 0.39 0.32 0.46
12 12.7 CaliBboy 1281 12 0.022 0.004 0.005 0.20 0.21 0.47
13 13.1 TdgEsaka 1234 15 0.015 0.016 0.029 0.57 0.40 0.73
14 13.4 ptosis 1278 13 0.009 0.039 0.034 0.80 0.78 0.73
15 17.2 jmbus...@h... 2539 4 0.003 0.170 0.020 0.99 0.98 0.92

From Table 6 we can see that in general the more prolific reviewers also tend to be higher ranked on the RPI with a few exceptions. One exception is "jmbus", despite being the fourth most prolific contributor he/she is ranked at the bottom of the top 15 contributors list. This is likely due to having the highest 1-Bomb rate of the index with 17% of reviews being 1Bombs. His/her reviews also seem to be almost entirely correlated with Pay as FairPay, FastPay, and CommPay are all correlated upwards of 90%.

Similarly, "jessema" though only being the 9th most prolific reviewer seems to have the highest quality of reviews (slightly ahead of "kimadagem") with very low Flag, 1Bomb, and 5Bomb rates as well as very low correlation between the scales Fair, Fast, and Comm with that of Pay. Interestingly, though both "Thom Burr" and "Rosey" have very high correlation rates between Pay and the other scales, because the have relatively low Flag, 1Bomb, and 5Bomb rates they are ranked near the middle.

Overall, except for a few exceptions, I am very impressed that the top contributors seem to score so well on the RPI index.

Table 7: The Top 100 most prolific contributors ranked based on the Reviewer Problem Index (RPI).
Rank  Index   Name Nrev  Rrank  Flag  1Bomb  5Bomb  FairPay  FastPay  CommPay
1 -0.13 seri...@g... 488 64 0.000 0.000 0.006 0.00 -0.05 0.00
2 1.67 james...@y... 365 98 0.000 0.000 0.000 0.29 0.00 0.18
3 1.72 donn...@o... 1064 23 0.001 0.000 0.006 0.04 0.04 0.27
4 1.85 jessema...@g... 1539 9 0.001 0.001 0.016 0.12 0.09 0.20
5 1.94 iwashere 689 44 0.003 0.000 0.017 0.00 0.05 0.12
6 2.03 kimadagem 3732 2 0.002 0.000 0.014 0.05 -0.01 0.27
7 2.06 mmhb...@y... 422 79 0.005 0.000 0.009 0.00 0.00 0.00
8 2.21 aristotle...@g... 579 51 0.002 0.000 0.010 0.10 0.11 0.19
9 2.90 Kafei 561 55 0.002 0.000 0.027 0.16 0.13 0.27
10 2.93 turtledove 1188 19 0.001 0.000 0.012 0.32 0.04 0.34
90 15.28 Anthony99 571 53 0.005 0.014 0.391 1.00 1.00 1.00
91 15.83 cwwi...@g... 543 57 0.011 0.070 0.026 0.84 0.85 0.84
92 16.25 rand...@g... 490 63 0.002 0.157 0.051 0.97 0.97 0.99
93 16.76 trudyh...@c... 378 95 0.008 0.140 0.056 0.87 0.84 0.80
94 16.79 jmbus...@h... 2539 4 0.003 0.170 0.020 0.99 0.98 0.92
95 17.30 hs 945 28 0.010 0.115 0.098 0.87 0.86 0.89
96 17.94 ChiefSweetums 691 43 0.010 0.185 0.054 0.68 0.68 0.81
97 21.49 Playa 414 85 0.010 0.239 0.014 0.93 0.90 1.00
98 31.56 Tribune 360 99 0.053 0.011 0.108 0.76 0.61 0.97
99 35.74 taintturk. (moderator) 1176 21 0.027 0.499 0.014 0.89 0.87 0.73
100 40.53 Taskmistress 698 42 0.017 0.755 0.020 0.91 0.91 0.96


Find the full list of Top 100 ranked here (Second Tab).

In Table 7 we can see how reviewers score on the RPI across all of the Top 100 reviewers. The Top 10 have great scores with SERI having the top ranked score with over 488 reviews written and no Flags or 1Bombs and only three 5Bombs. For SERI there is also no correlation between Fair or Comm with an amazingly negative correlation with Fast.

The worse 10 reviewers is much more interesting mostly due to tainturk a Turkopticon moderator and Tribune a former moderator being on the list. Everybody on the worse 10 list suffer from very high correlations between the other scales and Pay. Tainturk though also suffers from having 50% of his/her reviews being 1Bombs (for those reviews in which all of the scales were completed). This is not the worse as Taskmistress has 75% 1Bombs but this was surprising. Looking back at the early reviews I see that 1Bombs seem to be common earlier in Turkopticon and are intended to reflect a Amazon Terms of Service violation, something that has since been implemented.

Similarly Tibune has one of the highest flag count rates in the entire list with an expected numbe rof flags of 5% on his/her reviews. However, as Tribune was invited to be a moderator despite this spotted history, we can only assume that my rating system has some serious flaws.

Overall, I would therefore take the RPI ranking with a grain of salt. Perhaps some of the longer time contributors to Turkopticon are suffering from changing standard over time. If I have time I will revisit the rating system looking at only reviews within the last year or two.