BadCosmonaut wrote:I'm guessing I quoted you before you edited your post.
No. If that were the case, you would see the
"last edited by [...]" footnote.
BadCosmonaut wrote:I think it's a little disingenuous
I never reply to ad-hominem stuff.
Just a quote from yourself to someone else (which apparently doesn't apply to yourself):
BadCosmonaut wrote:What practical result do you think can come from this conversation?
Also, it's rare where a worthwhile discussion happens after personal attacks/name calling.
I will reply to you regardless.
BadCosmonaut wrote:you edited your post then to act like I misquoted you...
Please stop your conspiracy theories, ok. I never said you misquoted me, let alone on purpose. You just missed my edit and I tried to point you to what you have missed, alright.
BadCosmonaut wrote:I think it actually does handle skewed distributions fundamentally different than the old system.
What practical result do you think can come from this conversation? You keep on repeating this on and on and don't seem willing to try to think through my explanations at all. Maybe we should just leave it at that. You think, that I'm talking nonsense and I think that there are misconceptions in your reasoning.
This discussion has gotten off its rails and is becoming a distraction.
We should either stop it or concentrate on the facts, which I'll try one more time below.
BadCosmonaut wrote:The problem of skewed distribution was that the old system would merge (into the same tier) movies that were very differently rated in order to squeeze them into the same 10% chunk.
I'm sorry, but we have been over this a couple of times already. I cannot explain it any better than I already did.
Nobody and nothing tries to "squeeze" anything. It's just math.
Simply put:
The old tier system was a 10% percentile system.
The new system is a 1% percentile system.
You get finer granularity, and thus higher accuracy. But that's really it. Other than that, it's the same system.
mpowell even said as much:
both systems can put very high scores into very low tiers or percentiles. Have you even looked at my examples above??
e.g:
• mrkwst22: 80 = 10%
• dushanj: 52 = 3%
Both of these scores would very likely be in tier 1 (in the old system). Now they are in the 3% and 10% percentile (corresponding to tier 0.3 and tier 1, if you will). Why are you unable to see, that you get finer granularity (1% steps instead of 10% steps), but both systems deal with skewed distributions in the very same manner. If you don't see that by now, I'll give up. Someone else will have to jump in.
BadCosmonaut wrote: For example, if a user had just 5 (out of 1,000) movies rated below a 50, then the old system would probably put all those movies into the same Tier 1. This could be a problem since those 5 movies could have been very differently rated. Like the spread could be 1, 10, 20, 30, 49. That's a huge gap between those scores to put them all into one tier. Yet that's what would happen under the old system. I don't think that problem can happen under the new system, since every single different rating possible will have its own tier.
Not only do you still fail to see, that fabiovisnadi, AFlickering, cke and myself were discussing something else entirely, what you say (
"every single different rating possible will have its own tier") is just plain wrong:
• If you have > 100 ratings, than some ratings obviously have to share the same percentile.
• And no,
"every single different rating possible will have its own tier" would only be possible if 1 = 1%*, 2 = 2%, 3 = 3% and so on ... but if you look at my long ranking list above, you'll see that this very obviously not the case.
(*actually 1 = 0%, but I'll try to keep it simple)
BadCosmonaut wrote: Using my example above, the movie rated 1 would have its own tier. The movie rated 10 would have its own. 20 would have its own. 30 its own. And 49 its own. Since there are 100 possible tiers, each of those would get their own. They wouldn't be merged into one.
That's also wrong.
The user in your example has 1000 rankings: 1, 10, 20, 30, 49 and 995 higher rankings.
In the old tier system rankings 1-49 (1, 10, 20, 30, 49) all fall into tier 1.
In the new percentile system rankings 1-49 (1, 10, 20, 30, 49) all fall into the 0% percentile.
BadCosmonaut wrote: The question that remains then is do any of the 100 tiers merge into something like the 10% chunks for purposes of comparison. If so, then there very well could be a problem with the new system, but I haven't seen an example of this yet, and I'm not sure if only MPowell could answer this.
I have offered you repeated demonstrations to the contrary, which you all ignore. Yes, go ahead and ask mpowell, since you don't take it from me.
If you don't trust my maths, why don't you just verify it by recalculating ?
Since you don't take it from me, maybe you'll take it from an actual user:
Matthias99:
https://www.criticker.com/profile/Matthias99total ratings:
7308x score of 30 ==> in 2% percentile
1x score of 40 ==>
also in 2% percentile
The worst part is, that you are not only wrong, but completely missing the point.
In fact, we didn't even discuss that matter, but something else entirely. Nevermind.
BadCosmonaut wrote:I'm curious, do you know of any users who have no ratings below 50 and who also consider 50 to be mediocre or above?
Yes, quite a few. But it gets even better, this user rates from 87-100:
https://www.criticker.com/rankings/getshort78BadCosmonaut wrote:you said:
livelove wrote:Ok, but that is not what the above discussion was about.
What was discussed was most users watch only films they like and thus only rate films they like, often leading to rating scales ranging e.g. between 50-100 with no ratings below 50.
You know what, I'll take that back. You *DO* misquote me on purpose. The first time it could have been an oversight on your behalf. But afterwards I told you, and now you misquote the very same thing again. I said
"no OR very few ratings below 50"BadCosmonaut wrote:livelove wrote:Ok, but that is not what the above discussion was about. What was discussed was most users watch only films they like and thus only rate films they like, often leading to rating scales ranging e.g. between 50-100 with no ratings below 50.
Since you said most users have no ratings below 50, I asked for an example.
I
also never said that! Please read carefully. I said
"most users watch only films they like and thus only rate films they like, often leading to [skewed] rating scales e.g. ..." e.g. means example given, which means that I give 1 specific example as a random example for a skewed rating scale. I never said that most users have no ratings below 50. Big difference!
And ... Jesus! Again: I didn't say
"no ratings below 50", I said
"... or very few". How many times are you actually going to repeat that incorrectly on purpose !?
Just to let you know: If you misquote me again, I'll stop replying.
And I already gave you plenty of examples, prior to this post as well as in this post (e.g. user getshort78 above).
BadCosmonaut wrote:I'm curious how many though, since you also said most users are like that.
Repeating it doesn't make it true.
BadCosmonaut wrote:most users are not like that. Most users do have some low ratings.
If you keep on misquoting me, that's not my fault. I said
"... or very few ratings below 50" which is equal to
"some low ratings".
BadCosmonaut wrote:Specifically, most users have ratings that cover the full spectrum of their personal scale.
you are just guessing
BadCosmonaut wrote:If this is true, then I'm still not seeing the problem with the current system other than the potential problem I discussed above.
Jesus! Nobody (definitely not me at any rate) said there were a problem!
We were all very nuanced in our discussion. We just mused aloud about the challenge of imbalanced voting distributions. mpowell will be the first to confirm that this is indeed a challenge and that there is no perfect solution for this and that he probably has found the best one of all imperfect solutions.
BadCosmonaut wrote:I'm trying to understand what you're saying the problem is
Again, we never said the new system has a problem (since it works just like the old one, just with higher precision and thus higher prediction accuracy). We wondered out aloud how to deal with users who mostly watch films they like (fabiovisnadi estimates that ~80% of films are ranked this way).
BadCosmonaut wrote:preferably with a specific example.
I already gave you plenty. Here is one of them:
user:
getshort78 ranks only films he likes.
His 121 rankings range from 87-100.
His worst ranked film, Sleepy Hollow, has a score of 87 which translates to 1% percentile.
How meaningful is it to assign such a low percentage value (1%) to such a high score (87) ?
Particularly when the user obviously thinks Sleepy Hollow is a very good movie.
Heck, he even says exactly that in his mini-review:
"Very good movie."The old tier system puts his score-87-film into his tier 1.
The new percentile system puts his score-87-film into the 1% percentile.
That goes to show, that both systems work in the same manner in that regard.
What the system does, is to "normalize" / "unskew" / "distribute evenly" the scores across the tiers/percentiles as best as the granularity (10 vs 100) permits, in an attempt to translate user'A scores into
"what user A really wanted to say IN THE LANGUAGE OF USER B (i.e. in the context of user B's voting scheme)".
Now the question is whether getshort78's score of 87 for Sleepy Hollow "really means" 1% (as Criticker assumes) or really means "a very good movie with a score of 87" as the user himself says.
THAT's what was up for debate (between the 3 above-mentioned users and myself).
The problem is, that you can find (many) examples where "what Criticker assumes" really reflects better what the user actually tries to express by his scores (e.g. score of 58, 3% percentile, and user says it's a very bad movie).
The challenge is to find a method that works in BOTH cases.