Welcome, Guest. Please Login or Register.
Nov 22nd, 2024, 7:12am

Home Home Help Help Search Search Members Members Login Login Register Register
Arimaa Forum « Ratings: quick fix for intransitivity »


   Arimaa Forum
   Arimaa
   General Discussion
(Moderator: supersamu)
   Ratings: quick fix for intransitivity
« Previous topic | Next topic »
Pages: 1  Reply Reply Notify of replies Notify of replies Send Topic Send Topic Print Print
   Author  Topic: Ratings: quick fix for intransitivity  (Read 1881 times)
Fritzlein
Forum Guru
*****



Arimaa player #706

   
Email

Gender: male
Posts: 5928
Ratings: quick fix for intransitivity
« on: Feb 22nd, 2006, 4:36pm »
Quote Quote Modify Modify

In another thread we discussed how to reduce the ratings distortion caused by someone picking a single opponent (particularly a bot) and playing that opponent over and over.  We tossed around some complicated formulas that may some day be implemented, but here's an easy band-aid:
 
Whenever two players sit down to a game, scan the last 50 rated games of each.  If either player has the other player 7 or more times in their last 50 rated games, then the game automatically converts to unrated.  In other words, you can play a given opponent all you like, but no more than 14% of your rating can be based on playing that opponent.
 
We might want to make an exception for both bots and humans 1500 or under, but once your rating starts to climb, you should have to start mixing it up between a variety of opponents for the games to count in your ratings.
 
What do you think?
IP Logged

jdb
Forum Guru
*****



Arimaa player #214

   


Gender: male
Posts: 682
Re: Ratings: quick fix for intransitivity
« Reply #1 on: Feb 22nd, 2006, 5:28pm »
Quote Quote Modify Modify

Nice idea.
 
Maybe tweak it a bit, only make the game unrated, if one side is winning a "large" percentage of the games. If the last 7 games are split 4 to 3, then it would be ok to make the game rated. If the last 7 are 7-0, then play unrated.
IP Logged
PMertens
Forum Guru
*****



Arimaa player #692

   
WWW

Gender: male
Posts: 437
Re: Ratings: quick fix for intransitivity
« Reply #2 on: Feb 22nd, 2006, 6:09pm »
Quote Quote Modify Modify

Imagine the possibility, that a total noob is playing only against a top-player and gets personal training ... his rating will not go up ?
In fact his rating would first go down and then freeze even when he is starting to win once in a while ...
I do not think that "rating" should be an incentive to play in a certain way - we got PotM for that ...
If I decide in the future to play only against Top 2000 players, than that might be a loss for the community but certainly not for my quality of play.
Since we do not really have that many Top2000 ... plenty of games would be lost ...
 
IP Logged
omar
Forum Guru
*****



Arimaa player #2

   


Gender: male
Posts: 1003
Re: Ratings: quick fix for intransitivity
« Reply #3 on: Feb 22nd, 2006, 7:16pm »
Quote Quote Modify Modify

You would really have to try it out to know how this effects the whole system and how much it fixes the problem. Plus you would want to try it with different constants. We can't just make changes to the system based on our mental simulations. But we can use mental simulations to guide us in deciding which actual simualtions to try.
« Last Edit: Feb 22nd, 2006, 7:19pm by omar » IP Logged
frostlad
Forum Senior Member
****



Arimaa player #1704

   


Gender: male
Posts: 46
Re: Ratings: quick fix for intransitivity
« Reply #4 on: Feb 22nd, 2006, 8:20pm »
Quote Quote Modify Modify

How many cases have there been of a player playing one opponent over and over to boost their ratings? I am just wondering as it seems like a lot of hassle to fix a problem if it doesn't really happen.
 
If you are looking at really fixing it I like the idea of looking at the last few times the two opponents have played together and let the game stand if the winning percentage isn't grossly tilted in one players favor.
IP Logged
Ryan_Cable
Forum Guru
*****



Arimaa player #951

   


Gender: male
Posts: 138
Re: Ratings: quick fix for intransitivity
« Reply #5 on: Feb 23rd, 2006, 12:19am »
Quote Quote Modify Modify

Personally, I don't want to do something that is such a hack unless we have run out of other options.  Moreover, I don't think that this will really fix the problem.
 
Currently there are seven bots rated above 1700.  5 are Bombs and 2 are Cluelesses.  I could beat any of these bots 95% of the time, if I focused on it.  Except, perhaps, for the Blitz bots, I think the entire bot ladder is mostly consistent with me being rated 2300+.  There is no significant intransitivity between me and the bots.
 
However, intransitivity becomes very important when we add top humans to the picture.  I am 1-22 against 99of9, Fritzlein, robinson, and Adanac, and 2-29 when you add in PMertens and blue22.  I have probably gained in skill since a few of those losses, but still it seems like some, if not all, of these people should be 400+ above me.  However, it doesn't seem reasonable to have the top human 900+ above the top bot.  To some degree, I think this is less intransitivity than just the fact that the sigmoid curve (in the ELO formula) is an imperfect model of skill distribution.
 
The upshot of this is that the key to rating inflation is not selection of opponents but rejection of them.  The secret to getting a high rating is to lock your invite flag in the off position and avoid skilled humans like the plague.  The only real way I see to solve this is to remove bots from the rating system (which I think would be a terrible idea for other reasons).  Anything that only degrades the importance of bot bashing is likely to just make skilled humans more threatening to a highly ratings-conscious player.
 
I think we should create a secondary rating system for use in selecting rankings for tournaments and other such things.  Something simple like the 20-game human performance rating or the 90-day human performance rating would probably work reasonably well.  Flaws like having a win cause a drop in rating or having ratings changing without any playing would be easier to overlook in a secondary system.
 
on Feb 22nd, 2006, 8:20pm, frostlad wrote:
How many cases have there been of a player playing one opponent over and over to boost their ratings? I am just wondering as it seems like a lot of hassle to fix a problem if it doesn't really happen.

Arimanator and I both took advantage of the bots to inflate our ratings.  When I first played Bomb2005Blitz, I was rated 109 points below it.  I used the Bait and Tackle strategy just to give myself a fighting chance at 15s per move.  However, I soon discovered that Bomb could be induced to maneuver its E back against the edge of the board into a crushing blockade.  With this discovery, Bomb became a much weaker bot and therefore became highly overrated.  I then transferred this overratedness to myself by playing many games against BombBlitz and BombFast.  I managed to push myself all the way to 2120, very briefly being the second highest rated human in the world.
 
In part, I did it just for the experience of playing and winning a bunch of Blitz games against a fairly skilled opponent.  In fact, I think I gained a surprising amount of tactical skill, even though I was playing approximately the same game every time.  I think I am currently able to beat BombBlitz without using the Bait and Tackle 80% of the time, whereas before all of the bot bashing, I doubt I could have beaten it 20% of the time.
 
In part, I did it out of an obsessive desire to see how far I could inflate my rating.  And in part, I did it to get a high ranking (3) in the World Championship, to help me achieve my goal of winning 2+ games (I won exactly 2).  It is this last item that makes it somewhat important to limit the possibilities for rating inflation.
IP Logged
Pages: 1  Reply Reply Notify of replies Notify of replies Send Topic Send Topic Print Print

« Previous topic | Next topic »

Arimaa Forum » Powered by YaBB 1 Gold - SP 1.3.1!
YaBB © 2000-2003. All Rights Reserved.