Welcome, Guest. Please Login or Register.
May 18th, 2024, 5:09am

Home Home Help Help Search Search Members Members Login Login Register Register
Arimaa Forum « 2009 Arimaa Challenge »


   Arimaa Forum
   Arimaa
   Events
(Moderator: supersamu)
   2009 Arimaa Challenge
« Previous topic | Next topic »
Pages: 1 ... 5 6 7 8  Reply Reply Notify of replies Notify of replies Send Topic Send Topic Print Print
   Author  Topic: 2009 Arimaa Challenge  (Read 7812 times)
99of9
Forum Guru
*****




Gnobby's creator (player #314)

  toby_hudson  


Gender: male
Posts: 1413
Re: 2009 Arimaa Challenge
« Reply #90 on: Mar 25th, 2009, 8:10pm »
Quote Quote Modify Modify

on Mar 25th, 2009, 4:43pm, omar wrote:
Ron, from your ruling I am assuming that game 100123 (bot_clueless vs ChrisB) should also be continued. Please confirm.

No, that had to be scrapped and restarted because both bots were playing on the same server.  Ron's phrase "bot play was not tainted before the timeout" is the key to deciding which should be continued and which should be scrapped.  The games that were already scrapped including this one were either due to bots running on the same server, or  under the wrong time control.
 
The game ChrisB needs to continue is 100645.
« Last Edit: Mar 25th, 2009, 8:13pm by 99of9 » IP Logged
ChrisB
Forum Guru
*****



Arimaa player #2339

   


Gender: male
Posts: 147
Re: 2009 Arimaa Challenge
« Reply #91 on: Mar 25th, 2009, 9:42pm »
Quote Quote Modify Modify

on Mar 25th, 2009, 8:10pm, 99of9 wrote:

The game ChrisB needs to continue is 100645.

 
Yes, Omar and I already corresponded about continuing 100645, and the likely time for continuing that game will be this Friday at 0400 GMT (11 pm Thursday for me).  I also mentioned to Omar that I'm not sure whether the March 25 TD decision brings game 100123 back into play.  I see arguments both for and against continuing that game, so whatever Ned decides will be fine with me.
IP Logged

99of9
Forum Guru
*****




Gnobby's creator (player #314)

  toby_hudson  


Gender: male
Posts: 1413
Re: 2009 Arimaa Challenge
« Reply #92 on: Mar 26th, 2009, 12:53am »
Quote Quote Modify Modify

on Mar 25th, 2009, 9:42pm, ChrisB wrote:
 I also mentioned to Omar that I'm not sure whether the March 25 TD decision brings game 100123 back into play.

I very much doubt that that is what Ned meant, but am happy to wait for him to clarify.  Of course if that game came back into play, we might have to resurrect a whole bunch, some which have already been replayed!
IP Logged
RonWeasley
Forum Guru
*****




Harry's friend (Arimaa player #441)

   


Gender: male
Posts: 882
Re: 2009 Arimaa Challenge
« Reply #93 on: Mar 26th, 2009, 3:43am »
Quote Quote Modify Modify

on Mar 25th, 2009, 4:43pm, omar wrote:

Ron, from your ruling I am assuming that game 100123 (bot_clueless vs ChrisB) should also be continued. Please confirm.

 
No, not this one where multiple bots were running on the same server.  This game is nullified.
IP Logged
omar
Forum Guru
*****



Arimaa player #2

   


Gender: male
Posts: 1003
Re: 2009 Arimaa Challenge
« Reply #94 on: Mar 26th, 2009, 1:49pm »
Quote Quote Modify Modify

Thanks for the clarification.
IP Logged
RonWeasley
Forum Guru
*****




Harry's friend (Arimaa player #441)

   


Gender: male
Posts: 882
Re: 2009 Arimaa Challenge
« Reply #95 on: Mar 27th, 2009, 5:01am »
Quote Quote Modify Modify

on Mar 25th, 2009, 4:43pm, omar wrote:

The silver server is scheduled to be terminated at the end of this month. So after that time there will only be one server to play on. Also the official challenge match games are scheduled to begin on April 5th, so I will have to stop the screening on April 4th.

 
Noted.  I approve the end of screening to be April 4.
IP Logged
woh
Forum Guru
*****



Arimaa player #2128

   


Gender: male
Posts: 254
Re: 2009 Arimaa Challenge
« Reply #96 on: Mar 28th, 2009, 2:12am »
Quote Quote Modify Modify

Omar, the link in the gameroom to the bot scores page isn't working. I think you need to leave out the 's' at the end.
IP Logged

Arimabuff
Forum Guru
*****



Arimaa player #2764

   


Gender: male
Posts: 589
Re: 2009 Arimaa Challenge
« Reply #97 on: Mar 28th, 2009, 9:13am »
Quote Quote Modify Modify

on Mar 28th, 2009, 2:12am, woh wrote:
Omar, the link in the gameroom to the bot scores page isn't working. I think you need to leave out the 's' at the end.

 
True, when I remove the "s" by hand the link works.
IP Logged
aaaa
Forum Guru
*****



Arimaa player #958

   


Posts: 768
Re: 2009 Arimaa Challenge
« Reply #98 on: Mar 28th, 2009, 8:16pm »
Quote Quote Modify Modify

It's amazing to note that despite having made no less than 6(!) attempts to play a preliminary game, it now looks that I will have made no official contribution whatsoever! Still, considering the purposes of this screening, I think, even leaving it like that (and I'm not changing my mind here), I've done good if I do say so myself; with respect to giving the defenders games to study, six games, even including incomplete ones, should be gnarly, no? Especially since, as you'll see below, they unfolded quite diversely. Also, given the whole mess that has plagued my games, it's probably for the best that I won't be responsible for any differentiation of the two bots. Chalking that first infamous game against Clueless up to me getting back into Arimaa after a short hiatus, it's hard to identify the stronger (anti-human) bot of the two. If, however, GnoBot manages to qualify with exactly one point difference, it will still leave a bad taste in my mouth.
 
Since the multitude of my games seems to have caused a bit of a confusion here, here's a summary of them:
 
Game 99880: After inflicting massive (what I would call fatal) material damage, Clueless times out, at first seemingly because it was being too cavalier managing its time; the game is later voided due to a wrong time control system being in use. I came close to resigning that game and the advantages I enjoyed in the other timed-out games come nowhere close to Clueless's in this one. So I reject the criticism that I'm being inconsistent; it's a fact of life that a later resumption of a game will advantage the human player over the computer, so only playing out a game that's clearly won by the latter should not be dubious.
 
Game 99969: In a much more promising position for me, Clueless times out again. At first judged as a legitimate outcome, some time afterwards more timeouts point to an interfering cron job by the server as the culprit. Now ruled to be resumed.
 
Game 100048: I win against GnoBot in an entertaining and (more importantly) legitimate game.
 
Game 100135: I finally get a game against Clueless to reach its natural ending (with me successfully converting a horse frame), turns out the bot was sharing computer resources for some time. Game gets discounted (and unrated).
 
Game 100273: This time, I'm the one having my horse framed, in a game against GnoBot. I still manage to win eventually. Counts.
 
Game 100358: Clueless times out yet again, this time early on. It is ruled to be resumed due to a mysterious network lag. At the time, I enjoyed a reasonable advantage, which Bomb2005P2, I believe, evaluated as 2.97, no doubt due to the horse I'm framing. Aided by a substantial amount of (unfair) pondering, I'm somewhat more troubled by the position.
 
Tentatively, I could hypothesize that GnoBot is relatively strong in the opening (not in a small part due to its book) but has trouble following through on any advantage accrued in that stage, whereas Clueless seem to be more of an all-rounder.
IP Logged
Fritzlein
Forum Guru
*****



Arimaa player #706

   
Email

Gender: male
Posts: 5928
Re: 2009 Arimaa Challenge
« Reply #99 on: Mar 29th, 2009, 2:59pm »
Quote Quote Modify Modify

Aaaa, I'm sorry to hear that you are still declining to finish your clueless timeouts as requested by the TD, but thanks anyway for your six attempts at participation.  Although you haven't contributed to the score of the bots or helped choose between them, you have indeed shed some light on their strengths and weaknesses, which is one of the explicit purposes of the screening period.
 
Will you also decline to participate in any future screening period, because your decision to withdraw is not based on what is expedient at present, but rather based on the principle that the rules are inherently unfair?
« Last Edit: Mar 30th, 2009, 6:32pm by Fritzlein » IP Logged

omar
Forum Guru
*****



Arimaa player #2

   


Gender: male
Posts: 1003
Re: 2009 Arimaa Challenge
« Reply #100 on: Apr 3rd, 2009, 9:39am »
Quote Quote Modify Modify

Game 101769 between sforry and bot_clueless timed out today.
 
I think this game needs to be continued since a network problem caused the timeout.
 
http://gold.arimaa.com/~clueless/logs/3426.netLog
« Last Edit: Apr 3rd, 2009, 9:41am by omar » IP Logged
RonWeasley
Forum Guru
*****




Harry's friend (Arimaa player #441)

   


Gender: male
Posts: 882
Re: 2009 Arimaa Challenge
« Reply #101 on: Apr 3rd, 2009, 9:50am »
Quote Quote Modify Modify

on Apr 3rd, 2009, 9:39am, omar wrote:
Game 101769 between sforry and bot_clueless timed out today.
 
I think this game needs to be continued since a network problem caused the timeout.
 
http://gold.arimaa.com/~clueless/logs/3426.netLog

Yes.  Please continue the game.
IP Logged
99of9
Forum Guru
*****




Gnobby's creator (player #314)

  toby_hudson  


Gender: male
Posts: 1413
Re: 2009 Arimaa Challenge
« Reply #102 on: Apr 4th, 2009, 4:45am »
Quote Quote Modify Modify

Congratulations in advance to Clueless/Jeff for winning the screening.  I'll try to give you a bigger challenge next year Smiley.  After all the technical problems, I'm glad the bots ended up separated by a decent margin.  I'm also glad they both got a few wins in the end - the start of the screening was looking quite humiliating for both bots.
IP Logged
Fritzlein
Forum Guru
*****



Arimaa player #706

   
Email

Gender: male
Posts: 5928
Re: 2009 Arimaa Challenge
« Reply #103 on: Apr 4th, 2009, 6:36am »
Quote Quote Modify Modify

Sadly, the screening period closed on another invalid game.  Woh's epic 144-move win over GnoBot, lasting six hours and thirty-nine minutes, can't count because woh was assigned the wrong color.  By my count that makes nine invalidated games for the tournament: woh's wrong-color game, four timeouts that weren't completed, and four games that overlapped on the same server.
 
Fortunately we had such enthusiastic participation that many games were completed and the bots were clearly discriminated.  We had fifty-one valid games, including two server timeouts that were restarted and played to completion, so we beat last year's participation in spite of all the issues.  
 
2008 2009
---- ----
  19   21  Participants
  41   51  Valid games
  16   21  Valid Pairs
   7    7  Discriminating Pairs

 
Every participation statistic increased except for the number of discriminating pairs, i.e. the number of times a player beat one bot and lost to the other with the same color.  It may be that when the two bots are closer in strength, as they were this year, it is less likely for any pair of games to provide discrimination.  Last year bomb beat sharp 6-1 in the discriminating pairs; this year clueless beat GnoBot 5-2.
 
Clueless finished with a record of 15-10 in valid games, for a performance rating of 1941.  Clueless' performance was very consistent.  Other than splitting two games with woh (who is rated right near clueless' performance), Clueless defeated everyone rated 1910 or below while losing to everyone rated 2002 and above.  This seems to peg clueless' true strength (at this time control and hardware) in a rather narrow range.
 
GnoBot finished with a record of 12-14 in valid games, for a performance rating of 1791.  GnoBot's performance was much more erratic, with two wins against over-2000 players (The_Jeh and naveed) but also five losses to under-1900 players (Tuks, Simon x2, sforry, and joe).  One must wonder whether some of the holes in GnoBot's strategy could be plugged without undermining its strengths.
 
For comparison, during the 2008 screening, bomb's performance rating was 1918, and sharp's was 1576.  So this year the race was not only closer, the bar was higher.
 
All in all the screening period fully fulfilled its function.  The strengths and weaknesses of both bots were probed, giving the Challenge defenders much useful information.  The bot that would be easier for the defenders to defeat was eliminated.  This is precisely what is supposed to happen during the screening.
« Last Edit: Apr 4th, 2009, 6:39am by Fritzlein » IP Logged

aaaa
Forum Guru
*****



Arimaa player #958

   


Posts: 768
Re: 2009 Arimaa Challenge
« Reply #104 on: Apr 4th, 2009, 7:22am »
Quote Quote Modify Modify

on Apr 4th, 2009, 6:36am, Fritzlein wrote:
By my count that makes nine invalidated games for the tournament: woh's wrong-color game, four timeouts that weren't completed, and four games that overlapped on the same server.

You forgot about four games which didn't count because they used the wrong time control.
IP Logged
Pages: 1 ... 5 6 7 8  Reply Reply Notify of replies Notify of replies Send Topic Send Topic Print Print

« Previous topic | Next topic »

Arimaa Forum » Powered by YaBB 1 Gold - SP 1.3.1!
YaBB © 2000-2003. All Rights Reserved.