1. Standard memberRagnorak
    For RHP addons...
    tinyurl.com/yssp6g
    Joined
    16 Mar '04
    Moves
    15013
    04 Jul '06 12:341 edit
    Ok, so I finally got my head around the formula. For the new players on the rise, we should err on the side of caution, remembering that all non-banded tournaments are open to everybody.

    So, say our New Player's highest opponents were 1600s players.

    Count number of 1600s players played = GameCount (say 10)
    Get Average opponent rating of these games, AvOppRating (say 1650)
    Get record against these players, ie: won 9 out of 10 = 90%

    Insert AvOppRating as OppRating in Win Expectancy Formula.

    So the formula would look like:
    0.90= 1/(10^((1650-NewPlayerRating)/400)+1)
    That implies that NewPlayerRating would be guesstimated for banding purposes = 2033.

    So New Players band would be 2000-2099.

    If New Player won 70% of his games against players with an average rating of 1650, then his projected rating would work out as about 1850.

    This formula would need to be applied until New Players graph has levelled out somewhat. I haven't thought too much about this aspect, but I'm sure its fairly easy to ascertain when a player is pretty much playing at his/her rating.

    D
  2. Standard memberXanthosNZ
    Cancerous Bus Crash
    p^2.sin(phi)
    Joined
    06 Sep '04
    Moves
    25076
    04 Jul '06 12:411 edit
    If we input Win Expectancy as 1 (or whatever record our new player holds against his highest rated opponents, then we might get a good indication of their rating. For some reason, I can't seem to get my head around the equation this morning.

    I don't blame you for not wanting to rearrange the formula. It's annoying.

    It becomes this:

    OpponentsRating - 400 * (log (1 - WinExp) - log (WinExp)) = YourRating

    Now this doesn't work if you have a WinExp of 1 as log 0 doesn't exist. But it does give something of an estimate.
    For example a player who has in the past shown a Win Expectancy of 0.99 against 1500 players would have an estimated skill level (not rating) of 2298.
    Of course this could be horribly inaccurate however as the Win Expectancy approaches 0.5 (from either direction) the accuracy will increase.

    EDIT: Yeah, I get the same banding as you do in your examples.
  3. Joined
    11 Nov '05
    Moves
    43938
    04 Jul '06 13:181 edit
    Question:
    How many rating points gets the lowest rated player at RHP to the highest rated player if he wins? How many rating points is it possible to lose or win at maximum?

    In real life noone can lose more than 31 points and noone cannot win more than 31 points. Does the equation used at RHP say that too?
  4. Standard memberXanthosNZ
    Cancerous Bus Crash
    p^2.sin(phi)
    Joined
    06 Sep '04
    Moves
    25076
    04 Jul '06 14:09
    Originally posted by FabianFnas
    Question:
    How many rating points gets the lowest rated player at RHP to the highest rated player if he wins? How many rating points is it possible to lose or win at maximum?

    In real life noone can lose more than 31 points and noone cannot win more than 31 points. Does the equation used at RHP say that too?
    If you actually looked at the rating formula you'd know the answer. The most points you can win or lose in a single game is the K value your rating confers (32,24 or 16).
  5. Joined
    11 Nov '05
    Moves
    43938
    04 Jul '06 14:30
    Originally posted by XanthosNZ
    If you actually looked at the rating formula you'd know the answer. The most points you can win or lose in a single game is the K value your rating confers (32,24 or 16).
    Does that mean that RHP does not obey the rating calculations in tournament OTH games?
    I don't think anyone can gain 32 pints, only 31.
    When you lose a game, you will always lose at least one rating point, no matter the difference in ratings.

    Correct me if I'm wrong. (But don't mind my spelling...)
  6. Standard memberXanthosNZ
    Cancerous Bus Crash
    p^2.sin(phi)
    Joined
    06 Sep '04
    Moves
    25076
    04 Jul '06 14:48
    Originally posted by FabianFnas
    Does that mean that RHP does not obey the rating calculations in tournament OTH games?
    I don't think anyone can gain 32 pints, only 31.
    When you lose a game, you will always lose at least one rating point, no matter the difference in ratings.

    Correct me if I'm wrong. (But don't mind my spelling...)
    You're wrong. Topalov will not gain any points from beating a 1600 rated player and the 1600 player will not lose any points either.

    http://en.wikipedia.org/wiki/ELO_rating_system
  7. Standard memberRagnorak
    For RHP addons...
    tinyurl.com/yssp6g
    Joined
    16 Mar '04
    Moves
    15013
    04 Jul '06 14:49
    Originally posted by FabianFnas
    Does that mean that RHP does not obey the rating calculations in tournament OTH games?
    I don't think anyone can gain 32 pints, only 31.
    When you lose a game, you will always lose at least one rating point, no matter the difference in ratings.

    Correct me if I'm wrong. (But don't mind my spelling...)
    Please take this to the help forum, as I don't see the relevance to the Banded Tournament problem.

    D
  8. Standard memberMarinkatomb
    wotagr8game
    tbc
    Joined
    18 Feb '04
    Moves
    61941
    04 Jul '06 17:44
    Originally posted by Ragnorak
    Ok, so I finally got my head around the formula. For the new players on the rise, we should err on the side of caution, remembering that all non-banded tournaments are open to everybody.

    So, say our New Player's highest opponents were 1600s players.

    Count number of 1600s players played = GameCount (say 10)
    Get Average opponent rating of these game ...[text shortened]... ure its fairly easy to ascertain when a player is pretty much playing at his/her rating.

    D
    Hmm, you are definately going in the right direction! Perhaps this calculation should be used with all players who have completed less than 50 games? Well thought out, rec for you! 🙂
  9. Standard memberMarinkatomb
    wotagr8game
    tbc
    Joined
    18 Feb '04
    Moves
    61941
    04 Jul '06 23:30
    Originally posted by David Tebb
    Ragnorak's suggestion of a rating floor based on the player's highest average of 10 (or more) continuous games is excellent.

    I also agree with his comments about making the system more automated. Tournament mods are a step in the wrong direction.

    However, I disagree with his other suggestion for dealing with new players. IMO, provisional players s ...[text shortened]... eted at least 20 games, there just isn't enough information to assess their true strength.
    Why just the top band? Wouldn't that completely cripple this band? How many 2100 players want to play provisional players? None of them do!
  10. Joined
    07 Jun '05
    Moves
    5301
    05 Jul '06 22:57
    Originally posted by marinakatomb
    Hmm, you are definately going in the right direction! Perhaps this calculation should be used with all players who have completed less than 50 games? Well thought out, rec for you! 🙂
    Why not just use the most probable rating, over as many games as you want? This bit of perl is close to what is needed:

    $min= 500;
    $best_guess=0;

    for ($i = 50; $i < 2500; $i++) {
    $sum=0;
    for ($j=0; $j< @opponents; $j++) {
    $result =eval_rating($i-$opponents[$j])-$results[$j];
    $sum += $result * $result;
    }
    print "$i, $sum\n";
    if($sum
  11. Standard memberRagnorak
    For RHP addons...
    tinyurl.com/yssp6g
    Joined
    16 Mar '04
    Moves
    15013
    06 Jul '06 00:12
    Originally posted by gezza
    Why not just use the most probable rating, over as many games as you want? This bit of perl is close to what is needed:

    $min= 500;
    $best_guess=0;

    for ($i = 50; $i < 2500; $i++) {
    $sum=0;
    for ($j=0; $j< @opponents; $j++) {
    $result =eval_rating($i-$opponents[$j])-$results[$j];
    $sum += $result * $result;
    }
    print "$i, $sum\n";
    if($sum
    Pseudocode would be much better.

    D
  12. Standard memberXanthosNZ
    Cancerous Bus Crash
    p^2.sin(phi)
    Joined
    06 Sep '04
    Moves
    25076
    06 Jul '06 01:33
    Originally posted by Ragnorak
    Pseudocode would be much better.

    D
    It would be although it's not that hard to understand even with no background directly in perl with a bit of puzzling. It would also be nice if RHP hadn't cut off the code halfway through due to a < or > followed by a character.
  13. Standard memberMarinkatomb
    wotagr8game
    tbc
    Joined
    18 Feb '04
    Moves
    61941
    06 Jul '06 10:18
    Originally posted by gezza
    Why not just use the most probable rating, over as many games as you want? This bit of perl is close to what is needed:

    $min= 500;
    $best_guess=0;

    for ($i = 50; $i < 2500; $i++) {
    $sum=0;
    for ($j=0; $j< @opponents; $j++) {
    $result =eval_rating($i-$opponents[$j])-$results[$j];
    $sum += $result * $result;
    }
    print "$i, $sum\n";
    if($sum
    Slight problem, often new users play a lot of provisional players when they arrive. What if a 2200 strength player played their first 20 games against p1200's?
  14. Standard memberRagnorak
    For RHP addons...
    tinyurl.com/yssp6g
    Joined
    16 Mar '04
    Moves
    15013
    06 Jul '06 14:281 edit
    Originally posted by marinakatomb
    Slight problem, often new users play a lot of provisional players when they arrive. What if a 2200 strength player played their first 20 games against p1200's?
    I think it would be unusual for a player's first 20 games to be against p1200s, but I guess it happens the odd time.

    Assuming, he wins them all, then once he is no longer provisional, his rating would be 1400 (average rating of all games as per current provisional system), and his projected skill level according to my above formula would be roughly 2200.

    Its highly unlikely that he'd meet 5 players better than 2200 all rated at p1200, but assuming this worst case scenario, where he only won 75% of the games, then his skill level would be estimated as 1600. The chances of this worst case scenario would be astronomical.

    In reality, he might encounter 1 or 2 players better than him while they are all rated at p1200. Assuming 1 defeat, then his skill level would be projected to be around 2000, 2 defeats and the projected skill level would be around 1800.

    I think the above scenarios are acceptable.

    D
  15. Joined
    07 Jun '05
    Moves
    5301
    06 Jul '06 18:202 edits
    Originally posted by Ragnorak
    Pseudocode would be much better.

    D
    Drat. Sorry that the script was cut off (I'll go back and edit it). I was not paying enough attention last night. Pseudocode follows, although most of you will have figured it out:

    for ( min_rating to max rating )
    for ( all opponents )
    expected_result = function ( opponents rating, test rating )
    error = expected_result - actual_result
    sum_of_squares = sum_of_squares + (error * error)
    end for loop
    plot sum_of_squares
    end for loop

    So for each rating, calculate the square of the difference between actual and expected results for all oppponents. Plot a graph of this against the rating. Take the minimum of the graph, and that gives you a least squares fit (evidently).
    The only real issue is that draws need to be counted as a loss followed by a win (can't remember why ATM).

    If a 2000 ability player plays twenty 1200 rated players, he ought to (80% chance) win all the games. As I count it, a 1781 player has a 50-50 chance of winning all his games against twenty 1200 rated players, but feel free to count up and correct that. If someone loses a game, the script gives a fair estimate of their performance.

    If someone wins all their games, the script above does not work. Winning all your games really just puts a lower limit on their strength, not much more. From earlier in the thread:
    by XanthosNZ
    The trend eventually stops at 2169 after 253 total games.
    This is consistant. The results from someone winning straight games against 1500 rated players is telling you nothing new - their rating is well above 1500, and they should pick on someone their own size.

    Gezza
Back to Top

Cookies help us deliver our Services. By using our Services or clicking I agree, you agree to our use of cookies. Learn More.I Agree