FilterHN

shric

1 year ago

[-]

The creator admits it early on -- it's measuring rarity based on the specific notation everyone uses, which greatly influences the classification of rarity.

Fundamentally all chess moves are a piece moving from one source to another destination including:

- castling as a king move with a distance greater than 1

- pawn moves to the 8th or 1st rank with the additional datum of a new piece

- en passant is the same as a regular pawn capture, it just requires the victim pawn to have moved two squares previously.

Algebraic notation also has an arbitrary and reasonable amount of extraneous detail despite dropping the source location if it's unambiguous.

For example, the captures (x), check(+) and checkmate(#) symbols are all unnecessary given the previous state of the board is always known. With en passant it's also unnecessary to have a special symbol indicating an en passant capture, and indeed there isn't one.

I was initially hoping to get some insight on e.g. which pairs of squares had the fewest moves for a given piece etc.

That being said, I thoroughly enjoyed the video. It was beautifully illustrated and explained everything clearly.

jerf

1 year ago

[-]

"I was initially hoping to get some insight on e.g. which pairs of squares had the fewest moves for a given piece etc."

This may not be quite what you were asking for, but it's close, and has the advantage that I can link it right now. Tom7's Elo World chess video has where pieces start and end up, and their survival rates, as a chart: https://youtu.be/DpXy041BIlA?si=Zdh6Rh6mekatp2-q&t=815

Retric

1 year ago

[-]

He did find a move that occurred a single time including the specific game that included it. He also showed many moves that occurred zero times from every single game played on lichess.org.

So, depending on your definition either could reasonably qualify. Which you pick as the rarest is simply an arbitrary definition.

You could consider different notions, but run into the issue of defining what is unambiguous. IE You could say e2 to e4 is unambiguous for a given game state but that would imply game state must be included in the definition for of a move. Defining what the minimum game state is for an unambiguous move would be a video of its own.

zarzavat

1 year ago

[-]

What occurred once is not what chess programmers would call a “move”, but rather what chess players would call “move”.

His definition of a move is one ply of algebraic notation. From a chess programming perspective algebraic notation is just a data format and doesn’t have any greater significance.

In programming terms a move is a data structure that allows you to derive one position from another according to the rules of chess.

In Stockfish a move is a 14 bit number, the first 5 bits are the destination square, the next 5 bits are the origin square, the next 2 bits are the promotion piece, and the last 2 bits are the move type (normal, promotion, en passant or castle)

https://github.com/official-stockfish/Stockfish/blob/master/...

maverwa

1 year ago

[-]

I was wondering on how Stockfish encodes the destination and origin square in 5 bits each. I think they don't, at least the code on github uses 6 bits each, which actually gives you 64 possible values, so works out fine:

Thats 16 bits. Thank you for sending me down that (small) rabbit hole!

zarzavat

1 year ago

[-]

Yes you are correct, I had 2^5 = 64 in my head for some reason.

Retric

1 year ago

[-]

Comparing games with that notation alone would make moving your queen e2-e3 in game A the same as moving a pawn e2-e3 in game 2.

I don’t think anyone would actually agree those are the same move.

What Stockfish considers a move includes the full board state, it simply doesn’t need to pass that information around internally. Thus removing that ambiguity but means there’s a great number of moves that have only occurred once.

Also, it’s 6bit + 6bit + 2bit + 2bit = 16 bits which isn’t an arbitrary number. There’s no need to actually encode that something is a promotion because it can be inferred from the board state, but there’s zero cost to pass an extra bit around so it’s included anyway.

anikan_vader

1 year ago

[-]

Some sources do write ep after en passant captures. As you point out, it’s no more redundant than notating checks.

Sesse__

1 year ago

[-]

Notating checks is not even redundant; it can disambiguate which piece is to move without additional information (e.g. Rac1 and Rhc1; only one of them might give a discovered check, so Rc1+ could then be an unambiguous notation where the check is not redundant). The PGN spec is clear that SAN disambiguates legal moves and not pieces (if moving one of those rooks would put yourself in check, you should not disambiguate when you move the other one), but I don't know whether it considers the check part of the move for those purposes.

1 year ago

[-]

I see what you mean obviously, but neither of those moves could possibly give a discovered check, right? If the rook starts in the corner of the board, nothing can hide behind it or attack from behind it.

Sesse__

1 year ago

[-]

Point, I should have written Rbc2, not Rac1.

stevage

1 year ago

[-]

It'd be a particularly cool position if, say these three moves were legal and distinct:

- Re4

- Re4+

- Re4#

EDIT

Ok, this seems to do it:

8/3Q4/4R3/6pp/2R2Pk1/6P1/4R1K1/3B4 w - - 0 1

Annoyingly, Lichess uses the rank or file notation in all cases.

Sesse__

1 year ago

[-]

I looked up; PGN 8.2.3.5 says:

> Neither the appearance nor the absence of either a check or checkmating indicator is used for disambiguation purposes. This means that if two (or more) pieces of the same type can move to the same square the differences in checking status of the moves does not allieviate the need for the standard rank and file disabiguation described above. (Note that a difference in checking status for the above may occur only in the case of a discovered check.)

1 year ago

[-]

Right, you'd need to look at board state transitions as opposed to move notation.

I'd imagine remarkably foolish moves from board states that only quite sophisticated users would get to would be up there

SamBam

1 year ago

[-]

Presumably there are a mind-bogglingly-huge number of unique board state transitions. It's virtually impossible for the same game of chess to be played twice, except for silly scholar's-mate type games. Almost every single game in a chess database will have many unique board state transitions.

1 year ago

[-]

I bet it's way less than you think - orders of magnitude.

What could happen versus what does happen are entirely different.

Doing some algebraic permutations computation here would be like claiming a 50,000 letter English document has 27^50,000 possibilities.

I mean no, there's words, and they only go in certain orders and there's all these rules.

Here's another approach: humans are pretty lousy when remembering large amounts of anything so let's say there's in practice, only been 100,000 unique games played over and over. Without the help of a computer or careful tabulation, I'm pretty sure no human would realize it because no human can remember 100,000 unique games.

Anyways, it's worth digging into the data to see what the variation really is. I bet the 90th percentile is embarrassingly small with a long tail that's far shorter then most think

edit: so I actually took 7.7 million games from https://www.ficsgames.org/download.html and did some basic processing on them. These are people ostensibly with ELOs over 2000 which is pretty decent just to see if I'll eat crow on this one.

Going in, I was expecting a uniqueness level to be something like 50-70%. Actual percentage of unique games over 7.7 million? 98.7%.

Alright fine.

Although I could try to do 1 billion games, I expected the distribution to be readily visible around 7 million.

Now as an artifact of the data, I made the games as compact as possible, potentially leading to ambiguity maybe. So a game might look like so

    a3a5b3b6Bb2Bb7c4e6Nc3Nf6d4Be7Nf3O-Oe3c5d5exd5Nxd5d6Nxe7+Qxe7Bd3d5O-Odxc4Bxc4Nc6Qe2Rad8Rad1Ne4h3Nd6Ba6Bxa6Qxa6Qb7Qd3f6Qc3Rfe8Rd3Ne4Qc4+Kh8Rxd8Nxd8Rd1Qe7Qd5Nf7b4axb4axb4cxb4Qc6h6Rd7Qf8Qxb6Rd8Rxd8Qxd8Qxb4Qe8Bd4Kg8Qc4Nd6Qd5Kh8Nh4Kh7Qb3Qe4f3Qe6Qd3+f5e4g6exf5Nxf5Nxf5Qxf5Qxf5gxf5Bf2Kg6g4fxg4hxg4h5Kg2hxg4fxg4Ne5Kg3Nf7Kh4Ne5Be3Nxg41/2-1/2

Given this we can just run uniq with incrementing numbers and find out how things increase. I'm doing this on a pretty old laptop (3rd gen intel) so excuse me for cutting things a bit short

number of characters / unique entries / percentage duplicates

         99 7470034 0.039416039621657628
         98 7462496 0.040385363441781119
         97 7454437 0.041421683508957363
         96 7446241 0.042475620631500677
         95 7437517 0.043597454142611958
         94 7428583 0.044746291899176449
         93 7419379 0.045929849399894973
         92 7409325 0.047222709798876217
         91 7399016 0.048548361067336399
         90 7388091 0.049953224789125783
         89 7376939 0.051387278814333581
         88 7364995 0.052923177422393386
         87 7352785 0.054493281408027117
         86 7340118 0.056122151775432672
         85 7326671 0.057851323625950024
         84 7312421 0.059683754567414482
         83 7297272 0.061631789397747494
         82 7281670 0.063638076243272224
         80 7247260 0.068062914748240111
         79 7228770 0.07044057426456829
         78 7209233 0.072952869233227302
         77 7187947 0.075690070989017588
         76 7166719 0.07841981442939705
         75 7144539 0.081271977115830896
         74 7119854 0.084446261873027284
         73 7094951 0.087648579608837096
         72 7068178 0.091091363720824936
         71 7039566 0.094770627867995505
         70 7009607 0.098623104961001351
         69 6978335 0.10264442288391196
         68 6944221 0.10703119826195528
         67 6909408 0.11150785919986417
         66 6871309 0.1164070722832925
         65 6831848 0.12148142718723132
         64 6790088 0.12685141428305979
         63 6744878 0.13266504255419009
         62 6698251 0.13866088518630681
         61 6648086 0.14511168505848671
         60 6594883 0.15195314634822232
         59 6540926 0.15889156573829932
         58 6481469 0.16653723917595897
         57 6419070 0.17456122923325301
         56 6355249 0.18276807660975847
         55 6282195 0.19216221064468775
         54 6209896 0.20145925798763076
         53 6133584 0.21127234360201919
         52 6048154 0.22225792783565479
         51 5963420 0.23315401228435984
         50 5870519 0.24510030469790289
         49 5770633 0.25794480975187595
         48 5668750 0.27104611232094422
         47 5558836 0.28518013439112821
         46 5443035 0.30007117547551587
         45 5323607 0.31542861845637304
         44 5192507 0.33228698311784588
         43 5064129 0.34879532132158775
         42 4925937 0.36656565792950735
         41 4777302 0.38567887708631909
         40 4626169 0.40511331817237839
         39 4465105 0.42582480288508218
         38 4301802 0.44682420429097458
         37 4126391 0.46938059333470927
         36 3948981 0.49219403707682896
         35 3770310 0.5151696348833128
         34 3581328 0.5394711411415466
         33 3387960 0.5643366503548165
         32 3202702 0.58815928132701434
         31 3008299 0.61315788289287476
         30 2810171 0.63863548833641626
         29 2617450 0.66341779875536144
         28 2413377 0.68965988152851743
         26 2044415 0.73710531205655982
         25 1849362 0.76218749819167997
         24 1674682 0.78464988674290859
         23 1499682 0.80715342462054207
         22 1324053 0.82973784664289008
         21 1167329 0.84989124361622848
         20 1005747 0.87066933880105002
         19 862832 0.88904701374837569
         18 733254 0.90570966192613567
         17 599187 0.9229495579983682
         16 489782 0.93701812692123954
         15 395141 0.94918816879710877
         14 300280 0.9613865008348812
         13 233128 0.97002168698093183
         12 167915 0.97840753392729818
         11 115118 0.98519678700915769
         10 78474 0.98990889924908909
         9 46588 0.9940091724420389
         8 26581 0.99658190548385495
         7 14316 0.99815908200996462
         6 5390 0.99930689103336889
         5 2659 0.99965807481590496
         4 408 0.9999475346088339
         3 180 0.99997685350389731
         2 20 0.99999742816709969
         1 9 0.9999988426751949

It would be interesting to see later, when I crunch larger numbers on a more capable machine, if these distributions generally hold. Of course it won't, it's not possible. But I'm wondering if it's greater than what the Shannon limit would predict.

An ancillary analysis would be to compare it to the possible legal permutations at a character count although this would of course require a board and rule model.

I would expect those percentages to decrease as the length increases and perhaps such a function can give more predictive heft to the actual "language" of chess in practice

1 year ago

[-]

It's also worth noting that unique string != board state.

Proof: Both black and white could move the left rook pawn as their first move and right rook pawn as the second move.

Now reset the board and do right rook first and left rook second. Same board state, different game string.

In practice unique board states is a strict subset of number of moves but given how far off I was on my first assessment ... I wonder if we're talking about another < 2% hit.

All of this is dependent on an actual engine that can process the notation. There's apparently lots of options for pgn.

I'd also like to develop a heatmap based on statistical analysis. I'd imagine this would not only be way less than equally distributed but there'd be no way to slice the data to make it appear equally distributed

bobmcnamara

1 year ago

[-]

Funny enough, en passant is the only capture that takes a piece not in the destination square.

Someone

1 year ago

[-]

It would be fun to have a chess variant where en passant applied to every move:

- you play bishop a3×e7 taking my queen

- I reply with bishop a7×c5, taking your bishop en passant, getting my queen back (your bishop got taken before it reached my queen)

- you reply with knight a4×b6, taking my bishop while it’s on route to intercept your bishop that took my queen. You get back your bishop, it does end up on e7, and I do lose my queen again

- I reply, taking your knight while it moves through a5. Your bishop dies again, I get back my queen.

- etc.

For knight’s moves, I think you’d have to either make a hard rule as to what square they move over, or let the player say how they moved on every move. Also, two pieces could be taken in one move (a piece on the target square and the knight that just hopped over it)

Standard chess already has some of this in the rules for castling. There, you aren’t allowed to move your king through a position where it would be attacked by an enemy piece. That’s like saying it can be taken en passant.

https://www.chessvariants.org/difftaking.dir/enpassant.html

jerf

1 year ago

[-]

From: https://www.chessvariants.org/alphabet.html

Standard Wiki/TVTropes-esque warning applies.

thaumasiotes

1 year ago

[-]

> I reply, taking your knight while it moves through a5

Knights don't move through intermediate spaces. They are unlike every other chess piece in that regard.

Dylan16807

1 year ago

[-]

That's arguable. The motivation for the rule, and especially the name of the rule, suggest the pawn is not all the way there yet.

CrazyStat

1 year ago

[-]

In what sense is the pawn not all the way there? It occupies the square, prevents any other piece from occupying the square, can deliver check or checkmate from the square, and can be captured on the square.

Someone

1 year ago

[-]

The OP refers to the fact that “en passant” is french for “in passing”, so the move sort-of refers to the idea that the pawn takes the other pawn while it is passing through the third or seventh row, as if the capture starts while the previous move still is in progress.

Also, the pawn can’t deliver checkmate, can it, if it can be taken en passant? It probably is possible to construct a position where taking en passant would bring the king into check in another way, but in those cases, the en passant move isn’t possible.

vitus

1 year ago

[-]

> Also, the pawn can’t deliver checkmate, can it, if it can be taken en passant? It probably is possible to construct a position where taking en passant would bring the king into check in another way, but in those cases, the en passant move isn’t possible.

I believe I've managed to construct a situation where this is the case. The key is that the pawn that would be able to take en passant is being pinned (e.g. by a queen or rook) with the king directly behind it, such that the pawn cannot perform any captures. Then, you just need to make sure all of the squares adjacent to the king are threatened, and finally actually put the king into check via a pawn advancing two squares.

Technically, the c4 pawn cannot be taken en passant (i.e. this is an illegal move) because it would expose the black king to a different check. But I think this is in the spirit of your question.

Contrived game (which you should be able to import into https://www.chess.com/analysis or https://lichess.org/analysis):

1. e4 d6 2. e5 Kd7 3. h4 Kc6 4. Rh3 d5 5. a4 d4 6. Raa3 Kc5 7. Rhg3 Kb4 8. Rg4 Qe8 9. Rag3 Qd8 10. Rg5 Kxa4 11. Rd3 Kb5 12. Ra3+ Kb6 13. d3 Kb5 14. Be2 Kc5 15. Bg4 Qd5 16. Rb3 Qe4+ 17. Ne2 Qxd3 18. Qxd3 Kd5 19. Rb6 a6 20. Nbc3+ Kc5 21. Na4+ Kd5 22. c4#

Or, if you just want to see the end state: blob:https://www.chess.com/20486c7e-a582-41ef-9f1a-bb6fdea2ca36 (seems to work in Chrome, not in Firefox)

(also: https://lichess.org/editor/rnb2bnr/1pp1pppp/pR6/3kP1R1/N1Pp2...)

https://lichess.org/editor/rnbqkbnr/pp3ppp/8/3pp3/2pPP3/4K3/...

Supermancho

1 year ago

[-]

Idioms do not have to be interpreted as literal, so I don't understand why anyone would think this. As I understand it:

the King is never in check, for the purposes of the game. The piece removed isn't in the destination square.

bongodongobob

1 year ago

[-]

It's because pawns used to be able to move only one square. En passant was created when they were allowed to move two squares, sort of pretending that it only moved one square and is why you can only do it immediately after the first pawn move, kind of where the pawn "should" be.

Dylan16807

1 year ago

[-]

In the sense that a pawn that's in the perfect position can strike while it is "passing", but if that doesn't happen then it finishes the move and it's too late, the opportunity is gone forever.

SamBam

1 year ago

[-]

Sure, but we don't have the same rule for a rook or bishop "passing" through a square where a pawn could strike it in passing.

It's just it's own unique rule, born from a period of transition between pawns only being able to take one step and being able to take two.

Dylan16807

1 year ago

[-]

Though castling does take all kinds of strikes in passing into account for the king.

SamBam

1 year ago

[-]

True. It's as if any piece could kill the king in transition.

And yet other pieces can't kill that pawn in passing. Only another pawn.

We shouldn't try to find too much logic.

kevindamm

1 year ago

[-]

Clearly there's some Heisenberg uncertainty principle where the pawn occupies both the third (or sixth) and fourth (or fifth) rank, in a kind of superposition that only an opponent pawn situated in a certain position would be able to observe.

FartyMcFarter

1 year ago

[-]

I think the logic is based on pragmatism. A different piece has a chance of capturing the pawn later, but a pawn would never be able to since it can't go backwards.

bobmcnamara

1 year ago

[-]

That's a good way to look at it.

Practically, game engines model this with an extra state variable in the tree indicating the capturable square.

billforsternz

1 year ago

[-]

This video is really about the rarest Standard Algebraic Notation (SAN) corner cases, which is more than a little different from rarest moves. But the author basically acknowledges this, and 'rarest moves' is so hard to really define anyway. So kudos it's otherwise a great video.

I'm about equally impressed with the statistical analysis and the video construction and presentation. I can imagine tackling the former, but not the latter. I did notice a few mistakes in the video presentation though (eg a Bd4# that he presented as Be4#). I imagine at some point he just thought "I've polished this thing enough" and stopped.

1 year ago

[-]

Rarest I think is pretty easy. It's just board state transition.

You'd probably need 15 orders more magnitude of data to get there, but the definition itself is pretty straight forward.

incorrecthorse

1 year ago

[-]

Board state itself becomes unique pretty quickly, so you would just end up with a gigantic lot of "moves" played only 1 time.

EDIT: so you could define "rare" moves as the biggest difference of occurrences between state N and state N+1.

1 year ago

[-]

I bet it becomes unique far far less often then most people think.

Computing the number of permutations is thoroughly unconvincing.

For instance, there's 20 possible first moves and of those only probably 2 are played 95% of the time. You can certainly compute what the rates open is and the rarest response that's actually played or the rarest response to the most common open

philipswood

1 year ago

[-]

Thinking about it, one could also have a canonical state transition based on the game state transitions.(i.e. expand the game tree).

This would include not only the board state, but all the moves made up to reaching it.

It has a useful meaning for openings.

But it is so sparsely populated by actual games that "rarest" becomes too easy - many moves have been made only once under this definition.

https://en.m.wikipedia.org/wiki/Perfect_information

1 year ago

[-]

You don't need to know anything before. Game theory has a word for this type of game called "perfect information"

It might be interesting, but the previous board state should not be a relevant factor when considering rarest move

[1] https://www.chess.com/analysis/game/live/108840009759?tab=re... [2] https://www.reddit.com/r/chess/comments/1claxsm/its_me_viih_... [3] https://en.wikipedia.org/wiki/Brandon_Jacobson [4] https://www.chess.com/analysis/game/live/108394316331?tab=re... [5] https://www.chess.com/blog/Utkarsho/a-grandmaster-account-ge...

mrslave

1 year ago

[-]

Interesting, but I initially expected this to be about the unusual opening employed to victory by Magnus Carlson against Kacper Piorun on May 7, 2024[1] (1. a4 e5 2. Ra3).

It's even more interesting because an unknown IRL Chess.com player named Viih_Sou (since revealed to be Brandon Jacobson[2][3]) used this opening to defeat Daniel Naroditsky on May 2, 2024[4] only to be subsequently banned for violating the Fair Play Policy[5].

filleduchaos

1 year ago

[-]

Why would that be the rarest chess move? It's a pretty common way to try to start a chess game, so much so that kids have to be taught not to do it because it's flat-out terrible.

thebruce87m

1 year ago

[-]

Can you explain the ban to a layman?

xyst

1 year ago

[-]

I just want to say that the format of this video is beautiful and easy to follow along. A topic that is easily boring and dry is presented in a way to keep the viewer interested

groggo

1 year ago

[-]

Yes, the text bothered me a little once I noticed it though. It's just moving a tiny bit, not necessary.

1 year ago

[-]

His definition of rare is an artifact of the notation where board state requires disambiguation, as in it includes externalities.

I feel like the question remains unanswered

kevincox

1 year ago

[-]

The problem is that there must be hundreds of ways to describe "a move" from the complete board state before and after to the distance that the piece moved (moving moving a bishop and queen 2 diagonally is the same move).

So while I agree that the notation is pretty arbitrary and puts lots of emphasis on how implicitly a move can be recorded I don't think it is fundamentally worse than any other definition. Yes, personally I would have picked something more directly tied to the game than if the notation requires more disambiguation, but I don't think it really makes the video any less interesting or the result and less useful (probably no use either way).

matt-attack

1 year ago

[-]

Agreed. Just the notion of double disambiguation is meaningless to a chess player who doesn’t use the notation. This is fundamentally not about chess. But about one particular way that certain people use to write down chess moves.

billforsternz

1 year ago

[-]

Another count against statistically analysing SAN strings to death is some of the rules of that format. There's a lot of weight given to disambiguation in this video. In SAN you must not (if you're following the rules, some software including ChessBase doesn't follow the rules) disambiguate if one of the pieces is pinned to the king. So 1.d4 e6 2.c4 Bb4+ 3.Nbd2 Nf6 4.Nf3. Not 4.Ngf3.

This muddies the waters I think, these doubly disambiguated moves he lovingly isolated would be recorded differently if a marginally involved piece was pinned.

More significantly, there's no SAN notation for stalemate, or en-passant (or many other things of course) of great chess significance, perhaps more worthy of analysis.

The video was still a great technical achievement, and very entertaining for a chess nerd like myself.

jameshart

1 year ago

[-]

You’re welcome to scan the game archives to determine the actual game state and find the case where someone had three Bishops on the same color, positioned in three corners of a square, moved one to create a discovered checkmate, but the move was notated without disambiguation because the two other bishops were pinned.

Rarestest move for sure.

billforsternz

1 year ago

[-]

Well, only one of the other bishops would have to be pinned else single disambiguation would do, but otherwise - fair point!

thih9

1 year ago

[-]

A different take on rare chess moves and perhaps more rare than what is presented in the video: https://en.wikipedia.org/wiki/Joke_chess_problem

paxys

1 year ago

[-]

The problem with using a dataset consisting of all games on lichess.org is that most/all instances of these moves are most certainly from people who are trying them out in a noncompetitive game just to see what happens. In fact he himself likely polluted the data further just to make this video, maybe even enough to change the answer.

There needs to be a minimum bar for the data to be meaningful, e.g. by restricting to players above a certain rating threshold, or considering tournament games only.

tavavex

1 year ago

[-]

I don't think including "noncompetitive" games is an issue. For a game with so many possible states, it only makes sense to ask about what moves have been played at all, and not the context that these moves were played in.

Plus, restricting the dataset introduces more biases and ambiguities. What exact ELO should be "good enough" for consideration? Why not a point higher or lower? Should they have accounted for time control too, because people in speed chess play worse and can get into weird situations they otherwise wouldn't have been in?

shric

1 year ago

[-]

He stated that he tried using master tournament games but the dataset was way too small.

But yes indeed, the single example game he showed was indeed a result of the winner playing very silly moves and the loser allowing it rather than resigning.

antaviana

1 year ago

[-]

In bullet games at Lichess it is not that uncommon to play on lost positions to try to either flag the opponent or to offload as many own pieces as possible to seek a stalemate with the frenzy. Conversely, the winning side then tries to delay the win by promoting a bunch of unneeded pieces and sort of demonstrating who is really in charge. It’s even fun.

shric

1 year ago

[-]

Yes, I've done it many times!

At the time of writing based on total games played at https://database.lichess.org/ one in every 272,369 games are mine :p

elijahbenizzy

1 year ago

[-]

This is delightful. I think that the hardest part (that, honestly, he absolutely nailed) is defining "rare" and "move" -- not only did he come up with reasonably satisfactory definitions, but he also was able to walk us through his discovery process.

The rest is a fun programming problem (which he largely glossed over), and it's clear he put a massive amount of care into the video. Thank you!

[1]: https://en.wikipedia.org/wiki/Streisand_effect

alhirzel

1 year ago

[-]

I am all about the hands used in this video! The bandage after getting hit by lightning is very immersive.

I bet this video will have a Streisand-like effect [1] on the indicated capture scenarios.

munch117

1 year ago

[-]

Kinda misunderstands algebraic notation: You are by no means required to use the minimal unambiguous notation. In fact, the canonical form is something like "Ng1-f3". "Nf3" is a context-dependent abbreviation of that.

Dig up an old chess book if you don't believe me. You'll find books that only use the long form.

dh5

1 year ago

[-]

Notation has changed over the years. Even twenty years ago you could pick up (admittedly dated) books using descriptive notation (e.g. P-K4). It's pretty clear what is the accepted standard for notation today though.

munch117

1 year ago

[-]

Algebraic notation hasn't changed notably in the last (at least) 50 years. The abbreviated form that most people use today was also the form that most people used 50 years ago. (A change to indicate draw offers is the only one I can remember.)

The point is that there are multiple ways of writing the same move, and none of them are more correct than the other. Basing your definition of a move on the precise text that the computer happened to spit out when the games were retrieved, that's just being lazy.

stevage

1 year ago

[-]

By coincidence, I was exploring a couple of very related questions a few months ago:

- [what are the commonest checkmating moves](https://chess.stackexchange.com/questions/44959/what-are-the...)

- [Have any full rank-and-file piece specifications ever been required in a masters level game of chess?](https://chess.stackexchange.com/questions/44961/have-any-ful...)

I think this question here is perhaps the more interesting one though, and it's so well explored.

foobarian

1 year ago

[-]

Huh. I wonder if the set of possible checkmate positions is quite limited compared to the full game.

[1] https://github.com/lichess-org/scalachess/blame/master/core/...

psuter

1 year ago

[-]

Great investigation and excellent production value!

Brings back good memories too, as 10 years ago my first contribution to lichess was to improve the PGN notation for disambiguation. [1]

gweinberg

1 year ago

[-]

OP misses a couple important things when he's discussing notation: when a pawn gets promoted, you have to specify what it gets promoted to, and also if the move results in stalemate you note that. Stalemates are much rarer than checkmates, and underpromotions are very rare also, so I imagine the stalemates analogous to the checkmates he considers must be rarer still, if they ever happen at all.

pimlottc

1 year ago

[-]

The game and move in question is here: https://lichess.org/NoXEwGi8#136

1 year ago

[-]

I wonder if the winner was specifically going for wacky notation when he played that, or just going for a crazy checkmate.

pimlottc

1 year ago

[-]

I wonder too, if the first promotion had been a queen (or a rook), that would have been checkmate right there.

kevincox

1 year ago

[-]

Yup. They were clearly not "playing seriously". They had a few obvious opertunities to win that they didn't take to get an odd checkmate in the end.

bjourne

1 year ago

[-]

If you define a "move" as a piece moving from one square to another, which I think is the correct definition, then the rarest move is probably an illegal move. Like black king jumping from b4 to h8 or something. Or a piece moving to an already occupied square of the same color. In over the board chess illegal moves happen all the time in bullet chess, but not online of course.

alberto-m

1 year ago

[-]

Chess historian Tim Krabbé ran an analysis with a different notation, using an extended algebrical system (start and end square + kind of captured piece, if any). Here his results: https://timkr.home.xs4all.nl/chess2/diary_6.htm (entry 105).

His site is a trove of information for people liking chess trivia.

none_to_remain

1 year ago

[-]

Do chess players actually consider this "double disambiguation" from the notation to be a different "move"?

bongodongobob

1 year ago

[-]

No.

bmacho

1 year ago

[-]

tl;dw

Creator of the video uses the standard chess notation, which encodes if a move

  - is a capture
  - is a check
  - is a checkmate
  - needs a disambiguation
  - and other things

It is arguable how much the rest of the board is part of a move (see the current top comment), but let's say moves are different if their standard chess notation is different.

Then it turns out that bishop double disambiguation moves are very rare. They require 2 same color bishop underpromotion for the start. And they doing captures and/or checkmates are even rarer. A lot of them never ever happened on lichess.

That's about the content. Now about the format.

The video features hands expressive hands all along, which makes it stimulating to watch. If you are into presentation, go and check it out.

animal531

1 year ago

[-]

It's been a long time since I've last played chess, but I've really enjoyed Anna and her mum Pia commentating on each other's games recently.

Props to them and all the other chess creators for bringing more influence and people to the game.

https://en.wikipedia.org/wiki/Anna_Cramling

alberto_ol

1 year ago

[-]

For those who, like me, don't know who are Anna and Pia

https://en.wikipedia.org/wiki/Pia_Cramling

venusenvy47

1 year ago

[-]

I prefer Gotham Chess. He analyzes recent games and makes it exciting with the way he vocalizes his descriptions. https://youtube.com/@gothamchess

bitcurious

1 year ago

[-]

Fun video. An interesting follow up would to do this would be to ‘disambiguate’ every move, such that a move was a single piece’s movement and reflected less of the state of the game.

AnimalMuppet

1 year ago

[-]

"doubly disambiguated bishop capture checkmate"

It's a 30 minute video. If you care that much about the topic, enjoy. If you just wondered what the move was... you're welcome.

f154hfds

1 year ago

[-]

It's a 17 minute video.. that's quite the round up. Just saying because for me the difference from 15 minutes to 30 minutes tends to go from: "yeah I'll check this out" to "boy this is an investment".

AnimalMuppet

1 year ago

[-]

Huh. I stand corrected. Not sure where I got 30 from.

(I wasn't rounding that far, I was giving what I thought was an accurate number. Accuse me of bad info, not bad math.)

LeifCarrotson

1 year ago

[-]

Interesting, I'd assumed it was going to be a pawn promotion to something esoteric like a knight that didn't create a check, but that's a couple layers deeper.

Thanks for the TLDR.

tavavex

1 year ago

[-]

On a scale of billions of games, your situation would be be fairly common. The author of the video got to the conclusion by stacking multiple insanely rare occurrences - the player would need to underpromote to a bishop (the rarest promotion) and then capture and checkmate while these bishops are placed in a specific configuration. The author get into weird territory fast when fishing for the strangest game out of billions.

CalChris

1 year ago

[-]

Did he include under-promotion captures resulting in zugzwang? If you’re not including zugzwang then you shouldn’t include checks and mates. The problem becomes a lot simpler.

1 year ago

[-]

Zugzwang has no notation. The author states in the first 2 minutes or so that the video is about the rarest edge cases in notation.

CalChris

1 year ago

[-]

Nf3 can be ambiguous but resolved by the position itself. That ambiguity has no notation either.

FWIW, I wrote the XBoard PGN parser.

1 year ago

[-]

Right, the video talks about that and doesn’t consider the ambiguous but resolved by position one different from any other Nf3

mathgradthrow

1 year ago

[-]

parent is referring to stalemate. Underpromotion to a bishop to achieve stalemate was my guess for rarest, since you can always stalemate with a queen just as well as a bishop, so theres no reason to underpromote if you goal is a draw on the turn in which the promotion occurs.

1 year ago

[-]

Stalemate doesn’t have notation either, so the OP doesn’t consider it different than any old bishop underpromotion

ViktorRay

1 year ago

[-]

This video is excellent. It’s one of those videos that will become a classic of YouTube in the future. The kind that’s recommended to millions of people

mehmetalianil

1 year ago

[-]

I find it odd that now that this video is out, all of these moves will be executed and it will be impossible to know for sure anymore.

skilled

1 year ago

[-]

Watched this earlier today myself, fantastic work by the author.