The hot hand, in code · deep dive

Gilovich, Vallone & Tversky (1985) asked a clean question: after a streak of makes, is the next shot more likely to go in than after a streak of misses? They found no difference and concluded the hot hand was an illusion. The conclusion held for thirty years. The estimator they used does not.Gilovich, Vallone a Tversky (1985) položili čistú otázku: po sérii trafených košov, je ďalší kôš pravdepodobnejší než po sérii minutí? Nenašli žiadny rozdiel a uzavreli, že horúca ruka je ilúzia. Záver vydržal tridsať rokov. Odhad, ktorý použili, nie.

1 · The estimator is biased on a coin1 · Odhad je skreslený už na minci

Take a shooter with provably no hot hand — independent flips of a fair coin — and run the exact GVT statistic: the probability of a make after k prior makes, minus the probability after k prior misses, averaged per record. If the method were sound this is zero. It isn't: selecting the shots that follow a streak inside a finite sequence is a biased sample, so the next shot is, on average, a make less often. The more of the streak you condition on, the worse it gets.Vezmi strelca, ktorý preukázateľne nemá horúcu ruku — nezávislé hody férovou mincou — a spusti presne GVT štatistiku: pravdepodobnosť trafenia po k predchádzajúcich trafeniach mínus pravdepodobnosť po k predchádzajúcich minutiach, spriemerované na záznam. Keby bola metóda správna, vyšla by nula. Nevyjde: výber košov, ktoré nasledujú po sérii vnútri konečnej postupnosti, je skreslená vzorka, takže ďalší kôš je v priemere trafený menej často. Čím dlhšiu sériu podmieňuješ, tým je to horšie.

On a fair coin, the statistic reads negative — and steepens with streak length. At a 100-shot record it is about −8 points at k=3 and −26 at k=5. None of this is a hot hand; it is the estimator measuring itself.Na férovej minci číta štatistika záporne — a strmie s dĺžkou série. Pri zázname 100 košov je to asi −8 bodov pri k=3 a −26 pri k=5. Nič z toho nie je horúca ruka; je to odhad, ktorý meria sám seba.

2 · The bias lives at the operating point2 · Skreslenie žije v prevádzkovom bode

Here is the part that turns a curiosity into a law. The bias is not a fixed quirk — it is wired to the sample size, and it explodes precisely where the data is thin, which is exactly the regime a real study lives in. Give the method tens of thousands of shots and it nearly behaves; give it a realistic game record and it lies by tens of points.Tu je tá časť, ktorá robí zo zaujímavosti zákon. Skreslenie nie je pevná zvláštnosť — je zviazané s veľkosťou vzorky a vybuchuje presne tam, kde sú dáta riedke, čo je presne režim, v ktorom žije reálna štúdia. Daj metóde desaťtisíce košov a takmer sa správa slušne; daj jej realistický herný záznam a klame o desiatky bodov.

−31 points at a 20-shot record, fading toward zero only at ~800 shots. The estimator is honest in the regime you never operate in, and badly biased in the one you do.−31 bodov pri zázname 20 košov, k nule sa blíži až pri ~800 košoch. Odhad je úprimný v režime, v ktorom nikdy nepracuješ, a silne skreslený v tom, v ktorom áno.

The method is honest where you don't need it and wrong where you do — the same trap we keep measuring across finance, networks and memory.Metóda je úprimná tam, kde ju nepotrebuješ, a nesprávna tam, kde áno — tá istá pasca, ktorú stále meriame naprieč financiami, sieťami a pamäťou.

3 · A measured zero hides a real streak effect3 · Nameraná nula skrýva skutočný efekt série

Now inject a genuine hot hand of known size and see what the GVT statistic reports. Because the estimator starts about 8 points low, it takes a real +8-point streak effect just to drag the measurement up to zero. So the original "no difference" is not evidence of no hot hand — it is evidence for a hot hand of roughly the size that was being dismissed.Teraz vstrekni skutočnú horúcu ruku známej veľkosti a pozri, čo GVT štatistika nahlási. Keďže odhad začína asi 8 bodov nízko, treba skutočný efekt série +8 bodov len na to, aby sa meranie vytiahlo na nulu. Takže pôvodné „žiadny rozdiel" nie je dôkazom žiadnej horúcej ruky — je dôkazom v prospech horúcej ruky zhruba takej veľkosti, akú zavrhovali.

What the study would have reported (green) against the shooter's true streak effect. The honest estimator is the dashed line. The crossing is the headline: a real +8 reads as a flat 0.Čo by štúdia nahlásila (zelená) oproti skutočnému efektu série strelca. Úprimný odhad je prerušovaná čiara. Priesečník je hlavná správa: skutočných +8 sa číta ako rovná 0.

What it generalizes toNa čo sa to zovšeobecňuje

This is one instance of a pattern we found by rebuilding two dozen claims: a standard method, calibrated on easy data, whose error is coupled to the very stress that defines the hard case — small samples here, heavy tails in diversification, correlation in the wisdom of crowds, scarcity in an agent's memory. The number you quote from the demo is a benign-regime mirage. Miller & Sanjurjo (2018) proved the hot-hand correction analytically; this page is just the same truth you can run.Toto je jeden prípad vzoru, ktorý sme našli prestavaním dvoch desiatok tvrdení: štandardná metóda, kalibrovaná na ľahkých dátach, ktorej chyba je zviazaná práve s tou záťažou, čo definuje ťažký prípad — tu malé vzorky, ťažké chvosty pri diverzifikácii, korelácia v múdrosti davu, vzácnosť v pamäti agenta. Číslo, ktoré cituješ z ukážky, je fatamorgána mierneho režimu. Miller a Sanjurjo (2018) dokázali korekciu horúcej ruky analyticky; táto stránka je len tá istá pravda, ktorú si môžeš spustiť.

Every chart here is the output of a small, seeded simulation — no hand-tuning, no cherry-picking. The verdict sits in the public ledger alongside its code, next to the other claims we've put on the bench.Každý graf tu je výstup malej, nasadenej (seeded) simulácie — žiadne ručné ladenie, žiadne vyberanie čerešničiek. Verdikt sedí vo verejnom registri spolu so svojím kódom, vedľa ostatných tvrdení, ktoré sme dali na lavicu.

See the full Crucible ledger →Pozri celý register Crucible →