Chess & History · 2026-04-21

From Deep Blue to AlphaZero: How AI Reinvented Chess Strategy

In 1997 a machine out-calculated us. Twenty years later, one taught itself to play — and changed how chess is understood.

When Deep Blue beat Kasparov in 1997, it won by brute force, calculating further than any human could. Two decades later a very different machine arrived — and it didn't just beat us at chess. It taught us new things about the game itself.

A machine that taught itself

In 2017, DeepMind unveiled AlphaZero. Where Deep Blue relied on human-tuned evaluation and raw speed, AlphaZero was given only the rules and learned entirely by playing itself — millions of games, with no opening books and no human games to study. In about four hours of self-play it surpassed the strongest traditional engine, and it went on to beat Stockfish in a 100-game match with 28 wins, 72 draws and zero losses.

A different kind of mind

The most striking part was how it played. AlphaZero examined only about 80,000 positions per second — against Stockfish's 70 million — yet won by 'understanding' rather than counting, using a neural network to focus on the most promising ideas. Its style stunned grandmasters: bold, long-term piece sacrifices and pawn pushes that looked almost human, even romantic, reviving ideas masters had long dismissed.

Engines built on these ideas now shape opening theory and train every top player. The machine that learned chess by itself ended up teaching it back to us.

From rival to teacher

The arc from Deep Blue to AlphaZero is the story of AI in miniature: first matching human skill by force, then surpassing it by learning. For chess, the result isn't the death of the game many feared in 1997 — it is a richer one, with human creativity and machine insight feeding each other.

In short: In 2017 DeepMind's AlphaZero learned chess from scratch by self-play in about four hours and beat Stockfish with 28 wins, 72 draws and 0 losses — examining far fewer positions but 'understanding' more, reshaping modern strategy.

Frequently asked questions

What was AlphaZero?

A 2017 program by DeepMind that taught itself chess from only the rules, through self-play, with no opening books or human games. It quickly surpassed the strongest traditional engines.

How did AlphaZero do against Stockfish?

In a 100-game match it scored 28 wins, 72 draws and no losses, despite examining only about 80,000 positions per second versus Stockfish's 70 million, relying on a neural network.

How is AlphaZero different from Deep Blue?

Deep Blue (1997) won by brute-force calculation using human-designed evaluation. AlphaZero (2017) learned chess by playing itself and won through pattern-based 'understanding', influencing modern strategy.

Una máquina que aprendió sola

En 2017, DeepMind presentó AlphaZero. Donde Deep Blue dependía de una evaluación afinada por humanos y de pura velocidad, a AlphaZero solo se le dieron las reglas y aprendió por completo jugando contra sí mismo — millones de partidas, sin libros de aperturas ni partidas humanas que estudiar. En unas cuatro horas de autojuego superó al motor tradicional más fuerte, y luego venció a Stockfish en un match de 100 partidas con 28 victorias, 72 tablas y cero derrotas.

Otro tipo de mente

Lo más llamativo fue cómo jugaba. AlphaZero examinaba solo unas 80.000 posiciones por segundo — frente a los 70 millones de Stockfish — y aun así ganaba por «comprensión» más que por conteo, usando una red neuronal para centrarse en las ideas más prometedoras. Su estilo asombró a los grandes maestros: sacrificios de piezas a largo plazo y avances de peón audaces que parecían casi humanos, hasta románticos, recuperando ideas que los maestros llevaban tiempo descartando.

Los motores construidos sobre estas ideas hoy moldean la teoría de aperturas y entrenan a todos los mejores. La máquina que aprendió ajedrez sola acabó enseñándonoslo de vuelta.

De rival a maestra

El arco de Deep Blue a AlphaZero es la historia de la IA en miniatura: primero igualar la habilidad humana por la fuerza, luego superarla aprendiendo. Para el ajedrez, el resultado no es la muerte del juego que muchos temieron en 1997 — es un juego más rico, con la creatividad humana y la intuición de la máquina alimentándose mutuamente.

En resumen: En 2017 AlphaZero de DeepMind aprendió ajedrez desde cero jugando contra sí mismo en unas cuatro horas y venció a Stockfish con 28 victorias, 72 tablas y 0 derrotas — examinando muchas menos posiciones pero «comprendiendo» más, y reinventando la estrategia moderna.

Preguntas frecuentes

¿Qué fue AlphaZero?

Un programa de 2017 de DeepMind que aprendió ajedrez solo a partir de las reglas, jugando contra sí mismo, sin libros de aperturas ni partidas humanas. Pronto superó a los motores tradicionales más fuertes.

¿Qué resultado tuvo AlphaZero contra Stockfish?

En un match de 100 partidas logró 28 victorias, 72 tablas y ninguna derrota, pese a examinar solo unas 80.000 posiciones por segundo frente a los 70 millones de Stockfish, apoyándose en una red neuronal.

¿En qué se diferencia AlphaZero de Deep Blue?

Deep Blue (1997) ganó por cálculo de fuerza bruta con una evaluación diseñada por humanos. AlphaZero (2017) aprendió ajedrez jugando contra sí mismo y ganó por «comprensión» basada en patrones, influyendo en la estrategia moderna.

También te puede interesar

Fuentes

Juega a History's Gambit →Más curiosidades