Artificial Intelligence Blog

Mathematica 2048 Simulator

March 20, 2014 in Games by hundalhh | Permalink

A guy named Phil asked me to post my code, so here it is. It’s in Mathematica, so I imagine that most people will have trouble reading it. I will probably rewrite the code in a more common, faster language later.

(* Mathematica code for simulating 2048
   - by Hein Hundal (Public Domain)

   Visit  http://gabrielecirulli.github.io/2048/

   for more detials.
*)

(* collapse[v] takes a list of values and returns a 
   collapsed list of values where two consecutive equal 
   values are summed into one value. *)

collapse[v_List] := PadRight[
   collapseAux[Cases[v, _Integer]]  , 4, "."];

collapseAux[{}] = {};
collapseAux[{x_}] = {x};
collapseAux[v_List] := If[ v[[1]] == v[[2]],
   Prepend[ collapseAux[Drop[v, 2]], v[[1]]*2],
   Prepend[ collapseAux[Drop[v, 1]], v[[1]]]];

vGlobalMoves = Characters["lrud"];
mGlobalEmptyBoard = Table[".", {4}, {4}];

move[mBoard_, sMove_String] := Switch[sMove,
   "l", collapse /@ mBoard,
   "r", Reverse /@ collapse /@ Reverse /@ mBoard,
   "u", Transpose[ collapse /@ Transpose[ mBoard ]],
   "d", Reverse[ Transpose[ 
        collapse /@ Transpose[ Reverse[ mBoard]]]],
   _, Throw[{"move::illeagal move", sMove}]];

(* game1Turn[ mStart_List, randFunc_, moveStrat_]
      Performs one turn of the game.
      - mStart is a 4 x4 game matrix where every elemet 
        is either a number 2, 4, 8, ... or the string ".".
      - randFunc is any function that take a positive 
        integer n as input and outputs a positive integer 
        between 1 and n.
      - moveStrat is any function that takes a game board as 
        an input and gives as an output one of the four 
        characters u, d, l, r.
      - The output of game1Turn is a new board state.  *)

game1Turn[ mStart_List, randFunc_, moveStrat_] :=
  Module[{sMove, mBoard, mEmpty, iSpot, iVal},
   sMove = moveStrat[mStart];
   mBoard = move[mStart, sMove];

   (* only add a new piece if the board changed *)
   If[ mBoard =!= mStart,
      mEmpty = Position[mBoard, "."];
      iSpot = randFunc[Length[mEmpty]];

      (* the new board tile will either be a 4 or a 2 *)
      iVal = If[ randFunc[10] == 1, 4, 2];
      mBoard = ReplacePart[mBoard, mEmpty[[iSpot]] -> iVal]
    ];
   mBoard];

(*  gameManyTurns  - executes iDo turns of the game  *)
gameManyTurns[mStart_List, randFunc_, moveStrat_, iDo_Integer] := 
   NestList[game1Turn[#, randFunc, moveStrat] &, mStart, iDo];

(******************* Display Results of Multiple Runs **********)

periodTo0[m_List]  :=  m /. "." -> 0;
maxTile[m_List]    := Max[Flatten[periodTo0[m]]]
totalTiles[m_List] := Total[Flatten[periodTo0[ m ]]];

rand1[i_Integer] := 1 + RandomInteger[i - 1];

(* rand2[m]   replaces a random entry on the board m with a 2 *)
rand2[m_List] := ReplacePart[ m, 
   (RandomInteger[3, {2}] + 1) -> 2];

runSeveralGames[ randFunc_, moveStrat_, iDo_Integer]  := 
   Module[{},
   Table[
    (* run a single game for 100 turns *)
    ten1 = gameManyTurns[rand2[mGlobalEmptyBoard], 
               randFunc, moveStrat, 100];

    (* keep going until there is not change for 50 moves *)
    While[ ten1[[-50]] =!= ten1[[-1]]  && Length[ten1] < 10000,
       ten1 =  Join[ten1, gameManyTurns[ten1[[-1]], 
                    randFunc, moveStrat, 100]]
    ];

    ten2 = TakeWhile[ ten1, # =!= ten1[[-1]] &];

    (* output a list {# turns of the game, tile Total,
    maximum tile} for each game *)
    {Length[ten2], totalTiles[Last[ten1]], maxTile[ten1[[-1]]]},
    {iDo}]];

stats[mRes_List] := Module[{mRN = N@mRes},
   {Mean[mRN], StandardDeviation[mRN], Max[mRes[[All, 3]]],
    Tally[mRes[[All, 3]]] // Sort}];

(******** Blind Cyclic Strategy ****************************)

(* createCyclicStrategy - creates a cyclic strategy function 
   from the string s.  If s = "uddl", then the strategy function 
   will repeat the sequence move up, move down, move down, and 
   move left indefinitely. *)

createCyclicStrategy[sMoves_String] := Module[
   {exHeld, iCount = 1},
   exHeld = Hold[
     Function[ m, chars[[ Mod[iCount++, iStringLength] + 1]]]];
   ReleaseHold[
    exHeld /. {chars -> Characters[sMoves] ,
      iStringLength -> StringLength[sMoves]}]];

testOneStrategy[] := Module[{},
   stratDRDL = createCyclicStrategy["drdl"];
   mRes = runSeveralGames[rand1, stratDRDL, 100];
   stats[mRes]];

An AI for 2048 — Part 2 Cyclic Blind Strategies

March 18, 2014 in Games by hundalhh | Permalink

In my last post, I described how to play 2048 and reported the results of some simple simulations. Namely, I reported the results of two very simple strategies:

Random play will achieve an average ending tile total (AETT) of 252 and usually the highest tile will be a 64 tile or a 128 tile (about an 85% chance of ending with one of those).
A pure greedy strategy where the player just tries to make the largest new tile possible results in an average ending tile total (AETT) of 399 and usually the highest tile will be a 128 tile or a 256 tile (about an 87% chance of ending with one of those).

Blind Strategies

Perhaps surprisingly, there are “blind” strategies that will do even better than the pure greedy strategy. Blind strategies are strategies that ignore the current state of the board. I tried two types of simple blind strategies “biased random” and “repeating sequences”. I will summaries the results of the biased random trials in my next post.

The simplest repeating strategy would be just hitting the same key every time—down, down, down, …. Of course that does not do well. Nor would down, up, down, up, …. So the simplest repeating strategy worth trying would be down, right, down, right or some other combination of horizontal and vertical moves. I ran 100 games for every possible repeated sequence of two to four moves and found that the best of these sequences was down, right, down, left or a version of that sequence in disguise. The results are captured in the table below.

Notice that restricting your moves to only two out of the four possibilities does poorly. The Down, Left, Down, Left… strategy had an average tile total of only 44.38 at the end of the game and that strategy never created a tile above 64.

The best strategies were down-right-down-left or that same strategy in disguise (dlul, dldr, and drur are really the same strategy). Notice that this strategy does better than pure greedy which had an AETT of 399.

move sequence	AETT	Largest Tile
dl	44.38	64
ddll	46.98	64
drrd	47.98	64
dld	51.18	64
ddrr	52.68	64
drdd	53.32	64
dll	53.46	64
dlld	54.32	64
dr	55.76	64
drd	56.	64
ddrd	57.48	64
dldd	57.66	128
dlll	58.	128
ddl	58.5	64
ddr	58.58	64
drr	59.9	64
drrr	60.56	64
ddld	61.62	64
dddr	66.78	64
dddl	68.46	64
dulr	262.44	256
durl	266.8	256
drlu	267.38	256
dlru	273.4	256
drlr	290.96	256
drdu	295.4	256
dlrl	298.5	256
duld	298.58	256
dldu	307.06	256
duru	308.8	256
dllr	309.32	512
dlrr	312.56	256
dudr	313.38	512
druu	314.58	256
duul	315.74	512
dudl	315.82	512
dulu	317.16	256
dluu	322.48	512
ddul	323.92	256
dlud	325.92	256
ddur	326.6	512
ddru	328.5	256
durd	329.12	512
drud	333.7	256
drll	337.92	512
ddlu	338.6	512
duur	345.62	512
dlu	345.88	256
dru	348.1	512
drrl	348.26	512
dur	352.72	256
dul	354.64	512
ddrl	357.3	512
dlr	361.1	512
ddlr	372.24	512
drl	373.32	256
drru	375.3	512
dull	377.38	512
dllu	379.	512
durr	385.38	512
dlrd	404.16	512
drld	404.6	512
drul	435.72	512
dlur	440.06	512
drur	614.9	512
dldr	620.36	512
dlul	627.06	512
drdl	686.28	512

An AI for 2048 — Part 1

March 17, 2014 in Games, Reinforcement Learning by hundalhh | Permalink

So I played the game 2048 and thought it would be fun to create an AI for the game. I wrote my code in Mathematica because it is a fun, terse language with easy access to numerical libraries since I was thinking some simple linear or logit regression ought to give me some easy to code strategies.

The game itself is pretty easy to code in Mathematica. The game is played on a 4×4 grid of tiles. Each tile in the grid can contain either a blank (I will hereafter use a _ to denote a blank), or one of the numbers 2,4,8,16,32,64,128, 256, 512, 1024, or 2048. On every turn, you can shift the board right, left, up, or down. When you shift the board, any adjacent matching numbers will combine.

The object of the game is to get one 2048 tile which usually requires around a thousand turns.

Here’s an example. Suppose you have the board below.

If you shift right, the top row will become _, _, 4, 16, because the two twos will combine. Nothing else will combine.

Shifting left gives the same result as shifting right except all of the tiles are on the left side instead of the right.

If we look back at the original image (below), we can see that there are two sixteen tiles stacked vertically.

So the original board (above) shifted down combines the two sixteens into a thirty-two (below) setting some possible follow-up moves like combining the twos on the bottom row or combining the two thirty-twos into a sixty-four.

There are a few other game features that make things more difficult. After every move, if the board changes, then a two or four tile randomly fills an empty space. So, the sum of the tiles increases by two or four every move.

Random Play

The first strategy I tried was random play. A typical random play board will look something like this after 50 moves.

If you play for a while, you will think that this position is in a little difficult because the high numbers are not clumped, so they might be hard to combine. A typical random game ends in a position like this one.

In the board above, the random player is in deep trouble. His large tiles are near the center, the 128 is not beside the 64, there are two 32 tiles separated by the 64, and the board is almost filled. The player can only move up or down. In this game the random player incorrectly moved up combining the two twos in the lower right and the new two tile appeared in the lower right corner square ending the game.

I ran a 1000 random games and found that the random player ends the game with an average tile total of 252. Here is a histogram of the resulting tile totals at the end of all 1000 games.

(The random player’s tile total was between 200 and 220 one hundred and twenty seven times. He was between 400 and 420 thirteen times.)

The random player’s largest tile was 256 5% of the time, 128 44% of the time, 64 42% of the time, 32 8% of the time, and 16 0.5% of the time. It seems almost impossible to have a max of 16 at the end. Here is how the random player achieved this ignominious result.

The Greedy Strategy

So what’s a simple improvement? About the simplest possible improvement is to try to score the most points every turn—the greedy strategy. You get the most points by making the largest tile possible. In the board below, the player has the options of going left or right combining the 32 tiles, or moving up or down combining the two 2 tiles in the lower left.

The greedy strategy would randomly choose between left or right combining the two 32 tiles. The greedy action avoids splitting the 32 tiles.

I ran 1000 greedy player games games. The average tile total at the end of a greedy player simulation was 399—almost a 60% improvement over random play. Here is a histogram of the greedy player’s results.

If you look closely, you will see that is a tiny bump just above 1000. In all 1000 games, the greedy player did manage to get a total above 1000 once, but never got the elusive 1024 tile. He got the 512 tile 2% of the time, the 256 tile 43% of the time, the 128 44%, the 64 11%, and in three games, his maximum tile was 32.

I ran a number of other simple strategies before moving onto the complex look ahead strategies like UCT. More on that in my next post.

Assorted Links Feb 2014

February 19, 2014 in Assorted Links, Deep Belief Networks, General ML by hundalhh | Permalink

Enjoying John Baez’s blog Azimuth. Especially the posts on good research practices and an older post on levels of mathematical understanding.
García-Pérez, Serrano, and Boguñá wrote a cool paper on primes, probability, and integers as a bipartite network.
Loved the idea behind the game theoretical book “Survival of the Nicest” (see Yes Magazine for a two page introduction).
Scott Young is learning Chinese quickly.
Cyber warriors to the rescue.
Mao, Fluxx, and Douglas Hofstadter‘s Nomic are fun games.
Healy and Caudell are applying category theory to semantic and neural networks.
Some MOOCs for data science and machine learning.
Here an old but good free online course on the Computational Complexity of Machine Learning.
Great TeX graphics.
Watch this Ted Video to learn anything in 20 hours (YMMV).
Where are all the Steeler fans? Cowboy fans? ….
Productivity Hints.
Copper + Magnets = Fun
Stray dogs on the subway.
Deep learning on NPR.
Happy 40th birthday D&D
Google is applying deep learning to images of house numbers
Deep learning in your browser.
How to write a great research paper.
Do Deep Nets Really Need to be Deep?
A variation on neural net dropout.
Provable algorithms for Machine Learning
100 Numpy Exercises
Learn and Practice Applied Machine Learning | Machine Learning Mastery

2,880 cores at 4.25 GHz, 16 terabytes of RAM

January 31, 2014 in Games by hundalhh | Permalink

What would you do with 2880 cores?

Vasik Rajlich, author of the legendary chess program Rybka, used them to show that chess champion Bobby Fisher was right about the chess opening King’s Gambit (see “A bust for King’s Gambit“) .

King’s Gambit

Vasik proved that King’s Gambit is a loss for white unless white plays 3. Be2 after black takes the pawn. (In that case, it’s a draw.) For the details, click on the link below.

http://en.chessbase.com/post/rajlich-busting-the-king-s-gambit-this-time-for-sure

John Dobson, 1915–2014

January 17, 2014 in Astronomy by hundalhh | Permalink

In September 2008, I met John in Harrisburg. We slept overnight at his friend Zoungy’s place and then enjoyed the beautiful drive to Cherry Springs for the Black Forest Star Party. John was perhaps the most entertaining, charming, and inspiring person I have ever met. He had a million stories about astronomy, philosophy, science, and life in general. At age 93, he was recovering from a stroke, meeting new people, travelling alone with strangers like me, and giving entertaining informative speeches to crowds numbering in the hundreds. He would charm everyone he met.

John — rest in peace.

http://www.skyandtelescope.com/news/home/John-Dobson-1915ndash2014-240456881.html

http://www.universetoday.com/108150/john-dobson-inventor-of-the-popular-dobsonian-telescope-dead-at-98/

http://www.astronomy.com/news/2014/01/astronomy-popularizer-john-dobson-dies

r = 1 – cos(101 theta/100) – 1/5 cos(8 theta)

January 5, 2014 in Math by hundalhh | Permalink

Equation from Teacher’s Choice

Review: “Basic Category Theory for Computer Scientists”

December 31, 2013 in Category Theory, Graphical Models, Math by hundalhh | Permalink

So I have been wanting to understand Category Theory (see [1], [2], [3]) mainly because I thought it would help me understand advanced functional programming in Haskell and because of the book “Physics, topology, logic and computation: a Rosetta Stone” by Baez and Stay (2011). A number of the constructs in Haskell were derived from Category Theory so that was enough motivation for me to learn it. But also, monoidal categories are useful in several fields: physics, computer science, pure mathematics, and statistics. The biggest application for me is the idea of generalizing probabilistic graphical models through either Ologs or monoidal categories / tensor networks (see Brendan Fong’s thesis “Causal Theories: A Categorical Perspective on Bayesian Networks“).

To start my investigation of Category Theory, I began with the $20 thin book “Basic Category Theory for Computer Scientists” by Benjamin Pierce (see also the free online version “A taste of category theory for computer scientists” (1988)).

I really enjoyed the book. Dr. Pierce’s style is a little informal compared to pure math books like Mac Lane’s “Categories for the Working Mathematician” (pdf / Amazon), but I enjoy that more relaxed style of writing when I am first learning a field.

The biggest obstacle for learning category theory is the fact that category theory generalizes a lot of areas of pure mathematics like topology, abstract algebra, and geometry. It’s hard to generalize before you have examples to generalize, but the examples being generalized in category theory are mostly from higher level mathematics found in senior level undergraduate and graduate level courses. Pierce ameliorates this problem by introducing some of the most basic categories first: sets, ordered sets, partially ordered sets, groups, monoids, vector spaces, measure spaces, topological spaces, proofs, and a simple functional computer language. He takes the time to explicitly define most of these ideas, so, in theory, you could read this book without a background in theoretical mathematics, but it would be hard.

After defining categories and introducing the most basic categories, Pierce describes and defines the most basic ideas in category theory: subcategories, commutative diagrams, monomorphisms, epimorphisms, isomorphisms, initial/terminal objects, products, coproducts, universal constructions, equalizers, pullbacks, pushouts, limits, cones, colimits, cocones, exponentiation, and closed Cartesian categories. These ideas are spelled out over the thirty pages of chapter one including illuminating homework exercises. The homework exercises varied significantly in difficulty. Many of the exercises were trivial and there are two or three that I am still working on despite investing several hours of thought. Generally, I found the exercises to be a bit harder than those in Mac Lane’s book, but Pierce’s book required less of a background in mathematics. A couple of the exercises were incorrectly stated or impossible.

Chapter two introduced functors, natural transformations, adjoints, and F-algebras. After reading this chapter, I was finally able to understand the definition of monads which are an important part of the computer language Haskell! Pierce provides many examples of each of these ideas and enjoyable homework exercises to increase understanding. Pierce’s definition of adjoints is much easier to understand than the standard definitions using counit adjunction or Hom Sets.

The last major chapter concerns applications of category theory to computer science–specifically lambda-calculus and programming language design.

The first two chapters of the book give a reasonable condensed introduction to category theory for students that have taken a course in abstract algebra. A course in topology or linear algebra would be another useful prerequisite. I carried around the light 100 page book for a few months so that I could learn something whenever I had some extra time. I had hoped that when I had proven that several functors where monads, I would then really understand monads, but a full understanding still eludes me. Similarly, I have proven that several functor pairs are adjoint, but I still don’t feel as though I understand adjoint functors. I guess more work and learning are needed.

Have a great year. – Cheers, Hein

Assorted Links Dec 2013

December 23, 2013 in Assorted Links by hundalhh | 1 comment

A post by a very applied mathematician working in industry.
k-Sparse Autoencoders are a cool alternative to denoising and dropout.
Ken Shirriff describes the numerical software tricks used by Sinclair to created the first single chip calculator. (really cool !!)
A Beautiful Live Wind Map of the world.
Scott Aaronson on Philosophy, Physics, and Computer Science
Phase Transitions in Machine Learning, P vs NP
Historical Price of a gigabyte of RAM
Creativity is declining in the USA. (See also the Newsweek article and Slate.)
Trees do some amazing physics to move water to the top. (Video)
Business insider put together 25 insiteful charts full of good financial news for the USA
The HyTAQ Robot combines the easy travel planning (obstacle avoidance) of a quadricopter with the efficient travel of a robotic car. (Video)
NASA’s von Neumann Machines.
Stanford professors Rajaraman, Leskovec, Ullman wrote enjoyable free online book “Mining of Massive Datasets”
Check out Hacker News for data scientists.

“Deep Learning 101″

December 14, 2013 in Deep Belief Networks, Graphical Models by hundalhh | Permalink

Check out Markus Beissinger’s blog post “Deep Learning 101″. Markus reviews a lot of deep learning basics derived from the papers “Representation Learning: A Review and New Perspectives” (Bengio, Courville, Vincen 2012) and “Deep Learning of Representations: Looking Forward” (Bengio 2013). Beissinger covers the following topics:

An easy intro to Deep Learing
The Current State of Deep Learing
Probabilistic Graphical Models
Principal Component Analysis
Restricted Boltzman Machines
Auto-Encoders
“Challenges Looking Ahead”

This is a great intro and I highly recommend it.

If you want more information, check out Ng’s lecture notes, Honglak Lee’s 2010 NIPS slides, and Hinton’s Videos ([2009] [2013]).

« Older entries § Newer entries »