Study Of Game Strategy Emergence By Using Neural Networks

2y ago

71 Views

2 Downloads

3.15 MB

14 Pages

Last View : 7d ago

Last Download : 3m ago

Upload by : Karl Gosselin

Report this link

Download PDF

Transcription

Study of Game Strategy Emergence by using NeuralNetworks Ladislav ClementisInstitute of Applied InformaticsFaculty of Informatics and Information TechnologiesSlovak University of Technology in BratislavaIlkovičova 2, 842 16 Bratislava, Slovakiaclementis@fiit.stuba.skAbstract1.In artificial intelligence systems, various machine learning algorithms are used as learning algorithms. The most used artificial intelligence approaches are symbolic rule-based systemsand subsymbolic neural networks. The main objective of thiswork is to study the game strategy emergence by using subsymbolic approach - neural networks. From the viewpoint of artificial intelligence, games in general are interesting. The gamesare often complex even if their definitions, rules and goals aresimple. In this work we are concerned about the Battleshipgame. The Battleship game is a representative of games with incomplete information. We will design and implement solutionsbased on using subsymbolic artificial intelligence approach tosolve the Battleship game. We will use machine learning techniques as the supervised learning and the reinforcement learningfor this purpose. We will compare machine learning techniquesused by using simulation results and statistical data of humanplayer.Games in general are suitable to study many AI approaches.Games, especially board games are simple, well defined andplaying strategies are well comparable. Therefore it is simpleto evaluate game strategies developed by AI systems, especiallyby machine learning techniques.Categories and Subject DescriptorsF.1.1 [Theory of Computation]: COMPUTATION BYABSTRACT DEVICES—Models of Computation; G.3[Mathematics of Computing]:PROBABILITY ANDSTATISTICS—Markov processes, Probabilistic algorithms (including Monte Carlo), Stochastic processes; I.2.6 [ComputingMethodologies]: ARTIFICIAL INTELLIGENCE—Learning;I.2.8 [Computing Methodologies]: ARTIFICIAL INTELLIGENCE—Problem Solving, Control Methods, and Search; I.5.1[Computing Methodologies]: PATTERN RECOGNITION—ModelsIntroductionPopular breakthroughs have been done in AI system playingboard games [4, 11, 16, 21, 27, 28, 33].In our work we study the Battleship game as a AI problem. Wewill study the Battleship game by symbolic and subsymbolicAI approaches. We will use multiple machine learning techniques. We will compare these approaches by solving the Battleship game.We define our work objectives in section 2.2.Objectives of the workWe generalize the main objectives of our work. The main objectives are following: Design and describe a probability-based heuristic to theBattleship game strategy Use subsymbolic artificial feed-forward network adaptedby supervised learning - gradient descent method to solvethe Battleship game Use subsymbolic artificial feed-forward network adaptedby reinforcement learning to solve the Battleship gameKeywordsgame strategy, neural network, probability based heuristic, reinforcement learning, supervised learning Compare quality of game strategy learned by used subsymbolic approach9 Recommended by thesis supervisor: Prof. Vladimír KvasničkaDefended at Faculty of Informatics and Information Technologies, Slovak University of Technology in Bratislava on December 30, 2014. Compare game strategy emergence effectiveness of usedand subsymbolic approach.c Copyright 2011. All rights reserved. Permission to make digitalor hard copies of part or all of this work for personal or classroom useis granted without fee provided that copies are not made or distributedfor profit or commercial advantage and that copies show this notice onthe first page or initial screen of a display along with the full citation.Copyrights for components of this work owned by others than ACMmust be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, to redistribute to lists, or to useany component of this work in other works requires prior specific permission and/or a fee. Permissions may be requested from STU Press,Vazovova 5, 811 07 Bratislava, Slovakia.3.Artificial Intelligence ApproachesArtificial intelligence (AI) [1, 2, 11, 15, 18, 20] is a wide fieldof study. AI is usually defined as "the study and design of intelligent agents". AI problems are mainly based on natural intelligent systems and processes.The emergence of AI research field is linked to the developmentand progress in the computer technology that enabled advancedAI development.AI goal is to study and design artificial systems concerning:

2Clementis, L.: Study of Game Strategy Emergence by using Neural Networks perceptiony deductionoutput layer reasoning problem solving knowledge representation and knowledge transformationu1u2u3uqhidden layer learning planning natural language processing motion and manipulation social intelligence general intelligence - strong AIThe AI field is interdisciplinary and it overlaps many other disciplines as mathematics, computer science, psychology, philosophy, linguistics, neuroscience, cognitive science etc . . .AI uses many adjustable tools to design advanced systems:input layerx1x2 logic programming probabilistic and statistical toolsxn - 1 xnFigure 1: The simple three-layer ANN with the input layer x ,the single hidden layer u and the output layer y . The outputlayer y consists of single neuron y.X1X2X3 search and optimization algorithms evolutionary computationx3w2w1w3ϑwn-1Xn-1Xnywn machine learning techniques classifiersFigure 2: The simple artificial neuron with the input vectorx , the vector of weights w , threshold ϑ and output activity y. expert systems neural networks etc . . .AI use these tools to develop advanced AI systems. Advanced AIsystems are used for theoretical, research and practical purposes.These biologic neurons and biologic neural networks have inspired subsymbolic AI approach creating computational modelsof artificial neurons and artificial neural networks (ANNs).The simple ANN can be formally defined as a parametric mapping of input vector x to an output activity y, as shown by equation 1.We distinguish between cybernetic, high-level symbolic, lowlevel subsymbolic and hybrid AI approaches.y G(xx; w , ϑ )3.1(1)Subsymbolic Artificial Intelligence ApproachConnectionist subsymbolic AI uses parallel and/or sequentiallyprocessed information. In subsymbolic theory information isrepresented by a simple sequence of pulses. Transformations arerealized by simple calculations performed by artificial neuronsto solve AI problems.In the equation 1, x is the input vector, w is the vector of neuralweights and ϑ is the vector of neural thresholds.The simple ANN, as shown in figure 1 consists of three types oflayers:Theoretical concepts of subsymbolic AI have their origin in neuroscience. Networks of neurons, called neural networks are inspired by an animal central nervous systems (in particular thebrain) [3, 13, 15, 22, 24, 26, 33]. input layerA single neuron (a nerve cell) is composed of: output layer hidden layer(s) dendrites - input branched connections cell body which processes pulses axon - output connectionny t wi xi ϑi 1!(2)

Information Sciences and Technologies Bulletin of the ACM SlovakiaAn artificial neuron shown in figure 2 is a mathematical functionshown by equation 2 based on: input vector x vector of neural weights w4.3The Battleship GameThe Battleship game is a guessing game for two players. It isknown worldwide as a pencil and paper game which dates fromWorld War I. It was published by various companies as a padand-pencil game since 1931. We should mention some knowncommercial origins of this game: threshold ϑ The Starex Novelty Co. of NY published game as Salvo(1931) transfer function t() output (activity) y The Strathmore Co. published game as Combat, The Battleship Game (1933)An artificial neuron function can be represented by its linearcombination shown by equation 3 which is used as an input to atransfer function t(). The Milton Bradley Company published the pad-andpencil game Broadsides, The Game of Naval Strategy(1943) The Battleship game was released as a plastic board gameby the Milton Bradley Company (1967)y t (w1 x1 w2 x2 w3 x3 · · · wn 1 xn 1 wn xn ϑ ) (3)In equations 4 and 5 we are using the weighted sum of all nneuron inputs u, where w is a vector of weights and x is an inputvector.The game has been popularized under the name "The Battleshipgame" by the Milton Bradley Company as it was published as aplastic board game in 1967.4.1nu wi xi(4)y t (u ϑ )(5)i 1There are many types of ANN transfer functions, most used are:The Original Battleship GameWe provide description of the Battleship game as an interactivegame for two players; Player1 and Player2 . Game is iterativeby performing following tasks simultaneously:1. initial deployment of Player1 ’s and Player2 ’s ships2. iterative shooting to Player1 ’s and/or Player2 ’s battlefielduntil enemy ship is hit3. complete ship destruction in Player1 ’s and/or Player2 ’sbattlefield sigmoid function step function linear combinationThe simple step function usually used by perceptron is describedby equation 6.(1y 0if u ϑif u ϑ(6)Simple ANNs can be designed to solve simple AI problems.While dealing complex AI problems, ANNs are usually adaptedby machine learning techniques:Task 1 is performed just once by both players, initially at thebeginning of each game, before "action" part starts. Tasks 2and 3 are repeated sequentially until all ships in Player1 ’s orPlayer2 ’s battlefield are destroyed (i. e. one player destroysopponent’s ships).Player1 wins if he reveals all ships in Player2 ’s battlefield completely first, i. e. before Player2 does the same in Player1 ’sbattlefield. Similarly, Player2 wins if he reveals all ships inPlayer1 ’s battlefield completely first.Originally (the Milton Bradley Company version), ships wereplaced in a battlefield of size of 10 10. This battlefield includedset of linear (oblong) shaped ships, with vertical and horizontalorientations allowed. Originally, ship shapes were as follows (allwith the width of 1): Supervised learning Unsupervised learning Aircraft Carrier (length of 5) Reinforcement learning Battleship (length of 4) Deep learning Submarine, and Cruiser or Destroyer (length of 3) Destroyer or Patrol Boat (length of 2)When we are dealing with complex issues, it is difficult to definesymbolic rules explicitly. We can use subsymbolic approachesto develop a knowledge base by using machine learning techniques.Many definition and/or rule variations are present in the historyof this game, for example:

4Clementis, L.: Study of Game Strategy Emergence by using Neural Networks1 2 3 4 5 6 7ABCDEFG1 2 3 4 5 6 7ABCDEFGFigure 3: Example of problem instance, with pattern placements corresponding to the Battleship game rules (patterndeployment rules). This example is corresponding with ourmodification (environment with size of n n 7 7 and two"L"-shaped patterns with size of 2 3).Figure 4: Figure showing the current state of environmentcorresponding to the problem instance shown in figure 3.Gray cells are not revealed. Black and white cells are already revealed. Each white cell does not contain a part of apattern. Black cell contains a part of a pattern. after a successful hit, player gains one more hit attempt ina row environment size of n n 7 7 49 "cells" (MAX. hitattempts is 49) different battlefield sizes (size of 7 7, 8 8, 8 10, 10 12, etc . . . ) "L"-shaped patterns with size of 4 cells arranged in 2 3shapes (with all 8 rotation permutations allowed) many different ship shapes, etc . . . two patterns are present in environment4.2Simplified Modification of the Battleship Gameas an Optimization TaskWe can consider the Battleship game from the single player perspective, who is solving just single instance of "shooting" position decision problem at a time [23]. In this case, our optimization task is to minimize cumulative number of our hit attemptswhile revealing all opponent’s ships completely.Player, whose task is to reveal ships in enemy battlefield makesdecision each time before hit attempt. Respectively, his task is topick a position in enemy battlefield to be shot at next. This decision making can be formally described as Markov decision process (MDP), because player’s decision making is independent ofresult of decision made in previous iteration. Since the problemspace is not visible completely, this process can be described asPartially observable Markov decision process (POMDP) [8, 9,17, 26, 32]. All information currently available is stored in acurrent state of environment. Therefore, no information aboutprevious decision making is needed. Apart a current state ofenvironment, a sequence order of previous hit attempts is irrelevant.Player is formally modeled as an agent. Enemy battlefield (including ship placements) is taken into account as a problem instance. Current view of an agent (who’s view is incomplete information represented by partially revealed problem instance) isconsidered as a current state of environment. Ship representsa pattern present in problem instance. Ship placement may notbe completely known because of incomplete information aboutproblem instance, represented by a current state of environment.Ship placement in an environment is formally considered as apattern placement (rotation and position) in a problem instance.We provide simplified modification of the Battleship game fordescription and simulation purposes (shown in figure 3). Ourmodification is based on the changed environment size, changedpattern sizes and changed pattern quantity. We can simply summarize this modification as follows: only single player optimization perspectiveAll other trivial rules of the Battleship game remain unchanged(including the deployment rules): pattern overlap (even partial) is not permitted patterns may not share common edge pattern placement configuration remains stable whilesolving single problem instance (to make hit attempt tosame position twice is pointless) response from an environment to hit attempt is alwaystruthful, undeniable (as an oracle which always responds"YES" or "NO" truthfully), etc . . .We provide the example of problem instance shown in figure 3.This example problem instance is in correspondence with oursimplified modification of the Battleship game. We will be using this example problem instance in this work for simulations,further explanations and descriptions.Let us define an example of environment state. In environmentstate we have incomplete information about problem instance.The example environment state which we will be using in thiscontribution is shown in figure 4.In the current environmental state, we already performed hit attempts to cells D3, B4, G5 and D4. The first three hit attemptsto cells D3, B4, G5 (order is irrelevant) were unsuccessful, untilhit attempt to the cell D4 was finally successful. At this stage,we (or agent solving this problem) should perform some reasonable decision making to maximize the probability that next hitattempt will be successful. We provide description of the probabilistic heuristics in section 4.3. Note that initial hit attempts(before the first successful hit) should have at least some reasonable strategy too to increase success probability of a hit attempt.

Information Sciences and Technologies Bulletin of the ACM Slovakia51 2 3 4 5 6 7P1P5P2P6P3ABCDEFGP4P7P811 2 7 3 27 41 2 10 3 21 3 2Figure 5: All 8 possible pattern permutations, 4 vertical and4 horizontal.4.3Probability-Based Heuristic to the SimplifiedBattleship GameWe provide description of the probabilistic heuristic approach[5, 6, 7, 19, 29, 31] to the simplified Battleship game describedin section 4.2. There was already performed the first successfulhit attempt in the example environment. The current state of theenvironment is shown in figure 4.Cells D3, B4, G5 do not contain a part of a pattern, but cell D4 iscontaining a part of a pattern. What we do not know is specificpattern rotation (rotation and flip actually) and position. Therefore, in this case we can not determine cell to be shot at nextwith 100% probability of successful result. But we can maximize successful hit probability by a probability-based heuristic.According to the simplified battleship game described in section4.2, patterns placed in an environment are "L"-shaped patternswith size of 2 3 squares, with 8 possible rotations (or permutations). All this possible permutations together form the setP {P1 , P2 , P3 , P4 , P5 , P6 , P7 , P8 }. All possible pattern permutations are shown in figure 5.In correspondence with the example environment state shownin figure 4, multiple pattern permutations do match this environment state. Furthermore, multiple pattern permutations havemore than one possible position, which do match the currentstate of the environment. Numbers of possible pattern positionsaccording to pattern permutations are shown in table 1.As shown in table 1, there are totally 17 possible pattern placements (all possible pattern permutations with correspondingpossible positions) that match the current state of the environment (PosCon f 17), so that just one pattern is considered andcell D4 contains it’s part. Each one of 17 possible placementsis covering cell D4 and other three cells in addition. Therefore,each cell in the environment is covered by some number of possible pattern placements. In our example, each cell has a numberwhich is bounded by range from 0 to 17. These non-zero numbers (Coverings) for each cell are shown in figure 6 and in table2.CoveringProbability of each cell can be calculated by numberof Coverings for the cell divided by sum of all Coverings. Thiscalculation is given by equation 7.CoveringProbabilityi Coveringsi i Coveringsi(7)HitProbability of each cell is overall probability that hit attemptto this position will result into successful hit. Because in theFigure 6: The current state of the environment enriched bynon-zero numbers of pattern placement coverings of eachcell. The white numbers placed in dark-gray cells are pattern placement coverings.current state of the environment we are missing three remainingparts of the current pattern (MissingParts 3), HitProbabilityof each cell is three times higher than CoveringProbability. Calculation is given by equations 8 and 9.HitProbabilityi CoveringProbabilityi MissingParts (8)HitProbabilityi Coveringsi MissingParts i Coveringsi(9)The sum ( i Coveringsi ) is three times (MissingParts-times)higher than the number of possible pattern placements PosCon f(equation 10). Therefore, HitProbability can be described byequation 11. i Coveringsi MissingPartsPosCon f(10)CoveringsiPosCon f(11)HitProbabilityi For each cell i, non-zero value of Coveringsi , calculatedCoveringProbabilityi and HitProbabilityi of all n n 7 7cells are shown in table 2.According to HitProbability values shown in table 2, themost reasonable cell for next hit attempt is cell E4 withHitProbability of 0.5882. Cells C4 and D5 are acceptable too,with HitProbability values of 0.4118. Shooting to these positions is reasonable according to this probability-based heuristics.Note that if hit attempt to cell with high HitProbability valuewill result into unsuccessful hit, high number of pattern placements will be excluded from possible pattern placements in thecurrent state of the environment, and less pattern placements willremain.

6Clementis, L.: Study of Game Strategy Emergence by using Neural NetworksTable 1: Numbers of possible pattern positions with correspondence to the current state of the environment. Numbers correspond to all possible pattern permutations.Pattern Permutation Option:P1 P2 P3 P4 P5 P6 P7 P8 Numbers of Possible Placements: 23222222 17Table 2: CoveringProbability and HitProbability distribution in the current state of the environment. Cells C4, D5 and E4 havethe highest probability values. Note that the sum of HitProbability values equals 3 because there are 3 unrevealed parts of thecurrent pattern.Positioni identi f ier Numbero fCoveringsi CoveringProbabilityi HitProbabilityiB111/51 0.01961/17 0.0588C211/51 0.01961/17 0.0588C322/51 0.03922/17 0.1176C477/51 0.13737/17 0.4118C533/51 0.05883/17 0.1765C622/51 0.03922/17 0.1176D577/51 0.13737/17 0.4118D644/51 0.07844/17 0.2353E211/51 0.01961/17 0.0588E322/51 0.03922/17 0.1176E41010/51 0.196110/17 0.5882E533/51 0.05883/17 0.1765E622/51 0.03922/17 0.1176F311/51 0.01961/17 0.0588F433/51 0.05883/17 0.1765F522/51 0.03922/17 0.117651 5.51/51 1Solving the Simplified Battleship Game byUsing 3-Layer Neural Network Adapted byUsing Gradient Descent MethodIn the current state of the environment shown in figure 4 andfigure 6, area in which complete pattern is hidden ranges fromB2 to F6. This area has size of 5 5 25 cells and containsremaining parts of the pattern.NN input information consists of 7 7 49 cells. We will useinformation about the cells as an input for feed-forward neuralnetwork, shown in figure 7. Information will be enriched bysuccessful next hit attempt performed.We will use three-layer neural network shown in figure 7 as acognitive decision-making player [14]. In the current state ofthe environment shown in figure 4, we will create all states thatare possible next states (each possible next state with one moreuncovered square, with successful hit result predicted). Areawhere remaining pattern parts is within 5 5 25 area. We willuse configurations of these states as an input for neural network.This concept is described in figure 8.Possible next state with the highest output activity we consideras the most probable to fit the current state of the environment,as shown by equation 12.51/17 3put vector x to an output activity y when adaptation process isfinished. The desired accuracy ε should be set properly to makefinal mapping precise enough with respect to the effectivenessof the adaptation process.6.Solving the Simplified Battleship Game byUsing 3-Layer Neural Network Adapted byUsing Reinforcement LearningIn machine learning, various approaches are used. While using the supervised learning, input/output pairs are presented (labeled examples). While using reinforcement learning [8, 9, 10,25] inspired by behaviorist psychology, policy is learned on-lineby performance.Reinforcement learning is used in symbolic and also in subsymbolic approaches. Symbolic rule based system like LearningClassifier System (LCS) [5, 6, 12] uses reinforcement learningmethod to evaluate classifiers during their evolutionary process.LCS uses environment feedback as a reward response to an action performed. Neural network as a member of subsymbolicapproaches is also capable of learning output policy by usingfeedback as a reward response to an action performed.In general the reinforcement learning concept is based on: a set of environment states Syopt arg max yii(12)By taking advantage of back-propagation in gradient descentmethod, by calculating these partial derivatives we approximateweight and threshold values. Therefore, the neural networkshown in figure 7 works as a correct parametric mapping of in- a set of actions A state transition rules transition reward rules agent observation rules

Information Sciences and Technologies Bulletin of the ACM Slovakia7yinformation flowoutput layeru1u2u3uqhidden layerinput layerx1,1 x1,2 x1,3xn,n-1 xn,nFigure 7: Figure showing the artificial neural network mapped to the of environment cells.In the current state of the environment st S , a set of possiblenext states S t 1 S is defined by state transition rules. Our decision problem is to choose next state st 1 St 1 with the highestreward response (predicted). For this purpose we develop policyheuristic on-line, which is built on state-action-reward history.Performing an action at A (as an allowed action in state st )defines the transition between states st and st 1 as described byequation 13.st 1 at (st ); st S ; st 1 S t 1 S ; at A(13)Reward feedback is a scalar environment response value. Environment responses to an action performed, i.e. state transition.Reward is defined by transition reward rules, defined by equation 14.stat,1st 1,1at,kat,2st 1,kst 1,2Figure 8: Diagram showing how all k possible states withsuccessful hit prediction are created. These states are evaluated by neural network for action decision.assumed). This concept is shown in figure 8.rt 1 (st , at , st 1 ) R ; st S ; st 1 S t 1 S ; at A(14)If state space is completely observable to an agent, his nextstate decision making is described as MDP. If state space is partially observable by reason of observation rules, its stochasticity, complexity and/or size, decision making can be described asPOMDP.Agent makes decision based on a policy he develops duringhis history. Reinforcement learning itself requires clever exploration mechanisms. Decision policy can be deterministic orstochastic. From the theory of MDPs it is known that, the searchcan be restricted to the set of the so-called stationary policies. Apolicy is called stationary if the action-distribution returned byit depends only on the current state of the environment.Stochastic policy, e.g. roulette selection or random decision(with small non-zero probability 1 ε where 0 ε 1) isappropriate to ensure exploration diversity.After an action is performed in the environment, environment responses by providing reward. This reward is directly dependenton shooting result. Transition reward rules and values depend onimplementation. We choose values shown in equation 15.(1if action has resulted into hitrt 1 (st , at , st 1 ) 1 if action has not resulted into hit(15)State st 1 information we use for update of the current state st st 1 , which will be the new current state in the next iteration.For neural network adaptation we will use Q-learning-inspiredapproach [30] for updating neural network weights. Q-learningalgorithm uses Q-values, whose are updated by equation 16.As while using gradient descent method, we will use three-layerneural network shown in figure 7 as a cognitive [14] decisionmaking player.Qt 1 (st , at ) Qt (st , at ) α(st , at ) [rt 1 (st , at , st 1 ) γ max Qt (st 1 , a) Qt (st , at )]In the current state of the environment shown in figure 4, we willcreate all states that are possible next states (each possible nextstate with one more uncovered square, with successful hit resultQ-learning algorithm is parameterized by learning rate α(st , at )to control Q-value change rate. Parameter γ called discount factor is used to reduce reward acquisition by Q-value update.a(16)

8Clementis, L.: Study of Game Strategy Emergence by using Neural NetworksWe use similar approach to Q-learning algorithm to modify neural network weights and thresholds. If an action performed resulted into hit, success information is back-propagated into neural network by modifying neural network weights and thresholds. Update starts from output layer through hidden layer, updating weights of active neurons, which have actively participated on the current decision making. Thus the neural networkis learning environment response by reinforcing neural networkinner preferences. Thus, if environment response is negative,activities of active neurons are suppressed.7.2Figure 11 is showing average success rate of neural networkadapted by the reinforcement learning.The analysis of 10000 adaptation runs has shown that neuralnetwork adapted by the reinforcement learning has: sufficient number of learning iterations: 160000This approach is similar to value back-propagation used in gradient descent method approach. The difference is that only binary (success or not success) information is used for neural network update. Information about the probability distribution andcorrect input/output pairs is unavailable. Only success or notsuccess information is used to learn the policy.7.Simulation ResultsWe provide data, presented by diagrams showing effectivenessof approaches by revealing two patterns of the simplified Battleship game. Neural network will be adapted by two techniques:supervised learning and reinforcement learning for comparison.We also include results of human playing the simplified Battleship game. average number of hit attempts performed to reveal bothpatterns: 20.87423 our custom score: 6.03007The average number of hit attempts performed to reveal bothpatterns was slightly higher than while using neural networkadapted by the supervised learning. RL average number of hitattempts was 20.87423, SL average number of hit attempts was19.85320. Graphical difference in average success rate of NNadapted by RL and SL are shown in figure 12.7.3For purpose of approach comparison and quality of game strategy learned we will use these metrics: visualized data - graphs of success rate according to hitattempts average number of hit attempts performed to reveal bothpatterns our custom scorek 2 149 48 47, , ,., ,49 49 4949 49 (1, 0.98, 0.96, . . . , 0.04, 0.02)(17)7.1Human Solving the Simplified Battleship GameWe have performed a survey, including 12 human players. Wehave played 100 games totally. Each human was informed aboutprobability-based heuristics described in section 4.3. Averagehuman results are shown in figure 13.Average results have shown that human player has the followingcharacteristics: average number of hit attempts performed to reveal bothpatterns: 21.02150Our custom score we define as average success rate according tohit attempts evaluated by vector k shown by equation 17. Neural Network Results of Solving the Simplified Battleship Game By Using the Reinforcement LearningNeural Network Results of Solving the SimplifiedBattleship Game By Using the Supervised LearningAfter performing 40000 learning iterations the results haveshown that neural network was ada

The Battleship game was released as a plastic board game by the Milton Bradley Company (1967) The game has been popularized under the name "The Battleship game" by the Milton Bradley Company as it was published as a plastic board game in 1967. 4.1 The Original Battleship Game We provide description of the Battleship game as an interactive

Related Documents:

Girl Scout Superhero Academy

Game board printable Game pieces printable Game cards printable Dice Scissors Directions Game Set Up 1. Print and cut out the game board, game pieces, and game cards. 2. Fold the game pieces along the middle line to make them stand up. 3. Place game pieces on the START square. Game Rules 1. Each player take

106 Views

2y ago

Crack Game The Settlers 7l - cdn.thingiverse.com

legendary build-up strategy game with new features developed by the original creator Volker Wertich. Crack Game The Settlers 7l. June 14 2020 0. game settlers, game settlers of catan, game settlers 7, game settlers online, game settlers

56 Views

2y ago

Emergence in Sociology - ISCTE

facing sociological theories of emergence. EMERGENCE IN PHILOSOPHY The concept of emergence has a long history predating the 19th century (see Wheeler 1928), but the term was first used in 1875 by the philosopher George Henry Lewes. In a critique of Hume's theory of causation, Lewes

59 Views

3y ago

SYLLABUS Class: - B.B.A. VI Semester Subject: - Strategic Management

Unit-V Generic competitive strategy:- Generic vs. competitive strategy, the five generic competitive strategy, competitive marketing strategy option, offensive vs. defensive strategy, Corporate strategy:- Concept of corporate strategy , offensive strategy, defensive strategy, scope and significance of corporate strategy

68 Views

1y ago

Probability Project: Design Your Own Game

Design Your Own Game In this assignment, you will be designing your own game on your own in groups of 2. The game should be the type of game that you would play at a carnival, amusement park or casino. It cannot be a game that already exists— your group must create a unique game. Your game

346 Views

2y ago

PIPE Game Manual - cdn.akamai.steamstatic.com

During the Russian Game Developer's Conference, KRI-2004. . Your primary game input device is a computer mouse. You can control Dude, the main character, by clicking . - "Resume game": resumes current game, leaving the Game Menu. - "New game": starts a new game. W

77 Views

2y ago

Liste des jeux Game Boy 3 CHOUME NO TAMA - TAMA AND ...

game boy gallery 2 game boy wars game boy wars turbo game boy wars turbo - famitsu version game de hakken!! tamagotchi game de hakken!! tamagotchi - osutchi to mesutchi game de hakken!! tamagotchi 2 gamera - daikaijuu kuuchuu kessen ganbare goemon - kurofunetou no nazo ganbare goemon - sarawareta ebisumaru ganso!! yancha maru gb basketball gb .

74 Views

2y ago

PRUDENTIALUNIFORMS.COM FF-THE-CUFF

carmelita lopez (01/09/18), maria villagomez (02/15 . josefina acevedo (11/10/97) production supervisor silvia lozano mozo (03/27/17). folder left to right: alfredo romero (02/27/12), production supervisor leo saucedo (01/15/07) customer sales representative customer sales representativesroute build - supervising left to right: josefina acevedo (11/10/97) john perry (12/04/17), leo saucedo .

49 Views

3y ago

Recent Views

Grammar as a Foreign Language - List of Proceedings

Grammar as a Foreign Language Oriol Vinyals Google vinyals@google.com Lukasz Kaiser Google lukaszkaiser@google.com Terry Koo Google terrykoo@google.com Slav Petrov Google slav@google.com Ilya Sutskever Google ilyasu@google.com Geoffrey Hinton Google geoffhinton@google.com Abstract Synta

2y ago

445 Views

Attention is All you Need - NIPS

Google Brain avaswani@google.com Noam Shazeer Google Brain noam@google.com Niki Parmar Google Research nikip@google.com Jakob Uszkoreit Google Research usz@google.com Llion Jones Google Research llion@google.com Aidan N. Gomezy University of Toronto aidan@cs.toronto.edu Łukasz Kaiser Google Brain lukaszkaiser@google.com Illia Polosukhinz illia .

1y ago

303 Views

GSA Implementation of Google (G) Suite

Google Meet Classic Hangouts Google Chat Google Calendar Google Drive and Shared Drive Google Docs Google Sheets Google Slides Google Forms Google Sites Google Keep Apps Script D

2y ago

316 Views

Google Drive (Google Docs, Google Sheets, Google Slides)

Google Drive (Google Docs, Google Sheets, Google Slides) Employees are automatically issued a Kyrene Google account. Navigate to drive.google.com. Use Kyrene email address and network password to login. Launch in Chrome browser for best experience. Google Drive is a cloud storage sys

2y ago

388 Views

Exploring the relationship between Google Trends data and stock price data.

google search data. Additionally, we propose using a test that we created to explore the relationship, if any, of stock prices and the popularity of google searches. Finally, we share our results from the test and discuss the possibility of using the popularity of google searches to predict future stock price movement. 1. Introduction 1.1 Stock .

1y ago

164 Views

Quick Guide of Using Google Home to Control Smart Devices

Configuration needs Google Home app. Search "Google Home" in App Store or Google Play to install the app. 3.1 Set up Google Home with Google Home app You can skip this part if your Google Home is already set up. 1. Make sure your Google Home is energized. 2. Open the Google Home app by tapping the app icon on your mobile device. 3.

1y ago

326 Views

Elaboração de Provas Online usando o Formulário Google Docs

2 Após o login acesse o Google Drive ou o Google Docs e selecione a ferramenta Google Forms (Formulários). Clique na caixa de Ferramentas do Google, localizada no canto direito superior da tela e selecione o Google Drive. Na tela do Google Drive clique em New , opção More e selecione Google Forms. OBS: É possível acessar o google

10m ago

123 Views

A Hybrid Prediction Method for Stock Price Using LSTM and . - Hindawi

the relationship between stock prices and these factors. Although these factors will temporarily change the stock price, in essence, these factors will be reﬂected in the stock price and will not change the long-term trend of the stock price. erefore, stock prices can be predicted simply with historical data.

1y ago

159 Views

Stock Price Prediction Using the ARIMA Model - IJSSST

Stock price prediction is regarded as one of most difficult task to accomplish in financial forecasting due to complex nature of stock market [1, 2, 3]. The desire of many . work are historical daily stock prices obtained from two countries stock exchanged. The data composed of four elements, namely: open price, low price, high price and

1y ago

114 Views

ACS WASC Templates

File upload, Folder upload, Google Docs, Google Sheets, or Google Slides. You can also create Google Forms, Google Drawings, Google My Maps, etc. Share with exactly who you want — without email attachments. Search or sort your list of files, folders, and Google Docs. Preview files and Google Docs.

2y ago

366 Views

Google Drive - San Bernardino City Unified School District

Google Apps All of the Google applications that are available upon logging into Google.com (G , Gmail, Gphotos, Gdrive, etc.). Google Suite Google’s online cloud based office companion applications (Docs, Sheets, Slides). Google Drive Google’s online cloud storage and file sharing/collaboration application.

2y ago

378 Views

Single Sign On for Google Apps with NetScaler Unified Gateway

Google Apps for Work is a suite of cloud computing productivity and collaboration applications provided by Google on a subscription basis. It includes Google’s popular web applications including Gmail, Google Drive, Google Hangouts, Google Calendar and Google

2y ago

295 Views

Serviceteil

Google 84, 87, 124 Google 110 Google AdWords 101, 103 Google Alerts 127 Google Analytics 89 Google Maps 100, 110, 173 Google-Maps 63 Google Places 100, 103, 124 Graphiken 66 H Haftung 170 Haftungsausschluss 72 Hausfarbe 11 Headline 35 Heilmittelwerbegesetz 14, 69, 163 Heilversprechen 164 HONcode 78 HTML 58 HWG 31 I Imagefilm 31

2y ago

336 Views

Best practices for managing identities when you move to Google Cloud

Google Cloud. To provide t he informat ion an organizat ion would ne e d to transfer data and ownership from one Google Account to anot her for s ome of t he noncore Google s er vice s, such as Google Ads, Google Analyt ics, or DV360. Intende d audience Organizat ion administrators. Sta planning Google Cloud / Google Wor kspace migrat ion. Key .

1y ago

481 Views

Forecasting Stock Price Turning Points in The Tehran Stock Exchange .

Forecasting Stock Price Turning Points in the Tehran Stock Exchange Using Weighted Support Vector Machine. Journal of Entrepreneurship Education, 25(5), 1-12 . 2 1528-2651-25-5-797 Citation Information: Sayrani., M & Sharif, J.S. (2022). Forecasting Stock Price Turning Points in the Tehran Stock Exchange Using Weighted Support Vector Machine. .

7m ago

96 Views

Study Of Game Strategy Emergence By Using Neural Networks

It looks like you're using an ad-blocker