CS 285: Multi-Agent Systems

3y ago

31 Views

6 Downloads

2.48 MB

70 Pages

Last View : 1m ago

Last Download : 3m ago

Upload by : Samir Mcswain

Report this link

Download PDF

Transcription

CS 285: Multi-Agent SystemsFall 2013Lecture 1Prof. David C. ParkesHarvard SEAS

Lecture 1: Lesson plan What is a MAS? A retrospective on early MAS research Class outline

What is a Multi-Agent System? A system with multiple autonomous entities, withdistributed information, computational ability, andpossibly divergent interests. Agents :: artificial or human, cooperative or selfinterested

One view of an agent(Russell 1997)

Two themes of MAS research Design of intelligent agents that coordinate orcompete with each other Design of the coordination environment

Early example: UM Digital Library(Weinstein, Birmingham and Durfee 1996-98)

Agents (as viewed by the UMDL) *May team with each other to achieve goalsEncapsulate well-defined servicesCan make decisions according to prefs.May use “mentalistic concepts” such asbelief, desire and intention Proactive (initiate actions to achieve goals)

c.f. “agent-oriented programming”(Shoham; Jennings and Wooldridge)

MAS: A Brief History*ContractNet (Davis and Smith ‘81)Consensus (Ephrati and Rosenschein ‘91)Distr. CSP (Yokoo et al. ‘92-95; ‘97-05)Org. design (Decker and Lesser 93-95)Contracts coalitions (Sandholm &Lesser ‘93-98)Market-oriented programming (Wellman ‘93)Rules of encounter (Zlotkin and Rosenschein ‘93)Multi-agent Inf. Diagrams (Milch and Koller ‘00-01)

ContractNet (Smith and Davis ‘81)

Motivation * Distributed problem solving– No one has sufficient info to solve entire problem– Control and data distributed “How can systems that are perfectly willing toaccommodate one another act so as to be aneffective team?” Nodes (KS’s) cooperate by sharing subtasksof the overall problem

ContractNet(Smith and Davis1981)

“Connection problem” * Nodes with tasks to execute can find themost appropriate idle nodes to execute them Crucial to maintaining the focus of theproblem solver “Most appropriate [agent] to invoke for a taskcannot be identified a priori”

ContractNet * Processors do not get in each other’s way intrying to solve identical subproblems whileother subproblems are ignored The subproblems that eventually lead tosolutions be processed in preference Specific detail for how to bid not specified

Consensus (Ephrati and Rosenschein ‘91)

Motivation * Autonomous agents need to reachconsensus in order to coordinate action Bypass negotiation – use a “group choicemechanism” to select the result Want one that cannot be manipulated by anuntruthful agent

World in state S0; can move to S1-S6.

World in state S0; can move to S1-S6. Goals; g 1{At(G,3), At(W,2)}; g 2 {On(W,G), On(R,W)} v i(S) cost i[reach goal, S0] – cost i[reach goal, S];e.g., v 1 (2, 0, 1, 0, -2, 2)

Clarke tax – collect bids and fine a tax equalto the portion of bid that made a difference

Discussion*How to generate alternativesDifferent ways to determine “worth”Handling tax wasteWork distribution

Distributed CSP(Yokoo et al. ‘92-95; ‘97-05)

Distr. Constraint Satisfaction(Yokoo, Durfee, Ishida, Kuwabara 1992-95)*n variables x 1, x nFinite domains D 1, , D nEach agent belongs to one agentConstraint predicates p k(x 1,.x m)distributed amongst agents Goal: assign values to variables so that allpredicates satisfied

DCSP :: Motivation * Coordination of artificial automated agents;“Important infrastructure in DAI”Examples: Distributed truth maintenance– assign “IN” or “OUT” to data, some data shared Resource allocation– assign plans to the task(s) of each agent s.t. allplans can be executed simultaneously

Toy example: n-Queens * Asynchronous Weak commitment– Assign, send messages, if in conflict then try tofix (reduce constraints) and increment priority– Priority by agent ID if priority numbers the same

Extension: Optimization!(Yokoo et al. 1997- ; Shen, Tambe, Yokoo, 2003-05)

Organization Design for Task OrientedEnvironments (Decker and Lesser 93-95)

*TAEMS :: Motivation Organizational-based framework forrepresenting coordination problems in aformal, domain-independent way Tool for building and testing computationaltheories of coordination– Task interrelationships (hard – enables, soft –facilitates)– Task group, task (set of subtasks), executablemethod

Example: Hospital schedulingUnits – scheduling agents minimize patients’ staysAncillary agents – maximize equipment use, minimize setup times

Example: Airport scheduling

Task reallocation (Sandholm and Lesser ‘93-98)

Marginal-cost Based ContractingTr(Sandholm and Lesser 1993-98)

Cluster contractSwap contractMulti-agent contractFind “IR” paths that (a) avoid local suboptimality, (b)have “anytime” property and avoid need to backtrack

Cluster contractSwap contractMulti-agent contractFind “IR” paths that (a) avoid local suboptimality, (b)have “anytime” property and avoid need to backtrackClaim: even M contracts insufficient.agent 1 (H): Taskagent 2 (L): No task

Dynamic Coalition Formation (Sandholm and Lesser 1995)*Motivations“small transaction commerce on the Internet”“industrial trend towards dynamic, virtualenterprises that can take advantage ofeconomies of scale”

* Three interrelated challenges:– Generate coalitions– Solve the optimization problem for each coalition– Divide the value of generated solutionanytime algs.for solvingoptimizationproblem for acoalition

Market-oriented programming (Wellman ‘93)

Market-oriented programming(Wellman 1993)Consumer:Producer:Competitive equilibrium : agents best respond, andtotal consumption total production WALRAS tatonnement algorithm

Example: TransportationNetwork o the economySub-optimality: over-use of (2,3)

Introducing “carriers” (producers): set price on goods at marginal cost

Rules of Encounter (Zlotkin and Rosenschein ‘93)

Rules of Encounter(Rosenschein and Zlotkin 1993-94)

Multi-agent Inf. Diagrams (Milch and Koller ‘00-01)

Motivation*Settings with explicit self-interestGame theory!Succinct representationDetect structure; allow efficient computation

TreeKiller exampleExample: two agents, Alice (Poison, Build) andBob (Doctor).

Relevance graphIf D relies on D’, there is an edge ithe graph from D to D’.To optimize for D, need to knowdecision rule for all children solve TreeDoctor;then BuildPatio;then PoisonTree backward induction (if acyclicrelevance graph)Solve “components” if cycles.

Modern Examples *Multi-robot “pick-pack-ship” systemsPort security (LAX, Boston Harbor, )Smart Power Grid (agents in the home)Internet advertising markets (bidding for ads)Opportunistic commerce (e.g., agentsadvising whether to route to get gas )

Example: Opportunistic Commerce(Kamar et al.’08) Dynamic matching with location-specific services.

Course Goals Broad and rigorous introduction to the theory,methods and algorithms of multi-agent systems. Main intellectual connections with AI, Econ/CS andmicroeconomic theory Emphasize computational perspectives Provide a basis for research Research seminar--- we’ll read and discuss papers!

Class participation Submit comments on the assigned readingbefore each class– what is the main contribution of the paper?– what was the main insight in getting the result?– what is not clear to you?– what are the most important assumptions, arethey limiting?– what extensions does this suggest? Start for this Thursday! (Google form )

Student presentations You will present 1-2 papers Greg Stoddard and I will meet with you to discussbefore class We will have a joint discussion, driven through yourpresentation

Homeworks Will be two or three problem sets Relatively short (more theoretical thancomputational) Start in around two weeks

Final Paper Study research problem related to class Computational, theoretical, experimental orempirical Two (3?) people per group (by permission) Can be an exposition paper on two relatedtechnical papers Logistics– Submit a proposal 11/12– Short presentations 12/3-5– Paper due: 12/9

Grade breakdown 20% problem sets– two to three of these 40% participation– Comments, discussion, presentation, Piazza poston something topical 40% final project

Requirements CS 181 or CS 182 (or by permission) Some background in algorithms, complexitytheory, and probability theory Background in economic theory useful but notrequired Reasonable level of mathematicalsophistication

Office hoursDavid Parkes (parkes@eecs.harvard.edu): 11.30-12.30p on 9/3, 9/5 and 9/10 in MD 229 Today!! Regularly: 2.30-4pm on Tue/Thur– primarily to discuss this week’s papers with studentpresentersGreg Stoddard (gstoddard@seas.harvard.edu) 1.30-2.30p MD 219

Related AI and Econ/CS Classes CS 182 (AI; Fall), CS 181 (ML; Spring) CS 186 (EconCS; Spring) CS 284r (Networks AGT; Fall)CS 281 (Adv. ML; Fall)CS 279 (HCI; Fall)CS 280r (Planning; Spring)CS 286r (AGT Spring’14, AMD Fall’14)CS 289 (Bio-inspired; Spring)

Next Class “Distributed constraint handling and optimization” Required Reading before class! Chapter 12 of “MULTIAGENT SYSTEMS;” ed.Gerhard Weiss, MIT Press, 2013, 2nd edition Comments on reading due by midnight Wed 9/4– One paragraph would be fine– Come prepared to discuss

Broad and rigorous introduction to the theory, methods and algorithms of multi-agent systems. Main intellectual connections with AI, Econ/CS and microeconomic theory Emphasize computational perspectives Provide a basis for research Research seminar--- we’ll read and discuss papers!

Related Documents:

www.SouthernLock.com KEYBLANKS

dorma dor 44401 d100 key blanks 231-5000 esp esp es8 original key blank 245-1000 esp es9 esp original 245-2000 falcon fal kb573g falcon 285-1574 fal kb577e falcon 285-1577 fal kb577g falcon 285-1578 fal kb628a falcon ic 285-2628 fal kb628e falcon bow 285-4220 fal kb800a best bow 285-4225 fal

46 Views

2y ago

Multi-Agent Reinforcement Learning in Common Interest and Fixed Sum ...

2. Multi-Agent Reinforcement Learning and Stochastic Games Multi-Agent Reinforcement Learning (MARL) is an extension of RL (Sutton and Barto, 1998; Kaelbling et al., 1996) to multi-agent environments. It deals with the problems associated with the learning of optimal behavior from the point of view of an agent acting in a multi-agent en-vironment.

13 Views

1y ago

Facing the challenge of Windows logs collection to leverage valuable

ArcSight agent NXLog agent Community RSYSLOG agent Snare agent Splunk UF agent WinCollect agent Winlogbeat agent Injecting data with agent from the WEC server to your SIEM WEF/WEC 15 Chosen agent software solution Source clients WEC collector SIEM Other target / External provider JSON CEF Other target / External provider / Archiving solution

21 Views

1y ago

Designing Self-organizing Systems With Deep Multi-agent Reinforcement ...

In contrast to the centralized single agent reinforcement learning, during the multi-agent reinforcement learning, each agent can be trained using its own independent neural network. Such approach solves the problem of curse of dimensionality of action space when applying single agent reinforcement learning to multi-agent settings.

26 Views

1y ago

THE CONTRACT ACT, 1872

192. Representation of principal by sub-agent properly appointed : Agent's responsibility for sub-agent . Sub-agent's responsibility : 193. Agent's responsibility for sub-agent appointed without authority . 194. Relation between principal and person duly appointed by agent to act in : business of agency . 195. Agent's duty in naming such person

37 Views

2y ago

Chapter 2 Agents & Environments - courses.cs.washington.edu

Chess Poker Coffee delivery mobile robot 14 Agent Functions and Agent Programs An agent's behavior can be described by an agent function mapping percept sequences to actions taken by the agent An implementation of an agent function running on the agent architecture (e.g., a robot) is called an agent program

23 Views

1y ago

Agent Orange - ND Department of Veterans Affairs

Agent Purple: used 1961-65. Agent Blue used from 1962-71 in powder and water solution[4] Agent White used 1966-71. Agent Orange or Herbicide Orange, (HO): 1965- 70. Agent Orange II: used after 1968. Agent Orange III: Enhanced Agent Orange, Orange Plus, or Super Orange (SO)

10 Views

4m ago

Defining Plan Metrics for Multi-Agent Planning Within Mechatronic Systems

Multi-agent systems (MAS) and agent based systems are recognized as a new approach to the control and coordination of mechatronic systems (cf. [1,2]). MAS are concerned with the coordination of the behavior of several autonomous, partially intelligent systems, called agents [3]. Multi-agent planning is

4 Views

1y ago

Recent Views

Stock Market Development and Economic Growth: Empirical Evidence from China

measures used to proxy for stock market size and the size of real economy. Most of the existing studies use stock market index as a proxy for measuring the growth and development of stock market in a country. We argue that stock market index may not be a good measure of stock market size when looking at its association with economic growth.

1y ago

263 Views

Lasso Technique Application In Stock Market Modelling: An Empirical .

This research tries to see the influence of G7 and ASEAN-4 stock market on Indonesian stock market by using LASSO model. Stock market estimation method had been conducted such as Stock Market Forecasting Using LASSO Linear Regression Model (Roy et al., 2015) and Mali et al., (2017) on Open Price Prediction of Stock Market Using Regression Analysis.

2m ago

18 Views

The Stock Market Profits Blueprint - Liberated Stock Trader

The stock market profits blueprint has been hand crafted to enable you to understand all the factors that play on the stock market. It is called a blueprint because a blueprint is in effect an architectural document to show how something is designed. The Blueprint will show you a powerful way to envisage how the stock market and the stock market

1y ago

181 Views

Factors Affecting Performance of Stock Market: Evidence from . - HRMARS

We used the data of Colombo Stock Exchange (CSE) for Sri Lankan stock market in this research which is the main stock exchange of Sri Lanka. The market capitalization of CSE is over 20 billion USD. Colombo stock exchange is the first south Asian region stock market and overall 52nd who obtain the membership of World Federation of Exchanges.

11m ago

103 Views

Stock Market Development in the Philippines: Past and Present

Philippine stock market. This paper may serve as a basis for further research on the stock market development in the country. This paper is organized as follows: Section 2 traces the origins of the stock market in the Philippines while section 3 outlines the reforms that have been implemented to strengthen the stock market.

1y ago

128 Views

Columbus,Ohio 1890

Slicing Steaks 3563 Beef Tender, Select In Stock 3852 Angus XT Shoulder Clod, Choice In Stock 3853 Angus XT Chuck Roll, Choice 20/up In Stock 3856 Angus XT Peeled Knuckle In Stock 3857 Angus XT Inside Rounds In Stock 3858 Angus XT Flats, Choice In Stock 3859 Angus XT Eye Of Round, Choice In Stock 3507 Point Off Bnls Beef Brisket, Choice In Stock

2y ago

268 Views

Buying Your First Stock - Stock-Trak

Stock Market Game Time: 15 Minutes Requires: StockTrak Curriculum , Computer Access Buying Your First Stock This lesson is an introduction to buying a stock. Students will be introduced to basic vocabulary that is involved with a buying and owning a stock. Stu-dents will be going through the entire process of buying a stock from looking

1y ago

164 Views

1.11.1. Where to Find Wall Street Training - Investing 101

investing and day trading, how to trade stock options, online free stock trading, market timing strategies, and mutual funds. But, first—learn what these terms mean. Play stock market games:Play stock market games: A stock simulation market game will train you to be comfortable with investing

2y ago

125 Views

Stock Price Prediction Using RNN and LSTM - JETIR

1. BASIC INTRODUCTION OF STOCK MARKET A stock market is a public market for trading of company stocks. Stock market prediction is the task to find the future price of a company stock. The price of a share depends on the number of people who want to buy or sell it. If there are more buyers, then prices will rise. If the seller has a number of .

1y ago

114 Views

Stock Market Wealth Effects - Harvard University

negative stock return and a subsequent decline in household spending and employment. We use a local labor market analysis to address this empirical challenge and provide quantitative evidence on the stock market consumption wealth e ect. Our empirical strategy combines regional heterogeneity in stock market wealth with aggregate movements in stock

1y ago

104 Views

Artificial Intelligence Approach for Stock Market - IJSER

The forecast of stock market helps investors to make investment decisions, via giving them strong insights about the behavior of stock market for avoiding investment risks. It was found that news has an influence on the stock price behavior [2]. The stock market is a constantly changing indicator of economic activity all over the world.

1y ago

109 Views

The Stock Market Game Student Activity Packet - Maryland Council on .

1. The Stock Market Game Kick Off! (3 mins) 2. Intro to Investing (4 mins) 3. Intro to Companies (3 mins) 4. Intro to Stocks (4 mins) 5. Building Your Portfolio (5 mins) 6. The Stock Market Game Trading Portfolio (6 mins) 7. The Stock Market Game Rules (6 mins) 8. Conducting Research (5 mins) 9. Entering Stock Trades (4 mins) 10. Assessing Risk .

1y ago

114 Views

Stock Market Uncertainty and the Stock-Bond Return Relation

implied volatility and stock turnover may prove useful for ﬁnancial applications that need to under-stand and predict stock and bond return co-movements. Finally, our empirical results suggest that the beneﬁts of stock-bond diversiﬁcation increase during periods of high stock market uncertainty. This study is organized as follow.

1y ago

158 Views

The Stock Market Crash of 1929, Great Depression, Dust .

The Stock Market Crash of 1929 In 1929, the Stock Market Crashed!! The stock of a business represents the original money paid into or invested in the business by its founders. So the stock represents how much mone

2y ago

358 Views

Web Based Stock Forecasters - Winlab

Stock market prediction is the act of trying to determine the future value of a company stock or other financial instrument traded on a financial exchange. The successful prediction of a stock's future price could yield significant profit. The stock market is not an efficient market.

1y ago

102 Views

CS 285: Multi-Agent Systems

It looks like you're using an ad-blocker