Lifelong Machine Learning Systems: Beyond Learning Algorithms

3y ago

57 Views

2 Downloads

607.50 KB

7 Pages

Last View : 1d ago

Last Download : 3m ago

Upload by : Luis Waller

Report this link

Download PDF

Transcription

Lifelong Machine Learning: Papers from the 2013 AAAI Spring SymposiumLifelong Machine Learning Systems: Beyond Learning AlgorithmsDaniel L. SilverQiang Yang and Lianghao LiJodrey School of Computer ScienceAcadia University,Wolfville, Nova Scotia, Canada B4P 2R6Department of Computer Science and Engineering,Hong Kong University of Science and Technology,Clearwater Bay, Kowloon, Hong KongAbstractweb agents and robotics. And our computing and communication systems now have the capacity to implement and testLML systems.This paper reviews prior work on LML that uses supervised, unsupervised or reinforcement learning methods.This work has gone by names such as constructive induction, incremental and continual learning, explanation-basedlearning, sequential task learning, never ending learning,and most recently learning with deep architectures. We thenpresent our position on the move beyond learning algorithmsto LML systems, detail the reasons for our position and discuss potential arguments and counter-arguments. We thentake some initial steps to advance LML research by proposing a definition of LML and a reference framework for LMLthat considers all forms of machine learning. We completethe paper by listing several key challenges for and benefitsfrom LML research and conclude with two ideas for advancing the field.Lifelong Machine Learning, or LML, considers systems that can learn many tasks from one or more domains over its lifetime. The goal is to sequentially retain learned knowledge and to selectively transfer thatknowledge when learning a new task so as to developmore accurate hypotheses or policies. Following a review of prior work on LML, we propose that it isnow appropriate for the AI community to move beyondlearning algorithms to more seriously consider the nature of systems that are capable of learning over a lifetime. Reasons for our position are presented and potential counter-arguments are discussed. The remainder ofthe paper contributes by defining LML, presenting a reference framework that considers all forms of machinelearning, and listing several key challenges for and benefits from LML research. We conclude with ideas fornext steps to advance the field.IntroductionPrior Work on LMLOver the last 25 years there have been significant advancesin machine learning theory and algorithms. However, therehas been comparatively little work on systems that use thesealgorithms to learn a variety of tasks over an extended periodof time such that the knowledge of the tasks is retained andused to improve learning.This position paper argues that it is now appropriate tomore seriously consider the nature of systems that are capable of learning, retaining and using knowledge over a lifetime. In accord with (Thrun 1997), we call these lifelongmachine learning, or LML systems. We advocate that a systems approach is needed, taken in the context of an agentthat is able to acquire knowledge through learning, retain orconsolidate such knowledge, and use it for inductive transferwhen learning new tasks.We argue that LML is a logical next step in machine learning research. The development and use of inductive biasis essential to learning. There are a number of theoreticaladvances in AI that will be found at the point where machine learning meets knowledge representation. There arenumerous practical applications of LML in areas such asTheir exists prior research in supervised, unsupervised andreinforcement learning that consider systems that learn domains of tasks over extended periods of time. In particular,progress has been made in machine learning systems that exhibit aspects of knowledge retention and inductive transfer.Supervised LearningIn the mid 1980s Michalski introduced the theory of constructive inductive learning to cope with learning problemsin which the original representation space is inadequate forthe problem at hand (Michalski 1993). New knowledge ishypothesized through two interrelated searches: (1) a searchfor the best representational space for hypotheses and (2) asearch for the best hypothesis within the current representational space. The underlying principle is that new knowledge is easier to induce if search is done using the right representation.In 1989 Solomonof began work on incremental learning(Solomonoff 1989). His system was primed on a small, incomplete set of primitive concepts, that are able to expressthe solutions to the first set of simple problems. When themachine learns to use these concepts effectively it is givenmore difficult problems and, if necessary, additional primitive concepts needed to solve them, and so on.Copyright c 2013, Association for the Advancement of ArtificialIntelligence (www.aaai.org). All rights reserved.49

objects.Raina et al. proposed the Self-taught Learning methodto build high-level features using unlabeled data for a setof tasks (Raina et al. 2007). The authors used the featuresto form a succinct input representation for future tasks andachieve promising experimental results in several real applications such as image classification, song genre classification and webpage classification.Carlson et al. (Carlson et al. 2010) describe the design andpartial implementation of a never-ending language learner,or NELL, that each day must (1) extract, or read, information from the web to populate a growing structured knowledge base, and (2) learn to perform this task better than onthe previous day. The system uses a semi-supervised multiple task learning approach in which a large number (531) ofdifferent semantic functions are trained together in order toimprove learning accuracy.Recent research into the learning of deep architectures ofneural networks can be connected to LML (Bengio 2009).Layered neural networks of unsupervised Restricted Boltzman Machine and auto-encoders have been shown to efficiently develop hierarchies of features that capture regularities in their respective inputs. When used to learn a variety ofclass categories, these networks develop layers of commonfeatures similar to that seen in the visual cortex of humans.Recently, Le et al. used the deep learning method to buildhigh-level features for large-scale applications by scaling upthe dataset, the model and the computational resources (Leet al. 2012). By using millions of high resolution imagesand very large neural networks, their system effectively discover high-level concepts like a cat’s face and a human body.Experimental results on image classification show that theirnetwork can use its learned features to achieve a significant improvement in classification performance over stateof-the-art methods.In the mid 1990s, Thrun and Mitchell worked on alifelong learning approached they called explanation-basedneural networks (Thrun 1996). EBNN is able to transfersknowledge across multiple learning tasks. When faced witha new learning task, EBNN exploits domain knowledge ofprevious learning tasks (back-propagation gradients of priorlearned tasks) to guide the generalization of the new one. Asa result, EBNN generalizes more accurately from less datathan comparable methods. Thrun and Mitchell apply EBNNtransfer to autonomous robot learning when a multitude ofcontrol learning tasks are encountered over an extended period of time (Thrun and Mitchell 1995).Since 1995, Silver et al. have proposed variants of sequential learning and consolidation systems using standardback-propagation neural networks (Silver and Poirier 2004;Silver, Poirier, and Currie 2008). A system of two multiple task learning networks is used; one for short-term learning using task rehearsal to selectively transfer prior knowledge, and a second for long-term consolidation using taskrehearsal to overcome the stability-plasticity problem. Taskrehearsal is an essential part of this system. After a taskhas been successfully learned, its hypothesis representationis saved. The saved hypothesis can be used to generate virtual training examples so as to rehearse the prior task whenlearning a new task. Knowledge is transferred to the newtask through the rehearsal of previously learned tasks withinthe shared representation of the neural network. Similarly,the knowledge of a new task can be consolidated into a largedomain knowledge network without loss of existing taskknowledge by using task rehearsal to maintain the functionaccuracy of the prior tasks while the representation is modified to accommodate the new task.Rivest and Schultz proposed knowledge-based cascadecorrelation neural networks in the late 1990s (Shultz andRivest 2001). The method extends the original cascadecorrelation approach, by selecting previously learned subnetworks as well as simple hidden units. In this way thesystem is able to use past learning to bias new learning.Reinforcement LearningSeveral reinforcement learning researchers have consideredLML systems. In 1997, Ring proposed a lifelong learningapproach called continual learning that builds more complicated skills on top of those already developed both incrementally and hierarchically (Ring 1997). The system canefficiently solve reinforcement-learning tasks and can thentransfer its skills to related but more complicated tasks.Tanaka and Yamamura proposed a lifelong reinforcementlearning method for autonomous-robots by treating multiple environments as multiple-tasks (Tanaka and Yamamura1999). Parr and Russell used prior knowledge to reduce thehypothesis space for reinforcement learning when the polices considered by the learning process are constrained byhierarchies (Parr and Russell 1997).In (Sutton, Koop, and Silver 2007), Sutton et al. suggeststhat learning should continue during an agent’s operationssince the environment may change making prior learning insufficient. In their work, an agent is proposed to adapt to different local environments when encountering different partsof its world over an extended period of time. The experimental results suggest continual tracking of a solution canUnsupervised LearningTo overcome the stability-plasticity problem of forgettingprevious learned data clusters (concepts) Carpenter andGrossberg proposed ART (Adaptive Resonance Theory)neural networks (Grossberg 1987). Unsupervised ART networks learn a mapping between “bottom-up” input sensorynodes and “top-down” expectation nodes (or cluster nodes).The vector of new sensory data is compared with the vector of weights associated with one of the existing expectation nodes. If the difference does not exceed a set threshold,called the “vigilance parameter”, the new example will beconsidered a member of the most similar expectation node.If the vigilance parameter is exceeded than a new expectation node is used and thus a new cluster is formed.In (Strehl and Ghosh 2003), Trehl and Ghosh present acluster ensemble framework to reuse previous partitioningsof a set objects without accessing the original features. Byusing the cluster label but not the original features, the preexisting knowledge can be reused to either create a singleconsolidated cluster or generate a new partitioning of the50

Increasing Capacity of Computersachieve a better performance than learning a solution fromonly prior learning.Advances in modern computers provide the computationalpower for implementing and testing LML systems. Thenumber of transistors that can be placed cheaply on an integrated circuit has doubled approximately every two yearssince 1970. This trend is expected to continue into the foreseeable future, with some expecting the power of computingsystems to move to a log scale as computing systems increasingly use multiple processing cores. We are now at apoint where an LML system focused on a constrained domain of tasks (e.g. product recommendation) is computationally tractable in terms of both computer memory andprocessing time. As an example, Google Inc. recently used1000 computers, each with 16 cores, to train very large neural networks to discover high-level features from unlabeleddata (Le et al. 2012).Moving Beyond Learning AlgorithmsOur position is that it is now appropriate for the AI community to seriously tackle the LML problem, moving beyondthe development of learning algorithms and onto systemsthat learn, retain and use knowledge over a lifetime. Thefollowing presents our reasons for a call for wider researchon LML systems.Inductive Bias is Essential to LearningThe constraint on a learning system’s hypothesis space, beyond the criterion of consistency with the training examples,is called inductive bias (Mitchell 1980). Utgoff and Mitchellwrote in 1983 about the importance of inductive bias to concept learning from practical sets of training examples (Utgoff 1983). They theorized that learning systems shouldconduct their own search for an appropriate inductive biasusing knowledge from related tasks of same domain. Theyproposed a system that could shift its bias by adjusting theoperations of the modeling language. Since that time, the AIcommunity has come to accept the futility of searching for auniversal machine learning algorithm (Wolpert 1996). Ourproposal to consider systems that retain and use prior knowledge as a source of inductive bias promotes this perspective.Counter ArgumentsThere are arguments that could be made against greater investment in LML research. Here we present two argumentsand make an effort to counter them.First, some could argue that machine learning should focus on the fundamental computation truths of learning andnot become distracted by systems that employ learning theory. The idea is to stick to the field of study and leave theengineering of systems to others. Our response to this argument is that the retention of learned knowledge and its transfer would seem to be important constraints for the design ofany learning agent; constraints that would narrow the choiceof machine learning methods. Furthermore, it may directlyinform the choice of representation used by machine learning algorithms.A second potential argument is that LML is too wide anarea of investigation with significant cost in terms of empirical studies. We agree that this has been a deterrent formany researchers. Undertaking repeated studies, where thesystem is tested on learning sequences of tasks, increasesthe empirical effort by an order of magnitude. However, because it is hard, does not make it impossible, nor does itdecrease its relevance to the advance of AI. In recent years,there has been a growing appeal for a return to solving BigAI problems; LML is a step in this direction. As more researchers become involved, shared software and hardwaretools, methodologies and best practises will begin to offsetthe increase in experimental complexity.The next four sections make contributions toward advancing the field of LML by proposing a definition of LifelongMachine Learning, presenting essential ingredients of LML,developing a general reference framework, and outlining anumber of key challenges and benefits to LML research.Theoretical Advances in AI: ML meets KRIn (Thrun 1997), Thrun writes “The acquisition, representation and transfer of domain knowledge are the key scientific concerns that arise in lifelong learning.” We believe thatknowledge representation will play an important a role in thedevelopment of LML systems. More specifically, the interaction between knowledge retention and knowledge transferwill be key to the design of LML agents. Lifelong learningresearch has the potential to make serious advances on a significant AI problem - the learning of common backgroundknowledge that can be used for future learning, reasoningand planning. The work at Carnegie Mellon University onNELL is an early example of such research (Carlson et al.2010).Practical Agents/Robots Require LMLAdvances in autonomous robotics and intelligent agents thatrun on the web or in mobile devices present opportunities foremploying LML systems. Robots such as those that go intospace or travel under the sea must learn to recognize objectsand make decisions over extended periods of time and varied environmental circumstances. The ability to retain anduse learned knowledge is very attractive to the researchersdesigning these systems. Similarly, software agents on theweb or in our mobile phones would benefit from the abilityto learn more quickly and more accurately as they are challenged to learn new but related tasks from small numbers ofexamples.Definition of Lifelong Machine LearningDefinition: Lifelong Machine Learning, or LML, considers systems that can learn many tasks over a lifetime fromone or more domains. They efficiently and effectively retainthe knowledge they have learned and use that knowledge tomore efficiently and effectively learn new tasks.51

Effective and Efficient Retentionthe direct or indirect assignment of known task representation to the model of a new target task (Silver and Mercer 1996). In this way the learning system is initializedin favour of a particular region of hypothesis space of themodeling system (Ring 1993; Shavlik and Dietterich 1990;Singh 1992). Representational transfer often results in substantially reduced (efficient) training time with no loss in thegeneralization performance of the resulting hypotheses. Incontrast to representational transfer, functional transfer employs the use of implicit pressures from training examplesof related tasks (Abu-Mostafa 1995), the parallel learningof related tasks constrained to use a common internal representation (Baxter 1995; Caruana 1997), or the use of historical training information from related tasks (Thrun 1997;Naik and Mammone 1993). These pressures reduce the effective hypothesis space in which the learning system performs its search. This form of transfer has its greatest valuein terms of developing more accurate (effective) hypotheses.The systems approach emphasizes the interaction between knowledge retention and transfer learning and thatLML is not just a new learning algorithm. It may benefitfrom a new learning algorithm or modifications to an existing algorithm, but it also involves the retention and organization of knowledge. We feel there is much to be learned inthis regard from the writings of early cognitive scientists, artificial intelligence researchers and neuroscientists such asAlbus, Holland, Newel, Langly, Johnson-Laird and Minsky. To emphasize this, consider that the form in whichtask knowledge is retained can be separated from the formin which it is transferred. For example, the retained hypothesis representation for a learned task can be used to generate functional knowledge in the form of training examples(Robins 1995; Silver and Mercer 2002). These training examples can then be used as supplementary examples thattransfer knowledge when learning a related task.An LML system should resist the introduction and accumulation of erroneous knowledge. Only hypotheses with an acceptable level of generalization accuracy should be retainedin long-term memory else it may take some time to be corrected. Similarly, the process of retaining a new hypothesis should not reduced its accuracy or that of prior hypotheses existing in long-term memory. In fact, the integrationor consolidation of new task knowledge should increase theaccuracy of related prior knowledge.An LML system should provide an efficient method ofretaining knowledge both in terms of time and space. Thesystem must make use of its finite memory resources suchthat the duplication of information is minimized, if not eliminated. An LML system should also be computationally efficient when storing learned knowledge in long-term memory.Ideally, retention should occur online, however, in order toensure efficient (consolidated) and effective retention (minimal error) this may not be possible.Effective and Efficient LearningAn LML system should produce a hypothesis for a new taskthat meets or exceeds the generalization performance of ahypothesis developed strictly from the training examples.Preferably, the transfer of prior knowledg

tion, incremental and continual learning, explanation-based learning, sequential task learning, never ending learning, and most recently learning with deep architectures. We then present our position on the move beyond learning algorithms to LML systems, detail the reasons for our position and dis-cuss potential arguments and counter-arguments .

Related Documents:

Multi-Agent Distributed Lifelong Learning for Collective Knowledge ...

Lifelong machine learning methods acquire knowledge over a series of consecutive tasks, continually building upon their experience. Current lifelong learning algorithms rely upon a single learning agent that has centralized access to all data. In this paper, we extend the idea of lifelong learning from a single agent to a network of

12 Views

1y ago

UCD Lifelong Learning Part-Time Programme Autumn 2021- Spring 2022

UCD Access & Lifelong Learning is committed to providing an inclusive and welcoming environment on all of our programmes in order to make learning more accessible to everyone. Our Lifelong Learning bursary provides complimentary places on any of our short-term, interest-based Lifelong Learning courses in the academic year 2021-2022. You

10 Views

1y ago

Lifelong Learning - gfps.k12.mt.us

Lifelong Learning 2022 SPRING CATALOG. Y REGISTER ONLINE LIFELONG LEARNING REGISTRATION . If Lifelong Learning cancels a class, students will have the option of a refund adjustment* or refund. *Refund Adjustment: Fees can be adjusted and applied to an alternative class. Any cost differences will be responsibility of the student to pay .

10 Views

1y ago

It's Time for You! - University of Pittsburgh

The Osher Lifelong Learning Institute aspires to create a dedicated intellectual environment for older adult students, nurturing a lifelong passion for learning. The Osher Lifelong Learning Institute fosters lifelong learning through courses and programs that engage the learner, provide social interaction, and enrich lives.

14 Views

1y ago

Coggle: SWOT Analysis in Lifelong Learning Education Using ...

suitable module and approach for lifelong learning students. Keywords: SWOT Analysis, Lifelong learning, Coggle, Online Collaborative Mind Mapping, Higher Education . Introduction . UNESCO defines lifelong education as one of the key aspects in achieving global sustainable development.

55 Views

2y ago

Specification and Price of Automatic Rendering Machine (FOB ... - AR

decoration machine mortar machine paster machine plater machine wall machinery putzmeister plastering machine mortar spraying machine india ez renda automatic rendering machine price wall painting machine price machine manufacturers in china mail concrete mixer machines cement mixture machine wall finishing machine .

15 Views

3m ago

ANNEX A: GRADE 12 CAREER GUIDANCE LEARNING ACTIVITY PLAN (2 4 Quarter ...

ANNEX A: GRADE 12 CAREER GUIDANCE LEARNING ACTIVITY PLAN (2nd - 4th Quarter, S.Y. 2020 -2021) 108-110) Module 8: Ready to Take Off! 1. identify lifelong learning strategies to grow in a chosen career; 2. create a Lifelong Learning Plan; and 3. express love and passion to one's chosen career. 5. Lecturette Lifelong Learning Strategies and The

13 Views

1y ago

Fourth Grade CCSS with I can statements copy

h,’by’ using’clues’foundwithinthe’story ’ Icanusevariousstrategiestodeterminethe’ meaning’of’words’and’phrases’ Icanrecognizewordsinatextthatallude’ and’ mine’ meaning’ Allude’’ ’ Fourth’Grade’

50 Views

3y ago

Recent Views

American International Group, Inc. - Federal Reserve

American General Life Insurance Company AGL U.S. Life Insurance Company AGC Life Insurance Company AGC Life U.S. Life Insurance Company The United States Life Insurance Company in the City of New York U.S. Life U.S. Life Insurance Company The Variable Annuity Life Insurance Company VALIC U.S. Life Insurance Company

1y ago

269 Views

18.01.41 - REPLACEMENT OF LIFE INSURANCE AND ANNUITIES - Idaho

Department of Insurance Replacement of Life Insurance and Annuities. Page 3. 04. Existing Life Insurance or Annuity. "Existing Life Insurance or Annuity" means any life insurance or annuity in force, including life insurance under a binding or conditional receipt or a lif e insurance policy or annuity that is within an unconditional refund period.

1y ago

407 Views

Life Insurance Buyer's Guide Life Insurance - National Association of .

Life Insurance uers uide Naional ssociaion of Insurance Commissioners Compare the Different Types of Insurance Policies There are many types of life insurance pol-icies. You should choose a policy with fea-tures that fit your individual needs. Some things to consider are: Term Insurance vs. Cash Value In-surance. Term insurance is intended to

1y ago

520 Views

Personal insurance - Car & Business insurance King Price Insurance

The king's insurance options 5 Things you need to know 7 The stuff you need to do 14 How to claim 16 Our commitment to you 20 Car insurance 22 Car warranty 37 Shortfall cover 45 Scratch and dent 46 Tyre and rim 48 Motorbike insurance 53 Trailer and caravan insurance 64 Watercraft insurance 68 Home contents insurance 77 Buildings insurance 89

1y ago

673 Views

Group/Association - Proof of Loss Life Insurance Accidental Death Insurance

Life Insurance Accidental Death Insurance. MAIL TO: Connecticut General Life Insurance Company Life Insurance Company of North America New York Life Group Insurance Company of NY . Yes No. 2. If claiming voluntary or employee-paid benefits, please provide all of the enrollment history for the employee and the dependent (if claiming dependent .

1y ago

171 Views

The Pricing of Group Life Insurance Schemes - Actuaries

Thus, in comparison to individual life insurance, group life insurance is more cost-effective per thousand of rupees insurance cover. 2. General Characteristics of Group Life Insurance Group life insurance, within certain restrictions and conditions, provides insurance to members of a group without requiring evidence of insurability. There is a .

1y ago

173 Views

Supplemental Background Material - SOFE

2 Unit # per this study guide Name Section # 1 - (7 questions) 1 Chapter 1: The Economics of Life Insurance 1 Chapter 2: An Overview of Life Insurance 8 Chapter 12: The Regulation and Taxation of Life Insurance Companies Section # 2 - (17 questions) 2 Chapter 3: Personal Life Insurance Products: I 2 Chapter 4: Personal Life Insurance Products: II 2 Chapter 5: Life Insurance Policy Provisions .

3y ago

147 Views

Providing highly reliable wholesale funding and liquidity .

13. Starmount Life Insurance 14. Talcott Resolution Life Insurance 15. Talcott Life and Annuity Insurance 16. Unum Life Insurance 17. Vantis Life 18. Voya Retirement Insurance & Annuity . Health Companies . 19. Aetna Life Insurance 20. Blue Cross Blue Shield of MA 21. Blue Cross Blue S

2y ago

140 Views

2020 Japan Insurance Outlook - EY

Source: LIMRA, Life Insurance Culture Center, LIAJ 88.7% Percentage of Japanese households with life insurance policies-43.3% Decrease of in-force life insurance policy volume, 1996-2018 Source: Life Insurance Culture Center Key trends: Life Life insurance sales by channel Tied sales representative Agent 2003 2018 72.0% 7.0% 17.8% 53.7%

1y ago

115 Views

The Questions Answers on Life Insurance Workbook

A life insurance policy is, at the end of the day, a tool for ﬁnancial leverage—nothing more, noth-ing less. But this book is not about investing; it's about life insurance, and using life insurance as life . 2 The Questions and Answers on Life Insurance Workbook insurance. If you want to invest your money to grow your nest egg or .

1y ago

117 Views

Premium Financing for Life Insurance

Premium ﬁnancing can also be attractive to citizens of other countries who need life insurance protection. Life insurance from an ING life insurance company may help foreign nationals obtain needed life insurance coverage from a name that they know — ING. There are restrictions on marketing life insurance products to foreign nationals. Be .

1y ago

122 Views

NextWave Insurance: Life insurance and retirement 2021 (pdf)

3 NextWave Insurance: life insurance and retirement NextWave Insurance: life insurance and retirement Given the nature of the life insurance and retirement market, its leaders have always taken long-term views of their strategic horizons and growth prospects. Today, a combina

2y ago

481 Views

Gold Tier - MAPFRE Insurance

Foy Insurance of MA, LLC 198 Frank Consolati Insurance Agency, Inc. 198 County Insurance Agency, Inc. 198 Woodrow W Cross Agency 214 Woodland Insurance Agency, Inc. 214 Tegeler Insurance Services of CT, Inc. 214 Pantano/VonKahle Insurance Agency, Inc. 214 . Hanson Insurance Agency, Inc. 287 J.H. Slattery Insurance Agency, Inc. 287

1y ago

565 Views

The Life Insurance Need Gap - LIMRA

Need Life Insurance Have Life Insurance The gap between "I need" and "I "have" equals 18-points, or 46 million consumers This understates unmet need in the market. Life Insurance Ownership Gap - 2011 to 2021 Source: 2021 Insurance Barometer Life Insurance Ownership Gap 18-points

1y ago

164 Views

FINAL EXPENSE - OCi Services

Prosperity Life Group is a marketing name for products and services provided by one or more of the member companies of Prosperity Life Insurance Group, LLC, including SBLI USA Life Insurance Company, Inc., and S.USA Life Insurance Company, Inc. and Shenandoah Life Insurance Company. Members not licensed in all states. Only SBLI USA Life Insurance

1y ago

130 Views

Lifelong Machine Learning Systems: Beyond Learning Algorithms

It looks like you're using an ad-blocker