BiCycle: Item Recommendation With Life Cycles

3y ago
48 Views
4 Downloads
2.13 MB
10 Pages
Last View : 25d ago
Last Download : 3m ago
Upload by : Sutton Moon
Transcription

BiCycle: Item Recommendation with Life CyclesXinyue Liu , Yuanfang Song† , Charu Aggarwal‡ , Yao Zhang§ and Xiangnan Kong WorcesterPolytechnic Institute, Worcester, MA, USAof Wisconsin-Madison, Madison, WI, USA‡ IBM T.J. Watson Research Center, Yorktown Heights, NY, USA§ Fudan University, Shanghai, Chinaxliu4@wpi.edu, ysong243@wisc.edu, charu@us.ibm.com, zhang yao15@fudan.edu.cn, xkong@wpi.edu† UniversityI. I NTRODUCTIONCollaborative filtering (CF) [1]–[8] is widely used techniquefor item recommendation, which recommends items to usersbased on their historical interactions, such as ratings, reviews,transactions. Conventional CF methods usually assume thatthe preferences of users and the characteristics of items arestationary, as shown in Fig. 2a. However, this is not alwaysthe case in the real-world systems. First of all, the interests ofusers may evolve over time. People may like cartoon moviesduring their childhood and early teen years, but may not likethem after they enter college. Moreover, even the characters ofitems are more stable than user interests, they can subject tochange due to various external factors. For instance, in movierecommendation system, the popularity of a movie may exhibita pattern resembles the well-known hype curve [9], in whichthe popularity increases rapidly after release until it reachesthe peak, then it decreases to its lowest point gradually, at lastRelease Year (Movie’s Life Cycle)Popularity213(2016)(2015) (1985)Age (User’s Life Cycle)Interests for MovieAbstract—Recommender systems have attracted much attention in last decades, which can help the users explore new itemsin many applications. As a popular technique in recommendersystems, item recommendation works by recommending items tousers based on their historical interactions. Conventional itemrecommendation methods usually assume that users and itemsare stationary, which is not always the case in real-world applications. Many time-aware item recommendation models have beenproposed to take the temporal effects into the considerationsbased on the absolute time stamps associated with observedinteractions. We show that using absolute time to model temporaleffects can be limited in some circumstances. In this work, wepropose to model the temporal dynamics of both users anditems in item recommendation based on their life cycles. Thisproblem is very challenging to solve since the users and itemscan co-evolve in their life cycles and the sparseness of the databecome more severe when we consider the life cycles of bothusers and items. A novel time-aware item recommendation modelcalled BiCycle is proposed to address these challenges. BiCycleis designed based on two important observations: 1) correlatedusers or items usually share similar patterns in the similar stagesof their life cycles. 2) user preferences and item characters canevolve gradually over different stages of their life cycles. Extensiveexperiments conducted on three real-world datasets demonstratethe proposed approach can significantly improve the performanceof recommendation tasks by considering the inner life cycles ofboth users and items.Index Terms—Item Recommendation; Collaborative Filtering;Life Cycle; Evolution Inference; Data MiningA(current time)BCFig. 1: An illustration of item recommendation with user anditem life cycles. The red dotted lines denote the current timein the absolute time-line.it bounds up to the average level and becomes relatively stableover time, as shown in Fig. 1.Given its significance, the modeling of temporal effects ofcollaborative filtering has been extensively studied. Remarkably, the latest progress in the Netflix Prize contest is attributedto a temporal model [3]. Some other works have been proposedin this direction [6]–[8], in which the researchers show thesuperiority of incorporating temporal effects into the CFmodels. Conventional methods for temporal CF mainly focuson exploiting the change of user’s interests or item charactersover the absolute time-line, as illustrated in Fig 2b. However,in many real-world applications, users start their involvementat different time and items are released at different time,which makes it challenging to learn a unified time-awaremodel for the system. Furthermore, the users and items areusually following their internal evolving patterns, which arecalled life cycles. They can exhibit similar preferences orcharacters in similar stages of their life cycles. We illustrate atoy example of movie recommendation systems in Fig. 1, inwhich three users are at different stages of their life cycles. Allof them share similar evolving patterns with respect to theirlife cycles. The three movies in Fig. 1 are also at differentstages of their life cycles, and their evolving patterns obey

HistoryTimeCurrent TimeItem Life Cycle ? ? UU(t)U(t 1)V(t 1) ?U(3)V(t)V(t-1)U(t-1)U(t-2) V(t-2)?Current TimeV(3)?U(2)TemporalSmoothnessCurrent TimeU(1)User Life Cycle(a) Static Collaborative Filtering [1]HistoryV(2)V(1)Observed RatingTimeHistoryTimeV? ?Life Cycle?(c) Collaborative Filtering with life cycles [This paper](b) Collaborative Filtering with absolute time [3]Fig. 2: Comparison of different recommendation models. The static CF model in (a) considers all observed ratings togetherand ignore the time. The temporal CF model in (b) considers learning separate factors of users and items in different timewindows. Our model illustrated in (c) considers the inherent life cycles of users and items, and it learns factors for differentstages of life cycles for users and items with temporal regularizations.the hype curve. It is easy for any CF model to group theseusers and movies together by simply looking at their historicalratings. However, even all three users shows some interests toromance movies, we may not want to recommend Movie 3to them since it is no longer a popular choice. Similarly, wemay not want to recommend any romance movies to UserC since he already lost the interest to the movie genre. ForUser A who is still young on romance, we can recommendMovie 1 and Movie 2 to him with high confidence sincewe believe his life cycle pattern will be similar to User Band User C (to have high interests in romance in the nearfuture). For User B, the situation is similar except Movie 1may be a dubious choice since they are very likely to miss eachother’s peak. It is obvious that other time-aware CF modelswhich do not consider the life cycle will generate differentrecommendations, for example, they may not recommend anyromance movies to User A since A resembles User C at currenttime and C no longer watches any romance movies. Hence, itis beneficial to model the life cycles in collaborative filtering.In this paper, we study the problem of time-aware itemrecommendation with life cycles, where the goal is to learna unified time-aware item recommendation model by considering the inherent life cycles of users and items. Thisproblem is very natural and important due to the universalexistence of life cycles in various applications, e.g., life cyclesof scholars and research topics (academia), life cycles ofconsumers and merchandises (on-line business) etc. Despitethe significance of it, collaborative filtering with life cycles ishighly challenging, as summarized below.Life Cycle versus Wall Time: Users and items may exhibitdramatically different characteristic along the wall time (absolute time-line), but they may behave similar in each stage oftheir life cycles (relative time-line). Exploiting such similarityacross the relative time-line may help the item recommendation task. However, it is still an open problem that howto incorporate the concept of life cycles into recommendationmodels given all the data distributed on the absolute time-line.It is even more challenging when we attempt to model the lifecycles of both users and items simultaneously.Sparsity: Considering the large number of users and items ina real-world application, the data of interactions between usersand items are relatively sparse throughout the absolute timeline. The sparseness aggravates the prediction problem whenwe allocate the data into different stages of life cycles to learnseparate factors for each life stage. If we do not have enoughdata in some life stages, there may not be enough informationfor us to learn the patterns for these life stages, which maylead to inaccurate recommendations.In this paper, we propose a novel method called BiCycle toaddress the above issues, as illustrated in Fig. 2c. To the bestof our knowledge, the collaborative filtering problem has notbeen studied in the context of life cycle in the literature.II. P ROBLEM F ORMULATIONIn this section, we first introduce the preliminary and theformulation of the conventional collaborative filtering. Thenwe extends the problem to the time-aware variation. Throughout this paper, we use capital alphabet in boldface, e.g., X,to denote a matrix, and xij refers to the entry of X at i-throw and j-th column. We use lowercase alphabet in boldface,e.g., x, to denote a column-based vector, and xi refers to thei-th entry of x. We use X F to denote the Frobenius normof X. We use calligraphic letters to denote sets, e.g., A, B, C.A. Collaborative FilteringIn a recommender system, there are m users and n items.Each observed feedback is a tuple (i, j, rij ) denoting therating assigned to item i by user u, where the user indexi {1, . . . , m}, the item index j {1, . . . , n}, and the ratingvalues rij R. We use Ω {(i, j) rij is observed} to denotethe the set of observed user-item interactions. The goal ofconventional collaborative filtering problem is to estimate theunobserved rating {rij (i, j) / Ω} based on the observedones. However, the feedbacks are implicit [4], [5] rather than

TABLE I: nDefinitionthe original rating matrixthe rating matrix of p-th user life stage and q-th item life stagethe user latent factor matrix of p-th life stagethe item latent factor matrix of p-th life stagethe smoothed rating matrix of p-th user life stage and q-th item life stagethe regularization parameterthe variance regularization parameterthe fused regularization parameterthe user interest decay parameterthe item interest decay parameterthe total number of user life stagesthe total number of item life stagesthe number of factors in the matrix factorization modelthe number of users in the systemthe number of items in the systemexplicit in many applications, e.g., , users’ video viewing andproduct purchase history. In such cases, the value of a ratingrij does not deliver any preference information of user i withrespect to item j, and it only denotes an interaction betweenthem is observed or not observed. In such cases, the goal ofcollaborative filtering is to estimate {rij R (i, j) / Ω} basedon the observed interactions, where each rij is a real valueindicating how likely user i will consume item j in the nearfuture.B. Matrix FactorizationOne of the most successful approaches to collaborativefiltering problem is based on a matrix factorization model[1], [2]. Matrix factorization models map uses and items toa joint low-rank latent factor space, such that the feedbacksare modeled as the inner products in that space. In this typeof models, each user i is assigned a vector ui RD , andeach item j is assigned a vector vj RD , where D is thedimensionality of the latent factor space. The rating of item jassigned by user i is predicted using the inner product of thecorresponding vectors in the latent factor space as r̂ij u i vj . )todenotethelatentfactormatrixIf we use U (u ,.,um1of all users, and V (v1 , . . . , vn ) to denote the latent factormatrix of all items in the system. Then the predicted matrixcan be computed as the product of the corresponding vectorsin the latent factor space: R̂ U V, which is a dense matrixwith all unobserved values in original rating matrix R filled.In the setting of implicit feedbacks, U and V can be learnedby solving the following optimization problem [4], [5]:minimizeU,V1λ R U V 2F ( U 2F V 2F )22(1)where λ is the parameter controls the extent of regularization,and can be tuned by cross-validation. It is worth noting thatEq. (1) is not a convex problem with respect to U and V, butit is bi-convex. Hence, one can either use alternating leastsquares (ALS) or stochastic gradient descent to obtain thesolutions U and V towards Eq. (1).C. Time-aware CFIn conventional CF problems, we do not consider temporalinformation about the interactions or we assume that the datado not contain such information. However, in many real worldapplications, the time stamp associated with each ratings,reviews or visiting records is very useful and informative,and should not be ignored. In this section, we extend theproblem of collaborative filtering to time-aware version. Toformally define the problem of time-aware CF, we first givethe definition of time window as follows.D EFINITION II.1 (Time Window): A time window is a leftclosed-right-open time interval [ts , te ) on the absolute timeline with the start time stamp ts and the end time stamp te .D EFINITION II.2 (Time-aware CF Problem): In a recommender system with m users and n items, given a set ofhistorical user-item feedbacks F {(i, j, rijt ) (i, j) Ω, t [ts , te )}, one want to estimate the the preference of the userstowards the same set of items in the next time window [te , tf ),i.e., what items will a user consume in the future time span[te , tf ). So the goal of time-aware CF problem is equivalent toinfer a m n preference matrix R̂, where r̂ij R denotes thepreferences of user i towards item j in the future time window[te , tf ). In the context of implicit feedbacks, r̂ij correspondsto the confidence level on user i will consume item j in thefuture time window [te , tf ). Thus, if r̂ij r̂ij 0 , it indicatesthat user i is more likely to consume item j compared to itemj 0 in [te , tf ).It is straightforward that the static matrix factorizationmodels we discussed in Sec. II-B can be applied to inferthe future preference matrix R̂ if we simply ignore the timestamps. However, the prediction results fails to capture thedynamic temporal effects exist in users as well as in items.III. P ROPOSED M ETHODIn this section we introduce the proposed model BiCycle.In Sec. III-A, we first introduce the concept of life cycle. InSec. III-B, we address the sparsity problem of the life-cycledmatrix using exponential smoothing. Finally, in Sec. III-C, wediscuss the temporal regularizations we propose and put allpieces together to present our solution.A. Life CycleA major drawback of the conventional matrix factorizationmodels is that they fail to consider temporal effects. One may

propose to simply partition the observed data by time windowand learn a separate model using the data in each partition.However this approach not only suffers from severe sparsity,but isolates the interactions from different time periods, whichmakes the model difficult to capture the evolution patterns. Aswe discussed in Sec. I, similar users may share similar patternsduring the same stage of their life cycles, and their evolutiontrend may also resemble each other. Similarly, similar itemsmay also share similar patterns of popularity, availability anddemand across life stages. Thus, it is beneficial to exploit theco-evolution patterns of users and items across the stages oftheir life cycle. In this work, we propose to model the coevolution of users and items in a system using the concept oflife cycles, which is formally defined as follows.D EFINITION III.1 (Life Cycle): In a system with users anditems, we assume there are life cycles exist in both user sideand item side, where each user has at most M life stageshis/her life cycle, and each item has at most N life stages in itslife cycle. And the concept of life cycle refers to the entire lifespan of users and items included the unobserved (future) time.Thus, given a certain time point, some users/items may alreadyexperienced all M or N life stages, while other users/itemsmay only experienced partial life stages.D EFINITION III.2 (Life Stage): There are multiple life stagesfor a user or item. The -th life stage of user/item i is denoted(i) (i)by time window [ts , te ). Specially, we consider the M th stage of user i or N -th stage of item j as a open time(i)(j)window [tsM , ) or [tsN , ) with no explicit end time. Thisis sensible since in real-world evolution patterns, the user oritem usually enter a stable stage eventually at some time point,e.g., the stable stage in the hype curve.There are many ways to determine the life stages of usersand items. In this paper, we use a fixed length for the firstM 1 or N 1 stages, with the length of each user stageequals to lu and the length of each item stage equals to li .We left the M -th or N -th stage with open end. Note that thedetermination of life stages is orthogonal to our model, anyother sensible approach can be adopted with minor efforts.Thus, for any user in the system, we can allocate theirhistorical interactions into at most M bins, with each bincorresponds to a life stage. Similarly, for any item in thesystem, we can distribute their historical interactions into atmost N stages.In the static matrix factorization method, the historical feedbacks are cast into a m n matrix R. Similarly, by consideringthe life cycles of both users and items, we can cast the samehistorical feedbacks into M N separate matrices of the samesize, i.e., {R(pq) Rm n p 1, . . . , M, q 1, . . . , N }.Hence, R(pq) contains all the ratings given by users of p-thlife stages to items of q-th life stages, i.e.,(rijt , rijt F where p SU (i, t), q SI (j, t).(pq)rij 0, otherwise(2)where SU (i, t) is a function computes which life stage is useri in at the time stamp t, while SI (j, t) computes which lifestage is item j in at time stamp t. In this paper, we defineSU (i, t) as follows(i)SU (i, t) min(M, dt t0e)lu(3)(i)where t0 is the time stamp of first observed interaction ofuser i. And SI (j, t) follows the same form. The reason thatwe determine the life stage based upon the first observedinteraction time is two-folds. First, the age of the users anditems is not always available, relying on this kind of sideinformation hurts the generality of the model. Second, thelatent characteristic of users or items are not necessarily tiedto their age, but to their experience, which can be learned fromthe historical interactions of them. For example, a user’s tasteon movies evolves while he/she watches more movies, and amovie’s popularity and status also changes while it receivesmore reviews and criticism.Hence, given the historical feedbacks of a system, we cancast them into a set of life-cycled preference matrices {R(pq) }.One can employ the matrix factorization model discussed inSec. II-B to learn a latent factor matrix of users for each stage,i.e., {U(1) , . . . , U(M ) }, and a latent factor matrix of items foreach stage, i.e., {V(1) , . . . , V(N ) }. Although this approachcan capture users and items patterns at different life stages,it has some limitations. Firstly, it suffers from severe sparsity,since the original data is sparse and we allocate them intoM N separate matrices of original size. With such sparsity,it is difficult for factorization model to learn meaningful latentfactors in each life stage. Secondly, it does not force anyregularization in the temporal dimension, which makes themodel highly possible to over-fitting the data.B. SmoothingTo address the first issue we discussed above, we need toalleviate the sparseness of R(pq) . Inspired by the works ofmodeling interest forgetting in music recommendation [10],we build a smoothed counterpart for each R(pq) as follows. (pq)R , if p q 1. R(pq) µR̃((p 1),q) , if q 1 and p 2R̃(pq) R(pq) π R̃(p,(q 1)) , if p 1 and q 2 (pq)R µ

existence of life cycles in various applications, e.g., life cycles of scholars and research topics (academia), life cycles of consumers and merchandises (on-line business) etc. Despite the significance of it, collaborative filtering with life cycles is highly challenging, as summarized below. Life Cycle versus Wall Time: Users and items may .

Related Documents:

Item: Paper Item: Stapler Item: Staples Transaction: 2 CC#: 3752 5712 2501 3125 Item: Paper Item: Notebook Item: Staples Transaction: 1 CC#: 3716 0000 0010 3125 Item: Paper Item: Stapler Item: Staples Transaction: 2 CC#: 3716 0000 0010 3125 Item: Paper Item: Notebook Item: Staples Before us

rexroth a10vo & a10vso parts information view: a item # 1: rotary group item # 2: control-ass. item # 3: pump housing item # 4: end cover-ports item # 5: cradel ass. item # 6: shaft - drive item # 7: washer item # 8: adjusting disc item # 9: tappered brg item # 10: tappered brg item # 11: bearing cradle item # 12: seal - shaft

Bicycle Program office, striped 28 miles of bicycle lanes, updated the City s Bicycle Master Plan, installed over 650 bicycle parking racks and 250 bicycle route signs, and i

3. Cycleway facility design 18 3.1 Bicycle path (one-way) 20 3.2 Bicycle path (two-way) 30 3.3 Quietway 40 3.4 Shared path 48 3.5 Shared zone 52 4. Public bicycle parking 56 4.1 Integrated bicycle parking 56 4.2 Types of public bicycle parking 56 4.3 Key locations for public bicycle

Landry's Bicycles Platinum 2008 Bicycle Shop 24Natick MA Michigan - Platinum Platinum MI Catalyst Partners Platinum 2013 Professional Services 8Grand Rapids MI Minnesota - Platinum Platinum MN Quality Bicycle Products Platinum 2008 Bicycle Industry 450Bloomington MN . Giant Bicycle Gold 2017 Bicycle Industry 55Newbury Park CA

accomplishes this through planning, engineering and implementing bicycle facilities, including bicycle parking, and educating the community and agencies about bicycle transportation. Livable Streets is responsible for reviewing and fulfilling short-term bicycle parking requests and coordinating and assessing bicycle parking with other City .

Item 4 Liquid Propellants (b) Fuels (c) Oxidizers Item 9 (c) Accelerometers Item 13 Digital Computer Item 14 A-D Converter Circut Boards Item 2 (c) Solid Rocket Motor Item 2 (c) Liquid Rocket Engine Item 2(f) SAFF Conventional HE Warhead (Not Controlled) Item 11 (c) Satellite Navigation Receiver Item 2 (d) Guidance Set Item 2 (a) Individual .

2014 Newark Bicycle Plan Developed by the Newark Bicycle Committee, City of Newark, and WILMAPCO . x Review bicycle parking requirements in zoning codes and recommend revisions as needed. x Identify locations where bicycle parking should be provided. Improve safety for bicyclin