1130 Ieee Transactions On Fuzzy Systems, Vol. 20, No. 6, December 2012 .

1y ago

11 Views

2 Downloads

9.08 MB

17 Pages

Last View : 14d ago

Last Download : 3m ago

Upload by : Maxton Kershaw

Report this link

Download PDF

Transcription

1130IEEE TRANSACTIONS ON FUZZY SYSTEMS, VOL. 20, NO. 6, DECEMBER 2012Fuzzy c-Means Algorithms for Very Large DataTimothy C. Havens, Senior Member, IEEE, James C. Bezdek, Life Fellow, IEEE, Christopher Leckie,Lawrence O. Hall, Fellow, IEEE, and Marimuthu Palaniswami, Fellow, IEEEAbstract—Very large (VL) data or big data are any data thatyou cannot load into your computer’s working memory. This is notan objective definition, but a definition that is easy to understandand one that is practical, because there is a dataset too big for anycomputer you might use; hence, this is VL data for you. Clusteringis one of the primary tasks used in the pattern recognition and datamining communities to search VL databases (including VL images)in various applications, and so, clustering algorithms that scale wellto VL data are important and useful. This paper compares the efficacy of three different implementations of techniques aimed toextend fuzzy c-means (FCM) clustering to VL data. Specifically, wecompare methods that are based on 1) sampling followed by noniterative extension; 2) incremental techniques that make one sequential pass through subsets of the data; and 3) kernelized versions ofFCM that provide approximations based on sampling, includingthree proposed algorithms. We use both loadable and VL datasetsto conduct the numerical experiments that facilitate comparisonsbased on time and space complexity, speed, quality of approximations to batch FCM (for loadable data), and assessment of matchesbetween partitions and ground truth. Empirical results show thatrandom sampling plus extension FCM, bit-reduced FCM, and approximate kernel FCM are good choices to approximate FCM forVL data. We conclude by demonstrating the VL algorithms on adataset with 5 billion objects and presenting a set of recommendations regarding the use of different VL FCM clustering schemes.Index Terms—Big data, fuzzy c-means (FCM), kernel methods,scalable clustering, very large (VL) data.I. INTRODUCTIONCLUSTERING or cluster analysis is a form of exploratorydata analysis in which data are separated into groups orManuscript received September 6, 2011; revised January 27, 2012;accepted April 18, 2012. Date of publication May 25, 2012; date of current version November 27, 2012. This work was supported in part by Grant#1U01CA143062-01, Radiomics of Non-Small Cell Lung Cancer from the National Institutes of Health, and in part by the Michigan State University HighPerformance Computing Center and the Institute for Cyber Enabled Research.The work of T. C. Havens was supported by the National Science Foundationunder Grant #1019343 to the Computing Research Association for the CI Fellows Project.T. C. Havens is with the Department of Computer Science and Engineering, Michigan State University, East Lansing, MI 48824 USA (e-mail:havenst@gmail.com).J. C. Bezdek was with the Department of Electrical and Electronic Engineering, University of Melbourne, Parkville, Vic. 3010, Australia (e-mail:jcbezdek@gmail.com).C. Leckie is with the Department of Computer Science and Software Engineering, University of Melbourne, Parkville, Vic. 3010, Australia (e-mail:caleckie@csse.unimelb.edu.au).L. O. Hall is with Department of Computer Science and Engineering, University of South Florida, Tampa, FL 33630 USA (e-mail: hall@csee.usf.edu).M. Palaniswami is with the Department of Electrical and Electronic Engineering, University of Melbourne, Parkville, Vic. 3010, Australia (e-mail:palani@unimelb.edu.au).Color versions of one or more of the figures in this paper are available onlineat http://ieeexplore.ieee.org.Digital Object Identifier 10.1109/TFUZZ.2012.2201485TABLE IHUBER’S DESCRIPTION OF DATASET SIZES [11], [12]subsets such that the objects in each group share some similarity.Clustering has been used as a preprocessing step to separate datainto manageable parts [1], [2], as a knowledge discovery tool [3],[4], for indexing and compression [5], etc., and there are manygood books that describe its various uses [6]–[10]. The mostpopular use of clustering is to assign labels to unlabeled data—data for which no preexisting grouping is known. Any fieldthat uses or analyzes data can utilize clustering; the problemdomains and applications of clustering are innumerable.The ubiquity of personal computing technology, especiallymobile computing, has produced an abundance of staggeringlylarge datasets—Facebook alone logs over 25 terabytes (TB) ofdata per day. Hence, there is a great need to cluster algorithmsthat can address these gigantic datasets. In 1996, Huber [11]classified dataset sizes as in Table I.1 Bezdek and Hathaway[12] added the very large (VL) category to this table in 2006.Interestingly, data with 10 12 objects are still unloadable onmost current (circa 2011) computers. For example, a datasetrepresenting 1012 objects, each with ten features, stored in shortinteger (4 bytes) format would require 40 TB of storage (mosthigh-performance computers have 1 TB of main memory).Hence, we believe that Table I will continue to be pertinent formany years.There are two main approaches to clustering in VL data:distributed clustering which is based on various incremental styles, and clustering a sample found by either progressive or random sampling. Each has been applied in the context of FCM clustering of VL data; these ideas can also beused for Gaussian-mixture-model (GMM) clustering with theexpectation–maximization (EM) algorithm. Both approachesprovide useful ways to accomplish two objectives: accelerationfor loadable data and approximation for unloadable data.Consider a set of n objects O {o1 , . . . , on }, e.g., humansubjects in a psychological experiment, jazz clubs in Melbourne,or wireless sensor network nodes. Each object is typically represented by numerical feature-vector data that have the formX {x1 , . . . , xn } Rd , where the coordinates of xi providefeature values (e.g., weight, length, cover charge, etc.) describing object oi .A partition of the objects is defined as a set of cn values{uk i }, where each value represents the degree to which objectoi is in the kth cluster. The c-partition is often represented1 Huberalso defined tiny as 10 2 and small as 10 4 .1063-6706/ 31.00 2012 IEEE

HAVENS et al.: FUZZY c-MEANS ALGORITHMS FOR VERY LARGE DATA1131as a c n matrix U [uk i ]. There are three main types ofpartitions: crisp, fuzzy (or probabilistic), and possibilistic [13],[14]. Crisp partitions of the unlabeled objects are non emptymutually disjoint subsets of O such that the union of the subsetsequals O. The set of all nondegenerate (no zero rows) crispc-partition matrices for the object set O is Mhcn U Rc n uij {0, 1} j, i0 n uij n, i;j 1c uij 1, j(1)i 1where uk i is the membership of object oi in cluster k; and thepartition element uk i 1 if oi is labeled k and is 0 otherwise.When the rows of U are considered as vectors in Rn , we denotethe kth row as uk ; when the columns are considered as vectorsin Rk , we denote the ith column as (U )i . Both uk and (U )i areconsidered as column vectors.Fuzzy (or probabilistic) partitions are more flexible than crisppartitions in that each object can have membership in more thanone cluster. Note, if U is probabilistic, say U P [pk i ], thenpk i is interpreted as a posterior probability p(k oi ) that oi is inthe kth class. Since this paper focuses primarily on FCM, we donot specifically address this difference. However, we stress thatmost, if not all, of the methods described here can be directly applied to the GMM/EM algorithm, which is the most popular wayto find probabilistic clusters. The set of all fuzzy c-partitions is Mfcn U Rc n uij [0, 1] j, i0 n j 1uij n, i;c uij 1, j .(2)i 1Each column of the fuzzy partition U must sum to 1, thusensuring that every object has unit total membership in apartition ( k uk i 1).Many algorithms have been proposed to cluster in VL data,but only a handful of them address the fuzzy clustering problem. Literal schemes simply cluster the entire dataset. In contrast, extended clustering schemes apply a clustering algorithmto a representative (and manageably sized) sample of the fulldataset, and then noniteratively extend the sample result to obtain clusters for the remaining data in the full sample. Algorithmsthat include their own noniterative mechanisms for extensionare referred to as extensible algorithms [15]. Perhaps the mostwell-known method for fuzzy clustering of VL data is the generalized extensible fast FCM (geFFCM) [12]. This algorithmuses statistics-based progressive sampling to produce a reduceddataset that is large enough to capture the overall nature of thedata. It then clusters this reduced dataset and noniteratively extends the partition to the full dataset. However, the samplingmethod used in geFFCM can be inefficient and, in some cases,the data reduction is not sufficient for VL data. Hence, we willadapt geFFCM into a simple random sampling plus extensionFCM (rseFCM) algorithm. Other leading algorithms includesingle-pass FCM (spFCM) [16] and online FCM (oFCM) [17],which are incremental algorithms to compute an approximateFCM solution. The bit-reduced FCM (brFCM) [18] algorithmuses a binning strategy for data reduction. A kernel-based strategy which is called approximate kernel FCM (akFCM) wasdeveloped in [19] and [20], which relies on a numerical approximation that uses sampled rows of the kernel matrix to estimatethe solution to a c-means problem.In this paper, we compare four leading algorithms to computefuzzy partitions of VL vector data: rseFCM, spFCM, oFCM,and brFCM. Then, we will compare four kernel FCM (kFCM)algorithms to compute fuzzy partitions of VL data: rsekFCM,akFCM, spkFCM, and okFCM. The spkFCM and okFCM areproposed in this paper as kernelized extensions of the spFCMand oFCM algorithms.Section II describes FCM algorithms for VL data, andSection III proposes new kernelized extensions of some of thesealgorithms. The complexity of the algorithms is compared inSection IV, and Section V presents empirical results. Section VIcontains our conclusions and some ideas for further research.Next, we describe some of the related research.A. Background and Related WorkMuch research has been done on clustering in VL data. Methods can be roughly categorized into three types of algorithms.1) Sampling methods compute cluster centers from a smallersample of (often randomly) selected objects. Popular samplingbased methods include CLARA [7], CURE [21], and the coreset algorithms [22]. 2) Single-pass algorithms sequentially loadsmall groups of the data, clustering these manageable chunksin a single pass, and then combining the results from eachchunk. Representative algorithms include incremental clustering [23], [24] and divide-and-conquer approaches [25], [26].3) Data transformation algorithms alter the structure of the dataitself so that it is more efficiently accessed. The data often takethe form of graph-like structures. Well-known algorithms of thistype include BIRCH [27] and CLARANS [28]. Other algorithmsin this category include GARDENH D [29] and CLUTO [30],which were designed to address high-dimensional data. Whileall the algorithms that are mentioned in this paragraph performtheir job well, they all produce only crisp partitions.Methods that generate fuzzy partitions include the fast FCM(FFCM) developed in [31], where FCM is applied to largerand larger nested samples until there is little change in thesolution; and the multistage random FCM proposed in [32],which combines FFCM with a final literal run of FCM on the fulldataset. Both these schemes are more in the spirit of acceleration,rather than scalability, as they both contain a final run on thefull dataset. Other algorithms that are related, but were alsodeveloped for efficiency, include those described in [33] and[34]. In [35], fast kernel FCM (FKFCM) is proposed, which isdesigned to speedup the processing of quantized MRI images.This algorithm is similar to our brFCM algorithm in that it usesa weight in the FCM objective function to represent the multiplecontributions of objects that have the same quantization (pixel)value. Unlike brFCM, the FKFCM algorithm is not appropriatefor use with real-valued feature data.The algorithms in this paper rely on extension to producepartitions of the full dataset. The objective of extension depends on the size of the data. When the dataset is VL, sampling

1132Fig. 1.IEEE TRANSACTIONS ON FUZZY SYSTEMS, VOL. 20, NO. 6, DECEMBER 2012Population X and samples X VL , X L , and X s .and extension offers a clustering solution (i.e., makes clusteringfeasible) for cases where it is not otherwise possible with thecorresponding literal approach. If the dataset is merely large(L), but still loadable into memory, then an extended schememay offer an approximate solution which is comparable to theliteral solution at a significantly reduced computational cost—in other words, it accelerates the corresponding literal scheme.Therefore, the benefits for the two cases can be summarized asfeasibility for VL data and acceleration for L data. Both situations are depicted in Fig. 1, where the dataset to be clustered iseither XL or XVL . The set X denotes the source population,and Xs represents a sample of either XVL or XL . In a nutshell,extended schemes choose a sample Xs or collection of samples,cluster it (or them), and then extend the results to XL or XVL .We do not specifically address how the sampled datasetsare chosen; we simply use uniform random sampling. We believe that random sampling is sufficient for most VL datasetsand is certainly the most efficient way to choose a sample.However, progressive sampling might provide improved performance for some datasets. Provost et al. [36] provide a very readable analysis and summary of progressive sampling schemes.Sampling methods for spectral and kernel methods are addressedin [37]–[39], and these papers also support our claim that uniform random sampling is preferred.Another fundamental difference between XL and XVL involves the calculation of approximation error. For L datasets, wecan assess the approximation error by measuring the differencebetween the clustering solutions obtained using the corresponding extended and literal schemes. On the other hand, the onlysolution generally available for a VL dataset is that obtained bythe extended scheme, for which the approximation error cannotbe measured. Thus, our confidence in the accuracy of extendedclusters in the unverifiable case (XVL ) is necessarily derivedfrom the verified good behavior that we can observe by conducting various XL experiments. We do, however, perform ademonstration of the VL methods in Section V-D that comparesthe performance of the algorithms using a measure we devisedspecifically for this purpose. Now we move on to descriptionsof the FCM algorithms.II. FUZZY c-MEANS ALGORITHMSThe FCM algorithm is generally defined as the constrainedoptimization of the squared-error distortionJm (U, V ) nc i 1 j 12umij xj vi A(3)where U is the (c n) partition matrix, V {v1 , . . . , vc } isthe set of c cluster centers in Rd , m 1 is the fuzzificationconstant, and · A is any inner product A-induced norm, i.e., x A xT Ax. We will only use the Euclidean norm (A I)in our examples, but there are many examples where the use ofanother norm-inducing matrix, e.g., using A S 1 , the inverseof the sample covariance matrix, has been shown to be effective. The FCM/AO algorithm produces a solution to (3) usingalternating optimization (AO) [40]. Other approaches to optimizing the FCM model include genetic algorithms, particleswarm optimization, etc. The FCM/AO approach is by far themost common, and the only algorithm used here. We may easethe notation at times by dropping the “/AO” notation unlessclarity demands it. Algorithm 1 outlines the steps of the literalFCM/AO (LFCM/AO) algorithm. There are many ways to initialize LFCM; we choose c objects randomly from the datasetitself to serve as the initial cluster centers, which seems to workwell in almost all cases. However, any initialization method thatadequately covers the object space and does not produce anyidentical initial centers would work.The alternating steps of LFCM in (4) and (5) are iterated untilthe algorithm terminates, where termination is declared whenthere are only negligible changes in the cluster center locations:more explicitly, max1 k c { vk ,new vk ,old 2 } , where is a predetermined constant. We use 10 3 in our experimentsand the Euclidean norm for the termination test.A. Sampling and Noniterative ExtensionThe most basic, and perhaps obvious, way to address VL datais to sample the dataset and then use FCM to compute clustercenters of the sampled data. If the data were sufficiently sampled, the error between the cluster center locations produced byclustering the entire dataset and the locations produced by clustering the sampled data should be small. Algorithm 2 outlinesthe random sample and extend approach, which we denote asrseFCM. Note that the extension step in line 3 of rseFCM isequivalent to (4) in FCM. Extension can be used to computethe full fuzzy data partition for any algorithm that produces (orapproximates) cluster centers. In Section V, we will apply extension to other VL FCM algorithms in order to measure therelative accuracy of each clustering approach.

HAVENS et al.: FUZZY c-MEANS ALGORITHMS FOR VERY LARGE DATA1133Remark: Once the extension step is completed, so that a partition U on the full dataset is known, we can perform a completionstep by using U to (re)compute the cluster centers V with (5).Extension, followed by completion, yields a pair (U, V ) thatsatisfies the first-order necessary conditions to minimize Jm ,and if desired, we can compute the value Jm (U, V ).B. Algorithms Based on Weighted Fuzzy c-MeansIn LFCM, each object is considered equally important in theclustering solution. The weighted FCM (wFCM) model introduces weights that define the relative importance of each objectin the clustering solution. Hence, wFCM is defined as the constrained optimization ofJm w (U, V ) c n 2wj umij xj vi A(6)i 1 j 1where w Rn , wj 0, is a set of predetermined weights thatdefine the influence of each feature vector. Algorithm 3 outlinesthe wFCM algorithm. As (7) shows, objects with a higher weightw are more influential in defining the location of the clustercenters V . These weights are important in the spFCM, oFCM,and brFCM algorithms, which are outlined next.1) Single-Pass Fuzzy c-Means: Algorithm 4 outlines thespFCM algorithm. Line 1 of the algorithm sets the weight vectorw to the ns -length vector of 1s. Line 2 calculates the wFCMpartition of the first sample set of data, returning the c clustercenters V (note that wFCM in Line 2 is initialized using yourchosen initialization method). spFCM then iterates over the remaining subsets of data in X. At each iteration, wFCM is usedto cluster an augmented set of data that is composed of the unionof the cluster centers from the previous iteration and the sample subset Xl , which we denote as {V Xl }. Hence, there are(c ns ) objects that are clustered at each iteration. Lines 3 and 4determine the weights that are used in wFCM for the augmentedinput dataset {V Xl }. Line 3 calculates the weights for the c(next step) cluster centers V from the current cluster memberships, and Line 4 creates the (c ns )-length weight vector forthe next step, composed of the c weights that correspond to thecluster centers and ns 1s, which correspond to the objects in thesample subset Xl . Line 5 shows that wFCM takes as input theaugmented subset of data {V Xl }, the number of clusters c,and the weight vector w. In addition, wFCM is initialized usingthe cluster centers V from the previous iteration (shown as thefifth argument in Line 5), which speeds up convergence.2) Online Fuzzy c-Means: Algorithm 5 outlines the oFCMalgorithm. Unlike spFCM, which computes the new cluster centers by feeding forward the cluster centers from the previous iteration into the data being clustered, oFCM clusters all s subsets ofobjects separately and then aggregates the s sets of cluster centers at the end. In Line 2 of oFCM, a partition and set of clustercenters, denoted as U1 and V1 , are produced for the first subsetof objects X1 . At each iteration of Line 2, wFCM partitions Xl ,producing Ul and Vl . Note that we use the cluster centers fromthe previous iteration to initialize wFCM to speed up convergence (this feedforward initialization could be ignored for thecase where each iteration of Line 2 is performed separately, suchas on a distributed architecture). Line 3 computes the weightsfor each of the c cluster centers in each of the s sets of V s. Note

1134that (Ul )j indicates the jth column of Ul . Finally, wFCM is usedin Line 4 to cluster the (cs) centers {V1 . . . Vs } using thecorresponding weights {w1 . . . ws }. The final output is ccluster centers V .The size of the dataset that is composed of the cluster centers(which are processed at Line 4) could be large in cases whereX is extremely large and must be broken up into many chunks(s is large). If the cluster center data at Line 4 are too large tobe processed, it could be processed by another run of oFCM ora streaming algorithm, such as proposed in [41].3) Bit-Reduced Fuzzy c-Means: brFCM was designed to address the problem of clustering in large images. Algorithm 6outlines the steps of this algorithm. The brFCM algorithm begins by binning the input data X into a reduced set X , where Xis the set of bin prototypes (i.e., the bin centers). This reducedset X is then clustered using wFCM, where the weights arethe number of objects in each bin. There are many ways to bindata; e.g., image data can be bit-reduced by removing the leastsignificant bits and binning the identical objects. An easy way tobin the data is to compute the s-bin histogram; the weights arethe number of pixels in each bin, and the data are the bin centers(the means of the data in each bin). This is the method thatwe will use in this paper as it provides us with an easy way todirectly compare the results with the other VL FCM algorithmsas we can specify the number of bins (which is equivalent tospecifying the sample size).Note that the spFCM, oFCM, and brFCM algorithms all produce cluster centers V by approximately minimizing Jm w andnot the full c n data partition. However, this partition can beeasily computed by the extension step (Line 3) of Algorithm 2and then the completion step, computing a final V with the fullU , yields a necessary pair (U, V ) that can be evaluated by Jmif desired.It was shown in [42] that the spFCM and oFCM algorithmsconverge—[42, Ths. 1 and 2] also can be used to show thatrseFCM converges. Now we move to kernel versions of thealgorithms that are presented.C. Kernel Fuzzy c-Means AlgorithmsConsider some nonlinear mapping function φ: x φ(x) RD K , where DK is the dimensionality of the transformed feature vector x. With kernel clustering, we do not need to explicitly transform x; we simply need to represent the dot productIEEE TRANSACTIONS ON FUZZY SYSTEMS, VOL. 20, NO. 6, DECEMBER 2012φ(x) · φ(x) κ(x, x). The kernel function κ can take manyforms, with the polynomial κ(xi , xj ) (xTi xj 1)p and radial basis function (RBF) κ(xi , xj ) exp(σ xi xj 2 ) being two of the most well known. Given a set of n objectvectors X, we can, thus, construct an n n kernel matrixK [Kij κ(xi , xj )]n n . This kernel matrix K representsall pairwise dot products of the feature vectors in the transformed high-dimensional space—called the reproducing kernelHilbert space (RKHS).Given a kernel function κ, kernel FCM (kFCM) canbe generally defined as the constrained minimization of areformulation—by elimination of V by substitution of (5) into(3)—of the objective in (3): n ncn mumud(x,x)/2umJm (U ; κ) ikij k j κljj 1i 1 k 1l 1(9)where U Mfcn , m 1 is the fuzzification parameter,and dκ (xi , xk ) κ(xi , xi ) κ(xk , xk ) 2κ(xi , xk ) is thekernel-based distance between the ith and kth feature vectors.Note that in (9) we use a partition matrix that is (n c) to stayconsistent with the existing kernel clustering literature. We willstick to this convention for our discussion of kernel algorithms.kFCM solves the optimization problem min{Jm (U ; κ)} bycomputing iterated updates ofuij c k 1dκ (xi , vj )dκ (xi , vk )1m 1 1 i, j.(10)The kernel distance between input datum xi and cluster centervj isdκ (xi , vj ) φ(xi ) φ(vj ) 2(11)where, like LFCM, the cluster centers are linear combinationsof the feature vectors nml 1 ulj φ(xl ) nφ(vj ) .(12)ml 1 uljEquation (11) cannot by computed directly, but by usingthe identityi , xj ) φ(xi ) · φ(xj ), denoting ũj mKij κ(xmmmm Tum/ u whereujijj (u1j , u2j , . . . , un j ) , and substitutiing (12) into (11), we get n nm ml 1s 1 ulj usj φ(xl ) · φ(xs ) ndκ (xi , vj ) 2ml 1 ulj nml 1 ulj φ(xl ) · φ(xi ) n φ(xi ) · φ(xi ) 2ml 1 ulj ũTj K ũj eTi Kei 2ũTj Kei ũTj K ũj Kii 2 ũTj K i(13)where ei is the n-length unit vector with the ith element equalto 1. This formulation of kFCM is equivalent to that proposedin [43] and, furthermore, is identical to relational FCM [44] ifthe kernel κ(xi , xk ) xi , xj is used [45].Equation (13) shows the obvious problem which arises whenusing kernel clustering with VL data; the distance equation’s

HAVENS et al.: FUZZY c-MEANS ALGORITHMS FOR VERY LARGE DATA1135complexity is quadratic with the number of objects. Furthermore, the memory requirement to store K is also quadratic withthe number of objects.1) Approximate Kernel Fuzzy c-Means: The akFCM [20] algorithm approximates a solution to kFCM by using a numericalapproximation of the cluster center to object kernel distance in(13) that is based on the assumption that the cluster centers canbe accurately represented as a linear sum of a subset of the feature vectors. First, s feature vectors are drawn randomly. Thecluster centers are approximated by a linear sum of the selectedobjectsφ̂(vj ) n i 1aij ξi φ(xi ) s αij φ(x(i) )(14)i 1where aij is the weight of the ith feature vector in the jth clustercenter, ξi is a binary-selection variable (ξi 1 if xi is drawnand ξi 0 if xi is not), and αij a(i)j are the weights ofthe chosen vectors φ(x(i) ), i 1, . . . , s. Although this notationseems cumbersome, it dramatically simplifies the description ofakFCM, as we can ignore the weights aij for the objects thatare not sampled (ξi 0).After sampling, two kernel sub-matrices are computed:Kξ ξ [Kij ]s s i, j ξi 1, ξj 1Kξ [(K)i ]n s i ξi 1(16)(17)See [19] and [20] for a detailed derivation. In practice, Kξ ξ couldbe rank deficient (have a number of relatively small eigenvalues); hence, it is advisable to use a pseudoinverse when comput†ing Kξ 1ξ , which we denote as Kξ ξ . In addition, the calculationTof Kξ 1ξ Kξ only needs to be performed once, at the beginningof the algorithm, with the result stored.The key element of akFCM is the approximation of (13) usingαj , i.e.,dˆκ (xi , vj ) αTj Kξ ξ αj Kii 2(Kξ αj )i .Lm (U, 1) (18)Notice that the full kernel matrix is no longer required to compute the distance dˆκ (xi , vj ); the only required kernel matrixelements are the diagonal elements (which for the RBF kernelare all equal to 1) and the s columns of K corresponding to theselected objects. Algorithm 7 outlines the steps of akFCM.One theoretical advantage of the akFCM algorithm is that ithas a bounded squared-error distortion, Lm (U, Ξ), of2 1EΞ [Lm (U, Ξ)]sŨ m tr (U m )T K 1 [diag(K)] 1Lm (U, 1)n2 See [19] for a detailed derivation of this error for hard kernel c-means. Theerror we present here for akFCM is a simple extension of that derivation.n i 1(15)where (K)i is the ith column of K. Notice that Kξ ξ is the s ssquare matrix that corresponds to the pairwise elements of Kfor the ξ-selected objects. Kξ is the n s rectangular matrixthat corresponds to the columns of K for the ξ-selected objects;also, notice that Kξ ξ is a submatrix of Kξ .Using these two submatrices, the weights αj (where this isthe s-length vector, αj (α1j , . . . , αsj )T ) are computed asTTαj (Kξ 1ξ Kξ ũj ) .where EΞ is the expectation with respect tomthe selection variable/Ξ (ξ1 , . . . , ξn ), [Ũ m ]ik umikj uj k is the (column)normalized membership matrix, andKiic m Tmumij tr((U ) K Ũ )j 1is the squared-error distortion of the literal kFCM solution (i.e.,Ξ 1n , the vector of all 1s). In [19], it was shown that this error, for m 1, is bounded by cn/s for kernels where Kij 1,which is an intuitively pleasing result. This bound also holds form 1, as uij 1. Next, we describe novel kernelized extensions of the wFCM, spFCM, and oFCM algorithms.III. NEW KERNEL FUZZY c-MEANS ALGORITHMSFOR VERY LARGE DATAA. Weighted Kernel Fuzzy c-MeansThe extension of the kFCM model Jm (U ; κ) to the weightedkFCM (wkFCM) model Jm w (U ; κ) follows the same pattern asthe extension of (3) to (6). The cluster center φ(vj ) is a weightedsum of the feature vectors, as shown in (12). Now assume thateach object φ(xi ) has a different predetermined influence, givenby a respective weight wi . This leads to nml 1 wl ulj φ(xl ) nφ(vj ) .(19)ml 1 wl uljSubstituting (19) into (13) givesdwκ (xi , vj ) 1(w uj )T K(w uj ) Kii w uj 2 2(w uj )T K i (20) w uj where w is the vector of predetermined weights, and indicatesthe Hadamard product. This leads to the wkFCM algorithmshown in Algorithm 8. Notice that wkFCM also outputs theindex of the object closest to each cluster center, which is calledthe cluster prototype. The vector of indices p is important in theVL data schemes that are now proposed.

1136IEEE TRANSACTIONS ON FUZZY SYSTEMS, VOL. 20, NO. 6, DECEMBER 2012B. Random Sample and Extend

Related Documents:

IEEE Std 81-2012, IEEE Guide for Measuring Earth ...

IEEE 3 Park Avenue New York, NY 10016-5997 USA 28 December 2012 IEEE Power and Energy Society IEEE Std 81 -2012 (Revision of IEEE Std 81-1983) Authorized licensed use limited to: Australian National University. Downloaded on July 27,2018 at 14:57:43 UTC from IEEE Xplore. Restrictions apply.File Size: 2MBPage Count: 86Explore furtherIEEE 81-2012 - IEEE Guide for Measuring Earth Resistivity .standards.ieee.org81-2012 - IEEE Guide for Measuring Earth Resistivity .ieeexplore.ieee.orgAn Overview Of The IEEE Standard 81 Fall-Of-Potential .www.agiusa.com(PDF) IEEE Std 80-2000 IEEE Guide for Safety in AC .www.academia.eduTesting and Evaluation of Grounding . - IEEE Web Hostingwww.ewh.ieee.orgRecommended to you b

75 Views

2y ago

808 Ieee Transactions on Fuzzy Systems, Vol. 14, No. 6, December 2006 ...

808 IEEE TRANSACTIONS ON FUZZY SYSTEMS, VOL. 14, NO. 6, DECEMBER 2006 Interval Type-2 Fuzzy Logic Systems Made Simple Jerry M. Mendel, Life Fellow, IEEE, Robert I. John, Member, IEEE, and Feilong Liu, Student Member, IEEE Abstract—To date, because of the computational complexity of using a general type-2 fuzzy set (T2 FS) in a T2 fuzzy logic system

9 Views

1y ago

Takagi-Sugeno Fuzzy Modeling for Process Control

ing fuzzy sets, fuzzy logic, and fuzzy inference. Fuzzy rules play a key role in representing expert control/modeling knowledge and experience and in linking the input variables of fuzzy controllers/models to output variable (or variables). Two major types of fuzzy rules exist, namely, Mamdani fuzzy rules and Takagi-Sugeno (TS, for short) fuzzy .

94 Views

3y ago

Noise reduction by fuzzy image filtering - Fuzzy Systems, IEEE ...

IEEE TRANSACTIONS ON FUZZY SYSTEMS, VOL. 11, NO. 4, AUGUST 2003 429 Noise Reduction by Fuzzy Image Filtering Dimitri Van De Ville, Member, IEEE, Mike Nachtegael, Dietrich Van der Weken, Etienne E. Kerre, Wilfried Philips, Member, IEEE, and Ignace Lemahieu, Senior Member, IEEE Abstract— A new fuzzy filter is presented for the noise reduc-

9 Views

1y ago

MICROCONTROLLER BASED IMPLEMENTATION OF A FUZZY KNOWLEDGE ...

fuzzy controller that uses an adaptive neuro-fuzzy inference system. Fuzzy Inference system (FIS) is a popular computing framework and is based on the concept of fuzzy set theories, fuzzy if and then rules, and fuzzy reasoning. 1.2 LITERATURE REVIEW: Implementation of fuzzy logic technology for the development of sophisticated

56 Views

3y ago

A New Hendecagonal Fuzzy Number for Optimization Problems

Different types of fuzzy sets [17] are defined in order to clear the vagueness of the existing problems. D.Dubois and H.Prade has defined fuzzy number as a fuzzy subset of real line [8]. In literature, many type of fuzzy numbers like triangular fuzzy number, trapezoidal fuzzy number, pentagonal fuzzy number,

21 Views

1y ago

The shape of fuzzy sets in adaptive function approximation - Fuzzy ...

IEEE TRANSACTIONS ON FUZZY SYSTEMS, VOL. 9, NO. 4, AUGUST 2001 637 The Shape of Fuzzy Sets in Adaptive Function Approximation Sanya Mitaim and Bart Kosko Abstract— The shape of if-part fuzzy sets affects how well feed-forward fuzzy systems approximate continuous functions. We ex-plore a wide range of candidate if-part sets and derive supervised

16 Views

1y ago

Sample PDF of Std 11th Economics ... - Target Publications

Target Publications Pvt. Ltd. No part of this book may be reproduced or transmitted in any form or by any means, C.D. ROM/Audio Video Cassettes or electronic, mechanical including photocopying; recording orbyanyinformation storage and retrieval system without permission in writing from the Publisher. Based on the new textbook Exhaustive content coverage in Question & Answer format Answers .

218 Views

3y ago

Recent Views

Family Law and You Booklet - lsc.sa.gov.au

FAMILY LAW AND YOU The Family Law Act is the main law that deals with divorce, disputes about children and property matters. All children are covered by the Family Law Act, no matter where in Australia they live or who their parents are. The courts that can make decisions under the Family Law Act are federal courts called Family Law Courts.

1y ago

143 Views

12 PUBLIC LAW AND PRIVATE LAW - Home: The National .

INTRODUCTION TO LAW MODULE - 3 Public Law and Private Law Classification of Law 164 Notes z define Criminal Law; z list the differences between Public and Private Law; and z discuss the role of Judges in shaping Law 12.1 MEANING AND NATURE OF PUBLIC LAW Public Law is that part of law, which governs relationship between the State

3y ago

745 Views

Dr. Ram Manohar Lohiya National Law University, Lucknow

2. Health and Medicine Law 3. Int. Commercial Arbitration 4. Law and Agriculture IXth SEMESTER 1. Consumer Protection Law 2. Law, Science and Technology 3. Women and Law 4. Land Law (UP) Xth SEMESTER 1. Real Estate Law 2. Law and Economics 3. Sports Law 4. Law and Education **Seminar Courses Xth SEMESTER (i) Law and Morality (ii) Legislative .

3y ago

496 Views

Case Law Update by Victor P. Valmus Family Law uarterly

Family Law uarterly Official Publication of the Cobb County Family Law Section The Cobb Case Law Update The Cobb Family Law uarterlyJune, 201 The Cobb Family Law Quarterly June, 2014 In this Edition Business Valuation and Reporting in Matrimonial Disputes by Marc L. Effron, CPA/ CFF, JD, CVA and Kevin P. Couillard, ASA, CFA

1y ago

114 Views

Board Beans Collection - BOARD BEANS - Board Beans

Catan Family 3 4 4 Checkers Family 2 2 2 Cherry Picking Family 2 6 3 Cinco Linko Family 2 4 4 . Lost Cities Family 2 2 2 Love Letter Family 2 4 4 Machi Koro Family 2 4 4 Magic Maze Family 1 8 4 4. . Top Gun Strategy Game Family 2 4 2 Tri-Ominos Family 2 6 3,4 Trivial Pursuit: Family Edition Family 2 36 4

2y ago

384 Views

Companies Law - Cayman Islands dollar

Law 1 of 1971-15th December, 1970 Law 7 of 2000- 20th July, 2000 Law 7 of 1973-28th June, 1973 Law 5 of 2001-20th April, 2001 Law 24 of 1974-22nd November, 1974 Law 10 of 2001-25th May, 2001 Law 25 of 1975-9th December, 1975 Law 29 of 2001-26th September, 2001 Law 19 of 1977-10th November, 1977 Law 46 of 2001-14th January, 2002

3y ago

454 Views

It’s the Law!

ciples stated in Boyle’s Law, Charles’ Law, Gay-Lussac’s Law, Henry’s Law, and Dalton’s Law. Students will be able to explain the application of Boyle’s Law, Charles’ Law, Gay-Lussac’s Law, Henry’s Law, and Dalton’s Law to observations or events related to SCUBA diving. MateriaLs None audio/visuaL MateriaLs None teachinG tiMe

2y ago

378 Views

WHAT LAW IS ? An Introduction to Law

common law system civil law system!! sources of law in civil law !! a1. primary: statutes (written law) enacted by legislative power are the principal source of law. ! a2. two subsidiary sources of law: ! a2.1 administrative regulations a.2.2 customs!! ! sources of law in common law !!! b1. two primary sources of

2y ago

385 Views

Intermediate Law Law and You Worksheet 3: Australian law - Home Affairs

4. There are different kinds of law to deal with different kinds of problems. Four important kinds of law are civil law, criminal law, family law and administrative law. Civil law deals with disputes between individuals; for example, if someone sells you goods that are faulty, or that cause you injury or damage, you can take that person to court.

4m ago

110 Views

What is Family Law? - Courts and Tribunals Judiciary

What is family law? After all, the law of inheritance is usually thought of as a branch of property law and thus a matter for the Chancery rather than the Family Division. And family 1 Changing families: family law yesterday, today and tomorrow - a view from south of the Border [2018] Fam Law 538, 542-3.

1y ago

128 Views

Domestic Violence and Family Law in Papua New Guinea

Family law in PNG Family law deals with issues relating to family and domestic relationships. Major topics covered by family law include marriage, divorce, child maintenance, prop - erty claims following separation and the custody and adoption of children (Jessep and Luluaki 1985:11). Much of PNG's family law legislation was adopted as

1y ago

126 Views

Faculty of Juridical, Social and Political Sciences Year .

Law L Law IV 8 Drept procesual civil II / Civil Procedure Law II 5 Law L Law IV 8 Dreptul comerțului internațional / International ommercial Law 4 Law L Law IV 8 riminalistică / Forensics 4 Law L Law IV 8 Practică de cercetare pentru elaborarea lucrării de lincență(3 săptămân

2y ago

384 Views

Ohm ’s Law

Ohm ’s Law Ohm's law states that, in an electrical circuit, the current passing through most materials is directly proportional to the potential difference applied across them. 3-1—3-3: Ohm ’s Law Formulas There are three forms of Ohm’s Law: I V/R V IR R V/I where:File Size: 1MBPage Count: 40Explore furtherOhm's Law Quiz MCQs with Answers Ohm Lawohmlaw.comOhm’s Law Worksheet - Basic Electricity - All About omohms law worksheet - eering.orgOhm’s Law Worksheet - Richmond County School Systemwww.rcboe.orgOhm's Law with Examples - Physics Problems with Solutions ended to you b

2y ago

295 Views

Family Law for the Future — An Inquiry into the Family Law .

Review of the Family Law System On 27 September 2017, the Australian Law Reform Commission received Terms of Reference to undertake an inquiry into the family law system. On behalf of the Members of the Commission involved in this Inquiry, and in accordance with the Australian Law

3y ago

136 Views

Practice Material - Family - Law Society of British Columbia

The Law Society's . Report of the Family Law Task Force: Best Practice Guidelines for Law-yers Practising Family Law. Family law has undergone significant changes over the past several years, and more changes are underway. 2. It is important to verify that your legal knowledge and re-sources are current. For example, note these changes:

1y ago

125 Views

1130 Ieee Transactions On Fuzzy Systems, Vol. 20, No. 6, December 2012 .

It looks like you're using an ad-blocker