MapReduce-based Fuzzy C-Means Clustering Algorithm .

3y ago

30 Views

3 Downloads

348.29 KB

22 Pages

Last View : 1m ago

Last Download : 3m ago

Upload by : Joao Adcock

Report this link

Download PDF

Transcription

Noname manuscript No.(will be inserted by the editor)MapReduce-based Fuzzy C-Means ClusteringAlgorithm: Implementation and ScalabilitySimone A. LudwigReceived: date / Accepted: dateAbstract The management and analysis of big data has been identified asone of the most important emerging needs in recent years. This is becauseof the sheer volume and increasing complexity of data being created or collected. Current clustering algorithms can not handle big data, and therefore,scalable solutions are necessary. Since fuzzy clustering algorithms have shownto outperform hard clustering approaches in terms of accuracy, this paper investigates the parallelization and scalability of a common and eﬀective fuzzyclustering algorithm named Fuzzy C-Means (FCM) algorithm. The algorithmis parallelized using the MapReduce paradigm outlining how the Map andReduce primitives are implemented. A validity analysis is conducted in orderto show that the implementation works correctly achieving competitive purityresults compared to state-of-the art clustering algorithms. Furthermore, a scalability analysis is conducted to demonstrate the performance of the parallelFCM implementation with increasing number of computing nodes used.Keywords MapReduce · Hadoop · Scalability1 IntroductionManaging scientific data has been identified as one of the most importantemerging needs of the scientific community in recent years. This is because ofthe sheer volume and increasing complexity of data being created or collected,in particular, in the growing field of computational science where increases incomputer performance allow ever more realistic simulations and the potentialto automatically explore large parameter spaces. As noted by Bell et al. [1]:Simone A. LudwigDepartment of Computer ScienceNorth Dakota State UniversityFargo, ND, USAE-mail: simone.ludwig@ndsu.edu

2Simone A. Ludwig“As simulations and experiments yield ever more data, a fourth paradigmis emerging, consisting of the techniques and technologies needed to performdata intensive science”. The question to address is how to eﬀectively generate,manage and analyze the data and the resulting information. The solutionrequires a comprehensive, end-to-end approach that encompasses all stagesfrom the initial data acquisition to its final analysis.Data mining is is one of the most developed fields in the area of artificialintelligence and encompases a relatively broad field that deals with the automatic knowledge discovery from databases. Based on the rapid growth of datacollected in various fields with their potential usefulness, this requires eﬃcienttools to extract and utilize potentially gathered knowledge [2].One of the important data mining tasks is classification, which is an effective method that is used in many diﬀerent areas. The main idea behindclassification is the construction of a model (classifier) that assigns items in acollection to target classes with the goal to accurately predict the target classfor each item in the data [3]. There are many techniques that can be usedto do classification such as decision trees, Bayes networks, genetic algorithms,genetic programming, particle swarm optimization, and many others [4]. Another important data mining technique that is used when analyzing data isclustering [5]. The aim of clustering algorithms is to divide a set of unlabeleddata objects into diﬀerent groups called clusters. The cluster membership measure is based on a similarity measure. In order to obtain high quality clusters,the similarity measure between the data objects in the same cluster should bemaximized, and the similarity measure between the data objects from diﬀerentgroups should be minimized.Clustering is the classification of objects into diﬀerent groups, i.e., thepartitioning of data into subsets (clusters), such that data in each subset sharessome common features, often proximity according to some defined distancemeasure. Unlike conventional statistical methods, most clustering algorithmsdo not rely on the statistical distribution of data, and thus can be usefullyapplied in situations where little prior knowledge exists [6].Most sequential classification/clustering algorithms suﬀer from the problem that they do not scale with larger sizes of data sets, and most of themare computationally expensive, both in terms of time and space. For thesereasons, the parallelization of the data classification/clustering algorithms isparamount in order to deal with large scale data. To develop a good parallelclassification/clustering algorithm that takes big data into consideration, thealgorithm should be eﬃcient, scalable and obtain high accuracy solutions.In order to enable big data to be processed, the parallelization of datamining algorithms is paramount. Parallelization is a process where the computation is broken up into parallel tasks. The work done by each task, oftencalled its grain size, can be as small as a single iteration in a parallel loop oras large as an entire procedure. When an application can be broken up intolarge parallel tasks, the application is called a coarse grain parallel application.Two common ways to partition computation are task partitioning, in which

MapReduce-based Fuzzy C-Means Clustering Algorithm3each task executes a certain function, and data partitioning, in which all tasksexecute the same function but on diﬀerent data.This paper proposes the parallelization of a Fuzzy C-Means (FCM) clustering algorithm. The parallelization methodology used is the divide-and-conquermethodology referred to as MapReduce. The implementation details are explained in details outling how the FCM algorithm can be parallelized. Furthermore, a validity analysis is conducted in order to demonstrate the correctfunctioning of the implementation measuring the purity and comparing theseto state-of-the art clustering algorithms. Moreover, a scalability analysis isconduced to investigate the performance of the parallel FCM implementationby measuring the speedup for increasing number of computing nodes used.The remainder of this paper is as follows. Section 2 introduces clusteringand fuzzy clustering in particular. The following section (Section 3) discussesrelated work in the area of big data processing. In Section 4, the implementation is described in details. The experimental setup and results are given inSection 5, and Section 6 concludes this work outlining the findings obtained.2 Background to ClusteringClustering can be applied to data that are quantitative (numerical), qualitative(categorical), or a mixture of both. The data usually are observations of somephysical process. Each observation consists of n measured variables (features),grouped into an n-dimensional column vector zk [z1k , ., znk ]T , zk Rn .A set of N observations is denoted by Z {zk k 1, 2, ., N }, and isrepresented as an n N matrix [6]: z11 z12 . z1N z21 z22 . z2N Z (1) . . . . zn1 zn2 . znNThere are several definitions of how a cluster can be formulated dependingon the objective of clustering. In general, a cluster is a group of objects thatare more similar to one another than to members of other clusters [7, 8]. Theterm “similarity” is defined as mathematical similarity and can be defined by adistance norm. Distance can be measured among the data vectors themselvesor as a distance from a data vector to some prototypical object (prototype) ofthe cluster. Since the prototypes are usually not known beforehand, they aredetermined by the clustering algorithms simultaneously with the partitioningof the data. The prototypes can be vectors of the same dimension as the dataobjects, however, they can also be defined as “higher-level” geometrical objectssuch as linear or nonlinear subspaces or functions. The performance of mostclustering algorithms is influenced by the geometrical shapes and densities ofthe individual clusters, but also by spatial relations and distances among theclusters. Clusters can be well-separated, continuously connected or overlapping[6].

4Simone A. LudwigMany clustering algorithms have been introduced and clustering techniquescan be categorized depending on whether the subsets of the resulting classification are fuzzy or crisp (hard). Hard clustering methods are based on classicalset theory and require that an object either does or does not belong to a cluster. Hard clustering means that the data is partitioned into a specified numberof mutually exclusive subsets. Fuzzy clustering methods, however, allow theobjects to belong to several clusters simultaneously with diﬀerent degrees ofmembership. Fuzzy clustering is seen as more natural than hard clusteringsince the objects on the boundaries between several classes are not forced tofully belong to one of the classes, but rather are assigned membership degreesbetween 0 and 1 indicating their partial membership. The concept of fuzzypartition is vital for cluster analysis, and therefore, also for the identificationtechniques that are based on fuzzy clustering. Fuzzy and possibilistic partitions are seen as a generalization of hard partition which is formulated in termsof classical subsets [6].The objective of clustering is to partition the data set Z into c clusters. Fornow let us assume that c is known based on prior knowledge. Using classicalsets, a hard partition of Z can be defined as a family of subsets Ai 1 i c P (Z)1 with the following properties [6, 7]:c'Ai Z(2a)Ai Aj , 1 i ̸ j c(2b)i 1 Ai Z, 1 i c(2c)Equation 2a denotes that the union subset Ai contains all the data. The subsets have to be disjoint as stated by Equation 2b, and none of them can beempty nor contain all the data in Z as given by Equation 2c. In terms of membership (characteristic) function, a partition can be conveniently representedby the partition matrix U [µik ]c N . The ith subset Ai of Z. It follows fromEquations 2 that the elements of U must satisfy the following conditions [6]:µik {0, 1}, 1 i c, 1 k Nc(µik 1, 1 k N(2d)µik N, 1 i c(2f)(2e)i 10 N(k 1Thus, the space of all possible hard partition matrices for Z referred to as thehard partitioning space is defined by:)*cN((c NMHC U R µik {0, 1}, i, k;µik 1, k; 0 µik N, ii 1k 1(2g)

MapReduce-based Fuzzy C-Means Clustering Algorithm5Generalization of the hard partitioning to the fuzzy partitioning followsby allowing µik to obtain values in [0, 1]. The conditions for a fuzzy partitionmatrix, analogous to hard clustering equations, is given by [6, 9]:µik [0, 1], 1 i c, 1 k Nc(µik 1, 1 k N(3b)µik N, 1 i c(3c)(3a)i 10 N(k 1The ith row of the fuzzy partition matrix U contains values of the ith membership function of the fuzzy subset Ai of Z. Equation 3b constrains the sumof each column to 1, and thus the total membership of each zk in Z equalsone. The fuzzy partitioning space for Z is the set [6]:MF C )U Rc N µik [0, 1], i, k;c(i 1µik 1, k; 0 N(k 1µik N, i*(3d)3 Related WorkIn this section, first related work in the area of clustering is introduced, andafterwards clustering techniques that make use of parallelization mechanismsapplied to big data are described. Given that this paper is concerned with theimplementation and evaluation of a parallelized FCM algorithm, the relatedwork outlines other research conducted in this area.Fuzzy clustering can be categorized into three categories, namely hierarchical fuzzy clustering methods, graph-theoretic fuzzy clustering methods, andfuzzy clustering [10].Hierarchical clustering techniques generate a hierarchy of partitions bymeans of agglomerative and divisive methods [10]; whereby the agglomerativealgorithms produce a sequence of clusters of decreasing number by mergingtwo clusters from the previous level. The divisive algorithms on the other handperform the clustering the other way around. In [11] a hierarchical clusteringalgorithm in the area of business systems planning is proposed. The best cluster count is determined by a matching approach. In another technique, calledfuzzy equivalent relation-based hierarchical clustering, the clustering is managed without needing a predefined count of clusters [12].Another category referred to as graph-theoretic fuzzy clustering methodsis based on the notion of connectivity of nodes of a graph representing thedata set. In particular, the graph representing the data structure is a fuzzy

6Simone A. Ludwiggraph and diﬀerent types of connectivity leading to diﬀerent types of clusters.The fuzzy graph idea was first mentioned in [13], whereby the fuzzy analoguesof several basic graph-theoretic concepts such as bridges, cycles, paths, treeswere introduced. Fuzzy graphs were first applied to cluster analysis in [14].Fuzzy clustering that used diﬀerent objective functions has shown to givebetter formulation to clustering. The heavily used fuzzy C-Means clusteringmodel (FCM) was first introduced in 1974 [15], and was later extended andgeneralized in [7]. Many variations of the method and model improvementshave been suggested since then.The Gustafson-Kessel (GK) algorithm [16] is a fuzzy clustering methodwhich estimates the local covariance by the partitioning of the data into subsets that can be fitted with linear submodels. Nevertheless, considering a general structure for the covariance matrix can have a substantial eﬀect on themodeling approach, and thus, the Gath-Geva algorithm [17] was proposed toovercome the fitting problem. A fuzzy clustering algorithm where the prototype of each cluster is multi-dimensional linear was proposed by the FuzzyC-Varieties (FCV) [18] clustering algorithm.Replacing the Euclidean distances with other distance measures and enriching the cluster prototypes with further parameters allows for other shapesthan only spherical clusters to be discovered. Clusters might be ellipsoidal,linear, manifolds, quadrics or have even diﬀerent volumes [10]. The main benefit of fuzzy clustering has been proven to handle ambiguous data that shareproperties of diﬀerent clusters by the notion of diﬀerent membership degreesto be assigned to data objects.The use of fuzzy clustering algorithms, in particular the FCM algorithm,has been successfully applied to the area of digital image processing and image segmentation [19] in particular, where the technique showed to be veryeﬃcient. However, the drawback the FCM algorithm still exhibits is the robustness to noise and outliers, especially in absence of prior knowledge ofnoise. A crucial parameter, used to balance between robustness to noise andeﬀectiveness of preserving the details of the image, is manually set based onexperience. In addition, the time of segmenting an image depends on the imagesize, and hence the larger the size of the image, the longer the segmentationtime [20].In [21], a K-means clustering variant is proposed. The approach is basedon a prototype-based hybrid approach that speeds-up the k-means clusteringmethod. The method first partitions the data set into small clusters (grouplets). Each grouplet is first represented by a prototype, and then the set ofprototypes is partitioned into k clusters using the modified k-means method.The modified k-means method avoids empty clusters. Each prototype is replaced by its corresponding set of patterns to derive a partition of the dataset. The proposed method has shown to be much faster in terms of processingtime than basic K-means.Big data clustering has recently received significant amount of attention.In particular, the aim is to build eﬃcient and eﬀective parallel clustering algorithms. Many of these algorithms use the MapReduce methodology that was

MapReduce-based Fuzzy C-Means Clustering Algorithm7proposed by Google [22]. In the following we will review research that has usedand applied the MapReduce paradigm to the clustering of big data sets.A MapReduce design and implementation of an eﬃcient DBSCAN algorithm is introduced in [23]. The proposed algorithm addresses the drawbacksof existing parallel DBSCAN algorithms such as the data balancing and scalability issues. The algorithm tackles these issues using a parallelized implementation that removes the sequential processing bottleneck and thereby improvesthe algorithm’s scalability. Furthermore, the algorithm showed the largest improvement when the data was imbalanced. The evaluation conducted usinglarge scale data sets demonstrate the eﬃciency and scalability of their algorithm.A parallel K-means clustering algorithm based on MapReduce was proposed in [24]. The algorithm locates the centroids by calculating the weightedaverage of each individual cluster points via the Map function. The Reduce function then assigns a new centroid to each data point based on the distance calculations. At the end, a MapReduce iterative refinement technique is appliedto locate the final centroids. The authors evaluated the implementation using the measures of speedup, scaleup, and sizeup. The results showed that theproposed algorithm is able to process large data sets of 8 GB using commodityhardware eﬀectively.In [25], an algorithm for solving the problem of document clustering usingthe MapReduce-based K-means algorithm is proposed. The algorithm usesthe MapReduce paradigm to iterate over the document collection in orderto calculate the term frequency and inverse document frequency. The algorithm represents the documents as key,value pairs, where the key is thedocument type and the value is the document text. The authors compare anon-parallelized version of K-means with the parallelized version to show thespeedup gain. Furthermore, the experiments of the parallelized k-means algorithm on 50,000 documents showed that the algorithm performs very wellin terms of accuracy for this text clustering task while having a reasonableexecution time.Another K-means clustering algorithm using MapReduce by merging theK-means algorithm with the ensemble learning bagging method is introducedin [26]. The proposed algorithm addresses the instability and sensitivity tooutliers. Ensemble learning is a technique that uses a collection of models toachieve better results than any model in the collection, and bagging is one ofthe most popular type of ensemble techniques. The evaluation was performedon relatively small data sets (instances around 5,000) and only 4 nodes wereused for the parallelization. The authors show that a speedup is obtained ondata sets consisting of outliers.A Self-Organizing Map (SOM) was modified to work with large scale datasets by implementing the algorithm using the MapReduce concept to improvethe performance of clustering as shown in [27]. A self-organizing map is anunsupervised neural network that projects high-dimensional data onto a lowdimensional grid and visually represents the topological order of the originaldata. Unfortunately, the details of the MapReduce implementation are not

8Simone A. Ludwiggiven, but the experiments that were conducted with a small data set demonstrated the eﬃciency of the MapReduce-based SOM.In [28], the authors applied the MapReduce framework to solve co-clusteringproblems by introducing a framework called DISCO (DIStributed CO-clusteringwith MapReduce). Unlike clustering which groups similar rows or columns independently, co-clustering searches for interrelated submatrices of rows andcolumns. The authors proved that using MapReduce is a good solution forco-clustering mining tasks by applying the algorithm to data sets such ascollaborative filtering, text mining, etc. The experiments demonstrated thatco-clustering with MapReduce can scale well with large data sets using up to40 nodes.In [29], a fast clustering algorithm with a constant factor approximationguarantee was proposed. The authors use a sampling technique to reduce thedata size first, and then apply the Lloyd’s algorithm on the remaining dataset. A comparison of this algorithm with several sequential and parallel algorithms for the k-median

MapReduce-based Fuzzy C-Means Clustering Algorithm: Implementation and Scalability Simone A. Ludwig Received: date / Accepted: date Abstract The management and analysis of big data has been identiﬁed as one of the most important emerging needs in recent years. This is because of the sheer volume and increasing complexity of data being created .

Related Documents:

Takagi-Sugeno Fuzzy Modeling for Process Control

ing fuzzy sets, fuzzy logic, and fuzzy inference. Fuzzy rules play a key role in representing expert control/modeling knowledge and experience and in linking the input variables of fuzzy controllers/models to output variable (or variables). Two major types of fuzzy rules exist, namely, Mamdani fuzzy rules and Takagi-Sugeno (TS, for short) fuzzy .

92 Views

2y ago

MICROCONTROLLER BASED IMPLEMENTATION OF A FUZZY KNOWLEDGE ...

fuzzy controller that uses an adaptive neuro-fuzzy inference system. Fuzzy Inference system (FIS) is a popular computing framework and is based on the concept of fuzzy set theories, fuzzy if and then rules, and fuzzy reasoning. 1.2 LITERATURE REVIEW: Implementation of fuzzy logic technology for the development of sophisticated

54 Views

3y ago

A New Hendecagonal Fuzzy Number for Optimization Problems

Different types of fuzzy sets [17] are defined in order to clear the vagueness of the existing problems. D.Dubois and H.Prade has defined fuzzy number as a fuzzy subset of real line [8]. In literature, many type of fuzzy numbers like triangular fuzzy number, trapezoidal fuzzy number, pentagonal fuzzy number,

19 Views

1y ago

Generalized Fuzzy Clustering Model with Fuzzy C-Means

the traditional fuzzy c-means to a generalized model in convenience of application and research. 2.1 Fuzzy C-Means The basic idea of fuzzy c-means is to find a fuzzy pseudo-partition to minimize the cost function. A brief description is as follows: (1) In above formula, x i is the feature data to be clustered; m k is the center of each clus-ter; u

28 Views

3y ago

Journal of Fuzzy Extension and Applications

of fuzzy numbers are triangular and trapezoidal. Fuzzy numbers have a better capability of handling vagueness than the classical fuzzy set. Making use of the concept of fuzzy numbers, Chen and Hwang [9] developed fuzzy Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) based on trapezoidal fuzzy numbers.

27 Views

1y ago

Designing a smart multi-agent system based on fuzzy logic to improve ...

ii. Fuzzy rule base: in the rule base, the if-then rules are fuzzy rules. iii. Fuzzy inference engine: produces a map of the fuzzy set in the space entering the fuzzy set and in the space leaving the fuzzy set, according to the rules if-then. iv. Defuzzification: making something nonfuzzy [Xia et al., 2007] (Figure 5). PROPOSED METHOD

21 Views

1y ago

Introduction to Fuzzy Logic

Fuzzy Logic IJCAI2018 Tutorial 1. Crisp set vs. Fuzzy set A traditional crisp set A fuzzy set 2. . A possible fuzzy set short 10. Example II : Fuzzy set 0 1 5ft 11ins 7 ft height . Fuzzy logic begins by borrowing notions from crisp logic, just as

64 Views

2y ago

Fuzzy Relations, Rules and Inferences

2D Membership functions : Binary fuzzy relations (Binary) fuzzy relations are fuzzy sets A B which map each element in A B to a membership grade between 0 and 1 (both inclusive). Note that a membership function of a binary fuzzy relation can be depicted with a 3D plot. (, )xy P Important: Binary fuzzy relations are fuzzy sets with two dimensional

20 Views

1y ago

Recent Views

An Introduction to Islamic capital markets - REDmoney Events

Capital markets are markets for buying and selling equity securities (i.e. shares) and debt securities (i.e. bonds). Capital markets include primary markets, where new stock and bond issues are sold to investors, and secondary markets, where existing securities are traded Key participants: buyers, sellers and financial intermediaries

1y ago

104 Views

Don't fear the bear (RES-4011Q-A)

the 0% line are bull markets, and the red-shaded areas below it are bear markets — a decline of more than 20%. You'll notice that bear markets are shorter than bull markets. On average, bear markets last about 12 months, with an average loss . of about 32%.* Bull markets, on average, last nearly five years (54 months), with an average gain .

1y ago

105 Views

1213 How to Educate Consumers on Your Financial Services

Financial Empowerment 2 Financial education –strategy that provides people with financial knowledge, skills and resources Financial education builds an individual’s knowledge, skills and capacity to use resources and tools, including financial products and services leading to Financial Literacy Financial empowerment includes financial education and financial literacy –focuses .

3y ago

301 Views

Motives for Investing in Foreign Markets

international financial markets have been developed. Financial man-agers of MNCs must understand the various international financial markets that are available so that they can use those markets to facilitate their international business transactions. The specific objectives of this chapter are to describe the background and corporate use of .

3y ago

142 Views

Common Risk Factors in Cryptocurrency

excess returns over the risk-free rate of each portfolio, and the excess returns of the long- . Journal of Financial Economics, Journal of Financial Markets Journal of Financial Economics. Journal of Financial Economics. Journal of Financial Economics Journal of Financial Economics Journal of Financial Economics Journal of Financial Economics .

2y ago

203 Views

GEE II: FINANCIAL MARKETS, MONETARY POLICY AND THE

Policy, 11th Edition (New York: Addison-Wesley, 2018) V. FINANCIAL CRISES IN ADVANCED ECONOMIES (MB) Ch. 12 Financial Crises (C) Mishkin, F.S., "Asymmetric Information and Financial Crises: A Historical Perspective," in R. Glenn Hubbard, ed., Financial Markets and Financial Cri

2y ago

309 Views

Consumer protection in the banking, insurance and financial services .

insurance and financial services sector. ASIC's role in the financial system 2 As Australia's corporate, markets, financial services and consumer credit regulator, ASIC strives to ensure that Australia's financial markets are fair and transparent and supported by confident and informed investors and financial consumers. 3 The

1y ago

122 Views

International financial markets and bank funding in the euro area .

International financial markets and bank funding in the euro area: dynamics and participants1 Jaime Caruana Adrian Van Rixtel General Manager Senior Economist Bank for International Settlements 1. Introduction Financial markets are undergoing major and at times very rapid changes, mostly as a result of the financial crisis that began in 2007.

1y ago

100 Views

FINS5512 FINANCIAL MARKETS AND INSTITUTIONS Course Outline .

This course will provide students with an introduction to Australian financial markets and an evaluation of the institutions, instruments and participants involved in the industry. The mainstream markets to be evaluated include the equity, money, bond, futures, options and exchange rate markets. The subject

3y ago

146 Views

Money & Capital Markets - City University of New York

Financial Markets & Institutions By Mishkin and Eakins 7th edition (2012) McGraw-Hill Publishers ISBN: 978-0-13-213683-9 Learning Goals In this case study based graduate course we will 1) explore the function and structure of financial markets, including money, bond, stock, mortgage and foreign exchange markets,

3y ago

125 Views

Impact of COVID-19 on the Global Financial System

markets. Equity markets began declining rapidly, losing around 30% of market value in a matter of weeks, with the speed of the sell-off exceeding that of the global financial crisis of 2008-2009 (GFC). By early March, short-term funding markets and international US dollar funding markets started to show signs of stress and, in the

3y ago

112 Views

2. An Overview of the Financial System

2-5 Structure of Financial Markets Debt and Equity Markets Primary and Secondary Markets Investment Banks underwrite securities in primary markets Brokers and dealers work in seconda

2y ago

111 Views

HDFC MF Yearbook 2021

HDFC group pledged Rs150cr contribution to the PM CARES Fund to provide relief and rehabilitation measures towards the . Global Economy and Markets 2. Key Future Trends 3. Indian Economy 4. Equity Markets & Sector Overview 5. Fixed Income Markets 3. . Developed markets (DMs) likely to achieve herd immunity by CY21 and Emerging Markets .

3y ago

124 Views

Feb 10th, 2020 Tax Loss Harvesting (TLH) South Bay .

SCHF FTSE Developed Markets Ex-US Emerging Markets VWO FTSE Emerging Markets EEM MSCI Emerging Markets Index IEMG MSCI Emerging Markets Investable Market Index Dividend Stocks VIG NASDAQ US Dividend Achievers Select SCHD Dow Jones U.S. Dividend 100 TIPS VTIP Barclays Capital US TIPS 0-5 Years

2y ago

314 Views

2021 Capital Markets Fact Book - sifma

Introduction 2021 Capital Markets Fact Book Page 7 US Capital Markets Are the Largest in the World The U.S. capital markets are largest in the world and continue to be among the deepest, most liquid, and most efficient. Equities: U.S. equity markets represent 38.5% of the 105.8 trillion in global equity market cap, or 40.7 trillion; this

1y ago

108 Views

MapReduce-based Fuzzy C-Means Clustering Algorithm .

It looks like you're using an ad-blocker