“Clustering Of Pattern By Using Discrimination Analysis”

2y ago

9 Views

3 Downloads

490.54 KB

5 Pages

Last View : 1m ago

Last Download : 3m ago

Upload by : Lucca Devoe

Report this link

Download PDF

Transcription

2015 JETIR August 2015, Volume 2, Issue 8www.jetir.org (ISSN-2349-5162)“Clustering of Pattern by using Discriminationanalysis”Praveen Kumar Pandey, Asst. ProfessorDepartment of Mechanical Engineering, Faculty of Engineering & TechnologyGurukul Kangari University, Haridwar.1. IntroductionTraditionally grouping of an object in known class was done by various method like cluster analysis,membership-roster concept, common property concept, feature extraction, error estimation, minimum distancemethod} etc on the basis of similarity of their characteristics. The primary purpose of the discriminant functionis to predict the group of unknown objects based on cut off.Discriminant Analysis is used to distinguish between two or more predefined 'groups'. The analysis identifiesthose variables that contribute most to the differences between groups, it is also possible to useDiscriminant Analysis as a classification technique that can be used to place an unknown case into oneof the groups.Discriminant Analysis works by combining the variables in such a way that the differences between thepredefined groups are maximized . Note that group membership must be known before using DiscriminantAnalysis. The discriminant problem is how do we best predict or assign an object whose population identitywe do not know to one of the known populations of interest?The discriminant function can use several quantities variables, each of which makes an independentcontribution to the overall discrimination. Taking into consideration the effect of all variables this discriminantfunction produces the statistical decision for guessing.Discriminant function analysis or DA is used to classify cases into the values of a categorical dependent,usually a dichotomy. If discriminant function analysis is effective for a set of data, the classification table ofcorrect and incorrect estimates will yield a high percentage correct. Multiple discriminant function analysis isused when the dependent has three or more categories.Discriminant function analysis is used to determine which variables discriminate between two or morenaturally occurring groups. For example, an educational researcher may want to investigate which variablesdiscriminate between high school graduates who decide (1) to go to college, {2) to attend a trade orprofessional school, or (3) to seek no further training or education. For that purpose the researcher could collectdata on numerous variables prior to students graduation. After graduation, most students will naturally fallinto one of the three categories. Discriminant Analysis could then be used to determine which variable are thebest predictors of student’s subsequent educational choice.A medical researcher may record different variables relating to patients backgrounds in order to learn whichvariables best predict whether a patient is likely to recover completely (group 1), partially (group 2), or not atall (group 3). A biologist could record different characteristics of similar types (groups) of flowers, and thenperform a discriminant function analysis to determine the set of characteristics that allows for the bestdiscrimination between the types.JETIR1701107Journal of Emerging Technologies and Innovative Research (JETIR)WWW. JETIR . ORG446

2015 JETIR August 2015, Volume 2, Issue 8www.jetir.org (ISSN-2349-5162)There are several purposes for DA:. To classify cases into groups using a discriminant prediction equation. To investigate independent variable mean differences between groups formed by the dependent variable. To determine the percent of variance in the dependent variable explained by the independents. To determine the percent of variance in the dependent variable explained by the independents over and abovethe variance accounted for by control variables, using sequential discriminant analysis. To assess the relative importance of the independent variables in classifying the dependent variable. To discard variables which are little related to group distinctions. To test theory by observing whether cases are classified as predicted.1.1 BASIC ELEMENTS OF DISCRIMINANT FUNCTION ANALYSIS1.1.1 DISCRIMINATING VARIABLES: These are the independent variables, also called predictors.1.1.2 THE CRITERION VARIABLE : This is the dependent variable, also called the grouping variable. It isthe object of classification efforts.1.1.3 DISCRIMINANT FUNCTIONA discriminant function, also called a canonical root, Is a latent variable which is created as a linearcombination of discriminating (independent) variables, such that L b1 x1 b2 x2 . bnxn c, Where theb's are discriminant coefficients, the x's are discriminating variables, and c is a constant. This is analogous tomultiple regressions, but the b's are discriminant coefficients, which maximize the distance between the meansof the criterion (dependent) variable. Note that the foregoing assumes the discriminant function is estimatedusing ordinary least squares, the traditional method.1.1.4 NUMBER OF DISCRIMINANT FUNCTIONSThere is one discriminant function for 2-group discriminant analysis, but for higher order DA, the number offunctions (each with its own cut-off value) is the lesser of (g - 1), where g is the number of categories in thegrouping variable, each discriminant function is orthogonal to the others. A dimension is simply one of thediscriminant functions when there is more than one, in multiple discriminant analysis.1.1.5 EIGEN VALUEThe Eigen value of each discriminant function reflects the ratio of importance of the dimensions, whichclassify cases of the dependent variable. If there is more than one discriminant function, the first will be thelargest and most important, the second next most important in explanatory power, and so on. The Eigen valuesassess relative importance because they reflect the percent’s of variance explained in the dependent variable,cumulating to 100% for all functions.1.1.6 THE DISCRIMINANT SCOREThe discriminant score also called the DA score, is the value resulting from applying a discriminant functionformula to the data for a given case.1.1.7 CUTOFFIf the discriminant score of the function is less than or equal to the cutoff, the case is classed as 0, or if aboveit is classed as 1. When group sizes are equal, the cutoff is the mean of the two centroids (for two-group DA).If the groups are unequal, the cutoff is the weighted mean.JETIR1701107Journal of Emerging Technologies and Innovative Research (JETIR)WWW. JETIR . ORG447

2015 JETIR August 2015, Volume 2, Issue 8www.jetir.org (ISSN-2349-5162)1.2 ASSUMPTIONThe minimum set of conditions necessary to conduct linear discriminant is:1.2.3.4.There should be two or more than two priori groups or classifications of the population of entities.A sample of entities known to belong to each group exists.Each entity can be described by a set of quantitative variables.The variance and relationship among each of the variables are the same for each group.1.3 ANALYSIS OF TWO-GROUP DISCRIMINANT FUNCTIONIn the two-group case, discriminant function analysis can also be thought of as (and is analogous to) multipleregression (see Multiple Regression; the two-group discriminant analysis is also called Fisher lineardiscriminant analysis after Fisher, 1936; computationally all of these approaches are analogous). If we codethe two groups in the analysis as 1 and 2, and use that variable as the dependent variable in a multiple regressionanalysis, then we would get results that are analogous to those we would obtain via Discriminant Analysis. Ingeneral, in the two-group case we fit a linear equation of the type:Group a b1*x1 b2*x2 . bm*xmWhere a is a constant and b1 through bm are regression coefficients. The interpretation of the results of a twogroup problem is straight forward and closely follows the logic of multiple regressions: Those variables withthe largest (standardized) regression coefficients are the ones that contribute most to the prediction of groupmembership.1.4 DISCRIMINANT FUNCTIONS FOR MULTIPLE GROUPSWhen there are more than two groups, then we can estimate more than one discriminant function like the onepresented above. For example, when there are three groups, we could estimate (1) a function for discriminatingbetween group 1 and groups 2 and 3 combined, and (2) another function for discriminating between group 2and group 3. For example, we could have one function that discriminates between those high school graduatesthat go to college and those who do not (but rather get a job or go to a professional or trade school), and asecond function to discriminate between those graduates that go to a professional or trade school versus thosewho get a job. The b coefficients in those discriminant functions could then be interpreted as before.MATHEMATICAL EXAMPLEIn this example a simple data set will be used. The data are from 10 males and 10 females. Three variableswere recorded: height (inches!), weight (pounds!) and age (years).Data SummaryVariable MaleFemaleDifferenceHeight70.366.24.1Weight165.9 130.735.2Age42.38.4JETIR170110733.9Journal of Emerging Technologies and Innovative Research (JETIR)WWW. JETIR . ORG448

2015 JETIR August 2015, Volume 2, Issue 8www.jetir.org (ISSN-2349-5162)We need a method, which will maximize the group differnces displayed by these three discriminating variable,when they are combined into a single discriminating variable. This will be achieved by calculating aDiscriminant Function of the type:score w1height w2weight w3ageThus our problem is finding suitable values for wi.First calculate the variance-covariance matrix AHeightWeightAgeLet A Height 5.2124.4910.68Weight 24.49207.7265.56Age10.6864.56155.17Let w be a vector containing the unknown weights:w [w1 w2 w3 ]and d be a vector of the group differences (as shown above):d [4.1 35.2 8.4]It can be shown that A.w dThis is a relatively simple matrix algebra calculation since the equation be rewritten as w A-1.d and we needonly find A-1 , the inverse of matrix A, to solve the equation.Using standardized variables we find that0.0029 heightW 1.028 weight-0.096 ageHence;Discriminant score 0.0029 height 1.028 weight - 0.096 ageThe group centroids (mean scores) are females -1.25 males 1.23Consequently a positive score ( 0) indicates a male, a negative score ( 0) indicates a female. The means wouldnot be symmetrical if the group sizes differed.CONCLUSION1.The model is capable of grouping them simultaneously for given data.2.It has been observed that for input data no need to do calculation manually, computer program is capableto classify the data in various groups.JETIR1701107Journal of Emerging Technologies and Innovative Research (JETIR)WWW. JETIR . ORG449

2015 JETIR August 2015, Volume 2, Issue 83.www.jetir.org (ISSN-2349-5162)Method presented is very useful to classify the object in various field like banking, share market, trafficcontrol, etc.5.3 SCOPE FOR FUTURE WORKThe procedure and algorithm developed are based on certain assumptions. However, according to our workon pattern recognition system, the works that can be carried out in the future are here under:This procedure is very use full for classification so it can be use in many fields to classify the object in differentgroups. For example Marketing: Help marketers discover distinct groups in their customer bases, and then use this knowledgeto develop targeted marketing programs.Land use: Identification of areas of similar land use in an earth observation database. Insurance: Identifying groups of motor insurance policy holders with a high average claim cost:References1.Choulakian V. and Almhana J., ‘An Algorithm for nonmetric discriminant analysis’, ComputationalStatistics& Data Analysis, 35, 253 264, 2001.2.Dai D.Q. and Pong C. Y., 'Rgularized Discriminant analysis and its application’, Journal of the PatternRecognition Society, 36, 845-847, 2003.3.Gavin C. C. and Nicola L. C. Talbot, ‘Efficient leave-one-out cross- validation of Kernel FisherDiscriminant analysis’, Journal of the Pattern Recognition Society, 36, 2585-2592, 2003,4.Gonzalez R. C, and Thomason M.G., ‘Syntactic Pattern Recognition an Introduction’, Addison-WesleyPublishing Company, 1978.5.Gupta 3.C. and Kapoor V.K., ‘Fundamentals of Mathematical Statistics’, Sultan Chand & Sons, 1999.6.Lotikar R. and Kothari R., ‘Adaptive linear dimensionality reduction for classification society’, Journalof the Pattern Recognition Society, 33, 185-189, 2000.7.Ordowski M. and Gerard G.L. Meyer, ‘Geometric linear8.Discriminant analysis for pattern recognition’, Journal of the Pattern Recognition Society, 37, 421-428,2004.JETIR1701107Journal of Emerging Technologies and Innovative Research (JETIR)WWW. JETIR . ORG450

2015 JETIR August 2015, Volume 2, Issue 8 www.jetir.org (ISSN-2349-5162) JETIR1701107 Journal of Emergin

Related Documents:

Time Series Clustering - UC3M

Caiado, J., Maharaj, E. A., and D’Urso, P. (2015) Time series clustering. In: Handbook of cluster analysis. Chapman and Hall/CRC. Andrés M. Alonso Time series clustering. Introduction Time series clustering by features Model based time series clustering Time series clustering by dependence Introduction to clustering

64 Views

3y ago

Clustering and Social Network - Virginia Tech

6. A sample social network graph 7. Influence factor on for information query 8. IF calculation using network data 9. Functional component of clustering 10. Schema design for clustering 11. Sample output of Twitter accounts crawler 12. Flow diagram of the system 13. Clustering of tweets based on tweet data 14. Clustering of users based on .

16 Views

1y ago

Chapter 4 Clustering Algorithms and Evaluations

Chapter 4 Clustering Algorithms and Evaluations There is a huge number of clustering algorithms and also numerous possibilities for evaluating a clustering against a gold standard. The choice of a suitable clustering algorithm and of a suitable measure for the evaluation depen

35 Views

2y ago

Quantum Clustering of Large Data Sets

preprocessing step for quantum clustering , which leads to reduction in the algorithm complexity and thus running it on big data sets is feasible. Second, a newer version of COMPACT, with implementation of support vector clustering, and few enhancements for the quantum clustering algorithm. Third, an implementation of quantum clustering in Java.

14 Views

1y ago

Clustering Algorithms In Data Mining - Web of Proceedings

Data mining, Algorithm, Clustering. Abstract. Data mining is a hot research direction in information industry recently, and clustering analysis is the core technology of data mining. Based on the concept of data mining and clustering, this paper summarizes and compares the research status and progress of the five traditional clustering

14 Views

1y ago

A Survey of Web Clustering Engines - Fondazione Ugo Bordoni

clustering engines is that they do not maintain their own index of documents; similar to meta search engines [Meng et al. 2002], they take the search results from one or more publicly accessible search engines. Even the major search engines are becoming more involved in the clustering issue. Clustering by site (a form of clustering that

14 Views

7m ago

Information-Theoretic Co-clustering - University of Texas at Austin

tingency table analysis; H.3.3 [Information Search and Retrieval]: Clustering; I.5.3 [Pattern Recognition]: Clus-tering Keywords Co-clustering, information theory, mutual information 1. INTRODUCTION Clustering is a fundamental tool in unsupervised learning that is used to group together similar objects [14], and has

3 Views

1y ago

A Framework for Ontology-Driven Subspace Clustering

We create a general framework for ontology-driven subspace clustering. This framework can be most beneﬁcial for the hierar-chically organized subspace clustering algorithm and ontology hi-erarchy, i.e., it is independent of the clustering algorithms and on-tology application domain. To demonstrate the usefulness of this

36 Views

3y ago

Recent Views

Columbus,Ohio 1890

Slicing Steaks 3563 Beef Tender, Select In Stock 3852 Angus XT Shoulder Clod, Choice In Stock 3853 Angus XT Chuck Roll, Choice 20/up In Stock 3856 Angus XT Peeled Knuckle In Stock 3857 Angus XT Inside Rounds In Stock 3858 Angus XT Flats, Choice In Stock 3859 Angus XT Eye Of Round, Choice In Stock 3507 Point Off Bnls Beef Brisket, Choice In Stock

2y ago

268 Views

Buying Your First Stock - Stock-Trak

Stock Market Game Time: 15 Minutes Requires: StockTrak Curriculum , Computer Access Buying Your First Stock This lesson is an introduction to buying a stock. Students will be introduced to basic vocabulary that is involved with a buying and owning a stock. Stu-dents will be going through the entire process of buying a stock from looking

1y ago

164 Views

TRAINING - CamInstructor

Mastercam Training Guide Mill-Lesson-4-9 6. Change the parameters to match the Stock Setup screenshot below: Stock Setup Stock Origin The stock origin is the X-Y-Z coordinate position of the point indicated by the cross in the picture of the stock model. Use it so Mastercam knows where your stock model is located relative to your part and

3y ago

242 Views

WPX Energy, Inc. - Feltl and Company

WPX Energy, Inc. Common Stock We are offering 27,000,000 shares of our common stock. Our common stock is listed on the New York Stock Exchange under the symbol “WPX.” On July 10, 2015, the last reported sale price for our common stock on the New York Stock Exchange (the “NYSE”) was 11.22 per share.

3y ago

172 Views

Spray 2020 Corporate Profiles - industry-publications

Custom plastic tubes (mono & multi-layer, ABL and Polyami) Stock and custom plastic, metal, and wood caps and closures Stock and custom fine mist, treatment and lotion pumps Stock and custom droppers Stock and custom rollerballs/roll-ons Stock sampler bottles and vials Stock German Quality cosmetic pencil sharpeners

2y ago

180 Views

The Stock Market Profits Blueprint - Liberated Stock Trader

The stock market profits blueprint has been hand crafted to enable you to understand all the factors that play on the stock market. It is called a blueprint because a blueprint is in effect an architectural document to show how something is designed. The Blueprint will show you a powerful way to envisage how the stock market and the stock market

1y ago

181 Views

The Impact of Persian News on Stock Returns Through Text Mining Techniques

Persian news - on the stock prices has been neglected. Consequently, this study aimed to fill this gap. To this aim, the stock index values were collected from the Tehran Stock Exchange along with the . Stock market prediction is a way to understand the future fluctuations of a company's stock price (Jishag et al., 2020). Generally, two .

1y ago

225 Views

Stock Market Uncertainty and the Stock-Bond Return Relation

implied volatility and stock turnover may prove useful for ﬁnancial applications that need to under-stand and predict stock and bond return co-movements. Finally, our empirical results suggest that the beneﬁts of stock-bond diversiﬁcation increase during periods of high stock market uncertainty. This study is organized as follow.

1y ago

158 Views

Operation of Stock Exchange - Williams College

Class Notes Operation of Stock Exchange - 3 - Buying on Margin "Margin" is borrowing money from your broker to buy a stock and using your invest-ment as collateral. Example Buy paying full price Buy stock at 60. Stock price goes to 90. Return (90 - 60)/60 50% Buy on "margin" Buy stock at 60. Borrow 30; you pay 30.

1y ago

138 Views

Stock Market Development and Economic Growth: Empirical Evidence from China

measures used to proxy for stock market size and the size of real economy. Most of the existing studies use stock market index as a proxy for measuring the growth and development of stock market in a country. We argue that stock market index may not be a good measure of stock market size when looking at its association with economic growth.

1y ago

263 Views

A Hybrid Prediction Method for Stock Price Using LSTM and . - Hindawi

the relationship between stock prices and these factors. Although these factors will temporarily change the stock price, in essence, these factors will be reﬂected in the stock price and will not change the long-term trend of the stock price. erefore, stock prices can be predicted simply with historical data.

1y ago

159 Views

A voyage to more stable safety stock and service levels - apics

safety stock targets. Most enterprise resources planning (ERP) systems perform a safety stock calculation. But very few include in the system all sources of variability as inputs to the safety stock formula. Furthermore, ERP tools rarely calculate accurate safety stock inputs or correct erroneous data. Figure 1 shows 13 basic safety stock inputs.

1y ago

149 Views

Factors Affecting Performance of Stock Market: Evidence from . - HRMARS

We used the data of Colombo Stock Exchange (CSE) for Sri Lankan stock market in this research which is the main stock exchange of Sri Lanka. The market capitalization of CSE is over 20 billion USD. Colombo stock exchange is the first south Asian region stock market and overall 52nd who obtain the membership of World Federation of Exchanges.

11m ago

103 Views

Forecasting Stock Price Turning Points in The Tehran Stock Exchange .

Forecasting Stock Price Turning Points in the Tehran Stock Exchange Using Weighted Support Vector Machine. Journal of Entrepreneurship Education, 25(5), 1-12 . 2 1528-2651-25-5-797 Citation Information: Sayrani., M & Sharif, J.S. (2022). Forecasting Stock Price Turning Points in the Tehran Stock Exchange Using Weighted Support Vector Machine. .

7m ago

96 Views

Water Physical Stock Account: 1995-2010 - Tableau Public

10 Water physical stock account for year ended June 2003, by region . 11 Water physical stock account for year ended June 2004, by region . 12 Water physical stock account for year ended June 2005, by region . 13 Water physical stock account for year ended June 2006, by region . 14 Water physical stock account for year ended June 2007, by region

3m ago

22 Views

“Clustering Of Pattern By Using Discrimination Analysis”

It looks like you're using an ad-blocker