HoroPCA: Hyperbolic Dimensionality Reduction Via Horospherical Projections

1y ago

3 Views

1 Downloads

619.14 KB

11 Pages

Last View : 1m ago

Last Download : 3m ago

Upload by : Konnor Frawley

Report this link

Download PDF

Transcription

HoroPCA: Hyperbolic Dimensionality Reduction via Horospherical ProjectionsInes Chami * 1 Albert Gu * 1 Dat Nguyen * 1 Christopher Ré 1AbstractThis paper studies Principal Component Analysis (PCA) for data lying in hyperbolic spaces.Given directions, PCA relies on: (1) a parameterization of subspaces spanned by these directions,(2) a method of projection onto subspaces thatpreserves information in these directions, and (3)an objective to optimize, namely the variance explained by projections. We generalize each ofthese concepts to the hyperbolic space and propose H ORO PCA, a method for hyperbolic dimensionality reduction. By focusing on the core problem of extracting principal directions, H ORO PCAtheoretically better preserves information in theoriginal data such as distances, compared to previous generalizations of PCA. Empirically, wevalidate that H ORO PCA outperforms existing dimensionality reduction methods, significantly reducing error in distance preservation. As a datawhitening method, it improves downstream classification by up to 3.9% compared to methodsthat don’t use whitening. Finally, we show thatH ORO PCA can be used to visualize hyperbolicdata in two dimensions.1. IntroductionLearning representations of data in hyperbolic spaces hasrecently attracted important interest in Machine Learning(ML) (Nickel & Kiela, 2017; Sala et al., 2018) due to theirability to represent hierarchical data with high fidelity in lowdimensions (Sarkar, 2011). Many real-world datasets exhibit hierarchical structures, and hyperbolic embeddingshave led to state-of-the-art results in applications suchas question answering (Tay et al., 2018), node classification (Chami et al., 2019), link prediction (Balazevic et al.,2019; Chami et al., 2020b) and word embeddings (Tifreaet al., 2018). These developments motivate the need foralgorithms that operate in hyperbolic spaces such as nearest*Equal contribution 1 Stanford University, CA, USA. Correspondence to: Ines Chami chami@cs.stanford.edu .Proceedings of the 38 th International Conference on MachineLearning, PMLR 139, 2021. Copyright 2021 by the author(s).neighbor search (Krauthgamer & Lee, 2006; Wu & Charikar,2020), hierarchical clustering (Monath et al., 2019; Chamiet al., 2020a), or dimensionality reduction which is the focusof this work.Euclidean Principal Component Analysis (PCA) is a fundamental dimensionality reduction technique which seeksdirections that best explain the original data. PCA is animportant primitive in data analysis and has many importantuses such as (i) dimensionality reduction (e.g. for memory efficiency), (ii) data whitening and pre-processing fordownstream tasks, and (iii) data visualization.Here, we seek a generalization of PCA to hyperbolic geometry. Given a core notion of directions, PCA involves thefollowing ingredients:1. A nested sequence of affine subspaces (flags) spannedby a set of directions.2. A projection method which maps points to these subspaces while preserving information (e.g. dot-product)along each direction.3. A variance objective to help choose the best directions.The PCA algorithm is then defined as combining these primitives: given a dataset, it chooses directions that maximizethe variance of projections onto a subspace spanned bythose directions, so that the resulting sequence of directionsoptimally explains the data. Crucially, the algorithm only depends on the directions of the affine subspaces and not theirlocations in space (Fig. 1a). Thus, in practice we can assumethat they all go through the origin (and hence become linearsubspaces), which greatly simplifies computations.Generalizing PCA to manifolds is a challenging problemthat has been studied for decades, starting with PrincipalGeodesic Analysis (PGA) (Fletcher et al., 2004) which parameterizes subspaces using tangent vectors at the mean ofthe data, and maximizes distances from projections to themean to find optimal directions. More recently, the Barycentric Subspace Analysis (BSA) method (Pennec, 2018) wasintroduced. It finds a more general parameterization ofnested sequences of submanifolds by minimizing the unexplained variance. However, both PGA and BSA map pointsonto submanifolds using closest-point or geodesic projections, which do not attempt to preserve information alongprincipal directions; for example, they cannot isometrically

Hyperbolic Dimensionality Reduction(a) Euclidean projections.(b) Hyperbolic geodesic projections.(c) Hyperbolic horospherical projections.Figure 1: Given datapoints (black dots), Euclidean and horospherical projections preserve distance information acrossdifferent subspaces (black lines) pointing towards the same direction or point at infinity, while geodesic projections do not.(a): Distances between points, and therefore the explained variance, are invariant to translations along orthogonal directions(red lines). (b): Geodesic projections do not preserve this property: distances between green projections are not the sameas distances between blue projections. (c): Horospherical projections project data by sliding it along the complementarydirection defined by horospheres (red circles) and there exist an isometric mapping between the blue and green projections.preserve hyperbolic distances between any points and shrinkall path lengths by an exponential factor (Proposition 3.5).Fundamentally, all previous methods only look for subspaces rather than directions that explain the data, and canperhaps be better understood as principal subspace analysisrather than principal component analysis. Like with PCA,they assume that all optimal subspaces go through a chosenbase point, but unlike in the Euclidean setting, this assumption is now unjustified: “translating” the submanifolds doesnot preserve distances between projections (Fig. 1b). Furthermore, the dependence on the base point is sensitive: asnoted above, the shrink factor of the projection dependsexponentially on the distances between the subspaces andthe data. Thus, having to choose a base point increases thenumber of necessary parameters and reduces stability.Here, we propose H ORO PCA, a dimensionality reductionmethod for data defined in hyperbolic spaces which betterpreserves the properties of Euclidean PCA. We show how tointerpret directions using points at infinity (or ideal points),which then allows us to generalize core properties of PCA:1. To generalize the notion of affine subspace, we proposeparameterizing geodesic subspaces as the sets spannedby these ideal points. This yields multiple viable nestedsubspaces (flags) (Section 3.1).2. To maximally preserve information in the original data,we propose a new projection method that uses horospheres, a generalization of complementary directions forhyperbolic space. In contrast with geodesic projections,these projections exactly preserve information – specifically, distances to ideal points – along each direction.Consequently, they preserve distances between pointsmuch better than geodesic projections (Section 3.2).3. Finally, we introduce a simple generalization of explained variance that is a function of distances onlyand can be computed in hyperbolic space (Section 3.3).Combining these notions, we propose an algorithm thatseeks a sequence of principal components that best explainvariations in hyperbolic data. We show that this formulation retains the location-independence property of PCA:translating target submanifolds along orthogonal directions(horospheres) preserves projected distances (Fig. 1c). Inparticular, the algorithm’s objective depends only on thedirections and not locations of the submanifolds (Section 4).We empirically validate H ORO PCA on real datasets and forthree standard PCA applications. First, (i) we show that ityields much lower distortion and higher explained variancethan existing methods, reducing average distortion by up to77%. Second, (ii) we validate that it can be used for datapre-processing, improving downstream classification by upto 3.8% in Average Precision score compared to methodsthat don’t use whitening. Finally, (iii) we show that thelow-dimensional representations learned by H ORO PCA canbe visualized to qualitatively interpret hyperbolic data.2. BackgroundWe first review some basic notions from hyperbolic geometry; a more in-depth treatment is available in standardtexts (Lee, 2013). We discuss the generalization of coordinates and directions in hyperbolic space and then reviewgeodesic projections. We finally describe generalizations ofthe notion of mean and variance to non-Euclidean spaces.2.1. The Poincaré Model of Hyperbolic SpaceHyperbolic geometry is a Riemannian geometry with constant negative curvature 1, where curvature measures deviation from flat Euclidean geometry. For easier visualizations, we work with the d-dimensional Poincaré model of

Hyperbolic Dimensionality t vector wDot product x · wIdeal point pBusemann func. Bp (x)Table 1: Analogies of components and their correspondingcoordinates, in both Euclidean and hyperbolic space.hyperbolic space: Hd {x Rd : kxk 1}, where k · k isthe Euclidean norm. In this model, the Riemannian distancecan be computed in cartesian coordinates by: kx yk2dH (x, y) cosh-1 1 2. (1)(1 kxk2 )(1 kyk2 )Geodesics Shortest paths in hyperbolic space are calledgeodesics. In the Poincaré model, they are represented bystraight segments going through the origin and circular arcsperpendicular to the boundary of the unit ball (Fig. 2).Geodesic submanifolds A submanifold M Hd iscalled (totally) geodesic if for every x, y M , the geodesicline connecting x and y belongs to M . This generalizesthe notion of affine subspaces in Euclidean spaces. In thePoincaré model, geodesic submanifolds are represented bylinear subspaces going through the origin and spherical capsperpendicular to the boundary of the unit ball.2.2. Directions in Hyperbolic spaceThe notions of directions, and coordinates in a given direction can be generalized to hyperbolic spaces as follows.Ideal points As with parallel rays in Euclidean spaces,geodesic rays in Hd that stay close to each other can beviewed as sharing a common endpoint at infinity, also calledan ideal point. Intuitively, ideal points represent directionsalong which points in Hd can move toward infinity. Theset of ideal points Sd 1 , called the boundary at infinity ofHd , is represented by the unit sphere Sd 1 {kxk 1}in the Poincaré model. We abuse notations and say that ageodesic submanifold M Hd contains an ideal point p ifthe boundary of M in Sd 1 contains p.Busemann coordinates In Euclidean spaces, each direction can be represented by a unit vector w. The coordinateof a point x in the direction of w is simply the dot productw · x. In hyperbolic geometry, directions can be represented by ideal points but dot products are not well-defined.Instead, we take a ray-based perspective: note that in Euclidean spaces, if we shoot a ray in the direction of w fromthe origin, the coordinate w · x can be viewed as the normalized distance to infinity in the direction of that ray. In otherwords, as a point y tw, (t 0) moves toward infinity inthe direction of w:w · x lim (d(0, tw) d(x, tw)) .t This approach generalizes to other geometries: given a unitspeed geodesic ray γ(t), the Busemann function Bγ (x) ofγ is defined as:1Bγ (x) lim (d(x, γ(t)) t) .t Up to an additive constant, this function only depends onthe endpoint at infinity of the geodesic ray, and not thestarting point γ(0). Thus, given an ideal point p, we definethe Busemann function Bp (x) of p to be the Busemannfunction of the geodesic ray that starts from the origin of theunit ball model and has endpoint p. Intuitively, it representsthe coordinates of x in the direction of p. In the Poincarémodel, there is a closed formula:Bp (x) lnkp xk2.1 kxk2Horospheres The level sets of Busemann functions Bp (x)are called horospheres centered at p. In this sense, theyresemble spheres, which are level sets of distance functions. However, intrinsically as Riemannian manifolds,horospheres have curvature zero and thus also exhibit manyproperties of planes in Euclidean spaces.Every geodesic with endpoint p is orthogonal to every horosphere centered at p. Given two horospheres with the samecenter, every orthogonal geodesic segment connecting themhas the same length. In this sense, concentric horospheres resemble parallel planes in Euclidean spaces. In the Poincarémodel, horospheres are Euclidean spheres that touch theboundary sphere Sd 1 at their ideal centers (Fig. 2). Givenan ideal point p and a point x in Hd , there is a unique horosphere S(p, x) passing through x and centered at p.2.3. Geodesic ProjectionsPCA uses orthogonal projections to project data onto subspaces. Orthogonal projections are usually generalized toother geometries as closest-point projections. Given a target submanifold M , each point x in the ambient space ismapped to the closest-point to it in M :GπM(x) argmin dM (x, y).y MGOne could view πM(·) as the map that pushes each point xalong an orthogonal geodesic until it hits M . For this reason,it is also called geodesic projection. In the Poincaré model,these can be computed in closed-form (see Appendix C).2.4. Manifold StatisticsPCA relies on data statistics which do not generalize easilyto hyperbolic geometry. One approach to generalize the1Note that compared to the above formula, the sign conventionis flipped due to historical reasons.

Hyperbolic Dimensionality Reductionsubspaces, called a flag. To generalize this to hyperbolicspaces, we first need to adapt the notion of linear/affinespans. Recall that geodesic submanifolds are generalizationsof affine subspaces in Euclidean spaces.Definition 3.1. Given a set of points S (that could be insideHd or on the boundary sphere Sd 1 ), the smallest geodesicsubmanifold of Hd that contains S is called the geodesichull of S and denoted by GH(S).Ideal PointIdeal PointFigure 2: Hyperbolic geodesics (black lines) going throughan ideal point (in red), and horospheres (red circles) centeredat that same ideal point. The hyperbolic lengths of geodesicsegments between two horospheres are equal.arithmetic mean is to notice that it is the minimizer of thesum of squared distances to the inputs. Motivated by this,the Fréchet mean (Fréchet, 1948) of a set of points S in aRiemannian manifold (M, dM ) is defined as:XµM (S) : argmindM (x, y)2 .y Mx SThis definition only depends on the intrinsic distance of themanifold. For hyperbolic spaces, since squared distancefunctions are convex, µ(S) always exists and is unique.2Analogously, the Fréchet variance is defined as:2σM(S): Thus, given K ideal points p1 , p2 , . . . , pK and a base pointb Hd , we can define a nested sequence of geodesicsubmanifolds GH(b, p1 ) GH(b, p1 , p2 ) · · · GH(b, p1 , . . . , pK ). This will be our notion of flags.Remark 3.2. The base point b is only needed here for technical reasons, just like an origin o is needed to define linearspans in Euclidean spaces. We will see next that it does notaffect the projection results or objectives (Theorem 4.1).Remark 3.3. We assume that none of b, p1 , . . . , pK are inthe geodesic hull of the other K points. This is analogousto being linearly independent in Euclidean spaces.N1 XdM (x, µ(S))2 . S (2)x SWe refer to (Huckemann & Eltzner, 2020) for a discussionon different intrinsic statistics in non-Euclidean spaces, anda study of their asymptotic properties.3. Generalizing PCA to the Hyperbolic SpaceWe now describe our approach to generalize PCA to hyperbolic spaces. The starting point of H ORO PCA is to pick1 K d ideal points p1 , . . . , pK Sd 1 to represent Kdirections in hyperbolic spaces (Section 2.2). Then, we generalize the core concepts of Euclidean PCA. In Section 3.1,we show how to generalize flags. In Section 3.2, we showhow to project points onto the lower-dimensional submanifold spanned by a given set of directions, while preservinginformation along each direction. In Section 3.3, we introducing a variance objective to optimize and show that it is afunction of the directions only.3.1. Hyperbolic FlagsIn Euclidean spaces, one can take the linear spans of moreand more components to define a nested sequence of linear2For more general geometries, existence and uniqueness holdif the data is well-localized (Kendall, 1990).3.2. Projections via HorospheresIn Euclidean PCA, points are projected to the subspacesspanned by the given directions in a way that preservescoordinates in those directions. We seek a projection methodin hyperbolic spaces with a similar property.Recall that coordinates are generalized by Busemann functions (Table 1), and that horospheres are level sets of Busemann functions. Thus, we propose a projection that preserves coordinates by moving points along horospheres. Itturns out that this projection method also preserves distancesbetter than the traditional geodesic projection.As a toy example, we first show how the projection is definedin the K 1 case (i.e. projecting onto a geodesic) and whyit tends to preserve distances well. We will then show howto use K 1 ideal points simultaneously.3.2.1. P ROJECTING ONTO K 1 D IRECTIONSFor K 1, we have one ideal point p and base point b, andthe geodesic hull GH(b, p) is just a geodesic γ. Our goal isHto map every x Hd to a point πb,p(x) on γ that has thesame Busemann coordinate in the direction of p:HBp (x) Bp (πb,p(x)).Since level sets of Bp (x) are horospheres centered at p,Hthe above equation simply says that πb,p(x) belongs to thehorosphere S(p, x) centered at p and passing through x.Thus, we define:Hπb,p(x) : γ S(p, x).Hπb,p(·)(3)Another important property thatshares with orthogonal projections in Euclidean spaces is that it preserves

Hyperbolic Dimensionality ReductionpxOProposition 3.5. Let M Hd be a geodesic submanifold.Then every geodesic segment of distance at least r fromM gets at least cosh(r) times shorter under the geodesicGprojection πM(·) to M :yx0y0γGlength(πM(I)) 1length(I).cosh(r)In particular, the shrink factor grows exponentially as thesegment I moves away from M .Figure 3: x0 , y 0 are horospherical (green) projections of x, y.Proposition 3.4 shows dH (x0 , y 0 ) dH (x, y). The distancebetween the two geodesic (blue) projections is smaller.distances along a direction – lengths of geodesic segmentsthat point to p are preserved after projection (Fig. 3):Proposition 3.4. For any x Hd , if y GH(x, p) then:HHdH (πb,p(x), πb,p(y)) dH (x, y).Proof. This follows from the remark in Section 2.2 abouthorospheres: every geodesic going through p is orthogonalto every horosphere centered at p, and every orthogonalgeodesic segment connecting concentric horospheres hasthe same length (Fig. 2). In this case, the segments fromHHx to y and from πb,p(x) to πb,p(y) are two such segments,connecting S(p, x) and S(p, y).3.2.2. P ROJECTING ONTO K 1 D IRECTIONSWe now generalize the above construction to projectionsonto higher-dimensional submanifolds. We describe themain ideas here; Appendix A contains more details, including an illustration in the case K 2 (Fig. 5).Fix a base point b Hd and K 1 ideal points{p1 , . . . , pK }. We want to define a map from Hd toM : GH(b, p1 , . . . , pK ) that preserves the Busemann coordinates in the directions of p1 , . . . , pK , i.e.: HBpj (x) Bpj πb,p(x) for every j 1, . . . , K.1 ,.,pKThe proof is in Appendix B.Computation Interestingly, horosphere projections canbe computed without actually computing the horospheres.The key idea is that if we let P GH(p1 , . . . , pK ) be thegeodesic hull of the horospheres’ centers, then the intersection S(p1 , x) · · · S(pK , x) is simply the orbit of x underthe rotations around P . (This is true for the same reason thatspheres whose centers lie on the same axis must intersectHalong a circle around that axis). Thus, πb,p(·) can be1 ,.,pKviewed as the map that rotates x around until it hits M . Itfollows that it can be computed by:1. Find the geodesic projection c πPG (x) of x onto P .2. Find the geodesic α on M that is orthogonal to P at c.3. Among the two points on α whose distance to c equalsdH (x, c), returns the one closer to b.The detailed computations and proof that this recovers horospherical projections are provided in Appendix A.3.3. Intrinsic Variance ObjectiveIn Euclidean PCA, directions are chosen to maximally preserve information from the original data. In particular, PCAchooses directions that maximize the Euclidean variance ofprojected data. To generalize this to hyperbolic geometry,we define an analog of variance that is intrinsic, i.e. dependent only on the distances between data points. As we willsee in Section 4, having an intrinsic objective helps makeour algorithm location (or base point) independent.It turns out that this intersection generally consists of twopoints instead of one. When that happens, one of themwill be strictly closer to the base point b, and we defineHπb,p(x) to be that point.1 ,.,pKThe usual notion of Euclidean variance is the squared sumof distances to the mean of the projected datapoints. Generalizing this is challenging because non-Euclidean spacesdo not have a canonical choice of mean. Previous workshave generalized variance either by using the unexplainedvariance or Fréchet variance. The former is the squaredsum of residual distances to the projections, and thus avoidscomputing a mean. However, it is not intrinsic. The latteris intrinsic (Fletcher et al., 2004) but involves finding theFréchet mean, which is not necessarily a canonical notionof mean and can only be computed by gradient descent.HAs with Proposition 3.4, πb,p(·) preserves distances1 ,.,pKalong K-dimensional manifolds (Corollary A.10). In contrast, geodesic projections in hyperbolic spaces never preserve distances (except between points already in the target):Our approach uses the observation that in Euclidean space:1X1 X2σ 2 (S) kx µ(S)k 2kx yk2 .nnAs before, the idea is to take the intersection with the horospheres centered at pj ’s and passing through x:Hπb,p: Hd M1 ,.,pKx 7 M S(p1 , x) · · · S(pK , x).x Sx,y S

Hyperbolic Dimensionality ReductionThus, we propose the following generalization of variance:2σH(S) 1 XdH (x, y)2 .n2(4)Theorem 4.1. For any b, b0 and any x, y Hd , the twoHH(y)) andprojected distances dH (πb,p(x), πb,p1 ,.,pK1 ,.,pKHHdH (πb0 ,p1 ,.,pK (x), πb0 ,p1 ,.,pK (y)) are equal.x,y SThis function agrees with the usual variance in Euclideanspace, while being a function of distances only. Thus it iswell defined in non-Euclidean space, is easily computed,and, as we will show next, has the desired invariance due toisometry properties of horospherical projections.The proof is included in Appendix A. Thus, H ORO PCAretains the location-independence property of EuclideanPCA: only the directions of target subspaces matter; theirexact locations do not (Fig. 1). Therefore, just like in theEuclidean setting, we can assume without loss of generalitythat b is the origin o of the Poincaré model. This:4. H ORO PCA1. alleviates the need to use d extra parameters to searchfor an appropriate base point, and2. simplifies computations, since in the Poincaré model,geodesics submanifolds that go through the origin aresimply linear subspaces, which are easier to deal with.Section 3 formulated several simple primitives – includingdirections, flags, projections, and variance – in a way thatis generalizable to hyperbolic geometry. We now revisitstandard PCA, showing how it has a simple definition thatcombines these primitives using optimization. This directlyleads to the full H ORO PCA algorithm by simply using thehyperbolic analogs of these primitives.Euclidean PCA Given a dataset S and a target dimension K, Euclidean PCA greedily finds a sequence of principal components p1 , . . . , pK that maximizes the varianceEof orthogonal projections πo,p(·) onto the linear 31 ,.,pksubspaces spanned by these components:Ep1 argmax σ 2 (πo,p(S)) p 1Eand pk 1 argmax σ 2 (πo,p(S)).1 ,.,pk ,p p 1Thus, for each 1 k K, {p1 , . . . , pk } is the optimal setof directions of dimension k.H ORO PCA Because we have generalized the notions offlag, projection, and variance to hyperbolic geometry, theH ORO PCA algorithm can be defined in the same fashion.Given a dataset S in Hd and a base point b Hd , we seeka sequence of K directions that maximizes the variance ofhorosphere-projected data:2Hp1 argmax σH(πb,p(S))p Sd 1 2Hand pk 1 argmax σH(πb,p(S)).1 ,.,pk ,p(5)p Sd 1 Base point independence Finally, we show that algorithm (5) always returns the same results regardless of thechoice of a base point b Hd . Since our variance objectiveonly depends on the distances between projected data points,it suffices to show that these distances do not depend on b.EHere o denotes the origin, and πo,p(·) denotes the pro1 ,.,pkjection onto the affine span of {o, p1 , . . . , pk }, which is equivalentto the linear span of {p1 , . . . , pk }.3After computing the principal directions which span the target M GH(o, p1 , . . . , pK ), the reduced dimensionalitydata can be found by applying an Euclidean rotation thatsends M to HK , which also preserves hyperbolic distances.5. ExperimentsWe now validate the empirical benefits of H ORO PCAon three PCA uses. First, for dimensionality reduction,H ORO PCA preserves information (distances and variance)better than previous methods which are sensitive to basepoint choices and distort distances more (Section 5.2). Next,we validate that our notion of hyperbolic coordinates captures variation in the data and can be used for whiteningin classification tasks (Section 5.3). Finally, we visualizethe representations learned by H ORO PCA in two dimensions (Section 5.4).5.1. Experimental SetupBaselines We compare H ORO PCA to several dimensionality reduction methods, including: (1) Euclidean PCA,which should perform poorly on hyperbolic data, (2) Exact PGA, (3) Tangent PCA (tPCA), which approximatesPGA by moving the data in the tangent space of the Fréchetmean and then solves Euclidean PCA, (4) BSA, (5) Hyperbolic Multi-dimensional Scaling (hMDS) (Sala et al.,2018), which takes a distance matrix as input and recovers a configuration of points that best approximates thesedistances, (6) Hyperbolic autoencoder (hAE) trained withgradient descent (Ganea et al., 2018; Hinton & Salakhutdinov, 2006). To demonstrate their dependence on base points,we also include two baselines that perturb the base point inPGA and BSA. We open-source our implementation4 andrefer to Appendix C for implementation details on how weimplemented all baselines and H ORO PCA.4https://github.com/HazyResearch/HoroPCA

Hyperbolic Dimensionality ReductionBalanced TreePCAtPCAPGAPGA-NoiseBSABSA-NoisehAEhMDSH ORO PCAPhylo TreeDiseasesCS Ph.D.distortion ( )variance ( )distortion ( )variance ( )distortion ( )variance ( )distortion ( )variance ( )0.840.700.63 0.070.87 0.080.50 0.000.74 0.120.26 0.000.220.19 0.000.341.162.11 0.470.29 0.303.02 0.011.06 0.676.91 0.007.547.15 0.000.940.630.64 0.010.64 0.020.61 0.030.68 0.020.32 0.040.740.13 0.010.4014.3415.29 0.5115.08 0.7718.60 1.1613.71 0.7245.87 3.5240.5169.16 1.960.900.630.66 0.020.88 0.040.52 0.020.80 0.110.18 0.000.210.15 0.010.263.923.16 0.390.53 0.195.95 0.251.62 1.3014.23 0.0615.0515.46 0.190.840.560.73 0.020.79 0.030.70 0.010.79 0.020.37 0.020.830.16 0.021.6811.096.14 0.604.58 0.648.15 0.964.41 0.5922.12 2.4719.9336.79 0.70Table 2: Dimensionality reduction results on 10-dimensional hyperbolic embeddings reduced to two dimensions. Resultsare averaged over 5 runs for non-deterministic methods. Best in bold and second best underlined.Datasets For dimensionality reduction experiments, weconsider standard hierarchical datasets previously used toevaluate the benefits of hyperbolic embeddings. Morespecifically, we use the datasets in (Sala et al., 2018) including a fully balanced tree, a phylogenetic tree, a biologicalgraph comprising of diseases’ relationships and a graph ofComputer Science (CS) Ph.D. advisor-advisee relationships.These datasets have respectively 40, 344, 516 and 1025nodes, and we use the code from (Gu et al., 2018) to embedthem in the Poincaré ball. For data whitening experiments,we reproduce the experimental setup from (Cho et al., 2019)and use the Polbooks, Football and Polblogs datasets whichhave 105, 115 and 1224 nodes each. These real-world networks are embedded in two-dimensions using Chamberlainet al. (2017)’s embedding method.Evaluation metrics To measure distance-preservation after projection, we use average distortion. If π(·) denotes amapping from high- to low-dimensional representations, theaverage distortion of a dataset S is computed as:1 S 2 X dH (π(x), π(y)) dH (x, y) .dH (x, y)x6 y SWe also measure the Fréchet variance in Eq. (2), which is theanalogue of the objective that Euclidean PCA optimizes5 .Note that the mean in Eq. (2) cannot be computed in closedform and we therefore compute it with gradient-descent.5.2. Dimensionality ReductionWe report metrics for the reduction of 10-dimensional embeddings to two dimensions in Table 2, and refer to Appendix C for additional results, such as more componentand dimension configurations. All results suggest thatH ORO PCA better preserves information contained in thehigh-dimensional representations.On distance preservation, H ORO PCA outperforms all meth5All mentioned PCA methods, including H ORO PCA, optimizefor some forms of variance but not Fréchet variance or distortion.ods with significant improvements on larger datasets. Thissupports our theoretical result that horospherical projectionsbetter preserve distances than geodesic projections. Furthermore, H ORO PCA also outperform

dinates and directions in hyperbolic space and then review geodesic projections. We ﬁnally describe generalizations of the notion of mean and variance to non-Euclidean spaces. 2.1. The Poincare Model of Hyperbolic Space Hyperbolic geometry is a Riemannian geometry with con-stant negative curvature 1, where curvature measures de-

Related Documents:

Kernel PCA Based Dimensionality Reduction Techniques for ... - IJERT

subset selection methods for Dimensionality reduction select a subset of "meaningful" dimensions from the original ones. This paper analyses the effect of various dimensionality reduction techniques for natural language text documents under nonlinear dimensionality reduction category for Clusterization process which is a significant step in text

7 Views

7m ago

Introduction to 3-manifolds

equal to ˇ. In hyperbolic geometry, 0 ˇ. In spherical geometry, ˇ . Figure 1. L to R, Triangles in Euclidean, Hyperbolic, and Spherical Geometries 1.1. The Hyperbolic Plane H. The majority of 3-manifolds admit a hyperbolic struc-ture [Thurston], so we shall focus primarily on the hyperbolic geometry, starting with the hyperbolic plane, H.

14 Views

1y ago

Hyperbolic orbifolds of small volume

Volume in hyperbolic geometry H n - the hyperbolic n-space (e.g. the upper half space with the hyperbolic metric ds2 dw2 y2). Isom(H n) - the group of isometries of H n. G Isom(H n), a discrete subgroup )M H n G is a hyperbolic n-orbifold. M is a manifold ()G is torsion free. We will discuss ﬁnite volume hyperbolic n-manifolds and .

13 Views

1y ago

5 Hyperbolic Geometry - University of California, Irvine

The angle between hyperbolic rays is that between their (Euclidean) tangent lines: angles are congruent if they have the same measure. q Lemma 5.10. The hyperbolic distancea of a point P from the origin is d(O, P) cosh 1 1 jPj2 1 jPj2 ln 1 jPj 1 jPj aIt should seem reasonable for hyperbolic functions to play some role in hyperbolic .

18 Views

1y ago

HYPERBOLIC MANIFOLDS - Cambridge University Press

1 Hyperbolic space and its isometries 1 1.1 Möbius transformations 1 1.2 Hyperbolic geometry 6 1.2.1 The hyperbolic plane 8 1.2.2 Hyperbolic space 8 1.3 The circle or sphere at inﬁnity 12 1.4 Gaussian curvature 16 1.5 Further properties of Möbius transformations 19 1.5.1 Commutativity 19 1.5.2 Isometric circles and planes 20 1.5.3 Trace .

18 Views

1y ago

The Arithmetics of the Hyperbolic Plane Talk 1: Hyperbolic Geometry

shortest paths connecting two point in the hyperbolic plane. After a brief introduction to the Poincar e disk model, we will talk about geodesic triangleand we will give a classi cation of the hyperbolic isome-tries. In the end, we will explain why the hyperbolic geometry is an example of a non-Euclidean geometry.

13 Views

1y ago

Hyperbolic Image Embeddings - CVF Open Access

metrical properties of the hyperbolic space are very differ-ent. It is known that hyperbolic space cannot be isomet-rically embedded into Euclidean space [18, 24], but there exist several well-studied models of hyperbolic geometry. In every model, a certain subset of Euclidean space is en-dowed with a hyperbolic metric; however, all these models

11 Views

1y ago

CBCS B.A./B.Sc. Mathematics Course Structure Year Seme ...

Andhra Pradesh State Council of Higher Education w.e.f. 2015-16 (Revised in April, 2016) B.A./B.Sc. FIRST YEAR MATHEMATICS SYLLABUS SEMESTER –I, PAPER - 1 DIFFERENTIAL EQUATIONS 60 Hrs UNIT – I (12 Hours), Differential Equations of first order and first degree : Linear Differential Equations; Differential Equations Reducible to Linear Form; Exact Differential Equations; Integrating Factors .

119 Views

3y ago

Recent Views

Personal insurance - Car & Business insurance King Price Insurance

The king's insurance options 5 Things you need to know 7 The stuff you need to do 14 How to claim 16 Our commitment to you 20 Car insurance 22 Car warranty 37 Shortfall cover 45 Scratch and dent 46 Tyre and rim 48 Motorbike insurance 53 Trailer and caravan insurance 64 Watercraft insurance 68 Home contents insurance 77 Buildings insurance 89

1y ago

685 Views

Australia's general insurance industry: sapping consumers .

body for general insurance companies licenced under the Insurance Act. General Insurance Code of Practice 2.22 The General Insurance Code of Practice (the Code) is a voluntary self-regulatory industry code developed by the First introduced in 1994, the ICA. Code covers most classes of general insurance, including home, strata and motor insurance.

1y ago

193 Views

3944 Com-A Bima English

4 Bangladesh Co-operative Insurance Ltd BD Co-operative . 7 Central Insurance Company Ltd Central 8 City General Insurance Company Ltd City General 9 Continental Insurance Company Ltd Continental 10 Crystal Insurance Company Ltd Crystal 11 Desh General insurance ltd Desh General 12 Dhaka Insurance Company Ltd Dhaka .

2y ago

367 Views

Oracle Insurance Performance Insight for General Insurance

for General Insurance Overview Oracle Insurance Performance Insight for General Insurance (OIPIGI) is a comprehensive business intelligence system created exclusively for the General Insurance/Property and Casualty (P&C) insurance industry. OIPIGI provides a complete set of web-based analytical and reporting components that enable users to

1y ago

181 Views

Gold Tier - MAPFRE Insurance

Foy Insurance of MA, LLC 198 Frank Consolati Insurance Agency, Inc. 198 County Insurance Agency, Inc. 198 Woodrow W Cross Agency 214 Woodland Insurance Agency, Inc. 214 Tegeler Insurance Services of CT, Inc. 214 Pantano/VonKahle Insurance Agency, Inc. 214 . Hanson Insurance Agency, Inc. 287 J.H. Slattery Insurance Agency, Inc. 287

1y ago

576 Views

Financial Performance of General Insurance Companies in Bangladesh: A .

general insurance companies, 17 private general insurance companies and the only public general insurance company (SBC) were taken for the study. SBC was established in 1973. To get a better representation the insurance companies were selected based on their year of establishment. Among the private general insurance companies 7 companies were

1y ago

222 Views

S OF GENERAL INSURANCE

General Insurance comprises of insurance of property against fire, burglary etc, personal insurance such as Accident and Health Insurance, and liability insurance which covers legal liabilities. Suitable general Insurance covers are necessary for every family. It is important to protect one’s property, which

3y ago

286 Views

American International Group, Inc. - Federal Reserve

American General Life Insurance Company AGL U.S. Life Insurance Company AGC Life Insurance Company AGC Life U.S. Life Insurance Company The United States Life Insurance Company in the City of New York U.S. Life U.S. Life Insurance Company The Variable Annuity Life Insurance Company VALIC U.S. Life Insurance Company

1y ago

275 Views

Consumer Guide to Auto Insurance - csimt.gov

consumer guide to auto insurance contents introduction to auto insurance 1 understanding your auto insurance policy 2 required auto insurance 3 optional types of auto insurance 4-5 getting the right coverage 6 accidents and violations 7 how to shop for auto insurance 8 shopping tips 9 frequently asked questions 10-11 insurance complaints/when you have a problem 12

2y ago

815 Views

Industry Observations Insurance Industry

Jun 30, 2019 · 6/17/2019 Commercial Insurance Branch of Extraco Banks, N.A. Higginbotham Insurance Group, Inc. Insurance Brokers NA 6/13/2019 Links Insurance Services, LLC World Insurance Associates LLC Property and Casualty Insurance NA 6/13/2019 Abram Interstate Insurance Services, Inc. Risk Placement Services,

2y ago

630 Views

Life Insurance Buyer's Guide Life Insurance - National Association of .

Life Insurance uers uide Naional ssociaion of Insurance Commissioners Compare the Different Types of Insurance Policies There are many types of life insurance pol-icies. You should choose a policy with fea-tures that fit your individual needs. Some things to consider are: Term Insurance vs. Cash Value In-surance. Term insurance is intended to

1y ago

531 Views

your guide to understanding auto ins in nh - New Hampshire

Hampshire Insurance Department does not mandate or set Auto Insurance Rates. Auto Insurance Rates will vary by insurance company. This guide is intended to give New Hampshire consumers basic information on auto insurance. It suggests ways to: Lower the cost of your auto insurance, shop for Auto insurance and, file an auto insurance claim.

1y ago

460 Views

18.01.41 - REPLACEMENT OF LIFE INSURANCE AND ANNUITIES - Idaho

Department of Insurance Replacement of Life Insurance and Annuities. Page 3. 04. Existing Life Insurance or Annuity. "Existing Life Insurance or Annuity" means any life insurance or annuity in force, including life insurance under a binding or conditional receipt or a lif e insurance policy or annuity that is within an unconditional refund period.

1y ago

412 Views

EXAMINATION REPORT OF THE ADMIRAL INSURANCE COMPANY AS OF . - Delaware

Berkley Regional Specialty Insurance Comp 31295 DE Carolina Casualty Insurance Company 10510 IA Clermont Insurance Company 33480 IA Continental Western Insurance Company 10804 IA Firemen's Insurance Com pany of Wash, D.C. 21784 DE Gemini Insurance Company 10833 DE Great Divide Insurance Company 25224 ND

1y ago

263 Views

Japan's Insurance Market - Toa Re

with 61.6% of net premiums written, of which automobile insurance totaled 48.8% and compulsory automobile liability insurance totaled 12.8%. Fire insurance accounted for 13.7%, miscellaneous casualty insurance including liability insurance accounted for 11.6%, accident insurance accounted for 9.8%, and marine insurance accounted for 3.2%.

1y ago

187 Views

HoroPCA: Hyperbolic Dimensionality Reduction Via Horospherical Projections

It looks like you're using an ad-blocker