Comparing Apache Ignite And Cassandra For HTAP Applications

1y ago
24 Views
2 Downloads
7.62 MB
28 Pages
Last View : 9d ago
Last Download : 3m ago
Upload by : Giovanna Wyche
Transcription

Comparing Apache Ignite and CassandraFor HTAP ApplicationsDenis MagdaApache Ignite PMC ChairGridGain Product Management 2018 GridGain Systems, Inc.GridGain Company Confidential

Agenda Hybrid Transactional/Analytical Processing Ignite vs. Cassandra– Denormalized Architecture or Collocated Processing?– Row-level Isolation or Distributed Transactions?– Caching or In-Memory Store? Q&A 2018 GridGain Systems, Inc.GridGain Company Confidential

Hybrid Transaction/AnalyticalProcessing 2018 GridGain Systems, Inc.GridGain Company Confidential

“HTAP is an emerging application architecturethat "breaks the wall" between transactionprocessing and analytics. It enables moreinformed and "in business real time" decisionmaking.” (Gartner Inc., 2014) 2018 GridGain Systems, Inc.GridGain Company Confidential

Ignite vs. Cassandra:Denormalized Architecture or Collocated Processing? 2018 GridGain Systems, Inc.GridGain Company Confidential

Cassandra Path: Denormalized Architecture Denormalization Strategy– Pros– Increasing Performance in Favor of Data RedundancyPerformanceCons–Query-Driven Architecture–Easy to Start and Hard to Evolve 2018 GridGain Systems, Inc.GridGain Company Confidential

Let’s Take an Example 2018 GridGain Systems, Inc.GridGain Company Confidential

Start With a Question!Q1: What are the car models produced by a vendor within aparticular time frame (newest first)? 2018 GridGain Systems, Inc.GridGain Company Confidential

Run QueriesQ1: What are the car models produced by a vendor within aparticular time frame (newest first)?Extra Queries Supported Out of the Box 2018 GridGain Systems, Inc.GridGain Company Confidential

Somewhere in Production Q2: What is the number of cars of a specific modelproduced by a vendor? 2018 GridGain Systems, Inc.GridGain Company Confidential

Let’s reuse existing table!Q2: What is the number of cars of a specific modelproduced by a vendor? 2018 GridGain Systems, Inc.GridGain Company Confidential

Let’s reuse existing table!Q2: What is the number of cars of a specific modelproduced by a vendor?InvalidRequest: code 2200 [Invalid query]message "PRIMARY KEY column"car model" cannot be restricted (precedingcolumn "production year" is not restricted)" 2018 GridGain Systems, Inc.GridGain Company Confidential

Q2:Solution: Create New TableQ1: 2018 GridGain Systems, Inc.GridGain Company Confidential

Ignite Path: Affinity Collocation Related Data Collocation–Countries and Cities, Vendors and Cars–On a Single Cluster NodeCollocated Processing–Efficient SQL JOINS–Map-Reduce for Fast Data–Machine Learning w/o ELT–Reduced Network Traffic - Better Performance 2018 GridGain Systems, Inc.GridGain Company Confidential

Collocated Data Distribution 2018 GridGain Systems, Inc.GridGain Company Confidential

Collocated JOINs21321. Initial Query2. Query execution over local data3. Reduce multiple results in one 2018 GridGain Systems, Inc.GridGain Company Confidential

Machine Learning: Genetic AlgorithmsBiological EvolutionSimulationChromosome and Genes ClusterF1, C1, M1DURABLE MEMORYF F1 F2C C1 C2M M1 M2DURABLE MEMORYF2, C2, M2F Fitness CalculationC CrossoverM Mutation 2018 GridGain Systems, Inc.Collocated ComputationGridGain Company ConfidentialIgnite Cluster

Ignite vs. Cassandra:Row-level Isolation or Distributed Transactions? 2018 GridGain Systems, Inc.GridGain Company Confidential

Cassandra Path: Lightweight Transactions Cassandra is Eventually Consistent DB– With Tunable ConsistencyLightweight Transactions–Aka. Compare and Set Transactions–Row-Level IsolationApplicability–Atomic and linearizable updates of a record–Prevent duplications on inserts 2018 GridGain Systems, Inc.GridGain Company Confidential

Ignite Path: ACID Transactions Distributed ACID Transactions– Pessimistic/Optimistic2 Phase Commit–From RAM to disk Deadlock-free Transactions Applicability–No limitations. Classic Transactions. 2018 GridGain Systems, Inc.GridGain Company Confidential

Ignite vs. Cassandra:Caching or In-Memory Store? 2018 GridGain Systems, Inc.GridGain Company Confidential

Caching Cassandra Off-heap Options–Page Cache–Row Cache–Bloom FiltersJava Heap Options–Key Cache–DSE In-Memory 2018 GridGain Systems, Inc.GridGain Company Confidential

Ignite Memory-Centric StorageAutomaticDefragmentationMemory-Centric StoragePredictable memoryconsumptionDURABLE MEMORYDURABLE MEMORYDURABLE MEMORYFully Transactional(Write-Ahead Log)InstantaneousRestarts 2018 GridGain Systems, Inc.Off-heap Removesnoticeable GC pausesStores Supersetof DataServer NodeServer NodeGridGain Company ConfidentialServer Node

YCSB: Average Load 2018 GridGain Systems, Inc.GridGain Company Confidential

YCSB: Ramping Up Load 2018 GridGain Systems, Inc.GridGain Company Confidential

Caching Cassandra With Ignite No rip-and-replace– Automatic Read/Write-Through– Key-Value OnlyDistributed SQL– Keep CassandraOver Ignite DataACID Transactions–Ignite layer 2018 GridGain Systems, Inc.GridGain Company Confidential

Summary: Ignite or Cassandra? Simplified Architecture– Denormalization vs Affinity CollocationData Consistency and Transactions–Ignite: ACID Transactions–Cassandra: Row-level IsolationIn-Memory Store and Performance–Ignite: read intensive and mixed workloads–Cassandra: write intensive workloads (big load) 2018 GridGain Systems, Inc.GridGain Company Confidential

Any Questions?Thank you for joining us. Follow the gridgain.com#apacheignite#gridgain#dmagda 2018 GridGain Systems, Inc.GridGain Company Confidential

Summary: Ignite or Cassandra? Simplified Architecture - Denormalizationvs Affinity Collocation Data Consistency and Transactions - Ignite: ACID Transactions - Cassandra: Row-level Isolation In-Memory Store and Performance - Ignite: read intensive and mixed workloads - Cassandra: write intensive workloads (big load)

Related Documents:

Getting Started with the Cloud . Apache Bigtop Apache Kudu Apache Spark Apache Crunch Apache Lucene Apache Sqoop Apache Druid Apache Mahout Apache Storm Apache Flink Apache NiFi Apache Tez Apache Flume Apache Oozie Apache Tika Apache Hadoop Apache ORC Apache Zeppelin

CASSANDRA_SSL_STORAGE_PORT FALSE 7011; Cassandra cassandra.ssl_storage_port; Cassandra parameter contrail-node-init, contrail-external-cassandra; CASSANDRA_SSL_TRUSTSTORE_PASSWORD FALSE ornatum; Cassandra Cassandra parameter; contrail-external-cassandra CASSANDRA_STORAGE_PORT FALSE; 7010 Cassandra; cassandra.storage_port Cassandra

Apache Cassandra 1.0 Documentation Introduction to Apache Cassandra Apache Cassandra is a free, open-source, distributed database system for managing large amounts of structured, semi-structured, and unstructured data. Cassandra is designed to scale to a very large size across many commodity Apache Cassandra 1.0 Documentation 1

TP2: data modeling with Apache Cassandra 12. Cassandra versions Latest version 3.11.9 Cassandra 3.0 is supported until 6 months after 4.0 release (date TBD) Cassandra 2.2 is supported until 4.0 release Cassandra 2.1 is supported until 4.0 release 13. Cassandra 3.X physical model

Cassandra database: Build Cassandra code. Installation and configuration of Cassandra on Windows. Installation and configuration of Cassandra on Linux. Running a single Cassandra node. Examples of access control list usage. Extend Cassandra to multiple nodes. Build Cassandra code:

valid credentials to work with Cassandra database. Now the point is how to prepare for Apache Cassandra certification. One of the most popular certification for Apache Cassandra is "Professional Certification with Apache Cassandra: Massively NoSQL database " and this certification is for both Developer as well as architects.

Amazon Keyspaces (for Apache Cassandra) Guide du développeur Fonctionnement Qu'est-ce qu'Amazon Keyspaces (pour Apache Cassandra) ? Amazon Keyspaces (pour Apache Cassandra) est un service de base de données compatible avec Apache

Catalog Description: An elementary introduction to logical thinking. One-third of the course is devoted to problems of language and semantics. Section Description: The study of logic attunes us to the structure of our thoughts and judgments about the world. The brick and mortar of this structure is argument and reason. We will learn the rules of constructing good arguments, better understand .