Practical Distributed Systems - Storage - Part 2

1y ago

7 Views

2 Downloads

1.69 MB

48 Pages

Last View : 1m ago

Last Download : 3m ago

Upload by : Harley Spears

Report this link

Download PDF

Transcription

Data storage in distributedsystems - part IIPiotr JaczewskiRTB HousePractical Distributed Systems, 2022

What will this lecture be about?In this part of the course we will focus on NoSQL databases andtheir usage in distributed systems. We will also briefly talk aboutdata formats and their features from the perspective of adistributed system.We will cover the topics of: Data models in NoSQL storages.Implementation details of selected NoSQL storages.Data formats and schema evolution in distributed systems.Practical Distributed Systems, 2022

NoSQL DatabasesSome of the common features of the NoSQL databases: Designed to easily scale horizontally.Usually donʼt use strict schemas.Concentrated around data aggregates.Donʼt use SQL, but some have query languages.Mostly support limited transactional capabilities (like multi-object transactions),due to running on clustered environment.Provide various options for data consistency.Practical Distributed Systems, 2022

CAP TheoremSource: hazelcast.comPractical Distributed Systems, 2022

CAP Theorem - critiqueA great article by Martin al Distributed Systems, 2022

NoSQL Data ModelsPractical Distributed Systems, 2022

Key Value Model May be viewed as a generalization of a hash tablewith put/get/remove operations.Data type agnostic - the understanding of storedvalue is the responsibility of client applications.Some implementations may include some built-indata types like maps, sets, counters.None or limited querying capabilities.Oﬀer great performance.Practical Distributed Systems, 2022

Document Model Can be considered as a subtype of key-value databases.Have some awareness of the data stored.The document format is usually JSON, BSON, XML, etc.The documents doesnʼt have to be of the same schemain a table/collection.Slightly improved querying capabilities.Support for secondary indexes.Allow partial document update.Practical Distributed Systems, 2022

Wide-Column Another variation of key-value model.No relations between tables.Map keys to rows and rows consist of groups ofcolumns.Groups of columns are called column families.Usually each row may have a varying number ofcolumns.Some implementations feature SQL-ish query language.Nonexistent columns do not take storage space.Practical Distributed Systems, 2022

Source: scnsoft.comPractical Distributed Systems, 2022

Graph model databases Focused on the relationship between data entities.Store both entities and edges between them.Both entities and edges can have their customproperties.Support querying and traversing the object graphs.Traversing the graph is very fast.For specific graph related scenarios.Practical Distributed Systems, 2022

Source: neo4j.comPractical Distributed Systems, 2022

MongoDB Document oriented database - documents in JSON.Support for large data sets.Supports searching by fields, range queries and usingregular expressions.Supports indexing/secondary indexes.Dedicated clients/REST API.Mature and production ready.Practical Distributed Systems, 2022

MongoDB ArchitecturePractical Distributed Systems, 2022

MongoDB ShardingMethods of sharding: Range based sharding - may result in shard imbalance.Hash based sharding - more even value distribution.Tag-aware sharding - explicitly determine groups of shards on which range ofdocuments will reside. Config Server will periodically assess the balance of shards across the cluster.Rebalance operation will move chunks between shards.Chunks contain adjacent values of shard keys.Practical Distributed Systems, 2022

MongoDB cluster replication Sharding is combined with replication.Each shard is replicated across a replicaset.Master accepts writes which are thenapplied to replicas.Master node is determined by an election.To become a primary a node must be able tocontact more than half of replica set.Election is based on priority set byadministrator and timestamp of the lastoperation.Practical Distributed Systems, 2022

MongoDB Concurrency/Consistency Supports (since version 4.0) ACID transactions on multiple documents betweenshards.Pessimistic concurrency control at global, database, collection levels.Optimistic concurrency control at document level (WiredTiger storage engine).Consistency is tuneable.Write concern - the client may be ordered to write synchronously only to primaryor also to a specified number of replicas - strong consistency.Read preference - the client may specify whether the read request is routed toprimary or secondary replica.Read concern - the client may choose to read only replicated data that is durable orread the newest data that may not be yet replicated and thus can be lost.Practical Distributed Systems, 2022

MongoDB Usage ConsiderationsReasons to use: When the strict schema is a problem.Use for CRUD applications, Web APIs, Content Management Systems.Straightforward architecture.Rather easy maintenance and configuration.Reasons not to use: Be careful with relationships between documents - no constraints.No rigid schema is not always your friend - custom versioning patterns must beimplemented by application.Practical Distributed Systems, 2022

HBase Wide-Column oriented.The data model is strictly based on the original Google BigTable specification.Provides random access database services on top of HDFS.Does not bother with data redundancy or disk failures - these are handled by HDFS.Can be easily accessed via MapReduce jobs on Hadoop.Practical Distributed Systems, 2022

HBase ArchitecturePractical Distributed Systems, 2022

HBase Storage ArchitecturePractical Distributed Systems, 2022

HBase RegionServer Regions are equivalent to range based shards.HBase Master will evaluate the balance of regions across all RegionServers.Regions can be splitted when becoming too large and can be relocated to otherRegionServers by the HBase Master.RegionServers are co-located with the Hadoop DataNodes for good data locality.Data locality can be broken by RegionServer rebalance, failovers.Data locality is usually restored when the underlying HFiles are compacted.Practical Distributed Systems, 2022

HBase RegionServerPractical Distributed Systems, 2022

HBase Concurrency/Consistency No multi object transactions, only atomicity of operations at row-level.Row-level locking for every update, even when mutation crosses multiple ColumnFamilies.Reads are not blocked by write operations - concurrent read will see the previousversion before update.Scans do not exhibit snapshot isolation, however all writes committed before thescan started will be visible, as well as those committed after.Practical Distributed Systems, 2022

HBase Concurrency/Consistency Secondary replicas for RegionServers provide availability for read operations.Until failover is done the aﬀected region is only available for reads.Thus secondary RegionServers are read only.Secondary RegionServers follow the primary and see only committed updates.Secondary RegionServers do not make their copy of the HFiles - no storageoverhead, the data is kept in BlockCache or read from primary HFiles.Replica RegionServers memory state can be refreshed from primary HFiles at ainterval - higher chance of stale read.Replica RegionServers memory state can be asynchronously updated via WALreplication - lower chance of stale reads.Reads from replica RegionServers can be also allowed via Timeline Consistency.Practical Distributed Systems, 2022

HBase Timeline ConsistencyPractical Distributed Systems, 2022

HBase Usage ConsiderationsReasons to use: If a true, BigTable wide column data model is required.If MapReduce jobs must be run on data.If there is an existing Hadoop/HDFS cluster.If there are billions of potential rows.Reasons not to use: Complex multi-element architecture.Painful operations and maintenance.High performance requires a lot of memory for BlockCache.A myriad of dependencies for client libraries.Practical Distributed Systems, 2022

Cassandra Wide-column oriented (implicitly).The clustering mode is based on the conceptderived from Amazon Dynamo.Linear scalability, you can expand the cluster orshrink horizontally whenever needed, usingcommodity hardware with no downtime.Each node in the cluster can work as a clustercoordinator and perform all operations.Leaderless architecture, uses gossip protocol toknow the cluster state.Practical Distributed Systems, 2022

Cassandra Consistent Hashing Cassandra distributes data throughout the cluster by using consistent hashingtechnique.Each node is allocated a range of hash values and data is placed on the node if theprimary key hash lies within the nodes range.If the number of ranges is equal to the number of nodes then addition or removalof node will require a lot of data movement and can result in a cluster imbalance.So we introduce a lot more ranges mapped to virtual nodes.Virtual nodes mapped to physical nodes, so that the addition/removal of node willcause few ranges to move and will leave the cluster balanced.Practical Distributed Systems, 2022

Cassandra Consistent HashingPractical Distributed Systems, 2022

Cassandra ReplicationPractical Distributed Systems, 2022

Cassandra Reads/WritesPractical Distributed Systems, 2022

Cassandra Consistency Follows the Amazon Dynamo model with tunable consistency for writes and reads.Write Consistency levels: ALL - all replicas must acknowledge the write.ONE\TWO\THREE - the specified amount of nodes must acknowledge the write.QUORUM - majority of replica nodes must acknowledge the write.ANY - any node can acknowledge, even if the node is not responsible for storingthe particular data.Write Consistency levels in multi DC scenario: LOCAL QUORUM - majority of replica nodes in a local DC must acknowledge thewrite.EACH QUORUM - majority of replica nodes in each clustered DC mustacknowledge the write.Practical Distributed Systems, 2022

Cassandra Consistency Read Consistency levels: ALL - all replica nodes are polled for the data.ONE/TWO/THREE - reads will be polled from the specified number of replicanodes.QUORUM - read completes after majority of nodes have returned the data.LOCAL ONE/LOCAL QUORUM/EACH QUORUM - analogous levels for multi-DCsetup.Practical Distributed Systems, 2022

Cassandra Consistency LevelsWrite\ReadONEQUORUMALLONEQUORUMALLHigh performanceand availability,lowest consistency.Fast writes with highavailability, moderateconsistency.Fast writes with highavailability, slow readswith consistency andlow availability.Fast and highlyavailable reads withmoderate consistency.Medium performance,high availability andstrict consistency.Slow reads with lowavailability and strictconsistency.Slow writes with lowavailability, fast andconsistent reads.Slow writes with lowavailability, consistentavailable reads ofmedium performance.Strict consistency,lowest performanceand availability.Practical Distributed Systems, 2022

Cassandra Consistency Repair If write consistency level is not set to ALL, inconsistencies may appear due to thenode downtimes, network partitions etc.Hinted handoﬀs - a technique where a node will store an update for a temporarilyunavailable replica node. If the failed node is restored, it will receive the update.Write consistency level ANY will write hinted handoﬀ even if all replicas are down.Hinted handoﬀs are deleted after some time.Read repair - if hinted handoﬀs were deleted, the normal read operation may beused to repair inconsistent replicas.After returning the value to the client the coordinator node writes the correct datato the inconsistent replica.Anti-entropy repair - compares all nodes and writes most recent data to fix replicas.Practical Distributed Systems, 2022

Cassandra Concurrency The unit of modification is a single column in a row.Multiple clients can update separate columns in a row without a conflict.Conflicting writes are resolved using timestamps - “Last Write Wins”.Support for “lightweight”, “optimistic” transactions limited to a single operation.Compare-and-set - operation checks the value and if the value is as expected,updates the value, otherwise operation needs to be retried.Transaction implemented by a quorum based transaction protocol - f-distributed-systems/paxos.htmlPractical Distributed Systems, 2022

Cassandra Log-Structured Merge TreePractical Distributed Systems, 2022

Cassandra Usage ConsiderationsReasons to use: Applicable for most data scenarios.Huge datasets, accessed by “almost” SQL (no aggregate functions, no joins).Easy horizontal scaling, cross-DC replication.Leaderless architecture - increased availability.Reasons not to use: Disk space consumption - it is diﬀicult to tune the SSTable compaction properly indata intensive scenarios.Works on JVM - garbage collections, etc. may aﬀect performance.Relatively complex - bugs?Practical Distributed Systems, 2022

Aerospike Very fast data access by key.Hybrid storage - RAM block devices PMEM (Persistent Memory).Can store data on raw SSD/NVMe block device - bypassing usual filesystem layer.In-memory indexes preserved on a shared memory segment (for fast recovery).Relatively easy single master per partition replication scheme.Client-tunable consistency policies.Transactions are limited to a single record and are CAS based.Practical Distributed Systems, 2022

Aerospike Distribution and Data Model Data is always distributed into 4096 partitions, evenly spread across nodes.Data model is straightforward:Practical Distributed Systems, 2022

Aerospike Usage ScenarioPractical Distributed Systems, 2022

Aerospike Usage ConsiderationsReasons to use: Low latency access to data.High concurrency writes support.Easy cluster management.Reasons not to use: Community version is severely limited (number of nodes, amount of data).Frequent scans - due to the hash based distribution model are heavy and involveall nodes.Practical Distributed Systems, 2022

What to store in a NoSQL Database? Unstructured data (images, text, binary files).Structured data in text document formats: Structured data in binary formats: JSONXMLBSON - Binary JSONProtocolBuﬀersApache AvroWhat we are aiming for is the forward/backwardcompatibility between schema versions.We want to support schema evolution.Practical Distributed Systems, 2022

Avro vs Protocol BuﬀersProtocol Buﬀers: Support for schema evolution via field tags (order numbers).Field tags cannot change and possible change of type must becompatible.Field tags must be written to a serialized message.Prevalent in various Google ecosystem tools.Avro: Must know the writer schema to support the schema evolution.More concise binary format (no field tags).Wider support in various Apache Big Data tools.Practical Distributed Systems, 2022

Schema Registry PatternPractical Distributed Systems, 2022

SummaryWe have discussed: The available data models for NoSQL databases The implementation details of MongoDB, ApacheHBase, Apache Cassandra and Aerospike databases. The data formats, schema evolution and the schemaregistry pattern.Practical Distributed Systems, 2022

Practical Distributed Systems, 2022 Data storage in distributed systems - part II Practical Distributed Systems, 2022 Piotr Jaczewski RTB House. . Optimistic concurrency control at document level (WiredTiger storage engine). Consistency is tuneable. Write concern - the client may be ordered to write synchronously only to primary .

Related Documents:

Distributed Database Systems - UiO

Distributed Database Design Distributed Directory/Catalogue Mgmt Distributed Query Processing and Optimization Distributed Transaction Mgmt -Distributed Concurreny Control -Distributed Deadlock Mgmt -Distributed Recovery Mgmt influences query processing directory management distributed DB design reliability (log) concurrency control (lock)

20 Views

1y ago

Best New ApptioOneFeatures of 2022 -Deep Dive

Cost Transparency Storage Storage Average Cost The cost per storage Cost Transparency Storage Storage Average Cost per GB The cost per GB of storage Cost Transparency Storage Storage Devices Count The quantity of storage devices Cost Transparency Storage Storage Tier Designates the level of the storage, such as for a level of service. Apptio .

27 Views

1y ago

System types Distributed systems

Distributed systems where the system software runs on a loosely integrated group of cooperating processors linked by a network 2 Distributed systems Virtually all large computer-based systems are now distributed systems Information processing is distributed over several computers rather than confined to a single machine

13 Views

1y ago

Distributed Control and Intelligence Using Multi Agent Systems

Distributed Control 20 Distributed control systems (DCSs) - Control units are distributed throughout the system; - Large, complex industrial processes, geographically distributed applications; - Utilize distributed resources for computation with information sharing; - Adapt to contingency scenarios and

15 Views

1y ago

WHERE FOOD AND LOGISTICS MEET - Global Cold Chain Alliance

los angeles cold storage co. lyons cold storage llc marianne's ice cream mar-jac poultry mattingly cold storage mccook cold storage merchants cold storage, llc mesa cold storage midwest refrigerated services minnesota freezer warehouse co mtc logistics nestle usa new orleans cold storage newcold nor-am cold storage nor-am ice and cold storage

34 Views

1y ago

Where Food and Logistics Meet

los angeles cold storage los angeles cold storage co. lyons cold storage llc marianne's ice cream mar-jac poultry mattingly cold storage mccook cold storage merchants cold storage, llc mesa cold storage midwest refrigerated services minnesota freezer warehouse co mtc logistics nestle usa new orleans cold storage newcold nor-am cold storage .

37 Views

1y ago

SWAGELOK Stock List - Adams LLC

Part No : MS-HTB-4 Part No : MS-HTB-6M Part No : MS-HTB-6T Part No : MS-HTB-8 Part No : MS-TBE-2-7-E-FKIT Part No : MS-TC-308 Part No : PGI-63B-PG5000-LAO2 Part No : RTM4-F4-1 Part No : SS 316 Part No : SS 316L Part No : SS- 43 ZF2 Part No : SS-10M0-1-8 Part No : SS-10M0-6 Part No : SS-12?0-2-8 Part No : SS-12?0-7-8 Part No : SS-1210-3 Part No .

91 Views

2y ago

Lubrication Free Gear Set Quiet Operation 68-70 DbA ...

to AGMA 9 standard, improved the quality and performance of the QE range. Today, the QE Vibrator not only meets industry expectations, but will out-perform competitive models when correctly selected and operated in line with the information given in this brochure. When a QE Vibrator is directly attached to a trough it is referred to as a “Brute Force” design. It is very simple to calculate .

45 Views

3y ago

Recent Views

Saint Robert Bellarmine - WordPress

Aug 08, 2018 · Sister Laura Gorman Sister Anna Frances Portisch Sister Mary Edward Haren Sister Dolores Priske (Helen Julie) Sister Scholastica Healy Sister olette Marie Quinn Sister lara . S. Heidelman Sister Alice Mary Reilly Sister Genevieve Henneberry (Fidelis) Sister Genevieve Rigney

2y ago

160 Views

Sunday, September 12, 2021 10:00 a.m.

Sep 12, 2021 · On our 154th Church Anniversary, We salute the members of Mount Pleasant Baptist Church who have served for 50 years or more. Sister Brenda Bradley Sister Mary Lockett Sister Aaronita Brown Sister June Marshall Deacon Carlton Brown Sister Barbara Moore Sister Gwendolyn Brown Sister Frances Robinson Deaconess Josephine Byrd Sister Frances Ross

2y ago

344 Views

MRS Title 21-A. ELECTIONS - Maine Legislature

stepgrandchild, stepsister, stepbrother, mother-in-law, father-in-law, brother-in-law, sister-in-law, son-in-law, daughter-in-law, guardian, former guardian, domestic partner, the half-brother or half-sister of a person's spouse, or the spouse of a person's half-brother or half-sister. [PL 2009, c. 253, §2 (AMD).] 21. Incoming voting list.

1y ago

118 Views

12 PUBLIC LAW AND PRIVATE LAW - Home: The National .

INTRODUCTION TO LAW MODULE - 3 Public Law and Private Law Classification of Law 164 Notes z define Criminal Law; z list the differences between Public and Private Law; and z discuss the role of Judges in shaping Law 12.1 MEANING AND NATURE OF PUBLIC LAW Public Law is that part of law, which governs relationship between the State

3y ago

745 Views

Dr. Ram Manohar Lohiya National Law University, Lucknow

2. Health and Medicine Law 3. Int. Commercial Arbitration 4. Law and Agriculture IXth SEMESTER 1. Consumer Protection Law 2. Law, Science and Technology 3. Women and Law 4. Land Law (UP) Xth SEMESTER 1. Real Estate Law 2. Law and Economics 3. Sports Law 4. Law and Education **Seminar Courses Xth SEMESTER (i) Law and Morality (ii) Legislative .

3y ago

496 Views

Companies Law - Cayman Islands dollar

Law 1 of 1971-15th December, 1970 Law 7 of 2000- 20th July, 2000 Law 7 of 1973-28th June, 1973 Law 5 of 2001-20th April, 2001 Law 24 of 1974-22nd November, 1974 Law 10 of 2001-25th May, 2001 Law 25 of 1975-9th December, 1975 Law 29 of 2001-26th September, 2001 Law 19 of 1977-10th November, 1977 Law 46 of 2001-14th January, 2002

3y ago

454 Views

It’s the Law!

ciples stated in Boyle’s Law, Charles’ Law, Gay-Lussac’s Law, Henry’s Law, and Dalton’s Law. Students will be able to explain the application of Boyle’s Law, Charles’ Law, Gay-Lussac’s Law, Henry’s Law, and Dalton’s Law to observations or events related to SCUBA diving. MateriaLs None audio/visuaL MateriaLs None teachinG tiMe

2y ago

378 Views

WHAT LAW IS ? An Introduction to Law

common law system civil law system!! sources of law in civil law !! a1. primary: statutes (written law) enacted by legislative power are the principal source of law. ! a2. two subsidiary sources of law: ! a2.1 administrative regulations a.2.2 customs!! ! sources of law in common law !!! b1. two primary sources of

2y ago

385 Views

Immaculata, Pennsylvania 19345-0200 Catholic Schools

Fall, 2012 Cover Sister Monica Therese Sicilia, I.H.M. IHM Best Practices Sister Margaret Rose Adams, I.H.M For Teachers: Sister Adrienne Saybolt, I.H.M. “Helping K-2 Students Struggling with Reading and Writing” Prime Times Sister Rita James Murphy, I.H.M. Good Writer

2y ago

117 Views

Winter 2012 - IHM EDUCATIONAL RESOURCES - Home

IHM Best Practices Sister Margaret Rose Adams, IHM For Teachers: Sister Adrienne Saybolt, IHM “Helping K-2 Students Struggling with Reading and Writing” Prime Times Sister Elaine deChantal Brookes, IHM Sister

2y ago

138 Views

Tributes in Honor of: SISTER JANET AHLER, CSA CSA SISTERS .

Everett & Jeannine Solon SISTER CORINNE HEIMANN, CSA St Mary's Hospital Board of Directors Teresa Hebble John & Mary Sterba SISTER MARY VERONICA HEIMANN, CSA Sybil Teehan Teresa Hebble Rebecca & Gary Tirevold MR EDWARD HELSTOSKY Bonnie Young Barbara Britz SISTER JOELLEN FLYNN, CSA RAY HINZ Susan Flynn Carol Hinz Fran Frigo JEAN W HOFF

2y ago

341 Views

How to Use These “Snippets” and Poems

For Sale By Shel Silverstein One sister for sale! One sister for sale! One crying and spying young sister for sale! I’m really not kidding, So who’ll start the bidding? Do I hear the dollar? A nickel? A penny? Oh, isn’t there, isn’t there, isn’t there any One kid that will buy this old sister for sale,

2y ago

367 Views

CODIS2006 - Mixture Interpretation - Butler FINAL

“Things we do not do: Calculate mixture ratios for casework – Calculation used for this study: Find loci with 4 alleles (2 sets of sister alleles). Make sure sister alleles fall within 70%, then take the ratio of one allele from one sister set to one allele of the second sister set, figure ratios for all combinations and average.

2y ago

315 Views

CONSECRATA

Salesian Sisters of St. John Bosco Sister Marie Amata D’Amico, C.K. School Sisters of Christ the King Sister Mary Stephany Rose, O.S.H.J. Oblate Sisters of the Sacred Heart of Jesus Sister Brigid Mary Meeks, R.S.M. Religious Sisters of Mercy of Alma Sister Hae-Jin Lim, F.M.A. Salesian Sisters of

2y ago

111 Views

Sister Makes House Calls During the Pandemic

Sister Patricia Deckert, RSM. As an elementary school teacher, Sister Patricia (Pat) taught in the Trenton, Metuchen and Camden dioceses in New Jersey, serving eight years at Cathedral School in Trenton, and seven years at St. James School in Red Bank. Attending nursing school at the age of 50, Sister Pat first ministered at McAuley Hall

11m ago

86 Views

Practical Distributed Systems - Storage - Part 2

It looks like you're using an ad-blocker