Accelerating NoSQL

2y ago

18 Views

2 Downloads

924.48 KB

43 Pages

Last View : 1m ago

Last Download : 3m ago

Upload by : Azalea Piercy

Report this link

Download PDF

Transcription

Accelerating NoSQLRunning Voldemort on HailDBSunny GleasonMarch 11, 2011

whoami Sunny Gleason, human passion: distributed systems engineering previous.Ning : custom social networksAmazon.com : infra & web services now.building cloud infrastructure

whereami twitter : twitter.com/sunnygleason github : github.com/sunnygleason linkedin : linkedin.com/in/sunnygleason

what’s in this presentation? NoSQL Roundup Voldemort who? HailDB wha? Results & Next Steps Special Bonus Material

NoSQL “Not Only” SQL What’s the point? Proponent: “reaching next level of scale” Cynic: “cloud is hype, ops nightmare”

what does it gain? Higher performance, scalability, availability More robust fault-tolerance Simplified systems design Easier operations

what does it lose? Reduced / simplified programming model No ad-hoc queries, no joins, no txns Not ACID: Atomicity / Consistency /Isolation / Durability Operations / management is still evolving Challenging to quantify health of system Fewer domain experts

NoSQL MapKV Stores(volatile)KV RiakCouchDB,MongoDBCassandra,BigTable,HBaseNeo4J

NoSQL MapKV Stores(volatile)KV umnStoreGraphStoreDynamo,Voldemort,Riak

motivation database on 1 box : ok database with master/slave replication : ok database on cluster : tricky database on SAN : time bomb

performanceComplexity Sharding FusionIO SSDMySQLVoldemort ClusterMemcached1K10K100KAggregate Operations / Sec1M

dynamo case study Amazon : high read throughput, alwaysaccessible writes Shopping cart application ‘Glitches’ ok, duplicate or missing item Data loss or unavailability is unacceptable Solution: K-V schema plus smart routing &data placement

key-value storage Essentially, a gigantic hash table Typically assign byte[] values to byte[] keys Plus versioning mixed in to handle failuresand conflicts Yes, you *can* do range partitioning; inpractice, avoid it because of hot spots

k-v: durable vs. volatile RAM is ridiculous speed (ns), not durable Disk is persistent and slow (3-7ms) RAID eases the pain a bit (4-8x throughput) SSD is providing good promise (100-300us) FusionIO is redefining the space (30-100us)

dynamo clones Voldemort : from LinkedIn, dynamoimplementation in Java (default: BDB-JE) Riak : from Basho, dynamo implementationin Erlang (default: embedded InnoDB)

Voldemort Developed at LinkedIn Scalable Key-Value Storage Based on Amazon Dynamo model High Read Throughput Always Writable

Voldemort features Consistent Hashing Quorum settings : R, W, N Auto-sharding & rebalancing Pluggable storage engines

Consistent Hashing* Arrange keys around ring* Compute token in ringusing hash function* Determine nodes responsiblefor token using live set

R/W/N N : maximum number of nodes to queryfor an operation R : read quorum W : write quorum Can adjust ‘quorum’ to balance throughputand fault-tolerance

setting up Voldemort 1Step 1: Download the codeDownload either a recent stable release or, for those who like to live moredangerously, the up-to-the-minute build from the build server.Step 2: Start single node cluster bin/voldemort-server.sh config/single node cluster /tmp/voldemort.log &Step 3: Start commandline test client and do some operations bin/voldemort-shell.sh test tcp://localhost:6666Established connection to test via tcp://localhost:6666 put "hello" "world" get "hello"version(0:1): "world" delete "hello" get "hello"null exitk k thx bye.

setting up Voldemort 2 For a cluster, use cloud startup scripts Works with Amazon EC2 See sting-Infrastructure

Voldemort client libraries Java, Scala, Clojure Ruby Python C

storage engines BDB-JE (Oracle Sleepycat, the original) Krati (LinkedIn, pretty new) HailDB (new!) MySQL (old / dated)

BDB-JE Log-Structured B-Tree Fast Storage When Mostly Cached Configured without fsync() by default -writes are batched and flushed periodically

Krati Fast Hash-Oriented Storage Uses memory-mapped files for speed Configured without fsync() by default -writes are batched and flushed periodically

HailDB Fork of MySQL InnoDB plugin(contributors : Oracle, Google, Facebook,Percona) Higher stability for large data sets Fast crash recovery External from Java heap (ease GC pain) apt-get install haildb (from launchpad PPA) Use “flush-once-per-second” mode

HailDB, Java & VoldemortVoldemort ClientVoldemort Nodev-storage-innog414-haildbJNAHailDB(log, buffer pool,tablespace)Voldemort NodeVoldemort Node

HailDB & Java g414-haildb : where the magic happens uses JNA: Java Native Access dynamic binding to libhaildb shared library auto-generated from .h file (w/ JNAerator) Pointer classes & other shenanigans

HailDB schemakey VARBINARY(200)version VARBINARY(200)value BLOBPRIMARY KEY( key, version)

implementation gotchas InnoDB API-level usage is unclear Synchronization & locking is unclear Therefore. I learned to love reading C Error handling is *nasty* Installation a bit of a pain

experimental setup OS X: 8-Core Xeon, 32GB RAM, 200GBOWC SSD Faban Benchmark : PUT 64-byte key, 1024byte value Scenarios:1, 2, 4, 8 threads 512M Java Heap

Perf: BDB Put

Perf: Krati Put

Perf: HailDB Put

future work Improve Packaging / Installation Schema refinements & perf enhancements Online backup/export with XtraBackup JNI Bindings

schema refinements Build upon Nokia work on fast k-v schema 8-byte ‘long’ key hash vs. full key bytes Smart use of secondary indexes Native representation of vector clocks Delayed / soft deletion Expect 40-50% performance boost

InnoDB tuning Skinny columns, skinny rows! (esp. Primary Key) Varchar enum ‘bad’, int or smallint ‘good’ fixed-width rows allows in-place updates Use covering indexes strategically More data per page means faster index scans,more efficient buffer pool utilization You only get so many trx’s on given CPU/RAMconfiguration - benchmark this!

refined schemaid BIGINT (auto increment)key hash BIGINTkey VARBINARY(200)version VARBINARY(200)value BLOBPRIMARY KEY( id)KEY( key hash)

online backup hot backup of data to other machine /destination test Percona Xtrabackup with HailDB next step: backup/export to Hadoop/HDFS(similar to Cloudera Sqoop tool)

JNI bindings JNI can get 2-5x perf boost vs. JNA . at the expense of nasty code Will go for schema optimizations andInnoDB tuning tips *first*

resources github.com/voldemort/voldemortfreenode #voldemort on/g414-haildb jna.dev.java.net

more resources Amazon Dynamo Faban / XFaban HailDB Drizzle PBXT

Thank You!

Riak Memcached, Redis Column Store Cassandra, BigTable, HBase Graph Store Document Store CouchDB, MongoDB Neo4J. NoSQL Map NoSQL Key-Value Store KV Stores (durable) KV Stores (vo

Related Documents:

NoSQL in the Enterprise - accorsi.net

towards NoSQL databases is the high cost of legacy RDBMS vendors versus NoSQL software. In general, NoSQL software is a fraction of what vendors such as IBM and Oracle charge for their databases. What Constitutes an Enterprise NoSQL Solution? What should a technology leader or decision-maker look for in a NoSQL offering that deﬁnes it as truly

12 Views

1y ago

Learn MongoDB in 1 Day

Chapter 2: NoSQL Tutorial: Learn NoSQL Features, Types, What is, Advantages What is NoSQL? NoSQL is a non-relational DMS, that does not require a fixed schema, avoids joins, and is easy to scale. NoSQL database is used for distributed data stores with humongous data storage needs. No

26 Views

2y ago

A Study for Integrating SQL and NoSQL Databases

1. SQL Interface to RDB and NoSQL Database. To access both RDB and NoSQL databases, we provide a general SQL interface. It consists of a SQL query parser and Apache Phoenix to connect HBase as a NoSQL database to a SQL translator and a MySQL JDBC driver to an RDB connector. The application does not need to change the queries or manage NoSQL .

14 Views

1y ago

Embrace the Base: Oracle NoSQL Database

Oracle NoSQL Database Hands on Workshop Lab Exercise 1 - Start Oracle NoSQL Database instance and access data from Formatter classes In this exercise, you will start an Oracle NoSQL Database instance that has movie data preloaded. KVLite will be used as the Oracle NoSQL Database Instance. A very brief introduction to KVLite follows:

8 Views

1y ago

When, Where and Why to Use NoSQL - Aerospike

NoSQL database. A NoSQL database can be used to solve new problems that require: Scalability - A NoSQL database can scale horizontally to the scale required by big data. Applications can run in parallel on a cloud-based cluster comprising of dozens, hundreds, or even thousands of commodity servers. The NoSQL scale-out architecture

6 Views

1y ago

FIVE YEARS OF NOSQL CONSIDERED

MongoDB! Riak! Couchbase! Voldemort! Neo4J Titan!for!HBase! DOCUMENT COLUMN GEO GRAPH SEARCH OBJECT . NOSQL PROJECTS WITH BACKING . NOSQL COMPANIES MONGODB!!! CASSANDRA!! RIAK!! COUCH*! NEO4J ELASTICSEARCH! NOSQL INVESTMENTS 73 39 26

14 Views

2y ago

On the State of NoSQL Benchmarks

family of NoSQL storage systems (e.g. column, document stores). Most notably is the Yahoo! Cloud Serving Bench-mark (YCSB), which has become the de-facto standard for cross-family comparison of NoSQL databases [8]. In the remainder of this section we compare benchmark initiatives per NoSQL category, starting with key-value benchmarks and YCSB.

14 Views

2y ago

Goal of the presentation is to give an introduction of NoSQL databases ...

1. A paradigm shift from the traditional data model. SQL databases enforce a strict schema, whereas NoSQL databases has a week notion of schema. At the core all NoSQL databases are key/value systems, the difference is whether the database understands the value or not. Different type of NoSQL databases have different properties. We'll see four major

20 Views

1y ago

Recent Views

IN THIS ISSUE CAR WASH INSIGHT Recent, Notable M&A Transactions .

9/8/2022 Club Car Wash Sites of Tidal Wave Express Car Wash 8 8/29/2022 Take 5 Car Wash Soft Touch Car Wash, Auto Oasis Car Wash, Clearwater Car Wash and Birdie's Car Wash 5 8/25/2022 WhiteWater Express Geaux Clean Car Wash 7 8/19/2022 ModWash Home Team Car Wash 3 8/18/2022 Splash In ECO Car Wash (Wills Group) Blue Hen Car Wash 2

8m ago

100 Views

Personal insurance - Car & Business insurance King Price Insurance

The king's insurance options 5 Things you need to know 7 The stuff you need to do 14 How to claim 16 Our commitment to you 20 Car insurance 22 Car warranty 37 Shortfall cover 45 Scratch and dent 46 Tyre and rim 48 Motorbike insurance 53 Trailer and caravan insurance 64 Watercraft insurance 68 Home contents insurance 77 Buildings insurance 89

1y ago

673 Views

ESSENTIAL PLAN - Discovery

Car insurance only Car and home insurance Car insurance only Car and home insurance 12.5% 25% 5% 10% YOUR FUEL CASH BACK PERCENTAGE GET TO THE HIGHEST CASH BACK PERCENTAGE Add at least R250 000 of home insurance (household contents, buildings or both) Take your car to Tiger Wheel & Tyre and pass the Annual MultiPoint check

1y ago

269 Views

CAR INSURANCE EVERYTHING EXPLAINED - RSA Insurance Group

CAR INSURANCE 93013821.indd 1 15/03/2018 10:46. 2 WELCOME TO µ CAR INSURANCE Thank you for choosing µ to protect you and your car. This booklet is intended to help you check your cover and to reassure you that µ will give you the protection you need for the year ahead. First of all, to help you understand your car insurance policy we want to .

1y ago

274 Views

Describe types and purposes of insurance.

D.O. CAPS Consumer Skills: Insurance—10E 3 Your car - The car you drive can also affect your insurance rates. Insurance companies place certain kinds of cars in special risk categories. You should ask your insurance agent before making a car purchase to make sure you aren't getting a car that will cost you extra for your liability insurance.

1y ago

233 Views

Money Online Price Comparison - WordPress

you to compare car insurance quotes. You'll notice at the top of the screen is a warning regarding telling the truth when completing any form of car insurance quote as something withheld, which later becomes known, can void an insurance claim. 7 The process of completing a car insurance price comparison is broken down into 4

1y ago

174 Views

Contours Options Infant Car Seat Adapter Instruction Sheet

your Infant Car Seat, as described in the instruction manual provided by the Infant Car Seat manufacturer. † WHEN USING ONLY ONE INFANT CAR SEAT ADAPTER OR TWO FOR TWINS, THE FOLLOWING INFANT CAR SEATS CAN BE USED: † If your Infant Car Seat is not one of the models listed above, DO NOT use your infant car seat with this car seat adapter.

2y ago

564 Views

Microsoft Advertising Travel Update

last minute cruise deals -58.50% Car Rental Queries WoW Change car rental -43.80% rental cars -46.30% car rentals -40.60% cheap car rentals -48.00% car rentals cheapest rates -52.20% rent a car- 40.30% cheap rental cars -45.60% rental car -41.80% car rental deals -49.30% rental cars lowest price -53.90% Flight Queries WoW Change cheap flights .

1y ago

337 Views

Design and development of lift for an automatic car parking system

1. Stacker type car parking system 2. Puzzle type car parking system 3. Level type car parking system 4. Chess type car parking system 5. Rotary type car parking system 6. Tower type car parking system But lift is used only in tower type car parking system. Objectives:-

6m ago

172 Views

Gold Tier - MAPFRE Insurance

Foy Insurance of MA, LLC 198 Frank Consolati Insurance Agency, Inc. 198 County Insurance Agency, Inc. 198 Woodrow W Cross Agency 214 Woodland Insurance Agency, Inc. 214 Tegeler Insurance Services of CT, Inc. 214 Pantano/VonKahle Insurance Agency, Inc. 214 . Hanson Insurance Agency, Inc. 287 J.H. Slattery Insurance Agency, Inc. 287

1y ago

565 Views

Car Insurance This booklet covers:Car Rapid Bonus Business

Car Insurance This booklet covers:Car Rapid Bonus Business RAC Direct Insurance is a trading name of London and Edinburgh Insurance Company Limited. Registered in England No 924430. Registered Office: 8 Surrey Street, Norwich NR1 3NG. Member of the Aviva Group. Authorised and regulated by the Financial Services Authority. RAC052(V27)-1971-06.06 .

1y ago

218 Views

Root Insurance (ROOT) - Citron Research

Root Insurance (ROOT) Leveling the Playing Field of Car Insurance What every trader needs to know about one of the mostheavily shorted stocks in the market Traditional Credit-Based Car Insurance PerpetuatesEconomic and Racial Inequalities as one in three American cannot affordessentials because of car insurance premiums

1y ago

209 Views

-xglfldo:Dwfk Xjxvw Wkurxjk)2,

Affordable Care Act - insurance comparison, cheapest insurance, cheap health insurance NJ, cheapest insurance company Priority One High Volume - Washington state health insurance plans, affordable health insurance The best performing ad copy included those that made specific reference to finding "health insurance" for

1y ago

259 Views

The Pricing of Group Life Insurance Schemes - Actuaries

Thus, in comparison to individual life insurance, group life insurance is more cost-effective per thousand of rupees insurance cover. 2. General Characteristics of Group Life Insurance Group life insurance, within certain restrictions and conditions, provides insurance to members of a group without requiring evidence of insurability. There is a .

1y ago

173 Views

NK-ID 0192-8365-3702-0D3E - Car-O-Liner

CAR-O-DATA. 4. The vast majority of vehicles on the road today can be found in Car-O-Liner's database. Your . Car-O-Tronic. is delivered with a 14-day trial . Car-O-Data Vision2. subscription. Car-O-Data. is available with different subscription periods and database. 4. Check all options with our distributors. SOFTWARE PART. NO. Vision2 X1 .

3y ago

321 Views

Accelerating NoSQL

It looks like you're using an ad-blocker