As the data in the Commitlog is flushed to the disk periodically after every 10 seconds by default (controlled by commitlog_sync_period_in_ms), so if all replicas crash within 10 seconds, will I lose all that data? Does it mean that, theoretically, a Cassandra Cluster can lose data?
Cassandra is configured to lose 10 seconds of data by default?
2.9k Views Asked by Aliaksandr Kazlou At
2
There are 2 best solutions below
Related Questions in CASSANDRA
- How to perform ordering in cassandra
- Kong: Running Mashape Kong fails on Mac OS X
- Cassandra spark connector data loss
- How to insert a custom type with map<text, boolean> field using cqlsh in Cassandra?
- How to setup cassandra and spark
- Error running spark app using spark-cassandra connector
- Where are the API docs for org.apache.spark.sql.cassandra for Spark 1.3.x?
- java.sql.SQLSyntaxErrorException: name provided was not in the list of valid column labels:
- Cassandra query on 2 dates
- Cassandra WordCount Hadoop
- Cassandra: range select with incorrect result
- How to export data from Cassandra to mongodb?
- Spark Cassandra SQL can't perform DataFrame methods on query results
- Why is my cassandra insert rate better with a client/node in the same host than with client and one node in separate hosts?
- Does Cassandra support aggregation function or any other capabilities like Map Reduce?
Related Questions in DATA-INTEGRITY
- How to check the integrity of data backups
- What is Relational Integrity
- Cassandra is configured to lose 10 seconds of data by default?
- CRC check program
- how to save latest state of an entity in database with a proper db constraint
- In which order should I put LEN/CRC/DATA in a message? Should CRC protect the LEN field?
- Powershell script integrity
- MongoDB data integrity on a create command with a preceeding validation command
- Multiple ViewModels request same domain object, should I give them the same/different instance(s)?
- Deletions in a many-to-many structure
- Whats the best way to compare large amounts of data between two different databases?
- File integrity check
- Does TLS ensure message integrity and confidentiality of data transmission in a RESTful Java enterprise
- How to keep database speed after a lot of data
- ASP.NET MVC: Ensuring Item Edit posts the same Item ID
Related Questions in SCYLLA
- Cassandra is configured to lose 10 seconds of data by default?
- how much disk space is required to switch from SizeTiered to level compaction strategy in Scylla db?
- Disk space requirement for compaction on a token range in scylla/cassandra
- What happens when running 'nodetool rebuild' on a node in same datacenter?
- cassandra - best partition key for a fingerprint
- Scylla JMX - How is the MBean org.apache.cassandra.metrics:keyspace=uzzstore,name=EstimatedRowCount,scope=chunks,type=ColumnFamily mapped in the API?
- ScyllaDB webscraper table design issues
- Scylla - how to enable per table metrics?
- Is there a way to convert the value returned from writetime() into a readable date in Scylla CQL?
- Scylla - Two nodes with RF 2 not having the same data?
- Large TTL for expiring records in Scylla/Cassandra?
- NestJS - Creating dynamic module with sync and async options
- Best Cassandra/Scylla configuration with single FE node
- How do i use Cassandra/ScyllaDB/MongoDB to store pandas which has dynamic columns and updating it?
- Order by in materialized view doesn't sort the results
Related Questions in DATA-LOSS
- SQL Server Data Lost on User DB's
- Cassandra is configured to lose 10 seconds of data by default?
- Compensating for missing jpeg bytes
- Google Cloud SQL data loss
- Data loss between SQL Server and Datatable column
- ASP.NET Session Abandoning Unexpectantly
- MySQL 5.6 losing days of data
- Automatically deleted MySQL database table when restart server
- How to continue resizing partition after unexpected shut down manjaro
- JSF PrimeFaces lose data and session
- Best way to detect a "data loss" publish action when calling SSDT's SQLPackage.exe
- Can me sync redis's data into db?
- UITableView with Dynamic Sections, Custom Dropdown Views, and Repeated Data Issue on Scrolling
- event SOCKET_DATA does not receive all messages in AS3
- C# Sql Data not saving
Related Questions in DURABILITY
- Cassandra is configured to lose 10 seconds of data by default?
- rabbitmq-server start losing data over durable queues
- Can I specify CAS value and durability requirements together in a store operation?
- What method offered to guarantee data durability in Redis?
- Does a typical ACID RDBMS sync to disk every commit?
- CouchDB's crash-only design is for durability,why?
- PostgreSQL turn off durabilty
- What does it take to be durable on Linux?
- MongoDB: When Primary fails
- Handling durability requirements failure in Couchbase
- Couchbase View Consistency
- What are posix/linux/filesystem durability guarantees for ordered file writes?
- Pymongo default journaling
- Making redis durable with a slave redis queue
- Durable network communication dilemma
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
If a node crashed right before updating the commit log on disk, then yes, you could lose up to ten seconds of data.
If you keep multiple replicas, by using a replication factor higher than 1 or have multiple data centers, then much of the lost data would be on other nodes, and would be recovered on the crashed node when it was repaired.
Also the commit log may be written in less than ten seconds it the write volume is high enough to hit size limits before the ten seconds.
If you want more durability than this (at the cost of higher latency), then you can change the
commitlog_syncsetting fromperiodictobatch. Inbatchmode it uses thecommitlog_sync_batch_window_in_mssetting to control how often batches of writes are written to disk. In batch mode the writes are not acked until written to disk.The ten second default for periodic mode is designed for spinning disks, since they are so slow there is a performance hit if you block acks waiting for commit log writes. For this reason if you use
batchmode, they recommend a dedicated disk for the commit log so that the write head doesn't need to do any seeks to keep the added latency as low as possible.If you are using SSDs, then you can use more aggressive timing since the latency is greatly reduced compared to a spinning disk.