Wednesday, January 19, 2022

Apache Cassandra 4.0 finally arrives

Although it took a long time, Cassandra 4.0 has finally arrived.

It took quite a while. The Apache Software Foundation released Cassandra 4.0 beta over a decade ago. Developers were eager for this NoSQL database. It took longer than most expected. Then at the 11th hour, a nasty bug was found, which further delayed Cassandra’s release for a few days. Apache Cassandra 4.0 finally arrived, and is now ready to rip through your petabytes.

SEE: Electronic Data Disposal Policy(TechRepublic Premium).

It’s high-time you met this open-source, NoSQL NoSQL database. Like all NoSQL databases, Cassandra’s designed to analyze huge—I opened by saying petabytes remembe—amounts of semi-structured data. Cassandra’s goal is to store massive amounts of data, with more than a million writes per second, and be able to quickly access this data in an scalable and reliable way.

Cassandra is the database of record used by some of the most critical applications around the globe, including those from companies like Apple, DataStax or Netflix. It stores data that can be used for everything, from finance to healthcare, and everything in between. Therefore, it must be as accurate and reliable as possible. Cassandra Project Management Committee voted: “The overarching objective of the 4.0 version is that Cassandra 4.0 would be in a state where major users could run it in production when it was cut.”

Cassandra’s crew custom-built data correction tools to make this possible. These tools covered: 

  • Property-based/fuzz testing
  • Replay testing
  • Upgrade/diff Testing
  • Performance
  • Fault injection
  • Unit/test coverage expansion

This was not an easy task, and it took much longer than expected. The developers also encountered many hiccups on the way. However, they now believe the code is complete and ready for use. Cassandra 4.0 has been used by many large businesses. This was the original idea.

Cassandra will not be taking too long to release its next version, according to the future. Cassandra 3.0 was released in 2006, while 4.0 beta was released in 2013. However, Cassandra plans to release its next version within six months. Six months will elapse between dot releases and twelve between major releases. 

SEE: Snowflake data platform platform (free PDF) (TechRepublic)

Cassandra 4 has many improvements. This includes Java 11 long term support as well as Java 8. Java Development Kit 11 is not supported for production.

Cassandra now includes Audit Logging. You can also set configurable heap memory limits and disk space limits to avoid out-of-memory issues. All database activity is recorded per node as file records to a local filesystem directory. 

Cassandra now supports live query logging. You can again set configurable heap memory and disk space limits to prevent out-of–memory errors. It can be used for live traffic capture and replay, as well as debugging queries traffic and migration.

Cassandra’s Zero Copy streaming allows you to stream data five times faster between clusters. This means that users in the real world will have five times faster recovery times when there are issues. This will result in a lower total cost of ownership as you’ll use less cloud, server, and network resources. 

Cassandra’s programmers have promised that this version will be the most stable ever. I believe them. They spend a lot of effort to improve Cassandra’s performance and stability. We’ll soon find out if my faith has been justified. There are no problems with the loads Cassandra handles every day for each of its customers.

Also, see

    Leave a Reply