database - How to validate success of cassandra version upgrade and cross datacenter backup -


here production cassandra cluster 1 datacenter of 3 hosts. version 1.0.7. want upgrade 1.0.7 2.1.8 , add cassandra data center 3 hosts of version 2.1.8.

i have experimented on test cluster , can upgrade cluster without errors. still worry there data loss or modified. want design quick method validate following 2 points.

  1. are there data losses or damages when cluster upgraded 1.0.7 2.1.8?

  2. i add data center in cluster , alter keyspace strategy networktopologystrategy 2 replicas each data center. how validate 2 data centers holding same replicas?

there 10g rows in current clusters. tedious match rows. there better way validate points above? or can trust cassandra itself.

i'm not sure it's practical (or necessary) in cases check every row of data.

i'd before , after checks of things this:

  1. spot check selected subset of rows. if of them correct, of them are.
  2. compare data sizes before , after upgrade make sure in same ballpark.
  3. monitor upgrade process errors (which you're doing).
  4. run full repairs on nodes after upgrade , see if there unusual amount of data movement suggesting nodes not populated.

Comments

Popular posts from this blog

c - Bitwise operation with (signed) enum value -

xslt - Unnest parent nodes by child node -

YouTubePlayerFragment cannot be cast to android.support.v4.app.Fragment -