src/main/asciidoc/_chapters/compression.adoc

   1 ////
   2 /**
   3  *
   4  * Licensed to the Apache Software Foundation (ASF) under one
   5  * or more contributor license agreements.  See the NOTICE file
   6  * distributed with this work for additional information
   7  * regarding copyright ownership.  The ASF licenses this file
   8  * to you under the Apache License, Version 2.0 (the
   9  * "License"); you may not use this file except in compliance
  10  * with the License.  You may obtain a copy of the License at
  11  *
  12  *     http://www.apache.org/licenses/LICENSE-2.0
  13  *
  14  * Unless required by applicable law or agreed to in writing, software
  15  * distributed under the License is distributed on an "AS IS" BASIS,
  16  * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  17  * See the License for the specific language governing permissions and
  18  * limitations under the License.
  19  */
  20 ////
  21
  22 [appendix]
  23 [[compression]]
  24 == Compression and Data Block Encoding In HBase(((Compression,Data BlockEncoding)))
  25 :doctype: book
  26 :numbered:
  27 :toc: left
  28 :icons: font
  29 :experimental:
  30
  31 NOTE: Codecs mentioned in this section are for encoding and decoding data blocks or row keys.
  32 For information about replication codecs, see <<cluster.replication.preserving.tags,cluster.replication.preserving.tags>>.
  33
  34 Some of the information in this section is pulled from a link:http://search-hadoop.com/m/lL12B1PFVhp1/v=threaded[discussion] on the HBase Development mailing list.
  35
  36 HBase supports several different compression algorithms which can be enabled on a ColumnFamily.
  37 Data block encoding attempts to limit duplication of information in keys, taking advantage of some of the fundamental designs and patterns of HBase, such as sorted row keys and the schema of a given table.
  38 Compressors reduce the size of large, opaque byte arrays in cells, and can significantly reduce the storage space needed to store uncompressed data.
  39
  40 Compressors and data block encoding can be used together on the same ColumnFamily.
  41
  42 .Changes Take Effect Upon Compaction
  43 If you change compression or encoding for a ColumnFamily, the changes take effect during compaction.
  44
  45 Some codecs take advantage of capabilities built into Java, such as GZip compression. Others rely on native libraries. Native libraries may be available as part of Hadoop, such as LZ4. In this case, HBase only needs access to the appropriate shared library.
  46
  47 Other codecs, such as Google Snappy, need to be installed first.
  48 Some codecs are licensed in ways that conflict with HBase's license and cannot be shipped as part of HBase.
  49
  50 This section discusses common codecs that are used and tested with HBase.
  51 No matter what codec you use, be sure to test that it is installed correctly and is available on all nodes in your cluster.
  52 Extra operational steps may be necessary to be sure that codecs are available on newly-deployed nodes.
  53 You can use the <<compression.test,compression.test>> utility to check that a given codec is correctly installed.
  54
  55 To configure HBase to use a compressor, see <<compressor.install,compressor.install>>.
  56 To enable a compressor for a ColumnFamily, see <<changing.compression,changing.compression>>.
  57 To enable data block encoding for a ColumnFamily, see <<data.block.encoding.enable,data.block.encoding.enable>>.
  58
  59 .Block Compressors
  60 * none
  61 * Snappy
  62 * LZO
  63 * LZ4
  64 * GZ
  65
  66 .Data Block Encoding Types
  67 Prefix::
  68   Often, keys are very similar. Specifically, keys often share a common prefix and only differ near the end. For instance, one key might be `RowKey:Family:Qualifier0` and the next key might be `RowKey:Family:Qualifier1`.
  69   +
  70 In Prefix encoding, an extra column is added which holds the length of the prefix shared between the current key and the previous key.
  71 Assuming the first key here is totally different from the key before, its prefix length is 0.
  72 +
  73 The second key's prefix length is `23`, since they have the first 23 characters in common.
  74 +
  75 Obviously if the keys tend to have nothing in common, Prefix will not provide much benefit.
  76 +
  77 The following image shows a hypothetical ColumnFamily with no data block encoding.
  78 +
  79 .ColumnFamily with No Encoding
  80 image::data_block_no_encoding.png[]
  81 +
  82 Here is the same data with prefix data encoding.
  83 +
  84 .ColumnFamily with Prefix Encoding
  85 image::data_block_prefix_encoding.png[]
  86
  87 Diff::
  88   Diff encoding expands upon Prefix encoding.
  89   Instead of considering the key sequentially as a monolithic series of bytes, each key field is split so that each part of the key can be compressed more efficiently.
  90 +
  91 Two new fields are added: timestamp and type.
  92 +
  93 If the ColumnFamily is the same as the previous row, it is omitted from the current row.
  94 +
  95 If the key length, value length or type are the same as the previous row, the field is omitted.
  96 +
  97 In addition, for increased compression, the timestamp is stored as a Diff from the previous row's timestamp, rather than being stored in full.
  98 Given the two row keys in the Prefix example, and given an exact match on timestamp and the same type, neither the value length, or type needs to be stored for the second row, and the timestamp value for the second row is just 0, rather than a full timestamp.
  99 +
 100 Diff encoding is disabled by default because writing and scanning are slower but more data is cached.
 101 +
 102 This image shows the same ColumnFamily from the previous images, with Diff encoding.
 103 +
 104 .ColumnFamily with Diff Encoding
 105 image::data_block_diff_encoding.png[]
 106
 107 Fast Diff::
 108   Fast Diff works similar to Diff, but uses a faster implementation. It also adds another field which stores a single bit to track whether the data itself is the same as the previous row. If it is, the data is not stored again.
 109 +
 110 Fast Diff is the recommended codec to use if you have long keys or many columns.
 111 +
 112 The data format is nearly identical to Diff encoding, so there is not an image to illustrate it.
 113
 114
 115 Prefix Tree::
 116   Prefix tree encoding was introduced as an experimental feature in HBase 0.96.
 117   It provides similar memory savings to the Prefix, Diff, and Fast Diff encoder, but provides faster random access at a cost of slower encoding speed.
 118   It was removed in hbase-2.0.0. It was a good idea but little uptake. If interested in reviving this effort, write the hbase dev list.
 119
 120 [[data.block.encoding.types]]
 121 === Which Compressor or Data Block Encoder To Use
 122
 123 The compression or codec type to use depends on the characteristics of your data. Choosing the wrong type could cause your data to take more space rather than less, and can have performance implications.
 124
 125 In general, you need to weigh your options between smaller size and faster compression/decompression. Following are some general guidelines, expanded from a discussion at link:http://search-hadoop.com/m/lL12B1PFVhp1[Documenting Guidance on compression and codecs].
 126
 127 * If you have long keys (compared to the values) or many columns, use a prefix encoder.
 128   FAST_DIFF is recommended, as more testing is needed for Prefix Tree encoding.
 129 * If the values are large (and not precompressed, such as images), use a data block compressor.
 130 * Use GZIP for [firstterm]_cold data_, which is accessed infrequently.
 131   GZIP compression uses more CPU resources than Snappy or LZO, but provides a higher compression ratio.
 132 * Use Snappy or LZO for [firstterm]_hot data_, which is accessed frequently.
 133   Snappy and LZO use fewer CPU resources than GZIP, but do not provide as high of a compression ratio.
 134 * In most cases, enabling Snappy or LZO by default is a good choice, because they have a low performance overhead and provide space savings.
 135 * Before Snappy became available by Google in 2011, LZO was the default.
 136   Snappy has similar qualities as LZO but has been shown to perform better.
 137
 138 [[hadoop.native.lib]]
 139 === Making use of Hadoop Native Libraries in HBase
 140
 141 The Hadoop shared library has a bunch of facility including compression libraries and fast crc'ing -- hardware crc'ing if your chipset supports it.
 142 To make this facility available to HBase, do the following. HBase/Hadoop will fall back to use alternatives if it cannot find the native library
 143 versions -- or fail outright if you asking for an explicit compressor and there is no alternative available.
 144
 145 First make sure of your Hadoop. Fix this message if you are seeing it starting Hadoop processes:
 146 ----
 147 16/02/09 22:40:24 WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
 148 ----
 149 It means is not properly pointing at its native libraries or the native libs were compiled for another platform.
 150 Fix this first.
 151
 152 Then if you see the following in your HBase logs, you know that HBase was unable to locate the Hadoop native libraries:
 153 [source]
 154 ----
 155 2014-08-07 09:26:20,139 WARN  [main] util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
 156 ----
 157 If the libraries loaded successfully, the WARN message does not show. Usually this means you are good to go but read on.
 158
 159 Let's presume your Hadoop shipped with a native library that suits the platform you are running HBase on.
 160 To check if the Hadoop native library is available to HBase, run the following tool (available in  Hadoop 2.1 and greater):
 161 [source]
 162 ----
 163 $ ./bin/hbase --config ~/conf_hbase org.apache.hadoop.util.NativeLibraryChecker
 164 2014-08-26 13:15:38,717 WARN  [main] util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
 165 Native library checking:
 166 hadoop: false
 167 zlib:   false
 168 snappy: false
 169 lz4:    false
 170 bzip2:  false
 171 2014-08-26 13:15:38,863 INFO  [main] util.ExitUtil: Exiting with status 1
 172 ----
 173 Above shows that the native hadoop library is not available in HBase context.
 174
 175 The above NativeLibraryChecker tool may come back saying all is hunky-dory
 176 -- i.e. all libs show 'true', that they are available -- but follow the below
 177 presecription anyways to ensure the native libs are available in HBase context,
 178 when it goes to use them.
 179
 180 To fix the above, either copy the Hadoop native libraries local or symlink to them if the Hadoop and HBase stalls are adjacent in the filesystem.
 181 You could also point at their location by setting the `LD_LIBRARY_PATH` environment variable in your hbase-env.sh.
 182
 183 Where the JVM looks to find native libraries is "system dependent" (See `java.lang.System#loadLibrary(name)`). On linux, by default, is going to look in _lib/native/PLATFORM_ where `PLATFORM`      is the label for the platform your HBase is installed on.
 184 On a local linux machine, it seems to be the concatenation of the java properties `os.name` and `os.arch` followed by whether 32 or 64 bit.
 185 HBase on startup prints out all of the java system properties so find the os.name and os.arch in the log.
 186 For example:
 187 [source]
 188 ----
 189 ...
 190 2014-08-06 15:27:22,853 INFO  [main] zookeeper.ZooKeeper: Client environment:os.name=Linux
 191 2014-08-06 15:27:22,853 INFO  [main] zookeeper.ZooKeeper: Client environment:os.arch=amd64
 192 ...
 193 ----
 194 So in this case, the PLATFORM string is `Linux-amd64-64`.
 195 Copying the Hadoop native libraries or symlinking at _lib/native/Linux-amd64-64_     will ensure they are found.
 196 Rolling restart after you have made this change.
 197
 198 Here is an example of how you would set up the symlinks.
 199 Let the hadoop and hbase installs be in your home directory. Assume your hadoop native libs
 200 are at ~/hadoop/lib/native. Assume you are on a Linux-amd64-64 platform. In this case,
 201 you would do the following to link the hadoop native lib so hbase could find them.
 202 ----
 203 ...
 204 $ mkdir -p ~/hbaseLinux-amd64-64 -> /home/stack/hadoop/lib/native/lib/native/
 205 $ cd ~/hbase/lib/native/
 206 $ ln -s ~/hadoop/lib/native Linux-amd64-64
 207 $ ls -la
 208 # Linux-amd64-64 -> /home/USER/hadoop/lib/native
 209 ...
 210 ----
 211
 212 If you see PureJavaCrc32C in a stack track or if you see something like the below in a perf trace, then native is not working; you are using the java CRC functions rather than native:
 213 ----
 214   5.02%  perf-53601.map      [.] Lorg/apache/hadoop/util/PureJavaCrc32C;.update
 215 ----
 216 See link:https://issues.apache.org/jira/browse/HBASE-11927[HBASE-11927 Use Native Hadoop Library for HFile checksum (And flip default from CRC32 to CRC32C)],
 217 for more on native checksumming support. See in particular the release note for how to check if your hardware to see if your processor has support for hardware CRCs.
 218 Or checkout the Apache link:https://blogs.apache.org/hbase/entry/saving_cpu_using_native_hadoop[Checksums in HBase] blog post.
 219
 220 Here is example of how to point at the Hadoop libs with `LD_LIBRARY_PATH`      environment variable:
 221 [source]
 222 ----
 223 $ LD_LIBRARY_PATH=~/hadoop-2.5.0-SNAPSHOT/lib/native ./bin/hbase --config ~/conf_hbase org.apache.hadoop.util.NativeLibraryChecker
 224 2014-08-26 13:42:49,332 INFO  [main] bzip2.Bzip2Factory: Successfully loaded & initialized native-bzip2 library system-native
 225 2014-08-26 13:42:49,337 INFO  [main] zlib.ZlibFactory: Successfully loaded & initialized native-zlib library
 226 Native library checking:
 227 hadoop: true /home/stack/hadoop-2.5.0-SNAPSHOT/lib/native/libhadoop.so.1.0.0
 228 zlib:   true /lib64/libz.so.1
 229 snappy: true /usr/lib64/libsnappy.so.1
 230 lz4:    true revision:99
 231 bzip2:  true /lib64/libbz2.so.1
 232 ----
 233 Set in _hbase-env.sh_ the LD_LIBRARY_PATH environment variable when starting your HBase.
 234
 235 === Compressor Configuration, Installation, and Use
 236
 237 [[compressor.install]]
 238 ==== Configure HBase For Compressors
 239
 240 Before HBase can use a given compressor, its libraries need to be available.
 241 Due to licensing issues, only GZ compression is available to HBase (via native Java libraries) in a default installation.
 242 Other compression libraries are available via the shared library bundled with your hadoop.
 243 The hadoop native library needs to be findable when HBase starts.
 244 See
 245
 246 .Compressor Support On the Master
 247
 248 A new configuration setting was introduced in HBase 0.95, to check the Master to determine which data block encoders are installed and configured on it, and assume that the entire cluster is configured the same.
 249 This option, `hbase.master.check.compression`, defaults to `true`.
 250 This prevents the situation described in link:https://issues.apache.org/jira/browse/HBASE-6370[HBASE-6370], where a table is created or modified to support a codec that a region server does not support, leading to failures that take a long time to occur and are difficult to debug.
 251
 252 If `hbase.master.check.compression` is enabled, libraries for all desired compressors need to be installed and configured on the Master, even if the Master does not run a region server.
 253
 254 .Install GZ Support Via Native Libraries
 255
 256 HBase uses Java's built-in GZip support unless the native Hadoop libraries are available on the CLASSPATH.
 257 The recommended way to add libraries to the CLASSPATH is to set the environment variable `HBASE_LIBRARY_PATH` for the user running HBase.
 258 If native libraries are not available and Java's GZIP is used, `Got brand-new compressor` reports will be present in the logs.
 259 See <<brand.new.compressor,brand.new.compressor>>).
 260
 261 [[lzo.compression]]
 262 .Install LZO Support
 263
 264 HBase cannot ship with LZO because of incompatibility between HBase, which uses an Apache Software License (ASL) and LZO, which uses a GPL license.
 265 See the link:https://github.com/twitter/hadoop-lzo/blob/master/README.md[Hadoop-LZO at Twitter] for information on configuring LZO support for HBase.
 266
 267 If you depend upon LZO compression, consider configuring your RegionServers to fail to start if LZO is not available.
 268 See <<hbase.regionserver.codecs,hbase.regionserver.codecs>>.
 269
 270 [[lz4.compression]]
 271 .Configure LZ4 Support
 272
 273 LZ4 support is bundled with Hadoop.
 274 Make sure the hadoop shared library (libhadoop.so) is accessible when you start HBase.
 275 After configuring your platform (see <<hadoop.native.lib,hadoop.native.lib>>), you can make a symbolic link from HBase to the native Hadoop libraries.
 276 This assumes the two software installs are colocated.
 277 For example, if my 'platform' is Linux-amd64-64:
 278 [source,bourne]
 279 ----
 280 $ cd $HBASE_HOME
 281 $ mkdir lib/native
 282 $ ln -s $HADOOP_HOME/lib/native lib/native/Linux-amd64-64
 283 ----
 284 Use the compression tool to check that LZ4 is installed on all nodes.
 285 Start up (or restart) HBase.
 286 Afterward, you can create and alter tables to enable LZ4 as a compression codec.:
 287 ----
 288 hbase(main):003:0> alter 'TestTable', {NAME => 'info', COMPRESSION => 'LZ4'}
 289 ----
 290
 291 [[snappy.compression.installation]]
 292 .Install Snappy Support
 293
 294 HBase does not ship with Snappy support because of licensing issues.
 295 You can install Snappy binaries (for instance, by using +yum install snappy+ on CentOS) or build Snappy from source.
 296 After installing Snappy, search for the shared library, which will be called _libsnappy.so.X_ where X is a number.
 297 If you built from source, copy the shared library to a known location on your system, such as _/opt/snappy/lib/_.
 298
 299 In addition to the Snappy library, HBase also needs access to the Hadoop shared library, which will be called something like _libhadoop.so.X.Y_, where X and Y are both numbers.
 300 Make note of the location of the Hadoop library, or copy it to the same location as the Snappy library.
 301
 302 [NOTE]
 303 ====
 304 The Snappy and Hadoop libraries need to be available on each node of your cluster.
 305 See <<compression.test,compression.test>> to find out how to test that this is the case.
 306
 307 See <<hbase.regionserver.codecs,hbase.regionserver.codecs>> to configure your RegionServers to fail to start if a given compressor is not available.
 308 ====
 309
 310 Each of these library locations need to be added to the environment variable `HBASE_LIBRARY_PATH` for the operating system user that runs HBase.
 311 You need to restart the RegionServer for the changes to take effect.
 312
 313 [[compression.test]]
 314 .CompressionTest
 315
 316 You can use the CompressionTest tool to verify that your compressor is available to HBase:
 317
 318 ----
 319
 320  $ hbase org.apache.hadoop.hbase.util.CompressionTest hdfs://host/path/to/hbase snappy
 321 ----
 322
 323 [[hbase.regionserver.codecs]]
 324 .Enforce Compression Settings On a RegionServer
 325
 326 You can configure a RegionServer so that it will fail to restart if compression is configured incorrectly, by adding the option hbase.regionserver.codecs to the _hbase-site.xml_, and setting its value to a comma-separated list of codecs that need to be available.
 327 For example, if you set this property to `lzo,gz`, the RegionServer would fail to start if both compressors were not available.
 328 This would prevent a new server from being added to the cluster without having codecs configured properly.
 329
 330 [[changing.compression]]
 331 ==== Enable Compression On a ColumnFamily
 332
 333 To enable compression for a ColumnFamily, use an `alter` command.
 334 You do not need to re-create the table or copy data.
 335 If you are changing codecs, be sure the old codec is still available until all the old StoreFiles have been compacted.
 336
 337 .Enabling Compression on a ColumnFamily of an Existing Table using HBaseShell
 338 ====
 339 ----
 340
 341 hbase> disable 'test'
 342 hbase> alter 'test', {NAME => 'cf', COMPRESSION => 'GZ'}
 343 hbase> enable 'test'
 344 ----
 345 ====
 346
 347 .Creating a New Table with Compression On a ColumnFamily
 348 ====
 349 ----
 350
 351 hbase> create 'test2', { NAME => 'cf2', COMPRESSION => 'SNAPPY' }
 352 ----
 353 ====
 354
 355 .Verifying a ColumnFamily's Compression Settings
 356 ====
 357 ----
 358
 359 hbase> describe 'test'
 360 DESCRIPTION                                          ENABLED
 361  'test', {NAME => 'cf', DATA_BLOCK_ENCODING => 'NONE false
 362  ', BLOOMFILTER => 'ROW', REPLICATION_SCOPE => '0',
 363  VERSIONS => '1', COMPRESSION => 'GZ', MIN_VERSIONS
 364  => '0', TTL => 'FOREVER', KEEP_DELETED_CELLS => 'fa
 365  lse', BLOCKSIZE => '65536', IN_MEMORY => 'false', B
 366  LOCKCACHE => 'true'}
 367 1 row(s) in 0.1070 seconds
 368 ----
 369 ====
 370
 371 ==== Testing Compression Performance
 372
 373 HBase includes a tool called LoadTestTool which provides mechanisms to test your compression performance.
 374 You must specify either `-write` or `-update-read` as your first parameter, and if you do not specify another parameter, usage advice is printed for each option.
 375
 376 .+LoadTestTool+ Usage
 377 ====
 378 ----
 379
 380 $ bin/hbase org.apache.hadoop.hbase.util.LoadTestTool -h
 381 usage: bin/hbase org.apache.hadoop.hbase.util.LoadTestTool <options>
 382 Options:
 383  -batchupdate                 Whether to use batch as opposed to separate
 384                               updates for every column in a row
 385  -bloom <arg>                 Bloom filter type, one of [NONE, ROW, ROWCOL]
 386  -compression <arg>           Compression type, one of [LZO, GZ, NONE, SNAPPY,
 387                               LZ4]
 388  -data_block_encoding <arg>   Encoding algorithm (e.g. prefix compression) to
 389                               use for data blocks in the test column family, one
 390                               of [NONE, PREFIX, DIFF, FAST_DIFF, PREFIX_TREE].
 391  -encryption <arg>            Enables transparent encryption on the test table,
 392                               one of [AES]
 393  -generator <arg>             The class which generates load for the tool. Any
 394                               args for this class can be passed as colon
 395                               separated after class name
 396  -h,--help                    Show usage
 397  -in_memory                   Tries to keep the HFiles of the CF inmemory as far
 398                               as possible.  Not guaranteed that reads are always
 399                               served from inmemory
 400  -init_only                   Initialize the test table only, don't do any
 401                               loading
 402  -key_window <arg>            The 'key window' to maintain between reads and
 403                               writes for concurrent write/read workload. The
 404                               default is 0.
 405  -max_read_errors <arg>       The maximum number of read errors to tolerate
 406                               before terminating all reader threads. The default
 407                               is 10.
 408  -multiput                    Whether to use multi-puts as opposed to separate
 409                               puts for every column in a row
 410  -num_keys <arg>              The number of keys to read/write
 411  -num_tables <arg>            A positive integer number. When a number n is
 412                               speicfied, load test tool  will load n table
 413                               parallely. -tn parameter value becomes table name
 414                               prefix. Each table name is in format
 415                               <tn>_1...<tn>_n
 416  -read <arg>                  <verify_percent>[:<#threads=20>]
 417  -regions_per_server <arg>    A positive integer number. When a number n is
 418                               specified, load test tool will create the test
 419                               table with n regions per server
 420  -skip_init                   Skip the initialization; assume test table already
 421                               exists
 422  -start_key <arg>             The first key to read/write (a 0-based index). The
 423                               default value is 0.
 424  -tn <arg>                    The name of the table to read or write
 425  -update <arg>                <update_percent>[:<#threads=20>][:<#whether to
 426                               ignore nonce collisions=0>]
 427  -write <arg>                 <avg_cols_per_key>:<avg_data_size>[:<#threads=20>]
 428  -zk <arg>                    ZK quorum as comma-separated host names without
 429                               port numbers
 430  -zk_root <arg>               name of parent znode in zookeeper
 431 ----
 432 ====
 433
 434 .Example Usage of LoadTestTool
 435 ====
 436 ----
 437
 438 $ hbase org.apache.hadoop.hbase.util.LoadTestTool -write 1:10:100 -num_keys 1000000
 439           -read 100:30 -num_tables 1 -data_block_encoding NONE -tn load_test_tool_NONE
 440 ----
 441 ====
 442
 443 [[data.block.encoding.enable]]
 444 == Enable Data Block Encoding
 445
 446 Codecs are built into HBase so no extra configuration is needed.
 447 Codecs are enabled on a table by setting the `DATA_BLOCK_ENCODING` property.
 448 Disable the table before altering its DATA_BLOCK_ENCODING setting.
 449 Following is an example using HBase Shell:
 450
 451 .Enable Data Block Encoding On a Table
 452 ====
 453 ----
 454
 455 hbase>  disable 'test'
 456 hbase> alter 'test', { NAME => 'cf', DATA_BLOCK_ENCODING => 'FAST_DIFF' }
 457 Updating all regions with the new schema...
 458 0/1 regions updated.
 459 1/1 regions updated.
 460 Done.
 461 0 row(s) in 2.2820 seconds
 462 hbase> enable 'test'
 463 0 row(s) in 0.1580 seconds
 464 ----
 465 ====
 466
 467 .Verifying a ColumnFamily's Data Block Encoding
 468 ====
 469 ----
 470
 471 hbase> describe 'test'
 472 DESCRIPTION                                          ENABLED
 473  'test', {NAME => 'cf', DATA_BLOCK_ENCODING => 'FAST true
 474  _DIFF', BLOOMFILTER => 'ROW', REPLICATION_SCOPE =>
 475  '0', VERSIONS => '1', COMPRESSION => 'GZ', MIN_VERS
 476  IONS => '0', TTL => 'FOREVER', KEEP_DELETED_CELLS =
 477  > 'false', BLOCKSIZE => '65536', IN_MEMORY => 'fals
 478  e', BLOCKCACHE => 'true'}
 479 1 row(s) in 0.0650 seconds
 480 ----
 481 ====
 482
 483 :numbered:
 484
 485 ifdef::backend-docbook[]
 486 [index]
 487 == Index
 488 // Generated automatically by the DocBook toolchain.
 489 endif::backend-docbook[]