RELEASENOTES.md

   1 # RELEASENOTES
   2
   3 <!---
   4 # Licensed to the Apache Software Foundation (ASF) under one
   5 # or more contributor license agreements.  See the NOTICE file
   6 # distributed with this work for additional information
   7 # regarding copyright ownership.  The ASF licenses this file
   8 # to you under the Apache License, Version 2.0 (the
   9 # "License"); you may not use this file except in compliance
  10 # with the License.  You may obtain a copy of the License at
  11 #
  12 #     http://www.apache.org/licenses/LICENSE-2.0
  13 #
  14 # Unless required by applicable law or agreed to in writing, software
  15 # distributed under the License is distributed on an "AS IS" BASIS,
  16 # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  17 # See the License for the specific language governing permissions and
  18 # limitations under the License.
  19
  20 # Be careful doing manual edits in this file. Do not change format
  21 # of release header or remove the below marker. This file is generated.
  22 # DO NOT REMOVE THIS MARKER; FOR INTERPOLATING CHANGES!-->
  23 # HBASE  2.4.8 Release Notes
  24
  25 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
  26
  27
  28 ---
  29
  30 * [HBASE-26362](https://issues.apache.org/jira/browse/HBASE-26362) | *Major* | **Upload mvn site artifacts for nightly build to nightlies**
  31
  32 Now we will upload the site artifacts to nightlies for nightly build as well as pre commit build.
  33
  34
  35 ---
  36
  37 * [HBASE-26329](https://issues.apache.org/jira/browse/HBASE-26329) | *Major* | **Upgrade commons-io to 2.11.0**
  38
  39 Upgraded commons-io to 2.11.0.
  40
  41
  42 ---
  43
  44 * [HBASE-26186](https://issues.apache.org/jira/browse/HBASE-26186) | *Major* | **jenkins script for caching artifacts should verify cached file before relying on it**
  45
  46 Add a '--verify-tar-gz' option to cache-apache-project-artifact.sh for verifying whether the cached file can be parsed as a gzipped tarball.
  47 Use this option in our nightly job to avoid failures on broken cached hadoop tarballs.
  48
  49
  50 ---
  51
  52 * [HBASE-26339](https://issues.apache.org/jira/browse/HBASE-26339) | *Major* | **SshPublisher will skip uploading artifacts if the build is failure**
  53
  54 Now we will mark build as unstable instead of failure when the yetus script returns error. This is used to solve the problem that the SshPublisher jenkins plugin will skip uploading artifacts if the build is marked as failure. In fact, the test output will be more important when there are UT failures.
  55
  56
  57
  58 # HBASE  2.4.7 Release Notes
  59
  60 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
  61
  62
  63 ---
  64
  65 * [HBASE-26274](https://issues.apache.org/jira/browse/HBASE-26274) | *Major* | **Create an option to reintroduce BlockCache to mapreduce job**
  66
  67 Introduce \`hfile.onheap.block.cache.fixed.size\` and default to disable. When using ClientSideRegionScanner, it will be enabled with a fixed size for caching INDEX/LEAF\_INDEX block when a client, e.g. snapshot scanner, scans the entire HFile and does not need to seek/reseek to index block multiple times.
  68
  69
  70 ---
  71
  72 * [HBASE-26270](https://issues.apache.org/jira/browse/HBASE-26270) | *Minor* | **Provide getConfiguration method for Region and Store interface**
  73
  74 Provide 'getReadOnlyConfiguration' method for Store and Region interface
  75
  76
  77 ---
  78
  79 * [HBASE-26273](https://issues.apache.org/jira/browse/HBASE-26273) | *Major* | **TableSnapshotInputFormat/TableSnapshotInputFormatImpl should use ReadType.STREAM for scanning HFiles**
  80
  81 HBase's MapReduce API which can operate over HBase snapshots will now default to using ReadType.STREAM instead of ReadType.DEFAULT (which is PREAD) as a result of this change. HBase developers expect that STREAM will perform significantly better for average Snapshot-based batch jobs. Users can restore the previous functionality (using PREAD) by updating their code to explicitly set a value of \`ReadType.PREAD\` on the \`Scan\` object they provide to TableSnapshotInputFormat, or by setting the configuration property "hbase.TableSnapshotInputFormat.scanner.readtype" to "PREAD" in hbase-site.xml.
  82
  83
  84 ---
  85
  86 * [HBASE-26276](https://issues.apache.org/jira/browse/HBASE-26276) | *Major* | **Allow HashTable/SyncTable to perform rawScan when comparing cells**
  87
  88 Added --rawScan option to HashTable job, which allows HashTable/SyncTable to perform raw scans. If this property is omitted, it defaults to false. When used together with --versions set to a high value, SyncTable will fabricate delete markers to all old versions still hanging (not cleaned yet by major compaction), avoiding the inconsistencies reported in HBASE-21596.
  89
  90
  91
  92 # HBASE  2.4.6 Release Notes
  93
  94 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
  95
  96
  97 ---
  98
  99 * [HBASE-26204](https://issues.apache.org/jira/browse/HBASE-26204) | *Major* | **VerifyReplication should obtain token for peerQuorumAddress too**
 100
 101 VerifyReplication obtains tokens even if the peer quorum parameter is used. VerifyReplication with peer quorum can be used for secure clusters also.
 102
 103
 104 ---
 105
 106 * [HBASE-24652](https://issues.apache.org/jira/browse/HBASE-24652) | *Minor* | **master-status UI make date type fields sortable**
 107
 108 Makes RegionServer 'Start time' sortable in the Master UI
 109
 110
 111 ---
 112
 113 * [HBASE-26200](https://issues.apache.org/jira/browse/HBASE-26200) | *Major* | **Undo 'HBASE-25165 Change 'State time' in UI so sorts (#2508)' in favor of HBASE-24652**
 114
 115 Undid showing RegionServer 'Start time' in ISO-8601 format. Revert.
 116
 117
 118 ---
 119
 120 * [HBASE-6908](https://issues.apache.org/jira/browse/HBASE-6908) | *Major* | **Pluggable Call BlockingQueue for HBaseServer**
 121
 122 Can pass in a FQCN to load as the call queue implementation.
 123
 124 Standardized arguments to the constructor are the max queue length, the PriorityFunction, and the Configuration.
 125
 126 PluggableBlockingQueue abstract class provided to help guide the correct constructor signature.
 127
 128 Hard fails with PluggableRpcQueueNotFound if the class fails to load as a BlockingQueue\<CallRunner\>
 129
 130 Upstreaming on behalf of Hubspot, we are interested in defining our own custom RPC queue and don't want to get involved in necessarily upstreaming internal requirements/iterations.
 131
 132
 133 ---
 134
 135 * [HBASE-26196](https://issues.apache.org/jira/browse/HBASE-26196) | *Major* | **Support configuration override for remote cluster of HFileOutputFormat locality sensitive**
 136
 137 Allow any configuration for the remote cluster in HFileOutputFormat2 that could be useful the different configuration from the job's configuration is necessary to connect the remote cluster, for instance, non-secure vs secure.
 138
 139
 140 ---
 141
 142 * [HBASE-26160](https://issues.apache.org/jira/browse/HBASE-26160) | *Minor* | **Configurable disallowlist for live editing of loglevels**
 143
 144 Adds a new hbase.ui.logLevels.readonly.loggers config which takes a comma-separated list of logger names. Similar to log4j configurations, the logger names can be prefixes or a full logger name. The log level of read only loggers cannot be changed via the logLevel UI or setlevel CLI. This is useful for securing sensitive loggers, such as the SecurityLogger used for audit logs.
 145
 146
 147 ---
 148
 149 * [HBASE-26154](https://issues.apache.org/jira/browse/HBASE-26154) | *Minor* | **Provide exception metric for quota exceeded and throttling**
 150
 151 Adds "exceptions.quotaExceeded" and "exceptions.rpcThrottling" to HBase server and Thrift server metrics.
 152
 153
 154 ---
 155
 156 * [HBASE-26146](https://issues.apache.org/jira/browse/HBASE-26146) | *Minor* | **Allow custom opts for hbck in hbase bin**
 157
 158 Adds HBASE\_HBCK\_OPTS environment variable to bin/hbase for passing extra options to hbck/hbck2. Defaults to HBASE\_SERVER\_JAAS\_OPTS if specified, or HBASE\_REGIONSERVER\_OPTS.
 159
 160
 161
 162 # HBASE  2.4.5 Release Notes
 163
 164 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 165
 166
 167 ---
 168
 169 * [HBASE-26088](https://issues.apache.org/jira/browse/HBASE-26088) | *Critical* | **conn.getBufferedMutator(tableName) leaks thread executors and other problems**
 170
 171 The API doc for Connection#getBufferedMutator(TableName) and Connection#getBufferedMutator(BufferedMutatorParams) mentioned that when user dont pass a ThreadPool to be used, we use the ThreadPool in the Connection.  But in reality, we were creating new ThreadPool in such cases.
 172
 173 We are keeping the behaviour of code as is but corrected the Javadoc and also a bug of not closing this new pool while Closing the BufferedMutator.
 174
 175
 176 ---
 177
 178 * [HBASE-25986](https://issues.apache.org/jira/browse/HBASE-25986) | *Minor* | **Expose the NORMALIZARION\_ENABLED table descriptor through a property in hbase-site**
 179
 180 New config: hbase.table.normalization.enabled
 181
 182 Default value: false
 183
 184 Description: This config is used to set default behaviour of normalizer at table level. To override this at table level one can set NORMALIZATION\_ENABLED at table descriptor level and that property will be honored. Of course, this property at table level can only work if normalizer is enabled at cluster level using "normalizer\_switch true" command.
 185
 186
 187 ---
 188
 189 * [HBASE-22923](https://issues.apache.org/jira/browse/HBASE-22923) | *Major* | **hbase:meta is assigned to localhost when we downgrade the hbase version**
 190
 191 Introduced new config: hbase.min.version.move.system.tables
 192
 193 When the operator uses this configuration option, any version between
 194 the current cluster version and the value of "hbase.min.version.move.system.tables"
 195 does not trigger any auto-region movement. Auto-region movement here
 196 refers to auto-migration of system table regions to newer server versions.
 197 It is assumed that the configured range of versions does not require special
 198 handling of moving system table regions to higher versioned RegionServer.
 199 This auto-migration is done by AssignmentManager#checkIfShouldMoveSystemRegionAsync().
 200 Example: Let's assume the cluster is on version 1.4.0 and we have
 201 set "hbase.min.version.move.system.tables" as "2.0.0". Now if we upgrade
 202 one RegionServer on 1.4.0 cluster to 1.6.0 (\< 2.0.0), then AssignmentManager will
 203 not move hbase:meta, hbase:namespace and other system table regions
 204 to newly brought up RegionServer 1.6.0 as part of auto-migration.
 205 However, if we upgrade one RegionServer on 1.4.0 cluster to 2.2.0 (\> 2.0.0),
 206 then AssignmentManager will move all system table regions to newly brought
 207 up RegionServer 2.2.0 as part of auto-migration done by
 208 AssignmentManager#checkIfShouldMoveSystemRegionAsync().
 209
 210 Overall, assuming we have system RSGroup where we keep HBase system tables, if we use
 211 config "hbase.min.version.move.system.tables" with value x.y.z then while upgrading cluster to
 212 version greater than or equal to x.y.z, the first RegionServer that we upgrade must
 213 belong to system RSGroup only.
 214
 215
 216 ---
 217
 218 * [HBASE-25902](https://issues.apache.org/jira/browse/HBASE-25902) | *Critical* | **Add missing CFs in meta during HBase 1 to 2.3+ Upgrade**
 219
 220 While upgrading cluster from 1.x to 2.3+ versions, after the active master is done setting it's status as 'Initialized', it attempts to add 'table' and 'repl\_barrier' CFs in meta. Once CFs are added successfully, master is aborted with PleaseRestartMasterException because master has missed certain initialization events (e.g ClusterSchemaService is not initialized and tableStateManager fails to migrate table states from ZK to meta due to missing CFs). Subsequent active master initialization is expected to be smooth.
 221 In the presence of multi masters, when one of them becomes active for the first time after upgrading to HBase 2.3+, it is aborted after fixing CFs in meta and one of the other backup masters will take over and become active soon. Hence, overall this is expected to be smooth upgrade if we have backup masters configured. If not, operator is expected to restart same master again manually.
 222
 223
 224 ---
 225
 226 * [HBASE-25877](https://issues.apache.org/jira/browse/HBASE-25877) | *Major* | **Add access  check for compactionSwitch**
 227
 228 Now calling RSRpcService.compactionSwitch, i.e, Admin.compactionSwitch at client side, requires ADMIN permission.
 229 This is an incompatible change but it is also a bug, as we should not allow any users to disable compaction on a regionserver, so we apply this to all active branches.
 230
 231
 232 ---
 233
 234 * [HBASE-25984](https://issues.apache.org/jira/browse/HBASE-25984) | *Critical* | **FSHLog WAL lockup with sync future reuse [RS deadlock]**
 235
 236 Fixes a WAL lockup issue due to premature reuse of the sync futures by the WAL consumers. The lockup causes the WAL system to hang resulting in blocked appends and syncs thus holding up the RPC handlers from progressing. Only workaround without this fix is to force abort the region server.
 237
 238
 239 ---
 240
 241 * [HBASE-25993](https://issues.apache.org/jira/browse/HBASE-25993) | *Major* | **Make excluded SSL cipher suites configurable for all Web UIs**
 242
 243 Add "ssl.server.exclude.cipher.list" configuration to excluded cipher suites for the http server started by the InfoServer.
 244
 245
 246 ---
 247
 248 * [HBASE-25969](https://issues.apache.org/jira/browse/HBASE-25969) | *Major* | **Cleanup netty-all transitive includes**
 249
 250 We have an (old) netty-all in our produced artifacts. It is transitively included from hadoop. It is needed by MiniMRCluster referenced from a few MR tests in hbase. This commit adds netty-all excludes everywhere else but where tests will fail unless the transitive is allowed through. TODO: move MR and/or MR tests out of hbase core.
 251
 252
 253
 254 # HBASE  2.4.4 Release Notes
 255
 256 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 257
 258
 259 ---
 260
 261 * [HBASE-25963](https://issues.apache.org/jira/browse/HBASE-25963) | *Major* | **HBaseCluster should be marked as IA.Public**
 262
 263 Change HBaseCluster to IA.Public as its sub class MiniHBaseCluster is IA.Public.
 264
 265
 266
 267 # HBASE  2.4.3 Release Notes
 268
 269 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 270
 271
 272 ---
 273
 274 * [HBASE-25766](https://issues.apache.org/jira/browse/HBASE-25766) | *Major* | **Introduce RegionSplitRestriction that restricts the pattern of the split point**
 275
 276 After HBASE-25766, we can specify a split restriction, "KeyPrefix" or "DelimitedKeyPrefix", to a table with the "hbase.regionserver.region.split\_restriction.type" property. The "KeyPrefix" split restriction groups rows by a prefix of the row-key. And the "DelimitedKeyPrefix" split restriction groups rows by a prefix of the row-key with a delimiter.
 277
 278 For example:
 279 \`\`\`
 280 # Create a table with a "KeyPrefix" split restriction, where the prefix length is 2 bytes
 281 hbase\> create 'tbl1', 'fam', {CONFIGURATION =\> {'hbase.regionserver.region.split\_restriction.type' =\> 'KeyPrefix', 'hbase.regionserver.region.split\_restriction.prefix\_length' =\> '2'}}
 282
 283 # Create a table with a "DelimitedKeyPrefix" split restriction, where the delimiter is a comma (,)
 284 hbase\> create 'tbl2', 'fam', {CONFIGURATION =\> {'hbase.regionserver.region.split\_restriction.type' =\> 'DelimitedKeyPrefix', 'hbase.regionserver.region.split\_restriction.delimiter' =\> ','}}
 285 \`\`\`
 286
 287 Instead of specifying a split restriction to a table directly, we can also set the properties in hbase-site.xml. In this case, the specified split restriction is applied for all the tables.
 288
 289 Note that the split restriction is also applied to a user-specified split point so that we don't allow users to break the restriction, which is different behavior from the existing KeyPrefixRegionSplitPolicy and DelimitedKeyPrefixRegionSplitPolicy.
 290
 291
 292 ---
 293
 294 * [HBASE-25775](https://issues.apache.org/jira/browse/HBASE-25775) | *Major* | **Use a special balancer to deal with maintenance mode**
 295
 296 Introduced a MaintenanceLoadBalancer to be used only under maintenance mode. Typically you should not use it as your balancer implementation.
 297
 298
 299 ---
 300
 301 * [HBASE-25767](https://issues.apache.org/jira/browse/HBASE-25767) | *Major* | **CandidateGenerator.getRandomIterationOrder is too slow on large cluster**
 302
 303 In the actual implementation classes of CandidateGenerator, now we just random select a start point and then iterate sequentially, instead of using the old way, where we will create a big array to hold all the integers in [0, num\_regions\_in\_cluster), shuffle the array, and then iterate on the array.
 304 The new implementation is 'random' enough as every time we just select one candidate. The problem for the old implementation is that, it will create an array every time when we want to get a candidate, if we have tens of thousands regions, we will create an array with tens of thousands length everytime, which causes big GC pressure and slow down the balancer execution.
 305
 306
 307 ---
 308
 309 * [HBASE-25734](https://issues.apache.org/jira/browse/HBASE-25734) | *Minor* | **Backport HBASE-24305 to branch-2.4**
 310
 311 The following method was added to ServerName
 312
 313 - #valueOf(Address, long)
 314
 315
 316 ---
 317
 318 * [HBASE-25199](https://issues.apache.org/jira/browse/HBASE-25199) | *Minor* | **Remove HStore#getStoreHomedir**
 319
 320 Moved the following methods from HStore to HRegionFileSystem
 321
 322 - #getStoreHomedir(Path, RegionInfo, byte[])
 323 - #getStoreHomedir(Path, String, byte[])
 324
 325
 326 ---
 327
 328 * [HBASE-25685](https://issues.apache.org/jira/browse/HBASE-25685) | *Major* | **asyncprofiler2.0 no longer supports svg; wants html**
 329
 330 If asyncprofiler 1.x, all is good. If asyncprofiler 2.x and it is hbase-2.3.x or hbase-2.4.x, add '?output=html' to get flamegraphs from the profiler.
 331
 332 Otherwise, if hbase-2.5+ and asyncprofiler2, all works. If asyncprofiler1 and hbase-2.5+, you may have to add '?output=svg' to the query.
 333
 334
 335 ---
 336
 337 * [HBASE-25518](https://issues.apache.org/jira/browse/HBASE-25518) | *Major* | **Support separate child regions to different region servers**
 338
 339 Config key for enable/disable automatically separate child regions to different region servers in the procedure of split regions. One child will be kept to the server where parent region is on, and the other child will be assigned to a random server.
 340
 341 hbase.master.auto.separate.child.regions.after.split.enabled
 342
 343 Default setting is false/off.
 344
 345
 346 ---
 347
 348 * [HBASE-25374](https://issues.apache.org/jira/browse/HBASE-25374) | *Minor* | **Make REST Client connection and socket time out configurable**
 349
 350 Configuration parameter to set rest client connection timeout
 351
 352 "hbase.rest.client.conn.timeout" Default is 2 \* 1000
 353
 354 "hbase.rest.client.socket.timeout" Default of 30 \* 1000
 355
 356
 357 ---
 358
 359 * [HBASE-25587](https://issues.apache.org/jira/browse/HBASE-25587) | *Major* | **[hbck2] Schedule SCP for all unknown servers**
 360
 361 Adds scheduleSCPsForUnknownServers to Hbck Service.
 362
 363
 364 ---
 365
 366 * [HBASE-25636](https://issues.apache.org/jira/browse/HBASE-25636) | *Minor* | **Expose HBCK report as metrics**
 367
 368 Expose HBCK repost results in metrics, includes: "orphanRegionsOnRS", "orphanRegionsOnFS", "inconsistentRegions", "holes", "overlaps", "unknownServerRegions" and "emptyRegionInfoRegions".
 369
 370
 371 ---
 372
 373 * [HBASE-24305](https://issues.apache.org/jira/browse/HBASE-24305) | *Minor* | **Handle deprecations in ServerName**
 374
 375 The following methods were removed or made private from ServerName (due to HBASE-17624):
 376
 377 - getHostNameMinusDomain(String): Was made private without a replacement.
 378 - parseHostname(String): Use #valueOf(String) instead.
 379 - parsePort(String): Use #valueOf(String) instead.
 380 - parseStartcode(String): Use #valueOf(String) instead.
 381 - getServerName(String, int, long): Was made private. Use #valueOf(String, int, long) instead.
 382 - getServerName(String, long): Use #valueOf(String, long) instead.
 383 - getHostAndPort(): Use #getAddress() instead.
 384 - getServerStartcodeFromServerName(String): Use instance of ServerName to pull out start code)
 385 - getServerNameLessStartCode(String): Use #getAddress() instead.
 386
 387
 388
 389 # HBASE  2.4.2 Release Notes
 390
 391 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 392
 393
 394 ---
 395
 396 * [HBASE-25492](https://issues.apache.org/jira/browse/HBASE-25492) | *Major* | **Create table with rsgroup info in branch-2**
 397
 398 HBASE-25492 added a new interface in TableDescriptor which allows user to define RSGroup name while creating or modifying a table.
 399
 400
 401 ---
 402
 403 * [HBASE-25460](https://issues.apache.org/jira/browse/HBASE-25460) | *Major* | **Expose drainingServers as cluster metric**
 404
 405 Exposed new jmx metrics: "draininigRegionServers" and "numDrainingRegionServers" to provide "comma separated names for regionservers that are put in draining mode" and "num of such regionservers" respectively.
 406
 407
 408 ---
 409
 410 * [HBASE-25615](https://issues.apache.org/jira/browse/HBASE-25615) | *Major* | **Upgrade java version in pre commit docker file**
 411
 412 jdk8u232-b09 -\> jdk8u282-b08
 413 jdk-11.0.6\_10 -\> jdk-11.0.10\_9
 414
 415
 416 ---
 417
 418 * [HBASE-23887](https://issues.apache.org/jira/browse/HBASE-23887) | *Major* | **New L1 cache : AdaptiveLRU**
 419
 420 Introduced new L1 cache: AdaptiveLRU. This is supposed to provide better performance than default LRU cache.
 421 Set config key "hfile.block.cache.policy" to "AdaptiveLRU" in hbase-site in order to start using this new cache.
 422
 423
 424
 425 # HBASE  2.4.1 Release Notes
 426
 427 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 428
 429
 430 ---
 431
 432 * [HBASE-25449](https://issues.apache.org/jira/browse/HBASE-25449) | *Major* | **'dfs.client.read.shortcircuit' should not be set in hbase-default.xml**
 433
 434 The presence of HDFS short-circuit read configuration properties in hbase-default.xml inadvertently causes short-circuit reads to not happen inside of RegionServers, despite short-circuit reads being enabled in hdfs-site.xml.
 435
 436
 437 ---
 438
 439 * [HBASE-25333](https://issues.apache.org/jira/browse/HBASE-25333) | *Major* | **Add maven enforcer rule to ban VisibleForTesting imports**
 440
 441 Ban the imports of guava VisiableForTesting, which means you should not use this annotation in HBase any more.
 442 For IA.Public and IA.LimitedPrivate classes, typically you should not expose any test related fields/methods there, and if you want to hide something, use IA.Private on the specific fields/methods.
 443 For IA.Private classes, if you want to expose something only for tests, use the RestrictedApi annotation from error prone, where it could cause a compilation error if someone break the rule in the future.
 444
 445
 446 ---
 447
 448 * [HBASE-25441](https://issues.apache.org/jira/browse/HBASE-25441) | *Critical* | **add security check for some APIs in RSRpcServices**
 449
 450 RsRpcServices APIs that can be accessed only through Admin rights:
 451 - stopServer
 452 - updateFavoredNodes
 453 - updateConfiguration
 454 - clearRegionBlockCache
 455 - clearSlowLogsResponses
 456
 457
 458 ---
 459
 460 * [HBASE-25432](https://issues.apache.org/jira/browse/HBASE-25432) | *Blocker* | **we should add security checks for setTableStateInMeta and fixMeta**
 461
 462 setTableStateInMeta and fixMeta can be accessed only through Admin rights
 463
 464
 465 ---
 466
 467 * [HBASE-25318](https://issues.apache.org/jira/browse/HBASE-25318) | *Minor* | **Configure where IntegrationTestImportTsv generates HFiles**
 468
 469 Added IntegrationTestImportTsv.generatedHFileFolder configuration property to override the default location in IntegrationTestImportTsv. Useful for running the integration test when HDFS Transparent Encryption is enabled.
 470
 471
 472 ---
 473
 474 * [HBASE-25456](https://issues.apache.org/jira/browse/HBASE-25456) | *Critical* | **setRegionStateInMeta need security check**
 475
 476 setRegionStateInMeta can be accessed only through Admin rights
 477
 478
 479
 480 # HBASE  2.4.0 Release Notes
 481
 482 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
 483
 484
 485 ---
 486
 487 * [HBASE-25127](https://issues.apache.org/jira/browse/HBASE-25127) | *Major* | **Enhance PerformanceEvaluation to profile meta replica performance.**
 488
 489 Three new commands are added to PE:
 490
 491 metaWrite, metaRandomRead and cleanMeta.
 492
 493 Usage example:
 494 hbase pe  --rows=100000 metaWrite  1
 495 hbase pe  --nomapreduce --rows=100000 metaRandomRead  32
 496 hbase pe  --rows=100000 cleanMeta 1
 497
 498 metaWrite and cleanMeta should be run with only 1 thread and the same number of rows so all the rows inserted will be cleaned up properly.
 499
 500 metaRandomRead can be run with multiple threads. The rows option should set to within the range of rows inserted by metaWrite
 501
 502
 503 ---
 504
 505 * [HBASE-25237](https://issues.apache.org/jira/browse/HBASE-25237) | *Major* | **'hbase master stop' shuts down the cluster, not the master only**
 506
 507 \`hbase master stop\` should shutdown only master by default.
 508 1. Help added to \`hbase master stop\`:
 509 To stop cluster, use \`stop-hbase.sh\` or \`hbase master stop --shutDownCluster\`
 510
 511 2. Help added to \`stop-hbase.sh\`:
 512 stop-hbase.sh can only be used for shutting down entire cluster. To shut down (HMaster\|HRegionServer) use hbase-daemon.sh stop (master\|regionserver)
 513
 514
 515 ---
 516
 517 * [HBASE-25242](https://issues.apache.org/jira/browse/HBASE-25242) | *Critical* | **Add Increment/Append support to RowMutations**
 518
 519 After HBASE-25242, we can add Increment/Append operations to RowMutations and perform those operations atomically in a single row.
 520 HBASE-25242 includes an API change where the mutateRow() API returns a Result object to get the result of the Increment/Append operations.
 521
 522
 523 ---
 524
 525 * [HBASE-25263](https://issues.apache.org/jira/browse/HBASE-25263) | *Major* | **Change encryption key generation algorithm used in the HBase shell**
 526
 527 Since the backward-compatible change we introduced in HBASE-25263,  we use the more secure PBKDF2WithHmacSHA384  key generation algorithm (instead of PBKDF2WithHmacSHA1) to generate a secret key for HFile / WalFile encryption, when the user is defining a string encryption key in the hbase shell.
 528
 529
 530 ---
 531
 532 * [HBASE-24268](https://issues.apache.org/jira/browse/HBASE-24268) | *Minor* | **REST and Thrift server do not handle the "doAs" parameter case insensitively**
 533
 534 This change allows the REST and Thrift servers to handle the "doAs" parameter case-insensitively, which is deemed as correct per the "specification" provided by the Hadoop community.
 535
 536
 537 ---
 538
 539 * [HBASE-25278](https://issues.apache.org/jira/browse/HBASE-25278) | *Minor* | **Add option to toggle CACHE\_BLOCKS in count.rb**
 540
 541 A new option, CACHE\_BLOCKS, was added to the \`count\` shell command which will force the data for a table to be loaded into the block cache. By default, the \`count\` command will not cache any blocks. This option can serve as a means to for a table's data to be loaded into block cache on demand. See the help message on the count shell command for usage details.
 542
 543
 544 ---
 545
 546 * [HBASE-18070](https://issues.apache.org/jira/browse/HBASE-18070) | *Critical* | **Enable memstore replication for meta replica**
 547
 548 "Async WAL Replication" [1] was added by HBASE-11183 "Timeline Consistent region replicas - Phase 2 design" but only for user-space tables. This feature adds "Async WAL Replication" for the hbase:meta table.  It also adds a client 'LoadBalance' mode that has reads go to replicas first and to the primary only on fail so as to shed read load from the primary to alleviate \*hotspotting\* on the hbase:meta Region.
 549
 550 Configuration is as it was for the user-space 'Async WAL Replication'. See [2] and [3] for details on how to enable.
 551
 552 1. http://hbase.apache.org/book.html#async.wal.replication
 553 2. http://hbase.apache.org/book.html#async.wal.replication.meta
 554 3. http://hbase.apache.org/book.html#\_async\_wal\_replication\_for\_meta\_table\_as\_of\_hbase\_2\_4\_0
 555
 556
 557 ---
 558
 559 * [HBASE-25126](https://issues.apache.org/jira/browse/HBASE-25126) | *Major* | **Add load balance logic in hbase-client to distribute read load over meta replica regions.**
 560
 561 See parent issue, HBASE-18070, release notes for how to enable.
 562
 563
 564 ---
 565
 566 * [HBASE-25026](https://issues.apache.org/jira/browse/HBASE-25026) | *Minor* | **Create a metric to track full region scans RPCs**
 567
 568 Adds a new metric where we collect the number of full region scan requests at the RPC layer. This will be collected under "name" : "Hadoop:service=HBase,name=RegionServer,sub=Server"
 569
 570
 571 ---
 572
 573 * [HBASE-25253](https://issues.apache.org/jira/browse/HBASE-25253) | *Major* | **Deprecated master carrys regions related methods and configs**
 574
 575 Since 2.4.0, deprecated all master carrys regions related methods(LoadBalancer,BaseLoadBalancer,ZNodeClearer) and configs(hbase.balancer.tablesOnMaster, hbase.balancer.tablesOnMaster.systemTablesOnly), they will be removed in 3.0.0.
 576
 577
 578 ---
 579
 580 * [HBASE-20598](https://issues.apache.org/jira/browse/HBASE-20598) | *Major* | **Upgrade to JRuby 9.2**
 581
 582 <!-- markdown -->
 583 The HBase shell now relies on JRuby 9.2. This is a new major version change for JRuby. The most significant change is Ruby compatibility changed from Ruby 2.3 to Ruby 2.5. For more detailed changes please see [the JRuby release announcement for the start of the 9.2 series](https://www.jruby.org/2018/05/24/jruby-9-2-0-0.html) as well as the [general release announcement page for updates since that version](https://www.jruby.org/news).
 584
 585 The runtime dependency versions present on the server side classpath for the Joni (now 2.1.31) and JCodings (now 1.0.55) libraries have also been updated to match those found in the JRuby version shipped with HBase. These version changes are maintenance releases and should be backwards compatible when updated in tandem.
 586
 587
 588 ---
 589
 590 * [HBASE-25181](https://issues.apache.org/jira/browse/HBASE-25181) | *Major* | **Add options for disabling column family encryption and choosing hash algorithm for wrapped encryption keys.**
 591
 592 <!-- markdown -->
 593 This change adds options for disabling column family encryption and choosing hash algorithm for wrapped encryption keys. Changes are done such that defaults will keep the same behavior prior to this issue.
 594
 595 Prior to this change HBase always used the MD5 hash algorithm to store a hash for encryption keys. This hash is needed to verify the secret key of the subject. (e.g. making sure that the same secrey key is used during encrypted HFile read and write). The MD5 algorithm is considered weak, and can not be used in some (e.g. FIPS compliant) clusters. Having a configurable hash enables us to use newer and more secure hash algorithms like SHA-384 or SHA-512 (which are FIPS compliant).
 596
 597 The hash is set via the configuration option `hbase.crypto.key.hash.algorithm`. It should be set to a JDK `MessageDigest` algorithm like "MD5", "SHA-256" or "SHA-384". The default is "MD5" for backward compatibility.
 598
 599 Alternatively, clusters which rely on an encryption at rest mechanism outside of HBase (e.g. those offered by HDFS) and wish to ensure HBase's encryption at rest system is inactive can set `hbase.crypto.enabled` to `false`.
 600
 601
 602 ---
 603
 604 * [HBASE-25238](https://issues.apache.org/jira/browse/HBASE-25238) | *Critical* | **Upgrading HBase from 2.2.0 to 2.3.x fails because of “Message missing required fields: state”**
 605
 606 Fixes master procedure store migration issues going from 2.0.x to 2.2.x and/or 2.3.x. Also fixes failed heartbeat parse during rolling upgrade from 2.0.x. to 2.3.x.
 607
 608
 609 ---
 610
 611 * [HBASE-25234](https://issues.apache.org/jira/browse/HBASE-25234) | *Major* | **[Upgrade]Incompatibility in reading RS report from 2.1 RS when Master is upgraded to a version containing HBASE-21406**
 612
 613 Fixes so auto-migration of master procedure store works again going from 2.0.x =\> 2.2+. Also make it so heartbeats work when rolling upgrading from 2.0.x =\> 2.3+.
 614
 615
 616 ---
 617
 618 * [HBASE-25212](https://issues.apache.org/jira/browse/HBASE-25212) | *Major* | **Optionally abort requests in progress after deciding a region should close**
 619
 620 If hbase.regionserver.close.wait.abort is set to true, interrupt RPC handler threads holding the region close lock.
 621
 622 Until requests in progress can be aborted, wait on the region close lock for a configurable interval (specified by hbase.regionserver.close.wait.time.ms, default 60000 (1 minute)). If we have failed to acquire the close lock after this interval elapses, if allowed (also specified by hbase.regionserver.close.wait.abort), abort the regionserver.
 623
 624 We will attempt to interrupt any running handlers every hbase.regionserver.close.wait.interval.ms (default 10000 (10 seconds)) until either the close lock is acquired or we reach the maximum wait time.
 625
 626
 627 ---
 628
 629 * [HBASE-25167](https://issues.apache.org/jira/browse/HBASE-25167) | *Major* | **Normalizer support for hot config reloading**
 630
 631 <!-- markdown -->
 632 This patch adds [dynamic configuration](https://hbase.apache.org/book.html#dyn_config) support for the following configuration keys related to the normalizer:
 633 * hbase.normalizer.throughput.max_bytes_per_sec
 634 * hbase.normalizer.split.enabled
 635 * hbase.normalizer.merge.enabled
 636 * hbase.normalizer.min.region.count
 637 * hbase.normalizer.merge.min_region_age.days
 638 * hbase.normalizer.merge.min_region_size.mb
 639
 640
 641 ---
 642
 643 * [HBASE-25224](https://issues.apache.org/jira/browse/HBASE-25224) | *Major* | **Maximize sleep for checking meta and namespace regions availability**
 644
 645 Changed the max sleep time during meta and namespace regions availability check to be 60 sec. Previously there was no such cap
 646
 647
 648 ---
 649
 650 * [HBASE-24628](https://issues.apache.org/jira/browse/HBASE-24628) | *Major* | **Region normalizer now respects a rate limit**
 651
 652 <!-- markdown -->
 653 Introduces a new configuration, `hbase.normalizer.throughput.max_bytes_per_sec`, for specifying a limit on the throughput of actions executed by the normalizer. Note that while this configuration value is in bytes, the minimum honored valued is `1,000,000`, or `1m`. Supports values configured using the human-readable suffixes honored by [`Configuration.getLongBytes`](https://hadoop.apache.org/docs/current/api/org/apache/hadoop/conf/Configuration.html#getLongBytes-java.lang.String-long-)
 654
 655
 656 ---
 657
 658 * [HBASE-14067](https://issues.apache.org/jira/browse/HBASE-14067) | *Major* | **bundle ruby files for hbase shell into a jar.**
 659
 660 <!-- markdown -->
 661 The `hbase-shell` artifact now contains the ruby files that implement the hbase shell. There should be no downstream impact for users of the shell that rely on the `hbase shell` command.
 662
 663 Folks that wish to include the HBase ruby classes defined for the shell in their own JRuby scripts should add the `hbase-shell.jar` file to their classpath rather than add `${HBASE_HOME}/lib/ruby` to their load paths.
 664
 665
 666 ---
 667
 668 * [HBASE-24875](https://issues.apache.org/jira/browse/HBASE-24875) | *Major* | **Remove the force param for unassign since it dose not take effect any more**
 669
 670 <!-- markdown -->
 671 The "force" flag to various unassign commands (java api, shell, etc) has been ignored since HBase 2. As of this change the methods that take it are now deprecated. Downstream users should stop passing/using this flag.
 672
 673 The Admin and AsyncAdmin Java APIs will have the deprecated version of the unassign method with a force flag removed in HBase 4. Callers can safely continue to use the deprecated API until then; the internal implementation just calls the new method.
 674
 675 The MasterObserver coprocessor API deprecates the `preUnassign` and `postUnassign` methods that include the force parameter and replaces them with versions that omit this parameter. The deprecated methods will be removed from the API in HBase 3. Until then downstream coprocessor implementations can safely continue to *just* implement the deprecated method if they wish; the replacement methods provide a default implementation that calls the deprecated method with force set to `false`.
 676
 677
 678 ---
 679
 680 * [HBASE-25099](https://issues.apache.org/jira/browse/HBASE-25099) | *Major* | **Change meta replica count by altering meta table descriptor**
 681
 682 Now you can change the region replication config for meta table by altering meta table.
 683 The old "hbase.meta.replica.count" is deprecated and will be removed in 4.0.0. But if it is set, we will still honor it, which means, when master restart, if we find out that the value of 'hbase.meta.replica.count' is different with the region replication config of meta table, we will schedule an alter table operation to change the region replication config to the value you configured for 'hbase.meta.replica.count'.
 684
 685
 686 ---
 687
 688 * [HBASE-23834](https://issues.apache.org/jira/browse/HBASE-23834) | *Major* | **HBase fails to run on Hadoop 3.3.0/3.2.2/3.1.4 due to jetty version mismatch**
 689
 690 Use shaded json and jersey in HBase.
 691 Ban the imports of unshaded json and jersey in code.
 692
 693
 694 ---
 695
 696 * [HBASE-25163](https://issues.apache.org/jira/browse/HBASE-25163) | *Major* | **Increase the timeout value for nightly jobs**
 697
 698 Increase timeout value for nightly jobs to 16 hours since the new build machines are dedicated to hbase project, so we are allowed to use it all the time.
 699
 700
 701 ---
 702
 703 * [HBASE-22976](https://issues.apache.org/jira/browse/HBASE-22976) | *Major* | **[HBCK2] Add RecoveredEditsPlayer**
 704
 705 WALPlayer can replay the content of recovered.edits directories.
 706
 707 Side-effect is that WAL filename timestamp is now factored when setting start/end times for WALInputFormat; i.e. wal.start.time and wal.end.time values on a job context. Previous we looked at wal.end.time only. Now we consider wal.start.time too. If a file has a name outside of wal.start.time\<-\>wal.end.time, it'll be by-passed. This change-in-behavior will make it easier on operator crafting timestamp filters processing WALs.
 708
 709
 710 ---
 711
 712 * [HBASE-25165](https://issues.apache.org/jira/browse/HBASE-25165) | *Minor* | **Change 'State time' in UI so sorts**
 713
 714 Start time on the Master UI is now displayed using ISO8601 format instead of java Date#toString().
 715
 716
 717 ---
 718
 719 * [HBASE-25124](https://issues.apache.org/jira/browse/HBASE-25124) | *Major* | **Support changing region replica count without disabling table**
 720
 721 Now you do not need to disable a table before changing its 'region replication' property.
 722 If you are decreasing the replica count, the excess region replicas will be closed before reopening other replicas.
 723 If you are increasing the replica count, the new region replicas will be opened after reopening the existing replicas.
 724
 725
 726 ---
 727
 728 * [HBASE-25154](https://issues.apache.org/jira/browse/HBASE-25154) | *Major* | **Set java.io.tmpdir to project build directory to avoid writing std\*deferred files to /tmp**
 729
 730 Change the java.io.tmpdir to project.build.directory in surefire-maven-plugin, to avoid writing std\*deferred files to /tmp which may blow up the /tmp disk on our jenkins build node.
 731
 732
 733 ---
 734
 735 * [HBASE-25055](https://issues.apache.org/jira/browse/HBASE-25055) | *Major* | **Add ReplicationSource for meta WALs; add enable/disable when hbase:meta assigned to RS**
 736
 737 Set hbase.region.replica.replication.catalog.enabled to enable async WAL Replication for hbase:meta region replicas. Its off by default.
 738
 739 Defaults to the RegionReadReplicaEndpoint.class shipping edits -- set hbase.region.replica.catalog.replication to target a different endpoint implementation.
 740
 741
 742 ---
 743
 744 * [HBASE-25109](https://issues.apache.org/jira/browse/HBASE-25109) | *Major* | **Add MR Counters to WALPlayer; currently hard to tell if it is doing anything**
 745
 746 Adds a WALPlayer to MR Counter output:
 747
 748         org.apache.hadoop.hbase.mapreduce.WALPlayer$Counter
 749                 CELLS\_READ=89574
 750                 CELLS\_WRITTEN=89572
 751                 DELETES=64
 752                 PUTS=5305
 753                 WALEDITS=4375
 754
 755
 756 ---
 757
 758 * [HBASE-24896](https://issues.apache.org/jira/browse/HBASE-24896) | *Major* | **'Stuck' in static initialization creating RegionInfo instance**
 759
 760 1. Untangle RegionInfo, RegionInfoBuilder, and MutableRegionInfo static
 761 initializations.
 762 2. Undo static initializing references from RegionInfo to RegionInfoBuilder.
 763 3. Mark RegionInfo#UNDEFINED IA.Private and deprecated;
 764 it is for internal use only and likely to be removed in HBase4. (sub-task HBASE-24918)
 765 4. Move MutableRegionInfo from inner-class of
 766 RegionInfoBuilder to be (package private) standalone. (sub-task HBASE-24918)
 767
 768
 769 ---
 770
 771 * [HBASE-24956](https://issues.apache.org/jira/browse/HBASE-24956) | *Major* | **ConnectionManager#locateRegionInMeta waits for user region lock indefinitely.**
 772
 773 <!-- markdown -->
 774
 775 Without this fix there are situations in which locateRegionInMeta() on a client is not bound by a timeout. This happens because of a global lock whose acquisition was not under any lock scope. This affects client facing API calls that rely on this method to locate a table region in meta. This fix brings the lock acquisition under the scope of "hbase.client.meta.operation.timeout" and that guarantees a bounded wait time.
 776
 777
 778 ---
 779
 780 * [HBASE-24764](https://issues.apache.org/jira/browse/HBASE-24764) | *Minor* | **Add support of adding base peer configs via hbase-site.xml for all replication peers.**
 781
 782 <!-- markdown -->
 783
 784 Adds a new configuration parameter "hbase.replication.peer.base.config" which accepts a semi-colon separated key=CSV pairs (example: k1=v1;k2=v2_1,v3...). When this configuration is set on the server side, these kv pairs are added to every peer configuration if not already set. Peer specific configuration overrides have precedence over the above default configuration. This is useful in cases when some configuration has to be set for all the peers by default and one does not want to add to every peer definition.
 785
 786
 787 ---
 788
 789 * [HBASE-24994](https://issues.apache.org/jira/browse/HBASE-24994) | *Minor* | **Add hedgedReadOpsInCurThread metric**
 790
 791 Expose Hadoop hedgedReadOpsInCurThread metric to HBase.
 792 This metric counts the number of times the hedged reads service executor rejected a read task, falling back to the current thread.
 793 This will help determine the proper size of the thread pool (dfs.client.hedged.read.threadpool.size).
 794
 795
 796 ---
 797
 798 * [HBASE-24776](https://issues.apache.org/jira/browse/HBASE-24776) | *Major* | **[hbtop] Support Batch mode**
 799
 800 HBASE-24776 added the following command line parameters to hbtop:
 801 \| Argument \| Description \|
 802 \|---\|---\|
 803 \| -n,--numberOfIterations \<arg\> \| The number of iterations \|
 804 \| -O,--outputFieldNames \| Print each of the available field names on a separate line, then quit \|
 805 \| -f,--fields \<arg\> \| Show only the given fields. Specify comma separated fields to show multiple fields \|
 806 \| -s,--sortField \<arg\> \| The initial sort field. You can prepend a \`+' or \`-' to the field name to also override the sort direction. A leading \`+' will force sorting high to low, whereas a \`-' will ensure a low to high ordering \|
 807 \| -i,--filters \<arg\> \| The initial filters. Specify comma separated filters to set multiple filters \|
 808 \| -b,--batchMode \| Starts hbtop in Batch mode, which could be useful for sending output from hbtop to other programs or to a file. In this mode, hbtop will not accept input and runs until the iterations limit you've set with the \`-n' command-line option or until killed \|
 809
 810
 811 ---
 812
 813 * [HBASE-24602](https://issues.apache.org/jira/browse/HBASE-24602) | *Major* | **Add Increment and Append support to CheckAndMutate**
 814
 815 Summary of the change of HBASE-24602:
 816 - Add \`build(Increment)\` and \`build(Append)\` methods to the \`Builder\` class of the \`CheckAndMutate\` class. After this change, we can perform checkAndIncrement/Append operations as follows:
 817 \`\`\`
 818 // Build a CheckAndMutate object with a Increment object
 819 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
 820   .ifEquals(family, qualifier, value)
 821   .build(increment);
 822
 823 // Perform a CheckAndIncrement operation
 824 CheckAndMutateResult checkAndMutateResult = table.checkAndMutate(checkAndMutate);
 825
 826 // Get whether or not the CheckAndIncrement operation is successful
 827 boolean success = checkAndMutateResult.isSuccess();
 828
 829 // Get the result of the increment operation
 830 Result result = checkAndMutateResult.getResult();
 831 \`\`\`
 832 - After this change, \`HRegion.batchMutate()\` is used for increment/append operations.
 833 - As the side effect of the above change, the following coprocessor methods of RegionObserver are called when increment/append operations are performed:
 834   - preBatchMutate()
 835   - postBatchMutate()
 836   - postBatchMutateIndispensably()
 837
 838
 839 ---
 840
 841 * [HBASE-24694](https://issues.apache.org/jira/browse/HBASE-24694) | *Major* | **Support flush a single column family of table**
 842
 843 Adds option for the flush command to flush all stores from the specified column family only, among all regions of the given table (stores from other column families on this table would not get flushed).
 844
 845
 846 ---
 847
 848 * [HBASE-24625](https://issues.apache.org/jira/browse/HBASE-24625) | *Critical* | **AsyncFSWAL.getLogFileSizeIfBeingWritten does not return the expected synced file length.**
 849
 850 We add a method getSyncedLength in  WALProvider.WriterBase interface for  WALFileLengthProvider used for replication, considering the case if we use  AsyncFSWAL,we write to 3 DNs concurrently,according to the visibility guarantee of HDFS, the data will be available immediately
 851 when arriving at DN since all the DNs will be considered as the last one in pipeline.This means replication may read uncommitted data and replicate it to the remote cluster and cause data inconsistency.The method WriterBase#getLength may return length which just in hdfs client buffer and not successfully synced to HDFS, so we use this method WriterBase#getSyncedLength to return the length successfully synced to HDFS and replication thread could only read writing WAL file limited by this length.
 852 see also HBASE-14004 and this document for more details:
 853 https://docs.google.com/document/d/11AyWtGhItQs6vsLRIx32PwTxmBY3libXwGXI25obVEY/edit#
 854
 855 Before this patch, replication may read uncommitted data and replicate it to the slave cluster and cause data inconsistency between master and slave cluster, we could use FSHLog instead of AsyncFSWAL  to reduce probability of inconsistency without this patch applied.
 856
 857
 858 ---
 859
 860 * [HBASE-24779](https://issues.apache.org/jira/browse/HBASE-24779) | *Minor* | **Improve insight into replication WAL readers hung on checkQuota**
 861
 862 New metrics are exposed, on the global source, for replication which indicate the "WAL entry buffer" that was introduced in HBASE-15995. When this usage reaches the limit, that RegionServer will cease to read more data for the sake of trying to replicate it. This usage (and limit) is local to each RegionServer is shared across all peers being handled by that RegionServer.
 863
 864
 865 ---
 866
 867 * [HBASE-24404](https://issues.apache.org/jira/browse/HBASE-24404) | *Major* | **Support flush a single column family of region**
 868
 869 This adds an extra "flush" command option that allows for specifying an individual family to have its store flushed.
 870
 871 Usage:
 872 flush 'REGIONNAME','FAMILYNAME'
 873 flush 'ENCODED\_REGIONNAME','FAMILYNAME'
 874
 875
 876 ---
 877
 878 * [HBASE-24805](https://issues.apache.org/jira/browse/HBASE-24805) | *Major* | **HBaseTestingUtility.getConnection should be threadsafe**
 879
 880 <!-- markdown -->
 881 Users of `HBaseTestingUtility` can now safely call the `getConnection` method from multiple threads.
 882
 883 As a consequence of refactoring to improve the thread safety of the HBase testing classes, the protected `conf` member of the  `HBaseCommonTestingUtility` class has been marked final. Downstream users who extend from the class hierarchy rooted at this class will need to pass the Configuration instance they want used to their super constructor rather than overwriting the instance variable.
 884
 885
 886 ---
 887
 888 * [HBASE-24767](https://issues.apache.org/jira/browse/HBASE-24767) | *Major* | **Change default to false for HBASE-15519 per-user metrics**
 889
 890 Disables per-user metrics. They were enabled by default for the first time in hbase-2.3.0 but they need some work before they can be on all the time (See HBASE-15519)
 891
 892
 893 ---
 894
 895 * [HBASE-24704](https://issues.apache.org/jira/browse/HBASE-24704) | *Major* | **Make the Table Schema easier to view even there are multiple families**
 896
 897 Improve the layout of column family from vertical to horizontal in table UI.
 898
 899
 900 ---
 901
 902 * [HBASE-11686](https://issues.apache.org/jira/browse/HBASE-11686) | *Minor* | **Shell code should create a binding / irb workspace instead of polluting the root namespace**
 903
 904 In shell, all HBase constants and commands have been moved out of the top-level and into an IRB Workspace. Piped stdin and scripts passed by name to the shell will be evaluated within this workspace. If you absolutely need the top-level definitions, use the new compatibility flag, ie. hbase shell --top-level-defs or hbase shell --top-level-defs script2run.rb.
 905
 906
 907 ---
 908
 909 * [HBASE-24632](https://issues.apache.org/jira/browse/HBASE-24632) | *Major* | **Enable procedure-based log splitting as default in hbase3**
 910
 911 Enables procedure-based distributed WAL splitting as default (HBASE-20610). To use 'classic' zk-coordinated splitting instead, set 'hbase.split.wal.zk.coordinated' to 'true'.
 912
 913
 914 ---
 915
 916 * [HBASE-24698](https://issues.apache.org/jira/browse/HBASE-24698) | *Major* | **Turn OFF Canary WebUI as default**
 917
 918 Flips default for 'HBASE-23994 Add WebUI to Canary' The UI defaulted to on at port 16050. This JIRA changes it so new UI is off by default.
 919
 920 To enable the UI, set property 'hbase.canary.info.port' to the port you want the UI to use.
 921
 922
 923 ---
 924
 925 * [HBASE-24650](https://issues.apache.org/jira/browse/HBASE-24650) | *Major* | **Change the return types of the new checkAndMutate methods introduced in HBASE-8458**
 926
 927 HBASE-24650 introduced CheckAndMutateResult class and changed the return type of checkAndMutate methods to this class in order to support CheckAndMutate with Increment/Append. CheckAndMutateResult class has two fields, one is \*success\* that indicates whether the operation is successful or not, and the other one is \*result\* that's the result of the operation and is used for  CheckAndMutate with Increment/Append.
 928
 929 The new APIs for the Table interface:
 930 \`\`\`
 931 /\*\*
 932  \* checkAndMutate that atomically checks if a row matches the specified condition. If it does,
 933  \* it performs the specified action.
 934  \*
 935  \* @param checkAndMutate The CheckAndMutate object.
 936  \* @return A CheckAndMutateResult object that represents the result for the CheckAndMutate.
 937  \* @throws IOException if a remote or network exception occurs.
 938  \*/
 939 default CheckAndMutateResult checkAndMutate(CheckAndMutate checkAndMutate) throws IOException {
 940   return checkAndMutate(Collections.singletonList(checkAndMutate)).get(0);
 941 }
 942
 943 /\*\*
 944  \* Batch version of checkAndMutate. The specified CheckAndMutates are batched only in the sense
 945  \* that they are sent to a RS in one RPC, but each CheckAndMutate operation is still executed
 946  \* atomically (and thus, each may fail independently of others).
 947  \*
 948  \* @param checkAndMutates The list of CheckAndMutate.
 949  \* @return A list of CheckAndMutateResult objects that represents the result for each
 950  \*   CheckAndMutate.
 951  \* @throws IOException if a remote or network exception occurs.
 952  \*/
 953 default List\<CheckAndMutateResult\> checkAndMutate(List\<CheckAndMutate\> checkAndMutates)
 954   throws IOException {
 955   throw new NotImplementedException("Add an implementation!");
 956 }
 957 {code}
 958
 959 The new APIs for the AsyncTable interface:
 960 {code}
 961 /\*\*
 962  \* checkAndMutate that atomically checks if a row matches the specified condition. If it does,
 963  \* it performs the specified action.
 964  \*
 965  \* @param checkAndMutate The CheckAndMutate object.
 966  \* @return A {@link CompletableFuture}s that represent the result for the CheckAndMutate.
 967  \*/
 968 CompletableFuture\<CheckAndMutateResult\> checkAndMutate(CheckAndMutate checkAndMutate);
 969
 970 /\*\*
 971  \* Batch version of checkAndMutate. The specified CheckAndMutates are batched only in the sense
 972  \* that they are sent to a RS in one RPC, but each CheckAndMutate operation is still executed
 973  \* atomically (and thus, each may fail independently of others).
 974  \*
 975  \* @param checkAndMutates The list of CheckAndMutate.
 976  \* @return A list of {@link CompletableFuture}s that represent the result for each
 977  \*   CheckAndMutate.
 978  \*/
 979 List\<CompletableFuture\<CheckAndMutateResult\>\> checkAndMutate(
 980   List\<CheckAndMutate\> checkAndMutates);
 981
 982 /\*\*
 983  \* A simple version of batch checkAndMutate. It will fail if there are any failures.
 984  \*
 985  \* @param checkAndMutates The list of rows to apply.
 986  \* @return A {@link CompletableFuture} that wrapper the result list.
 987  \*/
 988 default CompletableFuture\<List\<CheckAndMutateResult\>\> checkAndMutateAll(
 989   List\<CheckAndMutate\> checkAndMutates) {
 990   return allOf(checkAndMutate(checkAndMutates));
 991 }
 992 \`\`\`
 993
 994
 995 ---
 996
 997 * [HBASE-24671](https://issues.apache.org/jira/browse/HBASE-24671) | *Major* | **Add excludefile and designatedfile options to graceful\_stop.sh**
 998
 999 Add excludefile and designatedfile options to graceful\_stop.sh.
1000
1001 Designated file with \<hostname:port\> per line as unload targets.
1002
1003 Exclude file should have \<hostname:port\> per line. We do not unload regions to hostnames given in exclude file.
1004
1005 Here is a simple example using graceful\_stop.sh with designatedfile option:
1006 ./bin/graceful\_stop.sh --maxthreads 4 --designatedfile /path/designatedfile hostname
1007 The usage of the excludefile option is the same as the above.
1008
1009
1010 ---
1011
1012 * [HBASE-24560](https://issues.apache.org/jira/browse/HBASE-24560) | *Major* | **Add a new option of designatedfile in RegionMover**
1013
1014 Add a new option "designatedfile" in RegionMover.
1015
1016 If designated file is present with some contents, we will unload regions to hostnames provided in designated file.
1017
1018 Designated file should have 'host:port' per line.
1019
1020
1021 ---
1022
1023 * [HBASE-24289](https://issues.apache.org/jira/browse/HBASE-24289) | *Major* | **Heterogeneous Storage for Date Tiered Compaction**
1024
1025 Enhance DateTieredCompaction to support HDFS storage policy within one class family.
1026 # First you need enable DTCP.
1027 To turn on Date Tiered Compaction (It is not recommended to turn on for the whole cluster because that will put meta table on it too and random get on meta table will be impacted):
1028 hbase.hstore.compaction.compaction.policy=org.apache.hadoop.hbase.regionserver.compactions.DateTieredCompactionPolicy
1029 ## Parameters for Date Tiered Compaction:
1030 hbase.hstore.compaction.date.tiered.max.storefile.age.millis: Files with max-timestamp smaller than this will no longer be compacted.Default at Long.MAX\_VALUE.
1031 hbase.hstore.compaction.date.tiered.base.window.millis: base window size in milliseconds. Default at 6 hours.
1032 hbase.hstore.compaction.date.tiered.windows.per.tier: number of windows per tier. Default at 4.
1033 hbase.hstore.compaction.date.tiered.incoming.window.min: minimal number of files to compact in the incoming window. Set it to expected number of files in the window to avoid wasteful compaction. Default at 6.
1034
1035 # Then enable HDTCP(Heterogeneous Date Tiered Compaction) as follow example configurations:
1036 hbase.hstore.compaction.date.tiered.storage.policy.enable=true
1037 hbase.hstore.compaction.date.tiered.hot.window.age.millis=3600000
1038 hbase.hstore.compaction.date.tiered.hot.window.storage.policy=ALL\_SSD
1039 hbase.hstore.compaction.date.tiered.warm.window.age.millis=20600000
1040 hbase.hstore.compaction.date.tiered.warm.window.storage.policy=ONE\_SSD
1041 hbase.hstore.compaction.date.tiered.cold.window.storage.policy=HOT
1042 ## It is better to enable WAL and flushing HFile storage policy with HDTCP. You can tune follow settings as well:
1043 hbase.wal.storage.policy=ALL\_SSD
1044 create 'table',{NAME=\>'f1',CONFIGURATION=\>{'hbase.hstore.block.storage.policy'=\>'ALL\_SSD'}}
1045
1046 # Disable HDTCP as follow:
1047 hbase.hstore.compaction.date.tiered.storage.policy.enable=false
1048
1049
1050 ---
1051
1052 * [HBASE-24648](https://issues.apache.org/jira/browse/HBASE-24648) | *Major* | **Remove the legacy 'forceSplit' related code at region server side**
1053
1054 Add a canSplit method to RegionSplitPolicy to determine whether we can split a region. Usually it is not related to RegionSplitPolicy so in the default implementation, it will test whether region is available and does not have reference file, but in DisabledRegionSplitPolicy, we will always return false.
1055
1056
1057 ---
1058
1059 * [HBASE-24382](https://issues.apache.org/jira/browse/HBASE-24382) | *Major* | **Flush partial stores of region filtered by seqId when archive wal due to too many wals**
1060
1061 Change the flush level from region to store when there are too many wals, benefit from this we can reduce unnessary flush tasks and small hfiles.
1062
1063
1064 ---
1065
1066 * [HBASE-24038](https://issues.apache.org/jira/browse/HBASE-24038) | *Major* | **Add a metric to show the locality of ssd in table.jsp**
1067
1068 Add a metric to show the locality of ssd in table.jsp, and move the locality related metrics to a new tab named localities.
1069
1070
1071 ---
1072
1073 * [HBASE-8458](https://issues.apache.org/jira/browse/HBASE-8458) | *Major* | **Support for batch version of checkAndMutate()**
1074
1075 HBASE-8458 introduced CheckAndMutate class that's used to perform CheckAndMutate operations. Use the builder class to instantiate a CheckAndMutate object. This builder class is fluent style APIs, the code are like:
1076 \`\`\`
1077 // A CheckAndMutate operation where do the specified action if the column (specified by the
1078 family and the qualifier) of the row equals to the specified value
1079 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
1080   .ifEquals(family, qualifier, value)
1081   .build(put);
1082
1083 // A CheckAndMutate operation where do the specified action if the column (specified by the
1084 // family and the qualifier) of the row doesn't exist
1085 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
1086   .ifNotExists(family, qualifier)
1087   .build(put);
1088
1089 // A CheckAndMutate operation where do the specified action if the row matches the filter
1090 CheckAndMutate checkAndMutate = CheckAndMutate.newBuilder(row)
1091   .ifMatches(filter)
1092   .build(delete);
1093 \`\`\`
1094
1095 And This added new checkAndMutate APIs to the Table and AsyncTable interfaces, and deprecated the old checkAndMutate APIs. The example code for the new APIs are as follows:
1096 \`\`\`
1097 Table table = ...;
1098
1099 CheckAndMutate checkAndMutate = ...;
1100
1101 // Perform the checkAndMutate operation
1102 boolean success = table.checkAndMutate(checkAndMutate);
1103
1104 CheckAndMutate checkAndMutate1 = ...;
1105 CheckAndMutate checkAndMutate2 = ...;
1106
1107 // Batch version
1108 List\<Boolean\> successList = table.checkAndMutate(Arrays.asList(checkAndMutate1, checkAndMutate2));
1109 \`\`\`
1110
1111 This also has Protocol Buffers level changes. Old clients without this patch will work against new servers with this patch. However, new clients will break against old servers without this patch for checkAndMutate with RM and mutateRow. So, for rolling upgrade, we will need to upgrade servers first, and then roll out the new clients.
1112
1113
1114 ---
1115
1116 * [HBASE-24471](https://issues.apache.org/jira/browse/HBASE-24471) | *Major* | **The way we bootstrap meta table is confusing**
1117
1118 Move all the meta initialization code in MasterFileSystem and HRegionServer to InitMetaProcedure. Add a new step for InitMetaProcedure called INIT\_META\_WRITE\_FS\_LAYOUT to place the moved code.
1119
1120 This is an incompatible change, but should not have much impact. InitMetaProcedure will only be executed once when bootstraping a fresh new cluster, so typically this will not effect rolling upgrading. And even if you hit this problem, as long as InitMetaProcedure has not been finished, we can make sure that there is no user data in the cluster, you can just clean up the cluster and try again. There will be no data loss.
1121
1122
1123 ---
1124
1125 * [HBASE-24017](https://issues.apache.org/jira/browse/HBASE-24017) | *Major* | **Turn down flakey rerun rate on all but hot branches**
1126
1127 Changed master, branch-2, and branch-2.1 to twice a day.
1128 Left branch-2.3, branch-2.2, and branch-1 at every 4 hours.
1129 Changed branch-1.4 and branch-1.3 to @daily (1.3 was running every hour).
1130
1131
1132
1133 # HBASE  2.3.0 Release Notes
1134
1135 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
1136
1137
1138 ---
1139
1140 * [HBASE-24603](https://issues.apache.org/jira/browse/HBASE-24603) | *Critical* | **Zookeeper sync() call is async**
1141
1142 <!-- markdown -->
1143
1144 Fixes a couple of bugs in ZooKeeper interaction. Firstly, zk sync() call that is used to sync the lagging followers with leader so that the client sees a consistent snapshot state was actually asynchronous under the hood. We make it synchronous for correctness. Second, zookeeper events are now processed in a separate thread rather than doing it in the thread context of zookeeper client connection. This decoupling frees up client connection quickly and avoids deadlocks.
1145
1146
1147 ---
1148
1149 * [HBASE-24631](https://issues.apache.org/jira/browse/HBASE-24631) | *Major* | **Loosen Dockerfile pinned package versions of the "debian-revision"**
1150
1151 <!-- markdown -->
1152 Update our package version numbers throughout the Dockerfiles to be pinned to their epic:upstream-version components only. Previously we'd specify the full debian package version number, including the debian-revision. This lead to instability as debian packaging details changed.
1153 See also [man deb-version](http://manpages.ubuntu.com/manpages/xenial/en/man5/deb-version.5.html)
1154
1155
1156 ---
1157
1158 * [HBASE-24205](https://issues.apache.org/jira/browse/HBASE-24205) | *Major* | **Create metric to know the number of reads that happens from memstore**
1159
1160 Adds a new metric where we collect the number of read requests (tracked per row) whether the row was fetched completely from memstore or it was pulled from files  and memstore.
1161 The metric is now collected under the mbean for Tables and under the mbean for regions.
1162 Under table mbean ie.-
1163 'name": "Hadoop:service=HBase,name=RegionServer,sub=Tables'
1164 The new metrics will be listed as
1165 {code}
1166     "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
1167  "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_mixedRowReadsCount": 1,
1168 {code}
1169 Where the format is Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_memstoreOnlyRowReadsCount
1170 Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_mixedRowReadsCount
1171 {code}
1172
1173 The same one under the region ie.
1174 "name": "Hadoop:service=HBase,name=RegionServer,sub=Regions",
1175 comes as
1176 {code}
1177    "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
1178     "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_mixedRowReadsCount": 1,
1179 {code}
1180 where
1181 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_memstoreOnlyRowReadsCount
1182 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_mixedRowReadsCount
1183 This is also an aggregate against every store the number of reads that happened purely from the memstore or it was a  mixed read that happened from memstore and file.
1184
1185
1186 ---
1187
1188 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
1189
1190 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
1191
1192 $hbase rowcounter -h
1193
1194 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
1195 Options:
1196     --starttime=\<arg\>       starting time filter to start counting rows from.
1197     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
1198     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
1199     --expectedCount=\<arg\>   expected number of rows to be count.
1200 For performance, consider the following configuration properties:
1201 -Dhbase.client.scanner.caching=100
1202 -Dmapreduce.map.speculative=false
1203
1204
1205 ---
1206
1207 * [HBASE-24217](https://issues.apache.org/jira/browse/HBASE-24217) | *Major* | **Add hadoop 3.2.x support**
1208
1209 CI coverage has been extended to include Hadoop 3.2.x for HBase 2.2+.
1210
1211
1212 ---
1213
1214 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
1215
1216 Adds being able to edit hbase:meta table schema. For example,
1217
1218 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
1219 Updating all regions with the new schema...
1220 All regions updated.
1221 Done.
1222 Took 1.2138 seconds
1223
1224 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
1225
1226
1227 ---
1228
1229 * [HBASE-15161](https://issues.apache.org/jira/browse/HBASE-15161) | *Major* | **Umbrella: Miscellaneous improvements from production usage**
1230
1231 This ticket summarizes significant improvements and expansion to the metrics surface area. Interested users should review the individual sub-tasks.
1232
1233
1234 ---
1235
1236 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
1237
1238 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
1239
1240
1241 ---
1242
1243 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
1244
1245 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
1246
1247
1248 ---
1249
1250 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
1251
1252 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
1253
1254
1255 ---
1256
1257 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
1258
1259 <!-- markdown -->
1260 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
1261 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
1262 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
1263 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
1264 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
1265 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
1266
1267
1268 ---
1269
1270 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
1271
1272 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
1273
1274 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
1275
1276 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
1277
1278 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
1279
1280
1281 ---
1282
1283 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
1284
1285 Added new metric to differentiate sink startup time from last OP applied time.
1286
1287 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
1288
1289 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
1290
1291 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
1292
1293 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
1294
1295
1296 ---
1297
1298 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
1299
1300 <!-- markdown -->
1301 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
1302
1303 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
1304
1305
1306 ---
1307
1308 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
1309
1310 Add backoff. Avoid retrying every 100ms.
1311
1312
1313 ---
1314
1315 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
1316
1317 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
1318
1319 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
1320
1321
1322 ---
1323
1324 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
1325
1326 Introduced a general 'local region' at master side to store the procedure data, etc.
1327
1328 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
1329
1330 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
1331
1332
1333 ---
1334
1335 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
1336
1337 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
1338
1339
1340 ---
1341
1342 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
1343
1344 Config key: hbase.regionserver.slowlog.systable.enabled
1345 Default value: false
1346
1347 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
1348 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
1349
1350 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
1351
1352 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
1353
1354  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
1355  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
1356  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
1357  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
1358                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
1359                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
1360                                                              rics: false
1361  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
1362  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
1363  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
1364  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
1365  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
1366  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
1367  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
1368  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
1369
1370
1371 ---
1372
1373 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
1374
1375 <!-- markdown -->
1376 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
1377
1378
1379 ---
1380
1381 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
1382
1383 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
1384
1385 The request log is disabled by default in conf/log4j.properties by the following lines:
1386
1387 # Disable request log by default, you can enable this by changing the appender
1388 log4j.category.http.requests=INFO,NullAppender
1389 log4j.additivity.http.requests=false
1390
1391 Change the 'NullAppender' to what ever you want if you want to enable request log.
1392
1393 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
1394
1395
1396 ---
1397
1398 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
1399
1400 Use a empty string to represent no column specified for deleteall in shell mode.
1401 useage:
1402 deleteall 'test','r1','',12345
1403 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
1404
1405
1406 ---
1407
1408 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
1409
1410 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
1411
1412
1413 ---
1414
1415 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
1416
1417 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
1418
1419
1420 ---
1421
1422 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
1423
1424 Moved to hbase-thirdparty 3.3.0.
1425
1426
1427 ---
1428
1429 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
1430
1431 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
1432
1433 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
1434
1435 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
1436
1437
1438 ---
1439
1440 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
1441
1442 <!-- markdown -->
1443 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
1444
1445 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
1446
1447
1448 ---
1449
1450 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
1451
1452 New Config: hbase.rpc.rows.size.threshold.reject
1453 -----------------------------------------------------------------------
1454
1455 Default value: false
1456 Description:
1457 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
1458
1459
1460 ---
1461
1462 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
1463
1464 StochasticLoadBalancer functional improvement:
1465
1466 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
1467
1468
1469 ---
1470
1471 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
1472
1473 user or admin can now use
1474 hbase shell \> rename\_rsgroup 'oldname', 'newname'
1475 to rename rsgroup.
1476
1477
1478 ---
1479
1480 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
1481
1482 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
1483
1484 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
1485
1486
1487 ---
1488
1489 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
1490
1491 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
1492
1493
1494 ---
1495
1496 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
1497
1498 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
1499
1500
1501 ---
1502
1503 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
1504
1505 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
1506
1507
1508 ---
1509
1510 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
1511
1512 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
1513
1514
1515 ---
1516
1517 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
1518
1519 <!-- markdown -->
1520 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
1521
1522
1523 ---
1524
1525 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
1526
1527 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
1528
1529 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
1530
1531 For running tests locally, to go faster, up fork count.
1532
1533 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
1534
1535 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
1536
1537
1538 ---
1539
1540 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
1541
1542 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
1543
1544
1545 ---
1546
1547 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
1548
1549 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
1550
1551
1552 ---
1553
1554 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
1555
1556 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
1557
1558
1559 ---
1560
1561 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
1562
1563 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
1564
1565
1566 ---
1567
1568 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
1569
1570 <!-- markdown -->
1571 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
1572
1573 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
1574
1575
1576 ---
1577
1578 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
1579
1580 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
1581
1582
1583 ---
1584
1585 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
1586
1587 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
1588
1589
1590 ---
1591
1592 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
1593
1594 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
1595
1596
1597 ---
1598
1599 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
1600
1601 ColumnFamilyDescriptor new builder API:
1602
1603     /\*\*
1604      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
1605      \* of versions(versionAfterInterval) after that interval elapses.
1606      \*
1607      \* @param retentionInterval Retain all versions for this interval
1608      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
1609      \*/
1610     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
1611         final int retentionInterval, final int versionAfterInterval)
1612
1613
1614 ---
1615
1616 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
1617
1618 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
1619
1620
1621 ---
1622
1623 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
1624
1625 Expose file system level read metrics for RegionServer.
1626
1627 If the HBase RS runs on top of HDFS, calculate the aggregation of
1628 ReadStatistics of each HdfsFileInputStream. These metrics include:
1629 (1) total number of bytes read from HDFS.
1630 (2) total number of bytes read from local DataNode.
1631 (3) total number of bytes read locally through short-circuit read.
1632 (4) total number of bytes read locally through zero-copy read.
1633
1634 Because HDFS ReadStatistics is calculated per input stream, it is not
1635 feasible to update the aggregated number in real time. Instead, the
1636 metrics are updated when an input stream is closed.
1637
1638
1639 ---
1640
1641 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
1642
1643 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
1644
1645 Here is a simple example of script:
1646 {code}
1647 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
1648 #!/bin/bash
1649 namespace=$1
1650 tablename=$2
1651 if [[ $namespace == test ]]; then
1652   echo test
1653 elif [[ $tablename == \*foo\* ]]; then
1654   echo other
1655 else
1656   echo default
1657 fi
1658 {code}
1659
1660
1661 ---
1662
1663 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
1664
1665 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
1666
1667
1668 ---
1669
1670 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
1671
1672 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
1673
1674
1675 ---
1676
1677 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
1678
1679 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
1680
1681 User used to see....
1682
1683   column=table:state, timestamp=1583967620343 .....
1684
1685 ... but now sees:
1686
1687   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
1688
1689
1690 ---
1691
1692 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
1693
1694 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
1695
1696
1697 ---
1698
1699 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
1700
1701 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
1702
1703 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
1704
1705
1706 ---
1707
1708 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
1709
1710 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
1711
1712 New Admin APIs:
1713 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
1714       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
1715
1716 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
1717       throws IOException;
1718
1719 Configs:
1720
1721 1. hbase.regionserver.slowlog.ringbuffer.size:
1722 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
1723
1724 Default
1725 256
1726
1727 2. hbase.regionserver.slowlog.buffer.enabled:
1728 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
1729
1730 Default
1731 false
1732
1733
1734 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
1735
1736
1737 ---
1738
1739 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
1740
1741 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
1742
1743
1744 ---
1745
1746 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
1747
1748 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
1749
1750 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
1751
1752 This is a fluent style API, the code is like:
1753
1754 For Table interface:
1755 {code}
1756 table.checkAndMutate(row, filter).thenPut(put);
1757 {code}
1758
1759 For AsyncTable interface:
1760 {code}
1761 table.checkAndMutate(row, filter).thenPut(put)
1762     .thenAccept(succ -\> {
1763       if (succ) {
1764         System.out.println("Check and put succeeded");
1765       } else {
1766         System.out.println("Check and put failed");
1767       }
1768     });
1769 {code}
1770
1771
1772 ---
1773
1774 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
1775
1776 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
1777
1778
1779 ---
1780
1781 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
1782
1783 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
1784
1785
1786 ---
1787
1788 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
1789
1790     Adds shell command regioninfo:
1791
1792       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
1793       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
1794       Took 0.4737 seconds
1795
1796
1797 ---
1798
1799 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
1800
1801 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
1802
1803 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
1804
1805
1806 ---
1807
1808 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
1809
1810 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
1811
1812 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
1813 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
1814
1815 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
1816
1817
1818 ---
1819
1820 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
1821
1822 <!-- markdown -->
1823 Enables master based registry as the default registry used by clients to fetch connection metadata.
1824 Refer to the section "Master Registry" in the client documentation for more details and advantages
1825 of this implementation over the default Zookeeper based registry.
1826
1827 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
1828
1829 Where to set this: HBase client configuration (hbase-site.xml)
1830
1831 Possible values:
1832 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
1833 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
1834
1835 Notes on defaults:
1836
1837 - For v3.0.0 and later, MasterRegistry is the default registry
1838 - For all releases in 2.x line, ZK based registry is the default.
1839
1840 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
1841
1842 ```
1843 <property>
1844   <name>hbase.client.registry.impl</name>
1845   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
1846 </property>
1847 ```
1848
1849
1850 ---
1851
1852 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
1853
1854 caffeine: 2.6.2 =\> 2.8.1
1855 commons-codec: 1.10 =\> 1.13
1856 commons-io: 2.5 =\> 2.6
1857 disrupter: 3.3.6 =\> 3.4.2
1858 httpcore: 4.4.6 =\> 4.4.13
1859 jackson: 2.9.10 =\> 2.10.1
1860 jackson.databind: 2.9.10.1 =\> 2.10.1
1861 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
1862 protobuf.plugin: 0.5.0 =\> 0.6.1
1863 zookeeper: 3.4.10 =\> 3.4.14
1864 slf4j: 1.7.25 =\> 1.7.30
1865 rat: 0.12 =\> 0.13
1866 asciidoctor: 1.5.5 =\> 1.5.8
1867 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
1868 error-prone: 2.3.3 =\> 2.3.4
1869
1870
1871 ---
1872
1873 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
1874
1875 - Reverts a binary incompatible binary change for ByteRangeUtils
1876 - Usage of reflection inside CommonFSUtils removed
1877
1878
1879 ---
1880
1881 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
1882
1883 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
1884
1885 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
1886
1887
1888 ---
1889
1890 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
1891
1892 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
1893
1894
1895 ---
1896
1897 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
1898
1899 Add a new config to hbase-default.xml
1900
1901   \<property\>
1902     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
1903     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
1904     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
1905     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
1906     called in order, so put the cleaner that prunes the most files in front. To
1907     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
1908     and add the fully qualified class name here. Always add the above
1909     default hfile cleaners in the list as they will be overwritten in
1910     hbase-site.xml.\</description\>
1911   \</property\>
1912
1913 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
1914
1915
1916 ---
1917
1918 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
1919
1920 Updated parent pom to Apache version 22.
1921
1922
1923 ---
1924
1925 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
1926
1927 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
1928
1929 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
1930
1931
1932 ---
1933
1934 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
1935
1936 Add a new feature to improve MTTR which have 3 steps to failover:
1937 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
1938 2. Open region.
1939 3. Bulkload the recovered.hfiles for every column family.
1940
1941 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
1942
1943 Config hbase.wal.split.to.hfile to true to enable this featue.
1944
1945
1946 ---
1947
1948 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
1949
1950 Changed the logging in hbase-zookeeper to use built-in formatting
1951
1952
1953 ---
1954
1955 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
1956
1957 From the PR:
1958
1959 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
1960
1961 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
1962
1963
1964 ---
1965
1966 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
1967
1968 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
1969
1970
1971 ---
1972
1973 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
1974
1975 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
1976
1977
1978 ---
1979
1980 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
1981
1982 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
1983
1984
1985 ---
1986
1987 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
1988
1989 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
1990
1991 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
1992
1993
1994 ---
1995
1996 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
1997
1998 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
1999
2000
2001 ---
2002
2003 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
2004
2005 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
2006 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
2007
2008 Fixed this bug as part of this Jira.
2009 Updated description for corresponding configs:
2010
2011 1. hbase.master.regions.recovery.check.interval :
2012
2013 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
2014
2015 2. hbase.regions.recovery.store.file.ref.count :
2016
2017 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
2018
2019
2020 ---
2021
2022 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
2023
2024 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
2025
2026
2027 ---
2028
2029 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
2030
2031 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
2032
2033
2034 ---
2035
2036 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
2037
2038 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
2039
2040
2041 ---
2042
2043 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
2044
2045 Bumped surefire plugin to 3.0.0-M4
2046
2047
2048 ---
2049
2050 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
2051
2052 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
2053
2054
2055 ---
2056
2057 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
2058
2059 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
2060 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
2061 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
2062 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
2063 From the shell this can be enabled by using the option per Column Family also by using the below format
2064 {code}
2065 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
2066 {code}
2067
2068
2069 ---
2070
2071 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
2072
2073 <!-- markdown -->
2074
2075 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
2076
2077 ```
2078 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
2079     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
2080 ```
2081
2082 See javadocs of the class `MobRefReporter` for more details.
2083
2084 the reference guide has added some information about MOB internals and troubleshooting.
2085
2086
2087 ---
2088
2089 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
2090
2091 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
2092
2093
2094 ---
2095
2096 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
2097
2098 Fixed unbalanced braces in string representation within HBase shell
2099
2100
2101 ---
2102
2103 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
2104
2105 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
2106 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
2107
2108
2109 ---
2110
2111 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
2112
2113 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
2114
2115
2116 ---
2117
2118 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
2119
2120 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
2121
2122 1. RowFilter
2123 2. ValueFilter
2124 3. QualifierFilter
2125 4. FamilyFilter
2126 5. ColumnValueFilter
2127
2128
2129 ---
2130
2131 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
2132
2133 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
2134
2135
2136 ---
2137
2138 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
2139
2140 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
2141
2142
2143 ---
2144
2145 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
2146
2147 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
2148
2149
2150 ---
2151
2152 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
2153
2154 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
2155
2156
2157 ---
2158
2159 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
2160
2161 <!-- markdown -->
2162 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
2163
2164 Such messages will happen at most once per five minutes.
2165
2166
2167 ---
2168
2169 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
2170
2171 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
2172
2173
2174 ---
2175
2176 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
2177
2178 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
2179
2180
2181 ---
2182
2183 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
2184
2185 <!-- markdown -->
2186
2187 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
2188
2189   - CVE-2019-16942
2190   - CVE-2019-16943
2191
2192 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
2193
2194
2195 ---
2196
2197 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
2198
2199 <!-- markdown -->
2200
2201 The MOB compaction process in the HBase Master now logs more about its activity.
2202
2203 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
2204
2205 Caveats:
2206 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
2207 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
2208 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
2209 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
2210
2211
2212 ---
2213
2214 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
2215
2216 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
2217
2218
2219 ---
2220
2221 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
2222
2223 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
2224
2225 Configs:
2226
2227 1. hbase.master.regions.recovery.check.interval :
2228
2229 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
2230
2231 2. hbase.regions.recovery.store.file.ref.count :
2232
2233 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
2234
2235
2236 ---
2237
2238 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
2239
2240 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
2241
2242 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
2243
2244
2245 ---
2246
2247 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
2248
2249 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
2250
2251
2252 ---
2253
2254 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
2255
2256 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
2257
2258
2259 ---
2260
2261 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
2262
2263 <!-- markdown -->
2264 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
2265
2266 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
2267
2268
2269 ---
2270
2271 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
2272
2273 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
2274
2275
2276 ---
2277
2278 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
2279
2280 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
2281
2282
2283 ---
2284
2285 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
2286
2287 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
2288
2289 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
2290
2291
2292 ---
2293
2294 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
2295
2296 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
2297 \<property\>
2298     \<name\>hbase.bucketcache.ioengine\</name\>
2299     \<value\> pmem:///path in persistent memory \</value\>
2300   \</property\>
2301
2302
2303 ---
2304
2305 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
2306
2307 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
2308 hbase\> snapshot\_cleanup\_switch false
2309
2310 We can re-enable it using:
2311 hbase\> snapshot\_cleanup\_switch true
2312
2313 We can query whether snapshot auto cleanup is enabled for cluster using:
2314 hbase\> snapshot\_cleanup\_enabled
2315
2316
2317 ---
2318
2319 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
2320
2321 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
2322
2323
2324 ---
2325
2326 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
2327
2328 This issue adds via its subtasks:
2329
2330  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
2331  \*\* Master thought this region opened, but no regionserver reported it.
2332  \*\* Master thought this region opened on Server1, but regionserver reported Server2
2333  \*\* More than one regionservers reported opened this region
2334  Both chores can be triggered from the shell to regenerate ‘new’ reports.
2335  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
2336  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
2337  \* Offline replace of hbase.version and hbase.id
2338  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
2339  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
2340  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
2341  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
2342  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
2343  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
2344
2345 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
2346
2347
2348 ---
2349
2350 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
2351
2352 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
2353
2354
2355 ---
2356
2357 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
2358
2359 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
2360
2361
2362 ---
2363
2364 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
2365
2366 Before this issue, read path was 100% offheap when block is in the BucketCache. But if a cache miss, then the RS needs to read the block via an on-heap API which causes high young-GC pressure.
2367
2368 This issue adds reading the block via offheap even if reading the block from filesystem directly.  It requires hadoop version(\>=2.9.3) but can also work with older hadoop versions (all works but we continue to read block onheap). It also requires HBASE-21946 which is not yet in place as of this writing/hbase-2.3.0.
2369
2370 We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex
2371
2372
2373 ---
2374
2375 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
2376
2377 <!-- markdown -->
2378 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
2379
2380
2381 ---
2382
2383 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
2384
2385 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
2386
2387
2388 ---
2389
2390 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
2391
2392 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
2393
2394
2395 ---
2396
2397 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
2398
2399 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
2400 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
2401
2402
2403 ---
2404
2405 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
2406
2407 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
2408 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
2409 \* TimeRange#until: Represents the time interval [0, maxStamp)
2410 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
2411
2412
2413 ---
2414
2415 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
2416
2417 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
2418 {code}
2419 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
2420 {code}
2421
2422
2423 ---
2424
2425 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
2426
2427 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
2428
2429
2430 ---
2431
2432 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
2433
2434 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
2435
2436 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
2437
2438
2439 ---
2440
2441 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
2442
2443 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
2444
2445
2446 ---
2447
2448 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
2449
2450 New shaded artifact for testing: hbase-shaded-testing-util.
2451
2452
2453 ---
2454
2455 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
2456
2457 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
2458 1. Check HDFS configuration
2459 2. Add master coprocessor:
2460     hbase.coprocessor.master.classes=
2461     “org.apache.hadoop.hbase.security.access.AccessController,
2462 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
2463 3. Enable this feature:
2464     hbase.acl.sync.to.hdfs.enable=true
2465 4. Modify table scheme to enable this feature for a table:
2466     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
2467
2468
2469 ---
2470
2471 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
2472
2473 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
2474
2475 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
2476
2477 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
2478 java.lang.ArrayIndexOutOfBoundsException: 18056
2479         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
2480         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
2481         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
2482         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
2483         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
2484         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
2485         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
2486         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
2487         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
2488
2489 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
2490
2491 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
2492
2493 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
2494
2495
2496 ---
2497
2498 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
2499
2500 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
2501
2502
2503 ---
2504
2505 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
2506
2507 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
2508
2509
2510 ---
2511
2512 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
2513
2514 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
2515
2516 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
2517
2518
2519 ---
2520
2521 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
2522
2523 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
2524
2525
2526 ---
2527
2528 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
2529
2530 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
2531 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
2532
2533
2534 ---
2535
2536 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
2537
2538 1. Add a new chore thread in master to do hbck checking
2539 2. Add a new web ui "HBCK Report" page to display checking results.
2540
2541 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
2542
2543 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
2544
2545
2546 ---
2547
2548 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
2549
2550 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
2551 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
2552
2553
2554 ---
2555
2556 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
2557
2558 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
2559
2560 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
2561
2562 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
2563
2564
2565 ---
2566
2567 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
2568
2569 Add a new master web UI to show the potentially problematic opened regions. There are three case:
2570 1. Master thought this region opened, but no regionserver reported it.
2571 2. Master thought this region opened on Server1, but regionserver reported Server2
2572 3. More than one regionservers reported opened this region
2573
2574
2575 ---
2576
2577 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
2578
2579 Feature: Take a Snapshot With TTL for auto-cleanup
2580
2581 Attribute:
2582 1. TTL
2583      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
2584
2585 Configs:
2586 1. Default Snapshot TTL:
2587      - FOREVER by default
2588      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
2589
2590 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
2591      - hbase.master.cleaner.snapshot.disable: "true"
2592     With this config, HMaster needs restart just like any other hbase-site config.
2593
2594
2595 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
2596
2597
2598 ---
2599
2600 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
2601
2602 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
2603
2604
2605 ---
2606
2607 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
2608
2609 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
2610
2611 This tool is deprecated in 2.x and will be removed in 3.0.
2612
2613
2614 ---
2615
2616 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
2617
2618 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
2619
2620
2621 ---
2622
2623 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
2624
2625 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
2626
2627 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
2628
2629 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
2630
2631
2632 ---
2633
2634 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
2635
2636 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
2637 To use this feature, please make sure the HDFS config is set:
2638 dfs.namenode.acls.enabled=true
2639 fs.permissions.umask-mode=027
2640
2641 and set the HBase config:
2642 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
2643 hbase.user.scan.snapshot.enable=true
2644
2645
2646 ---
2647
2648 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
2649
2650 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
2651
2652 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
2653
2654
2655 ---
2656
2657 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
2658
2659 <!-- markdown -->
2660
2661 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
2662
2663
2664 ---
2665
2666 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
2667
2668 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
2669
2670
2671 ---
2672
2673 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
2674
2675 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
2676
2677
2678 ---
2679
2680 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
2681
2682 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
2683
2684
2685 ---
2686
2687 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
2688
2689 The HBase "source checksum" now uses SHA512 instead of MD5.
2690
2691
2692 ---
2693
2694 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
2695
2696 <!-- markdown -->
2697
2698 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
2699
2700 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
2701
2702
2703 ---
2704
2705 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
2706
2707 The access method was used to the HttpServerFunctionalTest class as a common place.
2708
2709
2710 ---
2711
2712 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
2713
2714 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
2715
2716
2717 ---
2718
2719 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
2720
2721 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
2722
2723
2724 ---
2725
2726 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
2727
2728 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
2729
2730
2731 ---
2732
2733 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
2734
2735 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
2736
2737
2738 ---
2739
2740 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
2741
2742 Support get\|set LogLevel in secure(kerberized) environment.
2743
2744
2745 ---
2746
2747 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
2748
2749 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
2750
2751
2752 ---
2753
2754 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
2755
2756 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
2757
2758
2759 ---
2760
2761 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
2762
2763 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
2764
2765
2766 ---
2767
2768 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
2769
2770 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
2771
2772
2773 ---
2774
2775 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
2776
2777 Updated metrics core from 3.2.1 to 3.2.6.
2778
2779
2780 ---
2781
2782 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
2783
2784 The rubocop definition for the maximum method length was set to 75.
2785
2786
2787 ---
2788
2789 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
2790
2791 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
2792
2793
2794 ---
2795
2796 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
2797
2798 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
2799
2800
2801 ---
2802
2803 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
2804
2805 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
2806
2807 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
2808
2809 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
2810
2811 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
2812
2813 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
2814 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
2815
2816 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
2817 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
2818 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
2819
2820
2821 ---
2822
2823 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
2824
2825 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
2826
2827 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
2828
2829
2830 ---
2831
2832 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
2833
2834 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
2835
2836
2837 ---
2838
2839 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
2840
2841 <!-- markdown -->
2842 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
2843
2844 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
2845
2846
2847 ---
2848
2849 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
2850
2851 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
2852
2853 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
2854
2855
2856 ---
2857
2858 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
2859
2860 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
2861
2862
2863 ---
2864
2865 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
2866
2867 <!-- markdown -->
2868
2869 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
2870
2871 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
2872
2873
2874 ---
2875
2876 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
2877
2878 Add below method in Table interface:
2879
2880 RegionLocator getRegionLocator() throws IOException;
2881
2882 Add below methods in AsyncTable interface:
2883
2884 AsyncTableRegionLocator getRegionLocator();
2885 CompletableFuture\<TableDescriptor\> getDescriptor();
2886
2887
2888 ---
2889
2890 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
2891
2892 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
2893
2894 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
2895
2896 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
2897
2898
2899 ---
2900
2901 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
2902
2903 Introduced
2904
2905 Future\<Void\> createTableAsync(TableDescriptor);
2906
2907
2908 ---
2909
2910 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
2911
2912 Introduced these methods:
2913 void move(byte[]);
2914 void move(byte[], ServerName);
2915 Future\<Void\> splitRegionAsync(byte[]);
2916
2917 These methods are deprecated:
2918 void move(byte[], byte[])
2919
2920
2921 ---
2922
2923 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
2924
2925 Add a new jenkins file for running pre commit check for GitHub PR.
2926
2927
2928 ---
2929
2930 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
2931
2932 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
2933
2934
2935 ---
2936
2937 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
2938
2939 When insufficient permissions, you now get:
2940
2941 HTTP/1.1 403 Forbidden
2942
2943 on the HTTP side, and in the message
2944
2945 Forbidden
2946 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
2947 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
2948 and the rest of the ADE stack
2949
2950
2951 ---
2952
2953 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
2954
2955 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
2956
2957
2958 ---
2959
2960 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
2961
2962 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
2963
2964
2965 ---
2966
2967 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
2968
2969 <!-- markdown -->
2970 Fixed awkward dependency issue that prevented site building.
2971
2972 #### note specific to HBase 2.1.4
2973 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
2974 ```
2975 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
2976 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
2977         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
2978         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
2979         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
2980         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
2981         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
2982         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
2983         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
2984         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
2985         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
2986         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
2987         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
2988         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
2989         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
2990         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
2991         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
2992         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
2993         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
2994         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
2995         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
2996         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
2997         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
2998         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
2999         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
3000         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
3001         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
3002         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
3003 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
3004         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
3005         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
3006         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
3007         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
3008         ... 26 more
3009
3010 ```
3011
3012 Workaround via any _one_ of the following:
3013 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
3014 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
3015 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
3016 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
3017 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
3018
3019
3020 ---
3021
3022 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
3023
3024 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
3025
3026
3027 ---
3028
3029 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
3030
3031 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
3032
3033
3034 ---
3035
3036 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
3037
3038 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
3039
3040 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
3041
3042
3043 ---
3044
3045 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
3046
3047 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
3048
3049
3050 ---
3051
3052 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
3053
3054 <!-- markdown -->
3055
3056 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
3057
3058 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
3059
3060
3061 ---
3062
3063 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
3064
3065 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
3066
3067
3068 ---
3069
3070 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
3071
3072 Add a cloneSnapshotAsync method with restoreAcl parameter.
3073 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
3074 Make snapshotAsync method returns a Future\<Void\>.
3075 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
3076 Use default methods to reduce the code base for implementation classes.
3077
3078
3079 ---
3080
3081 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
3082
3083 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
3084
3085
3086 ---
3087
3088 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
3089
3090 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
3091 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
3092
3093 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
3094
3095 For example:
3096 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
3097
3098
3099 ---
3100
3101 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
3102
3103 Adds below flush, split, and compaction metrics
3104
3105  +  // split related metrics
3106  +  private MutableFastCounter splitRequest;
3107  +  private MutableFastCounter splitSuccess;
3108  +  private MetricHistogram splitTimeHisto;
3109  +
3110  +  // flush related metrics
3111  +  private MetricHistogram flushTimeHisto;
3112  +  private MetricHistogram flushMemstoreSizeHisto;
3113  +  private MetricHistogram flushOutputSizeHisto;
3114  +  private MutableFastCounter flushedMemstoreBytes;
3115  +  private MutableFastCounter flushedOutputBytes;
3116  +
3117  +  // compaction related metrics
3118  +  private MetricHistogram compactionTimeHisto;
3119  +  private MetricHistogram compactionInputFileCountHisto;
3120  +  private MetricHistogram compactionInputSizeHisto;
3121  +  private MetricHistogram compactionOutputFileCountHisto;
3122  +  private MetricHistogram compactionOutputSizeHisto;
3123  +  private MutableFastCounter compactedInputBytes;
3124  +  private MutableFastCounter compactedOutputBytes;
3125  +
3126  +  private MetricHistogram majorCompactionTimeHisto;
3127  +  private MetricHistogram majorCompactionInputFileCountHisto;
3128  +  private MetricHistogram majorCompactionInputSizeHisto;
3129  +  private MetricHistogram majorCompactionOutputFileCountHisto;
3130  +  private MetricHistogram majorCompactionOutputSizeHisto;
3131  +  private MutableFastCounter majorCompactedInputBytes;
3132  +  private MutableFastCounter majorCompactedOutputBytes;
3133
3134
3135 ---
3136
3137 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
3138
3139 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
3140
3141
3142 ---
3143
3144 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
3145
3146 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
3147 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
3148
3149
3150 ---
3151
3152 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
3153
3154 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
3155
3156 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
3157
3158
3159 ---
3160
3161 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
3162
3163 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
3164 Shell commands are as follows:
3165 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
3166
3167 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
3168 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
3169 Shell commands are as follows:
3170 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
3171 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
3172 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
3173
3174
3175 ---
3176
3177 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
3178
3179 Change spotbugs version to 3.1.11.
3180
3181
3182 ---
3183
3184 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
3185
3186 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
3187
3188 It also introduces additional info for each recovery queue, which was not accounted by this command before.
3189
3190 The new output for "status 'replication'" command is explained in details below:
3191 a) Source started, target stopped, no edits arrived on source yet:
3192 ...
3193  SOURCE: PeerID=1
3194          Normal Queue: 1
3195            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
3196 ...
3197 b) Source started, target stopped, add edit on source:
3198 ...
3199 Normal Queue: 1
3200            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
3201 ...
3202 c) Source started, target stopped, edit added on source, restart source:
3203 ...
3204 SOURCE: PeerID=1
3205          Normal Queue: 1
3206            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
3207          Recovered Queue: 1-hbase01.home,16020,1542784524057
3208            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
3209 ...
3210 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
3211 ...
3212 SOURCE: PeerID=1
3213          Normal Queue: 1
3214            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
3215          Recovered Queue: 1-hbase01.home,16020,1542782758742
3216            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
3217 ...
3218 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
3219 ...
3220        SOURCE: PeerID=1
3221          Normal Queue: 1
3222            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
3223 ...
3224 f) Source started, target stopped, add edit on source, restart source, restart target:
3225 ...
3226 SOURCE: PeerID=1
3227          Normal Queue: 1
3228            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
3229 ...
3230
3231
3232 ---
3233
3234 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
3235
3236 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
3237
3238
3239 ---
3240
3241 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
3242
3243 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
3244 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
3245 disable\_exceed\_throttle\_quota
3246 There are two limits when enable exceed throttle quota:
3247 1. Must set at least one read and one write region server throttle quota;
3248 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
3249
3250
3251 ---
3252
3253 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
3254
3255 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
3256
3257
3258 ---
3259
3260 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
3261
3262 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
3263
3264
3265 ---
3266
3267 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
3268
3269 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
3270
3271
3272 ---
3273
3274 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
3275
3276 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
3277
3278 hbase\> help 'scan'
3279
3280
3281 ---
3282
3283 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
3284
3285 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
3286
3287 For example:
3288 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
3289
3290
3291 ---
3292
3293 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
3294
3295 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
3296 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
3297
3298
3299 ---
3300
3301 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
3302
3303 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
3304
3305
3306 ---
3307
3308 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
3309
3310 Make StoppedRpcClientException extend DoNotRetryIOException.
3311
3312
3313 ---
3314
3315 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
3316
3317 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
3318 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
3319
3320
3321 ---
3322
3323 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
3324
3325 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
3326
3327 The effect releases are:
3328 2.1.x: 2.1.2 and below
3329 2.0.x: 2.0.4 and below
3330 1.x: 1.4.x and below
3331
3332 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
3333
3334
3335 ---
3336
3337 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
3338
3339 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
3340
3341
3342
3343 # HBASE  2.3.0 Release Notes
3344
3345 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
3346
3347
3348 ---
3349
3350 * [HBASE-24603](https://issues.apache.org/jira/browse/HBASE-24603) | *Critical* | **Zookeeper sync() call is async**
3351
3352 <!-- markdown -->
3353
3354 Fixes a couple of bugs in ZooKeeper interaction. Firstly, zk sync() call that is used to sync the lagging followers with leader so that the client sees a consistent snapshot state was actually asynchronous under the hood. We make it synchronous for correctness. Second, zookeeper events are now processed in a separate thread rather than doing it in the thread context of zookeeper client connection. This decoupling frees up client connection quickly and avoids deadlocks.
3355
3356
3357 ---
3358
3359 * [HBASE-24631](https://issues.apache.org/jira/browse/HBASE-24631) | *Major* | **Loosen Dockerfile pinned package versions of the "debian-revision"**
3360
3361 <!-- markdown -->
3362 Update our package version numbers throughout the Dockerfiles to be pinned to their epic:upstream-version components only. Previously we'd specify the full debian package version number, including the debian-revision. This lead to instability as debian packaging details changed.
3363 See also [man deb-version](http://manpages.ubuntu.com/manpages/xenial/en/man5/deb-version.5.html)
3364
3365
3366 ---
3367
3368 * [HBASE-24205](https://issues.apache.org/jira/browse/HBASE-24205) | *Major* | **Create metric to know the number of reads that happens from memstore**
3369
3370 Adds a new metric where we collect the number of read requests (tracked per row) whether the row was fetched completely from memstore or it was pulled from files  and memstore.
3371 The metric is now collected under the mbean for Tables and under the mbean for regions.
3372 Under table mbean ie.-
3373 'name": "Hadoop:service=HBase,name=RegionServer,sub=Tables'
3374 The new metrics will be listed as
3375 {code}
3376     "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
3377  "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_mixedRowReadsCount": 1,
3378 {code}
3379 Where the format is Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_memstoreOnlyRowReadsCount
3380 Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_mixedRowReadsCount
3381 {code}
3382
3383 The same one under the region ie.
3384 "name": "Hadoop:service=HBase,name=RegionServer,sub=Regions",
3385 comes as
3386 {code}
3387    "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
3388     "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_mixedRowReadsCount": 1,
3389 {code}
3390 where
3391 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_memstoreOnlyRowReadsCount
3392 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_mixedRowReadsCount
3393 This is also an aggregate against every store the number of reads that happened purely from the memstore or it was a  mixed read that happened from memstore and file.
3394
3395
3396 ---
3397
3398 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
3399
3400 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
3401
3402 $hbase rowcounter -h
3403
3404 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
3405 Options:
3406     --starttime=\<arg\>       starting time filter to start counting rows from.
3407     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
3408     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
3409     --expectedCount=\<arg\>   expected number of rows to be count.
3410 For performance, consider the following configuration properties:
3411 -Dhbase.client.scanner.caching=100
3412 -Dmapreduce.map.speculative=false
3413
3414
3415 ---
3416
3417 * [HBASE-24217](https://issues.apache.org/jira/browse/HBASE-24217) | *Major* | **Add hadoop 3.2.x support**
3418
3419 CI coverage has been extended to include Hadoop 3.2.x for HBase 2.2+.
3420
3421
3422 ---
3423
3424 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
3425
3426 Adds being able to edit hbase:meta table schema. For example,
3427
3428 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
3429 Updating all regions with the new schema...
3430 All regions updated.
3431 Done.
3432 Took 1.2138 seconds
3433
3434 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
3435
3436
3437 ---
3438
3439 * [HBASE-15161](https://issues.apache.org/jira/browse/HBASE-15161) | *Major* | **Umbrella: Miscellaneous improvements from production usage**
3440
3441 This ticket summarizes significant improvements and expansion to the metrics surface area. Interested users should review the individual sub-tasks.
3442
3443
3444 ---
3445
3446 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
3447
3448 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
3449
3450
3451 ---
3452
3453 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
3454
3455 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
3456
3457
3458 ---
3459
3460 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
3461
3462 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
3463
3464
3465 ---
3466
3467 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
3468
3469 <!-- markdown -->
3470 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
3471 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
3472 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
3473 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
3474 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
3475 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
3476
3477
3478 ---
3479
3480 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
3481
3482 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
3483
3484 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
3485
3486 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
3487
3488 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
3489
3490
3491 ---
3492
3493 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
3494
3495 Added new metric to differentiate sink startup time from last OP applied time.
3496
3497 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
3498
3499 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
3500
3501 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
3502
3503 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
3504
3505
3506 ---
3507
3508 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
3509
3510 <!-- markdown -->
3511 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
3512
3513 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
3514
3515
3516 ---
3517
3518 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
3519
3520 Add backoff. Avoid retrying every 100ms.
3521
3522
3523 ---
3524
3525 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
3526
3527 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
3528
3529 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
3530
3531
3532 ---
3533
3534 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
3535
3536 Introduced a general 'local region' at master side to store the procedure data, etc.
3537
3538 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
3539
3540 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
3541
3542
3543 ---
3544
3545 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
3546
3547 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
3548
3549
3550 ---
3551
3552 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
3553
3554 Config key: hbase.regionserver.slowlog.systable.enabled
3555 Default value: false
3556
3557 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
3558 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
3559
3560 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
3561
3562 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
3563
3564  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
3565  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
3566  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
3567  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
3568                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
3569                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
3570                                                              rics: false
3571  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
3572  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
3573  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
3574  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
3575  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
3576  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
3577  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
3578  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
3579
3580
3581 ---
3582
3583 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
3584
3585 <!-- markdown -->
3586 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
3587
3588
3589 ---
3590
3591 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
3592
3593 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
3594
3595 The request log is disabled by default in conf/log4j.properties by the following lines:
3596
3597 # Disable request log by default, you can enable this by changing the appender
3598 log4j.category.http.requests=INFO,NullAppender
3599 log4j.additivity.http.requests=false
3600
3601 Change the 'NullAppender' to what ever you want if you want to enable request log.
3602
3603 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
3604
3605
3606 ---
3607
3608 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
3609
3610 Use a empty string to represent no column specified for deleteall in shell mode.
3611 useage:
3612 deleteall 'test','r1','',12345
3613 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
3614
3615
3616 ---
3617
3618 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
3619
3620 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
3621
3622
3623 ---
3624
3625 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
3626
3627 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
3628
3629
3630 ---
3631
3632 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
3633
3634 Moved to hbase-thirdparty 3.3.0.
3635
3636
3637 ---
3638
3639 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
3640
3641 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
3642
3643 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
3644
3645 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
3646
3647
3648 ---
3649
3650 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
3651
3652 <!-- markdown -->
3653 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
3654
3655 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
3656
3657
3658 ---
3659
3660 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
3661
3662 New Config: hbase.rpc.rows.size.threshold.reject
3663 -----------------------------------------------------------------------
3664
3665 Default value: false
3666 Description:
3667 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
3668
3669
3670 ---
3671
3672 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
3673
3674 StochasticLoadBalancer functional improvement:
3675
3676 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
3677
3678
3679 ---
3680
3681 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
3682
3683 user or admin can now use
3684 hbase shell \> rename\_rsgroup 'oldname', 'newname'
3685 to rename rsgroup.
3686
3687
3688 ---
3689
3690 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
3691
3692 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
3693
3694 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
3695
3696
3697 ---
3698
3699 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
3700
3701 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
3702
3703
3704 ---
3705
3706 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
3707
3708 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
3709
3710
3711 ---
3712
3713 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
3714
3715 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
3716
3717
3718 ---
3719
3720 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
3721
3722 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
3723
3724
3725 ---
3726
3727 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
3728
3729 <!-- markdown -->
3730 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
3731
3732
3733 ---
3734
3735 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
3736
3737 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
3738
3739 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
3740
3741 For running tests locally, to go faster, up fork count.
3742
3743 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
3744
3745 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
3746
3747
3748 ---
3749
3750 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
3751
3752 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
3753
3754
3755 ---
3756
3757 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
3758
3759 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
3760
3761
3762 ---
3763
3764 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
3765
3766 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
3767
3768
3769 ---
3770
3771 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
3772
3773 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
3774
3775
3776 ---
3777
3778 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
3779
3780 <!-- markdown -->
3781 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
3782
3783 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
3784
3785
3786 ---
3787
3788 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
3789
3790 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
3791
3792
3793 ---
3794
3795 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
3796
3797 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
3798
3799
3800 ---
3801
3802 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
3803
3804 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
3805
3806
3807 ---
3808
3809 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
3810
3811 ColumnFamilyDescriptor new builder API:
3812
3813     /\*\*
3814      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
3815      \* of versions(versionAfterInterval) after that interval elapses.
3816      \*
3817      \* @param retentionInterval Retain all versions for this interval
3818      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
3819      \*/
3820     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
3821         final int retentionInterval, final int versionAfterInterval)
3822
3823
3824 ---
3825
3826 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
3827
3828 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
3829
3830
3831 ---
3832
3833 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
3834
3835 Expose file system level read metrics for RegionServer.
3836
3837 If the HBase RS runs on top of HDFS, calculate the aggregation of
3838 ReadStatistics of each HdfsFileInputStream. These metrics include:
3839 (1) total number of bytes read from HDFS.
3840 (2) total number of bytes read from local DataNode.
3841 (3) total number of bytes read locally through short-circuit read.
3842 (4) total number of bytes read locally through zero-copy read.
3843
3844 Because HDFS ReadStatistics is calculated per input stream, it is not
3845 feasible to update the aggregated number in real time. Instead, the
3846 metrics are updated when an input stream is closed.
3847
3848
3849 ---
3850
3851 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
3852
3853 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
3854
3855 Here is a simple example of script:
3856 {code}
3857 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
3858 #!/bin/bash
3859 namespace=$1
3860 tablename=$2
3861 if [[ $namespace == test ]]; then
3862   echo test
3863 elif [[ $tablename == \*foo\* ]]; then
3864   echo other
3865 else
3866   echo default
3867 fi
3868 {code}
3869
3870
3871 ---
3872
3873 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
3874
3875 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
3876
3877
3878 ---
3879
3880 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
3881
3882 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
3883
3884
3885 ---
3886
3887 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
3888
3889 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
3890
3891 User used to see....
3892
3893   column=table:state, timestamp=1583967620343 .....
3894
3895 ... but now sees:
3896
3897   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
3898
3899
3900 ---
3901
3902 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
3903
3904 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
3905
3906
3907 ---
3908
3909 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
3910
3911 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
3912
3913 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
3914
3915
3916 ---
3917
3918 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
3919
3920 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
3921
3922 New Admin APIs:
3923 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
3924       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
3925
3926 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
3927       throws IOException;
3928
3929 Configs:
3930
3931 1. hbase.regionserver.slowlog.ringbuffer.size:
3932 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
3933
3934 Default
3935 256
3936
3937 2. hbase.regionserver.slowlog.buffer.enabled:
3938 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
3939
3940 Default
3941 false
3942
3943
3944 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
3945
3946
3947 ---
3948
3949 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
3950
3951 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
3952
3953
3954 ---
3955
3956 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
3957
3958 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
3959
3960 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
3961
3962 This is a fluent style API, the code is like:
3963
3964 For Table interface:
3965 {code}
3966 table.checkAndMutate(row, filter).thenPut(put);
3967 {code}
3968
3969 For AsyncTable interface:
3970 {code}
3971 table.checkAndMutate(row, filter).thenPut(put)
3972     .thenAccept(succ -\> {
3973       if (succ) {
3974         System.out.println("Check and put succeeded");
3975       } else {
3976         System.out.println("Check and put failed");
3977       }
3978     });
3979 {code}
3980
3981
3982 ---
3983
3984 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
3985
3986 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
3987
3988
3989 ---
3990
3991 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
3992
3993 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
3994
3995
3996 ---
3997
3998 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
3999
4000     Adds shell command regioninfo:
4001
4002       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
4003       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
4004       Took 0.4737 seconds
4005
4006
4007 ---
4008
4009 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
4010
4011 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
4012
4013 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
4014
4015
4016 ---
4017
4018 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
4019
4020 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
4021
4022 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
4023 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
4024
4025 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
4026
4027
4028 ---
4029
4030 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
4031
4032 <!-- markdown -->
4033 Enables master based registry as the default registry used by clients to fetch connection metadata.
4034 Refer to the section "Master Registry" in the client documentation for more details and advantages
4035 of this implementation over the default Zookeeper based registry.
4036
4037 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
4038
4039 Where to set this: HBase client configuration (hbase-site.xml)
4040
4041 Possible values:
4042 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
4043 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
4044
4045 Notes on defaults:
4046
4047 - For v3.0.0 and later, MasterRegistry is the default registry
4048 - For all releases in 2.x line, ZK based registry is the default.
4049
4050 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
4051
4052 ```
4053 <property>
4054   <name>hbase.client.registry.impl</name>
4055   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
4056 </property>
4057 ```
4058
4059
4060 ---
4061
4062 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
4063
4064 caffeine: 2.6.2 =\> 2.8.1
4065 commons-codec: 1.10 =\> 1.13
4066 commons-io: 2.5 =\> 2.6
4067 disrupter: 3.3.6 =\> 3.4.2
4068 httpcore: 4.4.6 =\> 4.4.13
4069 jackson: 2.9.10 =\> 2.10.1
4070 jackson.databind: 2.9.10.1 =\> 2.10.1
4071 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
4072 protobuf.plugin: 0.5.0 =\> 0.6.1
4073 zookeeper: 3.4.10 =\> 3.4.14
4074 slf4j: 1.7.25 =\> 1.7.30
4075 rat: 0.12 =\> 0.13
4076 asciidoctor: 1.5.5 =\> 1.5.8
4077 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
4078 error-prone: 2.3.3 =\> 2.3.4
4079
4080
4081 ---
4082
4083 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
4084
4085 - Reverts a binary incompatible binary change for ByteRangeUtils
4086 - Usage of reflection inside CommonFSUtils removed
4087
4088
4089 ---
4090
4091 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
4092
4093 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
4094
4095 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
4096
4097
4098 ---
4099
4100 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
4101
4102 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
4103
4104
4105 ---
4106
4107 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
4108
4109 Add a new config to hbase-default.xml
4110
4111   \<property\>
4112     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
4113     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
4114     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
4115     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
4116     called in order, so put the cleaner that prunes the most files in front. To
4117     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
4118     and add the fully qualified class name here. Always add the above
4119     default hfile cleaners in the list as they will be overwritten in
4120     hbase-site.xml.\</description\>
4121   \</property\>
4122
4123 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
4124
4125
4126 ---
4127
4128 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
4129
4130 Updated parent pom to Apache version 22.
4131
4132
4133 ---
4134
4135 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
4136
4137 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
4138
4139 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
4140
4141
4142 ---
4143
4144 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
4145
4146 Add a new feature to improve MTTR which have 3 steps to failover:
4147 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
4148 2. Open region.
4149 3. Bulkload the recovered.hfiles for every column family.
4150
4151 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
4152
4153 Config hbase.wal.split.to.hfile to true to enable this featue.
4154
4155
4156 ---
4157
4158 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
4159
4160 Changed the logging in hbase-zookeeper to use built-in formatting
4161
4162
4163 ---
4164
4165 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
4166
4167 From the PR:
4168
4169 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
4170
4171 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
4172
4173
4174 ---
4175
4176 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
4177
4178 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
4179
4180
4181 ---
4182
4183 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
4184
4185 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
4186
4187
4188 ---
4189
4190 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
4191
4192 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
4193
4194
4195 ---
4196
4197 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
4198
4199 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
4200
4201 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
4202
4203
4204 ---
4205
4206 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
4207
4208 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
4209
4210
4211 ---
4212
4213 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
4214
4215 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
4216 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
4217
4218 Fixed this bug as part of this Jira.
4219 Updated description for corresponding configs:
4220
4221 1. hbase.master.regions.recovery.check.interval :
4222
4223 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
4224
4225 2. hbase.regions.recovery.store.file.ref.count :
4226
4227 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
4228
4229
4230 ---
4231
4232 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
4233
4234 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
4235
4236
4237 ---
4238
4239 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
4240
4241 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
4242
4243
4244 ---
4245
4246 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
4247
4248 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
4249
4250
4251 ---
4252
4253 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
4254
4255 Bumped surefire plugin to 3.0.0-M4
4256
4257
4258 ---
4259
4260 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
4261
4262 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
4263
4264
4265 ---
4266
4267 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
4268
4269 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
4270 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
4271 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
4272 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
4273 From the shell this can be enabled by using the option per Column Family also by using the below format
4274 {code}
4275 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
4276 {code}
4277
4278
4279 ---
4280
4281 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
4282
4283 <!-- markdown -->
4284
4285 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
4286
4287 ```
4288 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
4289     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
4290 ```
4291
4292 See javadocs of the class `MobRefReporter` for more details.
4293
4294 the reference guide has added some information about MOB internals and troubleshooting.
4295
4296
4297 ---
4298
4299 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
4300
4301 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
4302
4303
4304 ---
4305
4306 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
4307
4308 Fixed unbalanced braces in string representation within HBase shell
4309
4310
4311 ---
4312
4313 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
4314
4315 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
4316 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
4317
4318
4319 ---
4320
4321 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
4322
4323 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
4324
4325
4326 ---
4327
4328 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
4329
4330 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
4331
4332 1. RowFilter
4333 2. ValueFilter
4334 3. QualifierFilter
4335 4. FamilyFilter
4336 5. ColumnValueFilter
4337
4338
4339 ---
4340
4341 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
4342
4343 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
4344
4345
4346 ---
4347
4348 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
4349
4350 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
4351
4352
4353 ---
4354
4355 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
4356
4357 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
4358
4359
4360 ---
4361
4362 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
4363
4364 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
4365
4366
4367 ---
4368
4369 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
4370
4371 <!-- markdown -->
4372 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
4373
4374 Such messages will happen at most once per five minutes.
4375
4376
4377 ---
4378
4379 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
4380
4381 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
4382
4383
4384 ---
4385
4386 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
4387
4388 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
4389
4390
4391 ---
4392
4393 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
4394
4395 <!-- markdown -->
4396
4397 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
4398
4399   - CVE-2019-16942
4400   - CVE-2019-16943
4401
4402 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
4403
4404
4405 ---
4406
4407 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
4408
4409 <!-- markdown -->
4410
4411 The MOB compaction process in the HBase Master now logs more about its activity.
4412
4413 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
4414
4415 Caveats:
4416 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
4417 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
4418 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
4419 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
4420
4421
4422 ---
4423
4424 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
4425
4426 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
4427
4428
4429 ---
4430
4431 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
4432
4433 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
4434
4435 Configs:
4436
4437 1. hbase.master.regions.recovery.check.interval :
4438
4439 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
4440
4441 2. hbase.regions.recovery.store.file.ref.count :
4442
4443 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
4444
4445
4446 ---
4447
4448 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
4449
4450 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
4451
4452 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
4453
4454
4455 ---
4456
4457 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
4458
4459 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
4460
4461
4462 ---
4463
4464 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
4465
4466 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
4467
4468
4469 ---
4470
4471 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
4472
4473 <!-- markdown -->
4474 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
4475
4476 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
4477
4478
4479 ---
4480
4481 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
4482
4483 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
4484
4485
4486 ---
4487
4488 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
4489
4490 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
4491
4492
4493 ---
4494
4495 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
4496
4497 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
4498
4499 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
4500
4501
4502 ---
4503
4504 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
4505
4506 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
4507 \<property\>
4508     \<name\>hbase.bucketcache.ioengine\</name\>
4509     \<value\> pmem:///path in persistent memory \</value\>
4510   \</property\>
4511
4512
4513 ---
4514
4515 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
4516
4517 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
4518 hbase\> snapshot\_cleanup\_switch false
4519
4520 We can re-enable it using:
4521 hbase\> snapshot\_cleanup\_switch true
4522
4523 We can query whether snapshot auto cleanup is enabled for cluster using:
4524 hbase\> snapshot\_cleanup\_enabled
4525
4526
4527 ---
4528
4529 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
4530
4531 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
4532
4533
4534 ---
4535
4536 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
4537
4538 This issue adds via its subtasks:
4539
4540  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
4541  \*\* Master thought this region opened, but no regionserver reported it.
4542  \*\* Master thought this region opened on Server1, but regionserver reported Server2
4543  \*\* More than one regionservers reported opened this region
4544  Both chores can be triggered from the shell to regenerate ‘new’ reports.
4545  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
4546  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
4547  \* Offline replace of hbase.version and hbase.id
4548  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
4549  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
4550  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
4551  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
4552  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
4553  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
4554
4555 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
4556
4557
4558 ---
4559
4560 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
4561
4562 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
4563
4564
4565 ---
4566
4567 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
4568
4569 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
4570
4571
4572 ---
4573
4574 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
4575
4576 Before this issue, read path was 100% offheap when block is in the BucketCache. But if a cache miss, then the RS needs to read the block via an on-heap API which causes high young-GC pressure.
4577
4578 This issue adds reading the block via offheap even if reading the block from filesystem directly.  It requires hadoop version(\>=2.9.3) but can also work with older hadoop versions (all works but we continue to read block onheap). It also requires HBASE-21946 which is not yet in place as of this writing/hbase-2.3.0.
4579
4580 We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex
4581
4582
4583 ---
4584
4585 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
4586
4587 <!-- markdown -->
4588 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
4589
4590
4591 ---
4592
4593 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
4594
4595 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
4596
4597
4598 ---
4599
4600 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
4601
4602 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
4603
4604
4605 ---
4606
4607 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
4608
4609 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
4610 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
4611
4612
4613 ---
4614
4615 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
4616
4617 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
4618 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
4619 \* TimeRange#until: Represents the time interval [0, maxStamp)
4620 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
4621
4622
4623 ---
4624
4625 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
4626
4627 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
4628 {code}
4629 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
4630 {code}
4631
4632
4633 ---
4634
4635 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
4636
4637 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
4638
4639
4640 ---
4641
4642 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
4643
4644 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
4645
4646 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
4647
4648
4649 ---
4650
4651 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
4652
4653 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
4654
4655
4656 ---
4657
4658 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
4659
4660 New shaded artifact for testing: hbase-shaded-testing-util.
4661
4662
4663 ---
4664
4665 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
4666
4667 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
4668 1. Check HDFS configuration
4669 2. Add master coprocessor:
4670     hbase.coprocessor.master.classes=
4671     “org.apache.hadoop.hbase.security.access.AccessController,
4672 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
4673 3. Enable this feature:
4674     hbase.acl.sync.to.hdfs.enable=true
4675 4. Modify table scheme to enable this feature for a table:
4676     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
4677
4678
4679 ---
4680
4681 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
4682
4683 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
4684
4685 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
4686
4687 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
4688 java.lang.ArrayIndexOutOfBoundsException: 18056
4689         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
4690         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
4691         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
4692         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
4693         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
4694         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
4695         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
4696         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
4697         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
4698
4699 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
4700
4701 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
4702
4703 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
4704
4705
4706 ---
4707
4708 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
4709
4710 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
4711
4712
4713 ---
4714
4715 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
4716
4717 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
4718
4719
4720 ---
4721
4722 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
4723
4724 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
4725
4726 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
4727
4728
4729 ---
4730
4731 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
4732
4733 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
4734
4735
4736 ---
4737
4738 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
4739
4740 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
4741 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
4742
4743
4744 ---
4745
4746 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
4747
4748 1. Add a new chore thread in master to do hbck checking
4749 2. Add a new web ui "HBCK Report" page to display checking results.
4750
4751 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
4752
4753 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
4754
4755
4756 ---
4757
4758 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
4759
4760 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
4761 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
4762
4763
4764 ---
4765
4766 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
4767
4768 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
4769
4770 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
4771
4772 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
4773
4774
4775 ---
4776
4777 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
4778
4779 Add a new master web UI to show the potentially problematic opened regions. There are three case:
4780 1. Master thought this region opened, but no regionserver reported it.
4781 2. Master thought this region opened on Server1, but regionserver reported Server2
4782 3. More than one regionservers reported opened this region
4783
4784
4785 ---
4786
4787 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
4788
4789 Feature: Take a Snapshot With TTL for auto-cleanup
4790
4791 Attribute:
4792 1. TTL
4793      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
4794
4795 Configs:
4796 1. Default Snapshot TTL:
4797      - FOREVER by default
4798      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
4799
4800 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
4801      - hbase.master.cleaner.snapshot.disable: "true"
4802     With this config, HMaster needs restart just like any other hbase-site config.
4803
4804
4805 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
4806
4807
4808 ---
4809
4810 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
4811
4812 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
4813
4814
4815 ---
4816
4817 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
4818
4819 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
4820
4821 This tool is deprecated in 2.x and will be removed in 3.0.
4822
4823
4824 ---
4825
4826 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
4827
4828 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
4829
4830
4831 ---
4832
4833 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
4834
4835 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
4836
4837 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
4838
4839 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
4840
4841
4842 ---
4843
4844 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
4845
4846 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
4847 To use this feature, please make sure the HDFS config is set:
4848 dfs.namenode.acls.enabled=true
4849 fs.permissions.umask-mode=027
4850
4851 and set the HBase config:
4852 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
4853 hbase.user.scan.snapshot.enable=true
4854
4855
4856 ---
4857
4858 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
4859
4860 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
4861
4862 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
4863
4864
4865 ---
4866
4867 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
4868
4869 <!-- markdown -->
4870
4871 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
4872
4873
4874 ---
4875
4876 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
4877
4878 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
4879
4880
4881 ---
4882
4883 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
4884
4885 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
4886
4887
4888 ---
4889
4890 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
4891
4892 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
4893
4894
4895 ---
4896
4897 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
4898
4899 The HBase "source checksum" now uses SHA512 instead of MD5.
4900
4901
4902 ---
4903
4904 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
4905
4906 <!-- markdown -->
4907
4908 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
4909
4910 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
4911
4912
4913 ---
4914
4915 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
4916
4917 The access method was used to the HttpServerFunctionalTest class as a common place.
4918
4919
4920 ---
4921
4922 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
4923
4924 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
4925
4926
4927 ---
4928
4929 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
4930
4931 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
4932
4933
4934 ---
4935
4936 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
4937
4938 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
4939
4940
4941 ---
4942
4943 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
4944
4945 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
4946
4947
4948 ---
4949
4950 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
4951
4952 Support get\|set LogLevel in secure(kerberized) environment.
4953
4954
4955 ---
4956
4957 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
4958
4959 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
4960
4961
4962 ---
4963
4964 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
4965
4966 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
4967
4968
4969 ---
4970
4971 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
4972
4973 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
4974
4975
4976 ---
4977
4978 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
4979
4980 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
4981
4982
4983 ---
4984
4985 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
4986
4987 Updated metrics core from 3.2.1 to 3.2.6.
4988
4989
4990 ---
4991
4992 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
4993
4994 The rubocop definition for the maximum method length was set to 75.
4995
4996
4997 ---
4998
4999 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
5000
5001 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
5002
5003
5004 ---
5005
5006 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
5007
5008 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
5009
5010
5011 ---
5012
5013 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
5014
5015 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
5016
5017 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
5018
5019 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
5020
5021 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
5022
5023 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
5024 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
5025
5026 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
5027 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
5028 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
5029
5030
5031 ---
5032
5033 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
5034
5035 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
5036
5037 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
5038
5039
5040 ---
5041
5042 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
5043
5044 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
5045
5046
5047 ---
5048
5049 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
5050
5051 <!-- markdown -->
5052 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
5053
5054 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
5055
5056
5057 ---
5058
5059 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
5060
5061 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
5062
5063 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
5064
5065
5066 ---
5067
5068 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
5069
5070 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
5071
5072
5073 ---
5074
5075 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
5076
5077 <!-- markdown -->
5078
5079 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
5080
5081 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
5082
5083
5084 ---
5085
5086 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
5087
5088 Add below method in Table interface:
5089
5090 RegionLocator getRegionLocator() throws IOException;
5091
5092 Add below methods in AsyncTable interface:
5093
5094 AsyncTableRegionLocator getRegionLocator();
5095 CompletableFuture\<TableDescriptor\> getDescriptor();
5096
5097
5098 ---
5099
5100 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
5101
5102 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
5103
5104 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
5105
5106 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
5107
5108
5109 ---
5110
5111 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
5112
5113 Introduced
5114
5115 Future\<Void\> createTableAsync(TableDescriptor);
5116
5117
5118 ---
5119
5120 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
5121
5122 Introduced these methods:
5123 void move(byte[]);
5124 void move(byte[], ServerName);
5125 Future\<Void\> splitRegionAsync(byte[]);
5126
5127 These methods are deprecated:
5128 void move(byte[], byte[])
5129
5130
5131 ---
5132
5133 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
5134
5135 Add a new jenkins file for running pre commit check for GitHub PR.
5136
5137
5138 ---
5139
5140 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
5141
5142 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
5143
5144
5145 ---
5146
5147 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
5148
5149 When insufficient permissions, you now get:
5150
5151 HTTP/1.1 403 Forbidden
5152
5153 on the HTTP side, and in the message
5154
5155 Forbidden
5156 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
5157 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
5158 and the rest of the ADE stack
5159
5160
5161 ---
5162
5163 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
5164
5165 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
5166
5167
5168 ---
5169
5170 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
5171
5172 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
5173
5174
5175 ---
5176
5177 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
5178
5179 <!-- markdown -->
5180 Fixed awkward dependency issue that prevented site building.
5181
5182 #### note specific to HBase 2.1.4
5183 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
5184 ```
5185 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
5186 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
5187         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
5188         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
5189         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
5190         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
5191         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
5192         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
5193         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
5194         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
5195         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
5196         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
5197         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
5198         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
5199         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
5200         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
5201         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
5202         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
5203         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
5204         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
5205         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
5206         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
5207         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
5208         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
5209         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
5210         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
5211         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
5212         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
5213 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
5214         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
5215         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
5216         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
5217         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
5218         ... 26 more
5219
5220 ```
5221
5222 Workaround via any _one_ of the following:
5223 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
5224 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
5225 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
5226 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
5227 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
5228
5229
5230 ---
5231
5232 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
5233
5234 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
5235
5236
5237 ---
5238
5239 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
5240
5241 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
5242
5243
5244 ---
5245
5246 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
5247
5248 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
5249
5250 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
5251
5252
5253 ---
5254
5255 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
5256
5257 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
5258
5259
5260 ---
5261
5262 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
5263
5264 <!-- markdown -->
5265
5266 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
5267
5268 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
5269
5270
5271 ---
5272
5273 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
5274
5275 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
5276
5277
5278 ---
5279
5280 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
5281
5282 Add a cloneSnapshotAsync method with restoreAcl parameter.
5283 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
5284 Make snapshotAsync method returns a Future\<Void\>.
5285 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
5286 Use default methods to reduce the code base for implementation classes.
5287
5288
5289 ---
5290
5291 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
5292
5293 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
5294
5295
5296 ---
5297
5298 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
5299
5300 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
5301 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
5302
5303 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
5304
5305 For example:
5306 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
5307
5308
5309 ---
5310
5311 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
5312
5313 Adds below flush, split, and compaction metrics
5314
5315  +  // split related metrics
5316  +  private MutableFastCounter splitRequest;
5317  +  private MutableFastCounter splitSuccess;
5318  +  private MetricHistogram splitTimeHisto;
5319  +
5320  +  // flush related metrics
5321  +  private MetricHistogram flushTimeHisto;
5322  +  private MetricHistogram flushMemstoreSizeHisto;
5323  +  private MetricHistogram flushOutputSizeHisto;
5324  +  private MutableFastCounter flushedMemstoreBytes;
5325  +  private MutableFastCounter flushedOutputBytes;
5326  +
5327  +  // compaction related metrics
5328  +  private MetricHistogram compactionTimeHisto;
5329  +  private MetricHistogram compactionInputFileCountHisto;
5330  +  private MetricHistogram compactionInputSizeHisto;
5331  +  private MetricHistogram compactionOutputFileCountHisto;
5332  +  private MetricHistogram compactionOutputSizeHisto;
5333  +  private MutableFastCounter compactedInputBytes;
5334  +  private MutableFastCounter compactedOutputBytes;
5335  +
5336  +  private MetricHistogram majorCompactionTimeHisto;
5337  +  private MetricHistogram majorCompactionInputFileCountHisto;
5338  +  private MetricHistogram majorCompactionInputSizeHisto;
5339  +  private MetricHistogram majorCompactionOutputFileCountHisto;
5340  +  private MetricHistogram majorCompactionOutputSizeHisto;
5341  +  private MutableFastCounter majorCompactedInputBytes;
5342  +  private MutableFastCounter majorCompactedOutputBytes;
5343
5344
5345 ---
5346
5347 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
5348
5349 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
5350
5351
5352 ---
5353
5354 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
5355
5356 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
5357 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
5358
5359
5360 ---
5361
5362 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
5363
5364 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
5365
5366 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
5367
5368
5369 ---
5370
5371 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
5372
5373 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
5374 Shell commands are as follows:
5375 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
5376
5377 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
5378 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
5379 Shell commands are as follows:
5380 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
5381 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
5382 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
5383
5384
5385 ---
5386
5387 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
5388
5389 Change spotbugs version to 3.1.11.
5390
5391
5392 ---
5393
5394 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
5395
5396 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
5397
5398 It also introduces additional info for each recovery queue, which was not accounted by this command before.
5399
5400 The new output for "status 'replication'" command is explained in details below:
5401 a) Source started, target stopped, no edits arrived on source yet:
5402 ...
5403  SOURCE: PeerID=1
5404          Normal Queue: 1
5405            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
5406 ...
5407 b) Source started, target stopped, add edit on source:
5408 ...
5409 Normal Queue: 1
5410            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
5411 ...
5412 c) Source started, target stopped, edit added on source, restart source:
5413 ...
5414 SOURCE: PeerID=1
5415          Normal Queue: 1
5416            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
5417          Recovered Queue: 1-hbase01.home,16020,1542784524057
5418            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
5419 ...
5420 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
5421 ...
5422 SOURCE: PeerID=1
5423          Normal Queue: 1
5424            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
5425          Recovered Queue: 1-hbase01.home,16020,1542782758742
5426            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
5427 ...
5428 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
5429 ...
5430        SOURCE: PeerID=1
5431          Normal Queue: 1
5432            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
5433 ...
5434 f) Source started, target stopped, add edit on source, restart source, restart target:
5435 ...
5436 SOURCE: PeerID=1
5437          Normal Queue: 1
5438            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
5439 ...
5440
5441
5442 ---
5443
5444 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
5445
5446 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
5447
5448
5449 ---
5450
5451 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
5452
5453 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
5454 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
5455 disable\_exceed\_throttle\_quota
5456 There are two limits when enable exceed throttle quota:
5457 1. Must set at least one read and one write region server throttle quota;
5458 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
5459
5460
5461 ---
5462
5463 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
5464
5465 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
5466
5467
5468 ---
5469
5470 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
5471
5472 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
5473
5474
5475 ---
5476
5477 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
5478
5479 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
5480
5481
5482 ---
5483
5484 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
5485
5486 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
5487
5488 hbase\> help 'scan'
5489
5490
5491 ---
5492
5493 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
5494
5495 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
5496
5497 For example:
5498 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
5499
5500
5501 ---
5502
5503 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
5504
5505 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
5506 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
5507
5508
5509 ---
5510
5511 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
5512
5513 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
5514
5515
5516 ---
5517
5518 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
5519
5520 Make StoppedRpcClientException extend DoNotRetryIOException.
5521
5522
5523 ---
5524
5525 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
5526
5527 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
5528 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
5529
5530
5531 ---
5532
5533 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
5534
5535 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
5536
5537 The effect releases are:
5538 2.1.x: 2.1.2 and below
5539 2.0.x: 2.0.4 and below
5540 1.x: 1.4.x and below
5541
5542 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
5543
5544
5545 ---
5546
5547 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
5548
5549 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
5550
5551
5552
5553 # HBASE  2.3.0 Release Notes
5554
5555 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
5556
5557
5558 ---
5559
5560 * [HBASE-24631](https://issues.apache.org/jira/browse/HBASE-24631) | *Major* | **Loosen Dockerfile pinned package versions of the "debian-revision"**
5561
5562 <!-- markdown -->
5563 Update our package version numbers throughout the Dockerfiles to be pinned to their epic:upstream-version components only. Previously we'd specify the full debian package version number, including the debian-revision. This lead to instability as debian packaging details changed.
5564 See also [man deb-version](http://manpages.ubuntu.com/manpages/xenial/en/man5/deb-version.5.html)
5565
5566
5567 ---
5568
5569 * [HBASE-24205](https://issues.apache.org/jira/browse/HBASE-24205) | *Major* | **Create metric to know the number of reads that happens from memstore**
5570
5571 Adds a new metric where we collect the number of read requests (tracked per row) whether the row was fetched completely from memstore or it was pulled from files  and memstore.
5572 The metric is now collected under the mbean for Tables and under the mbean for regions.
5573 Under table mbean ie.-
5574 'name": "Hadoop:service=HBase,name=RegionServer,sub=Tables'
5575 The new metrics will be listed as
5576 {code}
5577     "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
5578  "Namespace\_default\_table\_t3\_columnfamily\_f1\_metric\_mixedRowReadsCount": 1,
5579 {code}
5580 Where the format is Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_memstoreOnlyRowReadsCount
5581 Namespace\_\<namespacename\>\_table\_\<tableName\>\_columnfamily\_\<columnfamilyname\>\_metric\_mixedRowReadsCount
5582 {code}
5583
5584 The same one under the region ie.
5585 "name": "Hadoop:service=HBase,name=RegionServer,sub=Regions",
5586 comes as
5587 {code}
5588    "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_memstoreOnlyRowReadsCount": 5,
5589     "Namespace\_default\_table\_t3\_region\_75a7846f4ac4a2805071a855f7d0dbdc\_store\_f1\_metric\_mixedRowReadsCount": 1,
5590 {code}
5591 where
5592 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_memstoreOnlyRowReadsCount
5593 Namespace\_\<namespacename\_table\_\<tableName\>\_region\_\<regionName\>\_store\_\<storeName\>\_metric\_mixedRowReadsCount
5594 This is also an aggregate against every store the number of reads that happened purely from the memstore or it was a  mixed read that happened from memstore and file.
5595
5596
5597 ---
5598
5599 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
5600
5601 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
5602
5603 $hbase rowcounter -h
5604
5605 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
5606 Options:
5607     --starttime=\<arg\>       starting time filter to start counting rows from.
5608     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
5609     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
5610     --expectedCount=\<arg\>   expected number of rows to be count.
5611 For performance, consider the following configuration properties:
5612 -Dhbase.client.scanner.caching=100
5613 -Dmapreduce.map.speculative=false
5614
5615
5616 ---
5617
5618 * [HBASE-24217](https://issues.apache.org/jira/browse/HBASE-24217) | *Major* | **Add hadoop 3.2.x support**
5619
5620 CI coverage has been extended to include Hadoop 3.2.x for HBase 2.2+.
5621
5622
5623 ---
5624
5625 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
5626
5627 Adds being able to edit hbase:meta table schema. For example,
5628
5629 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
5630 Updating all regions with the new schema...
5631 All regions updated.
5632 Done.
5633 Took 1.2138 seconds
5634
5635 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
5636
5637
5638 ---
5639
5640 * [HBASE-15161](https://issues.apache.org/jira/browse/HBASE-15161) | *Major* | **Umbrella: Miscellaneous improvements from production usage**
5641
5642 This ticket summarizes significant improvements and expansion to the metrics surface area. Interested users should review the individual sub-tasks.
5643
5644
5645 ---
5646
5647 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
5648
5649 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
5650
5651
5652 ---
5653
5654 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
5655
5656 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
5657
5658
5659 ---
5660
5661 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
5662
5663 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
5664
5665
5666 ---
5667
5668 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
5669
5670 <!-- markdown -->
5671 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
5672 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
5673 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
5674 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
5675 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
5676 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
5677
5678
5679 ---
5680
5681 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
5682
5683 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
5684
5685 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
5686
5687 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
5688
5689 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
5690
5691
5692 ---
5693
5694 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
5695
5696 Added new metric to differentiate sink startup time from last OP applied time.
5697
5698 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
5699
5700 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
5701
5702 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
5703
5704 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
5705
5706
5707 ---
5708
5709 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
5710
5711 <!-- markdown -->
5712 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
5713
5714 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
5715
5716
5717 ---
5718
5719 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
5720
5721 Add backoff. Avoid retrying every 100ms.
5722
5723
5724 ---
5725
5726 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
5727
5728 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
5729
5730 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
5731
5732
5733 ---
5734
5735 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
5736
5737 Introduced a general 'local region' at master side to store the procedure data, etc.
5738
5739 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
5740
5741 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
5742
5743
5744 ---
5745
5746 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
5747
5748 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
5749
5750
5751 ---
5752
5753 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
5754
5755 Config key: hbase.regionserver.slowlog.systable.enabled
5756 Default value: false
5757
5758 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
5759 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
5760
5761 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
5762
5763 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
5764
5765  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
5766  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
5767  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
5768  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
5769                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
5770                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
5771                                                              rics: false
5772  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
5773  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
5774  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
5775  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
5776  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
5777  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
5778  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
5779  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
5780
5781
5782 ---
5783
5784 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
5785
5786 <!-- markdown -->
5787 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
5788
5789
5790 ---
5791
5792 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
5793
5794 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
5795
5796 The request log is disabled by default in conf/log4j.properties by the following lines:
5797
5798 # Disable request log by default, you can enable this by changing the appender
5799 log4j.category.http.requests=INFO,NullAppender
5800 log4j.additivity.http.requests=false
5801
5802 Change the 'NullAppender' to what ever you want if you want to enable request log.
5803
5804 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
5805
5806
5807 ---
5808
5809 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
5810
5811 Use a empty string to represent no column specified for deleteall in shell mode.
5812 useage:
5813 deleteall 'test','r1','',12345
5814 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
5815
5816
5817 ---
5818
5819 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
5820
5821 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
5822
5823
5824 ---
5825
5826 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
5827
5828 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
5829
5830
5831 ---
5832
5833 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
5834
5835 Moved to hbase-thirdparty 3.3.0.
5836
5837
5838 ---
5839
5840 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
5841
5842 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
5843
5844 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
5845
5846 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
5847
5848
5849 ---
5850
5851 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
5852
5853 <!-- markdown -->
5854 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
5855
5856 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
5857
5858
5859 ---
5860
5861 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
5862
5863 New Config: hbase.rpc.rows.size.threshold.reject
5864 -----------------------------------------------------------------------
5865
5866 Default value: false
5867 Description:
5868 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
5869
5870
5871 ---
5872
5873 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
5874
5875 StochasticLoadBalancer functional improvement:
5876
5877 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
5878
5879
5880 ---
5881
5882 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
5883
5884 user or admin can now use
5885 hbase shell \> rename\_rsgroup 'oldname', 'newname'
5886 to rename rsgroup.
5887
5888
5889 ---
5890
5891 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
5892
5893 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
5894
5895 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
5896
5897
5898 ---
5899
5900 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
5901
5902 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
5903
5904
5905 ---
5906
5907 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
5908
5909 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
5910
5911
5912 ---
5913
5914 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
5915
5916 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
5917
5918
5919 ---
5920
5921 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
5922
5923 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
5924
5925
5926 ---
5927
5928 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
5929
5930 <!-- markdown -->
5931 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
5932
5933
5934 ---
5935
5936 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
5937
5938 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
5939
5940 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
5941
5942 For running tests locally, to go faster, up fork count.
5943
5944 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
5945
5946 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
5947
5948
5949 ---
5950
5951 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
5952
5953 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
5954
5955
5956 ---
5957
5958 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
5959
5960 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
5961
5962
5963 ---
5964
5965 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
5966
5967 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
5968
5969
5970 ---
5971
5972 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
5973
5974 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
5975
5976
5977 ---
5978
5979 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
5980
5981 <!-- markdown -->
5982 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
5983
5984 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
5985
5986
5987 ---
5988
5989 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
5990
5991 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
5992
5993
5994 ---
5995
5996 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
5997
5998 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
5999
6000
6001 ---
6002
6003 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
6004
6005 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
6006
6007
6008 ---
6009
6010 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
6011
6012 ColumnFamilyDescriptor new builder API:
6013
6014     /\*\*
6015      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
6016      \* of versions(versionAfterInterval) after that interval elapses.
6017      \*
6018      \* @param retentionInterval Retain all versions for this interval
6019      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
6020      \*/
6021     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
6022         final int retentionInterval, final int versionAfterInterval)
6023
6024
6025 ---
6026
6027 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
6028
6029 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
6030
6031
6032 ---
6033
6034 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
6035
6036 Expose file system level read metrics for RegionServer.
6037
6038 If the HBase RS runs on top of HDFS, calculate the aggregation of
6039 ReadStatistics of each HdfsFileInputStream. These metrics include:
6040 (1) total number of bytes read from HDFS.
6041 (2) total number of bytes read from local DataNode.
6042 (3) total number of bytes read locally through short-circuit read.
6043 (4) total number of bytes read locally through zero-copy read.
6044
6045 Because HDFS ReadStatistics is calculated per input stream, it is not
6046 feasible to update the aggregated number in real time. Instead, the
6047 metrics are updated when an input stream is closed.
6048
6049
6050 ---
6051
6052 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
6053
6054 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
6055
6056 Here is a simple example of script:
6057 {code}
6058 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
6059 #!/bin/bash
6060 namespace=$1
6061 tablename=$2
6062 if [[ $namespace == test ]]; then
6063   echo test
6064 elif [[ $tablename == \*foo\* ]]; then
6065   echo other
6066 else
6067   echo default
6068 fi
6069 {code}
6070
6071
6072 ---
6073
6074 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
6075
6076 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
6077
6078
6079 ---
6080
6081 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
6082
6083 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
6084
6085
6086 ---
6087
6088 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
6089
6090 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
6091
6092 User used to see....
6093
6094   column=table:state, timestamp=1583967620343 .....
6095
6096 ... but now sees:
6097
6098   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
6099
6100
6101 ---
6102
6103 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
6104
6105 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
6106
6107
6108 ---
6109
6110 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
6111
6112 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
6113
6114 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
6115
6116
6117 ---
6118
6119 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
6120
6121 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
6122
6123 New Admin APIs:
6124 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
6125       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
6126
6127 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
6128       throws IOException;
6129
6130 Configs:
6131
6132 1. hbase.regionserver.slowlog.ringbuffer.size:
6133 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
6134
6135 Default
6136 256
6137
6138 2. hbase.regionserver.slowlog.buffer.enabled:
6139 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
6140
6141 Default
6142 false
6143
6144
6145 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
6146
6147
6148 ---
6149
6150 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
6151
6152 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
6153
6154
6155 ---
6156
6157 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
6158
6159 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
6160
6161 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
6162
6163 This is a fluent style API, the code is like:
6164
6165 For Table interface:
6166 {code}
6167 table.checkAndMutate(row, filter).thenPut(put);
6168 {code}
6169
6170 For AsyncTable interface:
6171 {code}
6172 table.checkAndMutate(row, filter).thenPut(put)
6173     .thenAccept(succ -\> {
6174       if (succ) {
6175         System.out.println("Check and put succeeded");
6176       } else {
6177         System.out.println("Check and put failed");
6178       }
6179     });
6180 {code}
6181
6182
6183 ---
6184
6185 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
6186
6187 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
6188
6189
6190 ---
6191
6192 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
6193
6194 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
6195
6196
6197 ---
6198
6199 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
6200
6201     Adds shell command regioninfo:
6202
6203       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
6204       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
6205       Took 0.4737 seconds
6206
6207
6208 ---
6209
6210 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
6211
6212 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
6213
6214 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
6215
6216
6217 ---
6218
6219 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
6220
6221 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
6222
6223 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
6224 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
6225
6226 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
6227
6228
6229 ---
6230
6231 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
6232
6233 <!-- markdown -->
6234 Enables master based registry as the default registry used by clients to fetch connection metadata.
6235 Refer to the section "Master Registry" in the client documentation for more details and advantages
6236 of this implementation over the default Zookeeper based registry.
6237
6238 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
6239
6240 Where to set this: HBase client configuration (hbase-site.xml)
6241
6242 Possible values:
6243 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
6244 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
6245
6246 Notes on defaults:
6247
6248 - For v3.0.0 and later, MasterRegistry is the default registry
6249 - For all releases in 2.x line, ZK based registry is the default.
6250
6251 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
6252
6253 ```
6254 <property>
6255   <name>hbase.client.registry.impl</name>
6256   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
6257 </property>
6258 ```
6259
6260
6261 ---
6262
6263 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
6264
6265 caffeine: 2.6.2 =\> 2.8.1
6266 commons-codec: 1.10 =\> 1.13
6267 commons-io: 2.5 =\> 2.6
6268 disrupter: 3.3.6 =\> 3.4.2
6269 httpcore: 4.4.6 =\> 4.4.13
6270 jackson: 2.9.10 =\> 2.10.1
6271 jackson.databind: 2.9.10.1 =\> 2.10.1
6272 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
6273 protobuf.plugin: 0.5.0 =\> 0.6.1
6274 zookeeper: 3.4.10 =\> 3.4.14
6275 slf4j: 1.7.25 =\> 1.7.30
6276 rat: 0.12 =\> 0.13
6277 asciidoctor: 1.5.5 =\> 1.5.8
6278 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
6279 error-prone: 2.3.3 =\> 2.3.4
6280
6281
6282 ---
6283
6284 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
6285
6286 - Reverts a binary incompatible binary change for ByteRangeUtils
6287 - Usage of reflection inside CommonFSUtils removed
6288
6289
6290 ---
6291
6292 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
6293
6294 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
6295
6296 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
6297
6298
6299 ---
6300
6301 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
6302
6303 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
6304
6305
6306 ---
6307
6308 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
6309
6310 Add a new config to hbase-default.xml
6311
6312   \<property\>
6313     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
6314     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
6315     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
6316     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
6317     called in order, so put the cleaner that prunes the most files in front. To
6318     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
6319     and add the fully qualified class name here. Always add the above
6320     default hfile cleaners in the list as they will be overwritten in
6321     hbase-site.xml.\</description\>
6322   \</property\>
6323
6324 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
6325
6326
6327 ---
6328
6329 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
6330
6331 Updated parent pom to Apache version 22.
6332
6333
6334 ---
6335
6336 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
6337
6338 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
6339
6340 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
6341
6342
6343 ---
6344
6345 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
6346
6347 Add a new feature to improve MTTR which have 3 steps to failover:
6348 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
6349 2. Open region.
6350 3. Bulkload the recovered.hfiles for every column family.
6351
6352 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
6353
6354 Config hbase.wal.split.to.hfile to true to enable this featue.
6355
6356
6357 ---
6358
6359 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
6360
6361 Changed the logging in hbase-zookeeper to use built-in formatting
6362
6363
6364 ---
6365
6366 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
6367
6368 From the PR:
6369
6370 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
6371
6372 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
6373
6374
6375 ---
6376
6377 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
6378
6379 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
6380
6381
6382 ---
6383
6384 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
6385
6386 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
6387
6388
6389 ---
6390
6391 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
6392
6393 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
6394
6395
6396 ---
6397
6398 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
6399
6400 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
6401
6402 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
6403
6404
6405 ---
6406
6407 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
6408
6409 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
6410
6411
6412 ---
6413
6414 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
6415
6416 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
6417 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
6418
6419 Fixed this bug as part of this Jira.
6420 Updated description for corresponding configs:
6421
6422 1. hbase.master.regions.recovery.check.interval :
6423
6424 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
6425
6426 2. hbase.regions.recovery.store.file.ref.count :
6427
6428 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
6429
6430
6431 ---
6432
6433 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
6434
6435 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
6436
6437
6438 ---
6439
6440 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
6441
6442 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
6443
6444
6445 ---
6446
6447 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
6448
6449 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
6450
6451
6452 ---
6453
6454 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
6455
6456 Bumped surefire plugin to 3.0.0-M4
6457
6458
6459 ---
6460
6461 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
6462
6463 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
6464
6465
6466 ---
6467
6468 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
6469
6470 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
6471 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
6472 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
6473 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
6474 From the shell this can be enabled by using the option per Column Family also by using the below format
6475 {code}
6476 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
6477 {code}
6478
6479
6480 ---
6481
6482 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
6483
6484 <!-- markdown -->
6485
6486 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
6487
6488 ```
6489 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
6490     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
6491 ```
6492
6493 See javadocs of the class `MobRefReporter` for more details.
6494
6495 the reference guide has added some information about MOB internals and troubleshooting.
6496
6497
6498 ---
6499
6500 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
6501
6502 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
6503
6504
6505 ---
6506
6507 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
6508
6509 Fixed unbalanced braces in string representation within HBase shell
6510
6511
6512 ---
6513
6514 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
6515
6516 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
6517 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
6518
6519
6520 ---
6521
6522 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
6523
6524 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
6525
6526
6527 ---
6528
6529 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
6530
6531 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
6532
6533 1. RowFilter
6534 2. ValueFilter
6535 3. QualifierFilter
6536 4. FamilyFilter
6537 5. ColumnValueFilter
6538
6539
6540 ---
6541
6542 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
6543
6544 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
6545
6546
6547 ---
6548
6549 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
6550
6551 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
6552
6553
6554 ---
6555
6556 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
6557
6558 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
6559
6560
6561 ---
6562
6563 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
6564
6565 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
6566
6567
6568 ---
6569
6570 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
6571
6572 <!-- markdown -->
6573 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
6574
6575 Such messages will happen at most once per five minutes.
6576
6577
6578 ---
6579
6580 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
6581
6582 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
6583
6584
6585 ---
6586
6587 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
6588
6589 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
6590
6591
6592 ---
6593
6594 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
6595
6596 <!-- markdown -->
6597
6598 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
6599
6600   - CVE-2019-16942
6601   - CVE-2019-16943
6602
6603 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
6604
6605
6606 ---
6607
6608 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
6609
6610 <!-- markdown -->
6611
6612 The MOB compaction process in the HBase Master now logs more about its activity.
6613
6614 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
6615
6616 Caveats:
6617 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
6618 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
6619 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
6620 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
6621
6622
6623 ---
6624
6625 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
6626
6627 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
6628
6629
6630 ---
6631
6632 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
6633
6634 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
6635
6636 Configs:
6637
6638 1. hbase.master.regions.recovery.check.interval :
6639
6640 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
6641
6642 2. hbase.regions.recovery.store.file.ref.count :
6643
6644 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
6645
6646
6647 ---
6648
6649 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
6650
6651 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
6652
6653 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
6654
6655
6656 ---
6657
6658 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
6659
6660 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
6661
6662
6663 ---
6664
6665 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
6666
6667 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
6668
6669
6670 ---
6671
6672 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
6673
6674 <!-- markdown -->
6675 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
6676
6677 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
6678
6679
6680 ---
6681
6682 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
6683
6684 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
6685
6686
6687 ---
6688
6689 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
6690
6691 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
6692
6693
6694 ---
6695
6696 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
6697
6698 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
6699
6700 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
6701
6702
6703 ---
6704
6705 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
6706
6707 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
6708 \<property\>
6709     \<name\>hbase.bucketcache.ioengine\</name\>
6710     \<value\> pmem:///path in persistent memory \</value\>
6711   \</property\>
6712
6713
6714 ---
6715
6716 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
6717
6718 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
6719 hbase\> snapshot\_cleanup\_switch false
6720
6721 We can re-enable it using:
6722 hbase\> snapshot\_cleanup\_switch true
6723
6724 We can query whether snapshot auto cleanup is enabled for cluster using:
6725 hbase\> snapshot\_cleanup\_enabled
6726
6727
6728 ---
6729
6730 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
6731
6732 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
6733
6734
6735 ---
6736
6737 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
6738
6739 This issue adds via its subtasks:
6740
6741  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
6742  \*\* Master thought this region opened, but no regionserver reported it.
6743  \*\* Master thought this region opened on Server1, but regionserver reported Server2
6744  \*\* More than one regionservers reported opened this region
6745  Both chores can be triggered from the shell to regenerate ‘new’ reports.
6746  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
6747  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
6748  \* Offline replace of hbase.version and hbase.id
6749  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
6750  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
6751  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
6752  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
6753  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
6754  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
6755
6756 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
6757
6758
6759 ---
6760
6761 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
6762
6763 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
6764
6765
6766 ---
6767
6768 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
6769
6770 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
6771
6772
6773 ---
6774
6775 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
6776
6777 Before this issue, read path was 100% offheap when block is in the BucketCache. But if a cache miss, then the RS needs to read the block via an on-heap API which causes high young-GC pressure.
6778
6779 This issue adds reading the block via offheap even if reading the block from filesystem directly.  It requires hadoop version(\>=2.9.3) but can also work with older hadoop versions (all works but we continue to read block onheap). It also requires HBASE-21946 which is not yet in place as of this writing/hbase-2.3.0.
6780
6781 We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex
6782
6783
6784 ---
6785
6786 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
6787
6788 <!-- markdown -->
6789 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
6790
6791
6792 ---
6793
6794 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
6795
6796 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
6797
6798
6799 ---
6800
6801 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
6802
6803 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
6804
6805
6806 ---
6807
6808 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
6809
6810 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
6811 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
6812
6813
6814 ---
6815
6816 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
6817
6818 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
6819 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
6820 \* TimeRange#until: Represents the time interval [0, maxStamp)
6821 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
6822
6823
6824 ---
6825
6826 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
6827
6828 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
6829 {code}
6830 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
6831 {code}
6832
6833
6834 ---
6835
6836 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
6837
6838 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
6839
6840
6841 ---
6842
6843 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
6844
6845 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
6846
6847 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
6848
6849
6850 ---
6851
6852 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
6853
6854 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
6855
6856
6857 ---
6858
6859 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
6860
6861 New shaded artifact for testing: hbase-shaded-testing-util.
6862
6863
6864 ---
6865
6866 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
6867
6868 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
6869 1. Check HDFS configuration
6870 2. Add master coprocessor:
6871     hbase.coprocessor.master.classes=
6872     “org.apache.hadoop.hbase.security.access.AccessController,
6873 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
6874 3. Enable this feature:
6875     hbase.acl.sync.to.hdfs.enable=true
6876 4. Modify table scheme to enable this feature for a table:
6877     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
6878
6879
6880 ---
6881
6882 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
6883
6884 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
6885
6886 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
6887
6888 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
6889 java.lang.ArrayIndexOutOfBoundsException: 18056
6890         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
6891         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
6892         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
6893         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
6894         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
6895         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
6896         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
6897         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
6898         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
6899
6900 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
6901
6902 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
6903
6904 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
6905
6906
6907 ---
6908
6909 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
6910
6911 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
6912
6913
6914 ---
6915
6916 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
6917
6918 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
6919
6920
6921 ---
6922
6923 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
6924
6925 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
6926
6927 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
6928
6929
6930 ---
6931
6932 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
6933
6934 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
6935
6936
6937 ---
6938
6939 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
6940
6941 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
6942 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
6943
6944
6945 ---
6946
6947 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
6948
6949 1. Add a new chore thread in master to do hbck checking
6950 2. Add a new web ui "HBCK Report" page to display checking results.
6951
6952 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
6953
6954 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
6955
6956
6957 ---
6958
6959 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
6960
6961 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
6962 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
6963
6964
6965 ---
6966
6967 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
6968
6969 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
6970
6971 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
6972
6973 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
6974
6975
6976 ---
6977
6978 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
6979
6980 Add a new master web UI to show the potentially problematic opened regions. There are three case:
6981 1. Master thought this region opened, but no regionserver reported it.
6982 2. Master thought this region opened on Server1, but regionserver reported Server2
6983 3. More than one regionservers reported opened this region
6984
6985
6986 ---
6987
6988 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
6989
6990 Feature: Take a Snapshot With TTL for auto-cleanup
6991
6992 Attribute:
6993 1. TTL
6994      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
6995
6996 Configs:
6997 1. Default Snapshot TTL:
6998      - FOREVER by default
6999      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
7000
7001 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
7002      - hbase.master.cleaner.snapshot.disable: "true"
7003     With this config, HMaster needs restart just like any other hbase-site config.
7004
7005
7006 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
7007
7008
7009 ---
7010
7011 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
7012
7013 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
7014
7015
7016 ---
7017
7018 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
7019
7020 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
7021
7022 This tool is deprecated in 2.x and will be removed in 3.0.
7023
7024
7025 ---
7026
7027 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
7028
7029 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
7030
7031
7032 ---
7033
7034 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
7035
7036 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
7037
7038 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
7039
7040 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
7041
7042
7043 ---
7044
7045 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
7046
7047 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
7048 To use this feature, please make sure the HDFS config is set:
7049 dfs.namenode.acls.enabled=true
7050 fs.permissions.umask-mode=027
7051
7052 and set the HBase config:
7053 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
7054 hbase.user.scan.snapshot.enable=true
7055
7056
7057 ---
7058
7059 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
7060
7061 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
7062
7063 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
7064
7065
7066 ---
7067
7068 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
7069
7070 <!-- markdown -->
7071
7072 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
7073
7074
7075 ---
7076
7077 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
7078
7079 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
7080
7081
7082 ---
7083
7084 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
7085
7086 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
7087
7088
7089 ---
7090
7091 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
7092
7093 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
7094
7095
7096 ---
7097
7098 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
7099
7100 The HBase "source checksum" now uses SHA512 instead of MD5.
7101
7102
7103 ---
7104
7105 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
7106
7107 <!-- markdown -->
7108
7109 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
7110
7111 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
7112
7113
7114 ---
7115
7116 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
7117
7118 The access method was used to the HttpServerFunctionalTest class as a common place.
7119
7120
7121 ---
7122
7123 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
7124
7125 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
7126
7127
7128 ---
7129
7130 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
7131
7132 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
7133
7134
7135 ---
7136
7137 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
7138
7139 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
7140
7141
7142 ---
7143
7144 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
7145
7146 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
7147
7148
7149 ---
7150
7151 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
7152
7153 Support get\|set LogLevel in secure(kerberized) environment.
7154
7155
7156 ---
7157
7158 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
7159
7160 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
7161
7162
7163 ---
7164
7165 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
7166
7167 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
7168
7169
7170 ---
7171
7172 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
7173
7174 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
7175
7176
7177 ---
7178
7179 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
7180
7181 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
7182
7183
7184 ---
7185
7186 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
7187
7188 Updated metrics core from 3.2.1 to 3.2.6.
7189
7190
7191 ---
7192
7193 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
7194
7195 The rubocop definition for the maximum method length was set to 75.
7196
7197
7198 ---
7199
7200 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
7201
7202 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
7203
7204
7205 ---
7206
7207 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
7208
7209 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
7210
7211
7212 ---
7213
7214 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
7215
7216 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
7217
7218 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
7219
7220 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
7221
7222 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
7223
7224 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
7225 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
7226
7227 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
7228 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
7229 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
7230
7231
7232 ---
7233
7234 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
7235
7236 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
7237
7238 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
7239
7240
7241 ---
7242
7243 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
7244
7245 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
7246
7247
7248 ---
7249
7250 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
7251
7252 <!-- markdown -->
7253 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
7254
7255 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
7256
7257
7258 ---
7259
7260 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
7261
7262 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
7263
7264 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
7265
7266
7267 ---
7268
7269 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
7270
7271 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
7272
7273
7274 ---
7275
7276 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
7277
7278 <!-- markdown -->
7279
7280 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
7281
7282 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
7283
7284
7285 ---
7286
7287 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
7288
7289 Add below method in Table interface:
7290
7291 RegionLocator getRegionLocator() throws IOException;
7292
7293 Add below methods in AsyncTable interface:
7294
7295 AsyncTableRegionLocator getRegionLocator();
7296 CompletableFuture\<TableDescriptor\> getDescriptor();
7297
7298
7299 ---
7300
7301 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
7302
7303 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
7304
7305 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
7306
7307 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
7308
7309
7310 ---
7311
7312 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
7313
7314 Introduced
7315
7316 Future\<Void\> createTableAsync(TableDescriptor);
7317
7318
7319 ---
7320
7321 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
7322
7323 Introduced these methods:
7324 void move(byte[]);
7325 void move(byte[], ServerName);
7326 Future\<Void\> splitRegionAsync(byte[]);
7327
7328 These methods are deprecated:
7329 void move(byte[], byte[])
7330
7331
7332 ---
7333
7334 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
7335
7336 Add a new jenkins file for running pre commit check for GitHub PR.
7337
7338
7339 ---
7340
7341 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
7342
7343 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
7344
7345
7346 ---
7347
7348 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
7349
7350 When insufficient permissions, you now get:
7351
7352 HTTP/1.1 403 Forbidden
7353
7354 on the HTTP side, and in the message
7355
7356 Forbidden
7357 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
7358 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
7359 and the rest of the ADE stack
7360
7361
7362 ---
7363
7364 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
7365
7366 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
7367
7368
7369 ---
7370
7371 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
7372
7373 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
7374
7375
7376 ---
7377
7378 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
7379
7380 <!-- markdown -->
7381 Fixed awkward dependency issue that prevented site building.
7382
7383 #### note specific to HBase 2.1.4
7384 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
7385 ```
7386 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
7387 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
7388         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
7389         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
7390         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
7391         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
7392         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
7393         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
7394         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
7395         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
7396         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
7397         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
7398         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
7399         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
7400         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
7401         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
7402         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
7403         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
7404         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
7405         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
7406         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
7407         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
7408         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
7409         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
7410         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
7411         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
7412         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
7413         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
7414 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
7415         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
7416         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
7417         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
7418         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
7419         ... 26 more
7420
7421 ```
7422
7423 Workaround via any _one_ of the following:
7424 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
7425 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
7426 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
7427 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
7428 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
7429
7430
7431 ---
7432
7433 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
7434
7435 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
7436
7437
7438 ---
7439
7440 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
7441
7442 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
7443
7444
7445 ---
7446
7447 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
7448
7449 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
7450
7451 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
7452
7453
7454 ---
7455
7456 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
7457
7458 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
7459
7460
7461 ---
7462
7463 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
7464
7465 <!-- markdown -->
7466
7467 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
7468
7469 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
7470
7471
7472 ---
7473
7474 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
7475
7476 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
7477
7478
7479 ---
7480
7481 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
7482
7483 Add a cloneSnapshotAsync method with restoreAcl parameter.
7484 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
7485 Make snapshotAsync method returns a Future\<Void\>.
7486 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
7487 Use default methods to reduce the code base for implementation classes.
7488
7489
7490 ---
7491
7492 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
7493
7494 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
7495
7496
7497 ---
7498
7499 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
7500
7501 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
7502 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
7503
7504 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
7505
7506 For example:
7507 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
7508
7509
7510 ---
7511
7512 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
7513
7514 Adds below flush, split, and compaction metrics
7515
7516  +  // split related metrics
7517  +  private MutableFastCounter splitRequest;
7518  +  private MutableFastCounter splitSuccess;
7519  +  private MetricHistogram splitTimeHisto;
7520  +
7521  +  // flush related metrics
7522  +  private MetricHistogram flushTimeHisto;
7523  +  private MetricHistogram flushMemstoreSizeHisto;
7524  +  private MetricHistogram flushOutputSizeHisto;
7525  +  private MutableFastCounter flushedMemstoreBytes;
7526  +  private MutableFastCounter flushedOutputBytes;
7527  +
7528  +  // compaction related metrics
7529  +  private MetricHistogram compactionTimeHisto;
7530  +  private MetricHistogram compactionInputFileCountHisto;
7531  +  private MetricHistogram compactionInputSizeHisto;
7532  +  private MetricHistogram compactionOutputFileCountHisto;
7533  +  private MetricHistogram compactionOutputSizeHisto;
7534  +  private MutableFastCounter compactedInputBytes;
7535  +  private MutableFastCounter compactedOutputBytes;
7536  +
7537  +  private MetricHistogram majorCompactionTimeHisto;
7538  +  private MetricHistogram majorCompactionInputFileCountHisto;
7539  +  private MetricHistogram majorCompactionInputSizeHisto;
7540  +  private MetricHistogram majorCompactionOutputFileCountHisto;
7541  +  private MetricHistogram majorCompactionOutputSizeHisto;
7542  +  private MutableFastCounter majorCompactedInputBytes;
7543  +  private MutableFastCounter majorCompactedOutputBytes;
7544
7545
7546 ---
7547
7548 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
7549
7550 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
7551
7552
7553 ---
7554
7555 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
7556
7557 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
7558 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
7559
7560
7561 ---
7562
7563 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
7564
7565 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
7566
7567 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
7568
7569
7570 ---
7571
7572 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
7573
7574 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
7575 Shell commands are as follows:
7576 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
7577
7578 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
7579 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
7580 Shell commands are as follows:
7581 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
7582 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
7583 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
7584
7585
7586 ---
7587
7588 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
7589
7590 Change spotbugs version to 3.1.11.
7591
7592
7593 ---
7594
7595 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
7596
7597 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
7598
7599 It also introduces additional info for each recovery queue, which was not accounted by this command before.
7600
7601 The new output for "status 'replication'" command is explained in details below:
7602 a) Source started, target stopped, no edits arrived on source yet:
7603 ...
7604  SOURCE: PeerID=1
7605          Normal Queue: 1
7606            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
7607 ...
7608 b) Source started, target stopped, add edit on source:
7609 ...
7610 Normal Queue: 1
7611            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
7612 ...
7613 c) Source started, target stopped, edit added on source, restart source:
7614 ...
7615 SOURCE: PeerID=1
7616          Normal Queue: 1
7617            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
7618          Recovered Queue: 1-hbase01.home,16020,1542784524057
7619            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
7620 ...
7621 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
7622 ...
7623 SOURCE: PeerID=1
7624          Normal Queue: 1
7625            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
7626          Recovered Queue: 1-hbase01.home,16020,1542782758742
7627            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
7628 ...
7629 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
7630 ...
7631        SOURCE: PeerID=1
7632          Normal Queue: 1
7633            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
7634 ...
7635 f) Source started, target stopped, add edit on source, restart source, restart target:
7636 ...
7637 SOURCE: PeerID=1
7638          Normal Queue: 1
7639            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
7640 ...
7641
7642
7643 ---
7644
7645 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
7646
7647 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
7648
7649
7650 ---
7651
7652 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
7653
7654 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
7655 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
7656 disable\_exceed\_throttle\_quota
7657 There are two limits when enable exceed throttle quota:
7658 1. Must set at least one read and one write region server throttle quota;
7659 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
7660
7661
7662 ---
7663
7664 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
7665
7666 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
7667
7668
7669 ---
7670
7671 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
7672
7673 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
7674
7675
7676 ---
7677
7678 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
7679
7680 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
7681
7682
7683 ---
7684
7685 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
7686
7687 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
7688
7689 hbase\> help 'scan'
7690
7691
7692 ---
7693
7694 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
7695
7696 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
7697
7698 For example:
7699 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
7700
7701
7702 ---
7703
7704 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
7705
7706 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
7707 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
7708
7709
7710 ---
7711
7712 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
7713
7714 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
7715
7716
7717 ---
7718
7719 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
7720
7721 Make StoppedRpcClientException extend DoNotRetryIOException.
7722
7723
7724 ---
7725
7726 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
7727
7728 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
7729 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
7730
7731
7732 ---
7733
7734 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
7735
7736 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
7737
7738 The effect releases are:
7739 2.1.x: 2.1.2 and below
7740 2.0.x: 2.0.4 and below
7741 1.x: 1.4.x and below
7742
7743 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
7744
7745
7746 ---
7747
7748 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
7749
7750 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
7751
7752
7753
7754 # HBASE  2.3.0 Release Notes
7755
7756 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
7757
7758
7759 ---
7760
7761 * [HBASE-24545](https://issues.apache.org/jira/browse/HBASE-24545) | *Major* | **Add backoff to SCP check on WAL split completion**
7762
7763 Adds backoff in ServerCrashProcedure wait on WAL split to complete if large backlog of files to split (Its possible to avoid SCP blocking, waiting on WALs to split if you use procedure-based splitting --  set 'hbase.split.wal.zk.coordinated' to false to enable procedure based wal splitting.)
7764
7765
7766 ---
7767
7768 * [HBASE-24524](https://issues.apache.org/jira/browse/HBASE-24524) | *Minor* | **SyncTable logging improvements**
7769
7770 Notice this has changed log level for mismatching row keys, originally those were being logged at INFO level, now it's logged at DEBUG level. This is consistent with the logging of mismatching cells. Also, for missing row keys, it now logs row key values in human readable format, making it more meaningful for operators troubleshooting mismatches.
7771
7772
7773 ---
7774
7775 * [HBASE-24359](https://issues.apache.org/jira/browse/HBASE-24359) | *Major* | **Optionally ignore edits for deleted CFs for replication.**
7776
7777 Introduce a new config hbase.replication.drop.on.deleted.columnfamily, default is false. When config to true, the replication will drop the edits for columnfamily that has been deleted from the replication source and target.
7778
7779
7780 ---
7781
7782 * [HBASE-24418](https://issues.apache.org/jira/browse/HBASE-24418) | *Major* | **Consolidate Normalizer implementations**
7783
7784 <!-- markdown -->
7785 This change extends the Normalizer with a handful of new configurations. The configuration points supported are:
7786 * `hbase.normalizer.split.enabled` Whether to split a region as part of normalization. Default: `true`.
7787 * `hbase.normalizer.merge.enabled` Whether to merge a region as part of normalization. Default `true`.
7788 * `hbase.normalizer.min.region.count` The minimum number of regions in a table to consider it for merge normalization. Default: 3.
7789 * `hbase.normalizer.merge.min_region_age.days` The minimum age for a region to be considered for a merge, in days. Default: 3.
7790 * `hbase.normalizer.merge.min_region_size.mb` The minimum size for a region to be considered for a merge, in whole MBs. Default: 1.
7791
7792
7793 ---
7794
7795 * [HBASE-24309](https://issues.apache.org/jira/browse/HBASE-24309) | *Major* | **Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly**
7796
7797 Add a hbase-logging module, put the log4j related code in this module only so other modules do not need to depend on log4j at compile scope. See the comments of Log4jUtils and InternalLog4jUtils for more details.
7798
7799 Add a log4j.properties to the test jar of hbase-logging module, so for other sub modules we just need to depend on the test jar of hbase-logging module at test scope to output the log to console, without placing a log4j.properties in the test resources as they all (almost) have the same content. And this test module will not be included in the assembly tarball so it will not mess up the binary distribution.
7800
7801 Ban direct commons-logging dependency, and ban commons-logging and log4j imports in non-test code, to avoid mess up the downstream users logging framework. In hbase-logging module we do need to use log4j classes and the trick is to use full class name.
7802
7803 Add jcl-over-slf4j and jul-to-slf4j dependencies, as some of our dependencies use jcl or jul as logging framework, we should also redirect their log message to slf4j.
7804
7805
7806 ---
7807
7808 * [HBASE-21406](https://issues.apache.org/jira/browse/HBASE-21406) | *Minor* | **"status 'replication'" should not show SINK if the cluster does not act as sink**
7809
7810 Added new metric to differentiate sink startup time from last OP applied time.
7811
7812 Original behaviour was to always set startup time to TimestampsOfLastAppliedOp, and always show it on "status 'replication'" command, regardless if the sink ever applied any OP.
7813
7814 This was confusing, specially for scenarios where cluster was just acting as source, the output could lead to wrong interpretations about sink not applying edits or replication being stuck.
7815
7816 With the new metric, we now compare the two metrics values, assuming that if both are the same, there's never been any OP shipped to the given sink, so output would reflect it more clearly, to something as for example:
7817
7818 SINK: TimeStampStarted=Thu Dec 06 23:59:47 GMT 2018, Waiting for OPs...
7819
7820
7821 ---
7822
7823 * [HBASE-24132](https://issues.apache.org/jira/browse/HBASE-24132) | *Major* | **Upgrade to Apache ZooKeeper 3.5.7**
7824
7825 <!-- markdown -->
7826 HBase ships ZooKeeper 3.5.x. Was the EOL'd 3.4.x. 3.5.x client can talk to 3.4.x ensemble.
7827
7828 The ZooKeeper project has built a [FAQ](https://cwiki.apache.org/confluence/display/ZOOKEEPER/Upgrade+FAQ) that documents known issues and work-arounds when upgrading existing deployments.
7829
7830
7831 ---
7832
7833 * [HBASE-22287](https://issues.apache.org/jira/browse/HBASE-22287) | *Major* | **inifinite retries on failed server in RSProcedureDispatcher**
7834
7835 Add backoff. Avoid retrying every 100ms.
7836
7837
7838 ---
7839
7840 * [HBASE-24425](https://issues.apache.org/jira/browse/HBASE-24425) | *Major* | **Run hbck\_chore\_run and catalogjanitor\_run on draw of 'HBCK Report' page**
7841
7842 Runs 'catalogjanitor\_run' and 'hbck\_chore\_run' inline with the loading of the 'HBCK Report' page.
7843
7844 Pass '?cache=true' to skip inline invocation of 'catalogjanitor\_run' and 'hbck\_chore\_run' drawing the page.
7845
7846
7847 ---
7848
7849 * [HBASE-24408](https://issues.apache.org/jira/browse/HBASE-24408) | *Blocker* | **Introduce a general 'local region' to store data on master**
7850
7851 Introduced a general 'local region' at master side to store the procedure data, etc.
7852
7853 The hfile of this region will be stored on the root fs while the wal will be stored on the wal fs. This issue supercedes part of the code for HBASE-23326, as now we store the data in 'MasterData' directory instead of 'MasterProcs'.
7854
7855 The old hfiles will be moved to the global hfile archived directory with the suffix $-masterlocalhfile-$. The wal files will be moved to the global old wal directory with the suffix $masterlocalwal$. The TimeToLiveMasterLocalStoreHFileCleaner and TimeToLiveMasterLocalStoreWALCleaner are configured by default for cleaning the old hfiles and wal files, and the default TTLs are both 7 days.
7856
7857
7858 ---
7859
7860 * [HBASE-24115](https://issues.apache.org/jira/browse/HBASE-24115) | *Major* | **Relocate test-only REST "client" from src/ to test/ and mark Private**
7861
7862 Relocate test-only REST RemoteHTable and RemoteAdmin from src/ to test/. And mark them as InterfaceAudience.Private.
7863
7864
7865 ---
7866
7867 * [HBASE-23938](https://issues.apache.org/jira/browse/HBASE-23938) | *Major* | **Replicate slow/large RPC calls to HDFS**
7868
7869 Config key: hbase.regionserver.slowlog.systable.enabled
7870 Default value: false
7871
7872 This config can be enabled if hbase.regionserver.slowlog.buffer.enabled is already enabled. While hbase.regionserver.slowlog.buffer.enabled ensures that any slow/large RPC logs with complete details are written to ring buffer available at each RegionServer, hbase.regionserver.slowlog.systable.enabled would ensure that all such logs are also persisted in new system table hbase:slowlog.
7873 Operator can scan hbase:slowlog with filters to retrieve specific attribute matching records and this table would be useful to capture historical performance of slowness of RPC calls with detailed analysis.
7874
7875 hbase:slowlog consists of single ColumnFamily info. info consists of multiple qualifiers similar to the attributes available to query as part of Admin API: get\_slowlog\_responses.
7876
7877 One example of a row from hbase:slowlog scan result (Attached a sample screenshot in the Jira) :
7878
7879  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:call\_details, timestamp=2020-05-16T14:59:58.764Z, value=Scan(org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ScanRequest)
7880  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:client\_address, timestamp=2020-05-16T14:59:58.764Z, value=172.20.10.2:57348
7881  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:method\_name, timestamp=2020-05-16T14:59:58.764Z, value=Scan
7882  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:param, timestamp=2020-05-16T14:59:58.764Z, value=region { type: REGION\_NAME value: "cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf." } scan { a
7883                                                              ttribute { name: "\_isolationlevel\_" value: "\\x5C000" } start\_row: "cccccccc" time\_range { from: 0 to: 9223372036854775807 } max\_versions: 1 cache\_blocks: true max\_result\_size: 2
7884                                                              097152 caching: 2147483647 include\_stop\_row: false } number\_of\_rows: 2147483647 close\_scanner: false client\_handles\_partials: true client\_handles\_heartbeats: true track\_scan\_met
7885                                                              rics: false
7886  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:processing\_time, timestamp=2020-05-16T14:59:58.764Z, value=24
7887  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:queue\_time, timestamp=2020-05-16T14:59:58.764Z, value=0
7888  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:region\_name, timestamp=2020-05-16T14:59:58.764Z, value=cluster\_test,cccccccc,1589635796466.aa45e1571d533f5ed0bb31cdccaaf9cf.
7889  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:response\_size, timestamp=2020-05-16T14:59:58.764Z, value=211227
7890  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:server\_class, timestamp=2020-05-16T14:59:58.764Z, value=HRegionServer
7891  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:start\_time, timestamp=2020-05-16T14:59:58.764Z, value=1589640743932
7892  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:type, timestamp=2020-05-16T14:59:58.764Z, value=ALL
7893  \\x024\\xC1\\x06X\\x81\\xF6\\xEC                                  column=info:username, timestamp=2020-05-16T14:59:58.764Z, value=vjasani
7894
7895
7896 ---
7897
7898 * [HBASE-24271](https://issues.apache.org/jira/browse/HBASE-24271) | *Major* | **Set values in \`conf/hbase-site.xml\` that enable running on \`LocalFileSystem\` out of the box**
7899
7900 <!-- markdown -->
7901 HBASE-24271 makes changes the the default `conf/hbase-site.xml` such that `bin/hbase` will run directly out of the binary tarball or a compiled source tree without any configuration modifications vs. Hadoop 2.8+. This changes our long-standing history of shipping no configured values in `conf/hbase-site.xml`, so existing processes that assume this file is empty of configuration properties may require attention.
7902
7903
7904 ---
7905
7906 * [HBASE-24310](https://issues.apache.org/jira/browse/HBASE-24310) | *Major* | **Use Slf4jRequestLog for hbase-http**
7907
7908 Use Slf4jRequestLog instead of the log4j HttpRequestLogAppender in HttpServer.
7909
7910 The request log is disabled by default in conf/log4j.properties by the following lines:
7911
7912 # Disable request log by default, you can enable this by changing the appender
7913 log4j.category.http.requests=INFO,NullAppender
7914 log4j.additivity.http.requests=false
7915
7916 Change the 'NullAppender' to what ever you want if you want to enable request log.
7917
7918 Notice that, the logger name for master status http server is 'http.requests.master', and for region server it is 'http.requests.regionserver'
7919
7920
7921 ---
7922
7923 * [HBASE-24335](https://issues.apache.org/jira/browse/HBASE-24335) | *Major* | **Support deleteall with ts but without column in shell mode**
7924
7925 Use a empty string to represent no column specified for deleteall in shell mode.
7926 useage:
7927 deleteall 'test','r1','',12345
7928 deleteall 'test', {ROWPREFIXFILTER =\> 'prefix'}, '', 12345
7929
7930
7931 ---
7932
7933 * [HBASE-24304](https://issues.apache.org/jira/browse/HBASE-24304) | *Major* | **Separate a hbase-asyncfs module**
7934
7935 Added a new hbase-asyncfs module to hold the asynchronous dfs output stream implementation for implementing WAL.
7936
7937
7938 ---
7939
7940 * [HBASE-22710](https://issues.apache.org/jira/browse/HBASE-22710) | *Major* | **Wrong result in one case of scan that use  raw and versions and filter together**
7941
7942 Make the logic of the versions chosen more reasonable for raw scan, to avoid lose result when using filter.
7943
7944
7945 ---
7946
7947 * [HBASE-24285](https://issues.apache.org/jira/browse/HBASE-24285) | *Major* | **Move to hbase-thirdparty-3.3.0**
7948
7949 Moved to hbase-thirdparty 3.3.0.
7950
7951
7952 ---
7953
7954 * [HBASE-24252](https://issues.apache.org/jira/browse/HBASE-24252) | *Major* | **Implement proxyuser/doAs mechanism for hbase-http**
7955
7956 This feature enables the HBase Web UI's to accept a 'proxyuser' via the HTTP Request's query string. When the parameter \`hbase.security.authentication.spnego.kerberos.proxyuser.enable\` is set to \`true\` in hbase-site.xml (default is \`false\`), the HBase UI will attempt to impersonate the user specified by the query parameter "doAs". This query parameter is checked case-insensitively. When this option is not provided, the user who executed the request is the "real" user and there is no ability to execute impersonation against the WebUI.
7957
7958 For example, if the user "bob" with Kerberos credentials executes a request against the WebUI with this feature enabled and a query string which includes \`doAs=alice\`, the HBase UI will treat this request as executed as \`alice\`, not \`bob\`.
7959
7960 The standard Hadoop proxyuser configuration properties to limit users who may impersonate others apply to this change (e.g. to enable \`bob\` to impersonate \`alice\`). See the Hadoop documentation for more information on how to configure these proxyuser rules.
7961
7962
7963 ---
7964
7965 * [HBASE-24143](https://issues.apache.org/jira/browse/HBASE-24143) | *Major* | **[JDK11] Switch default garbage collector from CMS**
7966
7967 <!-- markdown -->
7968 `bin/hbase` will now dynamically select a Garbage Collector implementation based on the detected JVM version. JDKs 8,9,10 use `-XX:+UseConcMarkSweepGC`, while JDK11+ use `-XX:+UseG1GC`.
7969
7970 Notice a slight compatibility change. Previously, the garbage collector choice would always be appended to a user-provided value for `HBASE_OPTS`. As of this change, this setting will only be applied when `HBASE_OPTS` is unset. That means that operators who provide a value for this variable will now need to also specify the collector. This is especially important for those on JDK8, where the vm default GC is not the recommended ConcMarkSweep.
7971
7972
7973 ---
7974
7975 * [HBASE-24024](https://issues.apache.org/jira/browse/HBASE-24024) | *Major* | **Optionally reject multi() requests with very high no of rows**
7976
7977 New Config: hbase.rpc.rows.size.threshold.reject
7978 -----------------------------------------------------------------------
7979
7980 Default value: false
7981 Description:
7982 If value is true, RegionServer will abort batch requests of Put/Delete with number of rows in a batch operation exceeding threshold defined by value of config: hbase.rpc.rows.warning.threshold.
7983
7984
7985 ---
7986
7987 * [HBASE-24139](https://issues.apache.org/jira/browse/HBASE-24139) | *Critical* | **Balancer should avoid leaving idle region servers**
7988
7989 StochasticLoadBalancer functional improvement:
7990
7991 StochasticLoadBalancer would rebalance the cluster if there are any idle RegionServers in the cluster (RegionServer having no region), while other RegionServers have at least 1 region available.
7992
7993
7994 ---
7995
7996 * [HBASE-24196](https://issues.apache.org/jira/browse/HBASE-24196) | *Major* | **[Shell] Add rename rsgroup command in hbase shell**
7997
7998 user or admin can now use
7999 hbase shell \> rename\_rsgroup 'oldname', 'newname'
8000 to rename rsgroup.
8001
8002
8003 ---
8004
8005 * [HBASE-24218](https://issues.apache.org/jira/browse/HBASE-24218) | *Major* | **Add hadoop 3.2.x in hadoop check**
8006
8007 Add hadoop-3.2.0 and hadoop-3.2.1 in hadoop check and when '--quick-hadoopcheck' we will only check hadoop-3.2.1.
8008
8009 Notice that, for aligning the personality scripts across all the active branches, we will commit the patch to all active branches, but the hadoop-3.2.x support in hadoopcheck is only applied to branch-2.2+.
8010
8011
8012 ---
8013
8014 * [HBASE-23829](https://issues.apache.org/jira/browse/HBASE-23829) | *Major* | **Get \`-PrunSmallTests\` passing on JDK11**
8015
8016 \`-PrunSmallTests\` now pass on JDK11 when using \`-Phadoop.profile=3.0\`.
8017
8018
8019 ---
8020
8021 * [HBASE-24185](https://issues.apache.org/jira/browse/HBASE-24185) | *Major* | **Junit tests do not behave well with System.exit or Runtime.halt or JVM exits in general.**
8022
8023 Tests that fail because a process -- RegionServer or Master -- called System.exit, will now instead throw an exception.
8024
8025
8026 ---
8027
8028 * [HBASE-24072](https://issues.apache.org/jira/browse/HBASE-24072) | *Major* | **Nightlies reporting OutOfMemoryError: unable to create new native thread**
8029
8030 Hadoop hosts have had their ulimit -u raised from 10000 to 30000 (per user, by INFRA). The Docker build container has had its limit raised from 10000 to 12500.
8031
8032
8033 ---
8034
8035 * [HBASE-24112](https://issues.apache.org/jira/browse/HBASE-24112) | *Major* | **[RSGroup] Support renaming rsgroup**
8036
8037 Support RSGroup renaming in core codebase. New API Admin#renameRSGroup(String, String) is introduced in 3.0.0.
8038
8039
8040 ---
8041
8042 * [HBASE-23994](https://issues.apache.org/jira/browse/HBASE-23994) | *Trivial* | ** Add WebUI to Canary**
8043
8044 <!-- markdown -->
8045 The Canary tool now offers a WebUI when run in `region` mode (the default mode). It is enabled by default, and by default, it binds to `0.0.0.0:16050`. This can be overridden by setting `hbase.canary.info.bindAddress` and `hbase.canary.info.port`. To disable entirely, set the port to `-1`.
8046
8047
8048 ---
8049
8050 * [HBASE-23779](https://issues.apache.org/jira/browse/HBASE-23779) | *Major* | **Up the default fork count to make builds complete faster; make count relative to CPU count**
8051
8052 Pass --threads=2 building on jenkins. It shortens nightly build times by about ~25%.
8053
8054 It works by running module build/test in parallel when dependencies allow. Upping the forkcount beyond the pom default of 0.25C would have us broach our CPU budget on jenkins when two modules are running in parallel (2 modules at 0.25% of CPU each makes 0.5C and on jenkins, hadoop nodes run two jenkins executors per host).  Higher forkcounts also seems to threaten build stability.
8055
8056 For running tests locally, to go faster, up fork count.
8057
8058 $ x="0.5C"  ;  mvn --threads=2  -Dsurefire.firstPartForkCount=$x -Dsurefire.secondPartForkCount=$x test -PrunAllTests
8059
8060 You could up the x from 0.5C to 1.0C but YMMV (On overcommitted hardware, tests start bombing out pretty soon after startup). You could try upping thread count but on occasion are likely to overcommit hardware.
8061
8062
8063 ---
8064
8065 * [HBASE-24126](https://issues.apache.org/jira/browse/HBASE-24126) | *Major* | **Up the container nproc uplimit from 10000 to 12500**
8066
8067 Start docker with upped ulimit for nproc passing '--ulimit nproc=12500'. It was 10000, the default, but made it 12500. Then, set PROC\_LIMIT in hbase-personality so when yetus runs, it is w/ the new 12500 value.
8068
8069
8070 ---
8071
8072 * [HBASE-24150](https://issues.apache.org/jira/browse/HBASE-24150) | *Major* | **Allow module tests run in parallel**
8073
8074 Pass -T2 to mvn. Makes it so we do two modules-at-a-time dependencies willing. Helps speed build and testing. Doubles the resource usage when running modules in parallel.
8075
8076
8077 ---
8078
8079 * [HBASE-24121](https://issues.apache.org/jira/browse/HBASE-24121) | *Major* | **[Authorization] ServiceAuthorizationManager isn't dynamically updatable. And it should be.**
8080
8081 Master & RegionService now support refresh policy authorization defined in hbase-policy.xml without restarting service. To refresh policy, please execute hbase shell command: update\_config or update\_config\_all after policy file updated and synced on all nodes.
8082
8083
8084 ---
8085
8086 * [HBASE-24099](https://issues.apache.org/jira/browse/HBASE-24099) | *Major* | **Use a fair ReentrantReadWriteLock for the region close lock**
8087
8088 This change modifies the default acquisition policy for the region's close lock in order to prevent observed starvation of close requests. The new boolean configuration parameter 'hbase.regionserver.fair.region.close.lock' controls the lock acquisition policy: if true, the lock is created in fair mode (default); if false, the lock is created in nonfair mode (the old default).
8089
8090
8091 ---
8092
8093 * [HBASE-23153](https://issues.apache.org/jira/browse/HBASE-23153) | *Major* | **PrimaryRegionCountSkewCostFunction SLB function should implement CostFunction#isNeeded**
8094
8095 <!-- markdown -->
8096 The `PrimaryRegionCountSkewCostFunction` for the `StochasticLoadBalancer` is only needed when the read replicas feature is enabled. With this change, that function now properly indicates that it is not needed when the read replica feature is off.
8097
8098 If this improvement is not available, operators with clusters that are not using the read replica feature should manually disable it by setting `hbase.master.balancer.stochastic.primaryRegionCountCost` to `0.0` in hbase-site.xml for all HBase Masters.
8099
8100
8101 ---
8102
8103 * [HBASE-24055](https://issues.apache.org/jira/browse/HBASE-24055) | *Major* | **Make AsyncFSWAL can run on EC cluster**
8104
8105 Now AsyncFSWAL can also be used against the directory which has EC enabled. Need to make sure you also make use of the hadoop 3.x client as the option is only available in hadoop 3.x.
8106
8107
8108 ---
8109
8110 * [HBASE-24113](https://issues.apache.org/jira/browse/HBASE-24113) | *Major* | **Upgrade the maven we use from 3.5.4 to 3.6.3 in nightlies**
8111
8112 Branches-2.3+ use maven 3.5.3 building. Older branches use 3.5.4 still.
8113
8114
8115 ---
8116
8117 * [HBASE-24122](https://issues.apache.org/jira/browse/HBASE-24122) | *Major* | **Change machine ulimit-l to ulimit-a so dumps full ulimit rather than just 'max locked memory'**
8118
8119 Our 'Build Artifacts' have a machine directory under which we emit vitals on the host the build was run on. We used to emit the result of 'ulimit -l' as a file named 'ulimit-l'. This has been hijacked to instead emit result of running 'ulimit -a' which includes stat on ulimit -l.
8120
8121
8122 ---
8123
8124 * [HBASE-23678](https://issues.apache.org/jira/browse/HBASE-23678) | *Major* | **Literate builder API for version management in schema**
8125
8126 ColumnFamilyDescriptor new builder API:
8127
8128     /\*\*
8129      \* Retain all versions for a given TTL(retentionInterval), and then only a specific number
8130      \* of versions(versionAfterInterval) after that interval elapses.
8131      \*
8132      \* @param retentionInterval Retain all versions for this interval
8133      \* @param versionAfterInterval Retain no of versions to retain after retentionInterval
8134      \*/
8135     public ModifyableColumnFamilyDescriptor setVersionsWithTimeToLive(
8136         final int retentionInterval, final int versionAfterInterval)
8137
8138
8139 ---
8140
8141 * [HBASE-24050](https://issues.apache.org/jira/browse/HBASE-24050) | *Major* | **Deprecated PBType on all 2.x branches**
8142
8143 org.apache.hadoop.hbase.types.PBType is marked as deprecated without any replacement. It will be moved to hbase-example module and marked as IA.Private in 3.0.0. This is a mistake as it should not be part of our public API. Users who depend on this class should just copy the code your own code base.
8144
8145
8146 ---
8147
8148 * [HBASE-8868](https://issues.apache.org/jira/browse/HBASE-8868) | *Minor* | **add metric to report client shortcircuit reads**
8149
8150 Expose file system level read metrics for RegionServer.
8151
8152 If the HBase RS runs on top of HDFS, calculate the aggregation of
8153 ReadStatistics of each HdfsFileInputStream. These metrics include:
8154 (1) total number of bytes read from HDFS.
8155 (2) total number of bytes read from local DataNode.
8156 (3) total number of bytes read locally through short-circuit read.
8157 (4) total number of bytes read locally through zero-copy read.
8158
8159 Because HDFS ReadStatistics is calculated per input stream, it is not
8160 feasible to update the aggregated number in real time. Instead, the
8161 metrics are updated when an input stream is closed.
8162
8163
8164 ---
8165
8166 * [HBASE-24032](https://issues.apache.org/jira/browse/HBASE-24032) | *Major* | **[RSGroup] Assign created tables to respective rsgroup automatically instead of manual operations**
8167
8168 Admin can determine which tables go to which rsgroup by script  (setting hbase.rsgroup.table.mapping.script with local filystem path) on Master side which aims to lighten the burden of admin operations.  Note, since HBase 3+, rsgroup can be specified in TableDescriptor as well, if clients specify this, master will skip the determination from script.
8169
8170 Here is a simple example of script:
8171 {code}
8172 # Input consists of two string, 1st is the namespace of the table, 2nd is the table name of the table
8173 #!/bin/bash
8174 namespace=$1
8175 tablename=$2
8176 if [[ $namespace == test ]]; then
8177   echo test
8178 elif [[ $tablename == \*foo\* ]]; then
8179   echo other
8180 else
8181   echo default
8182 fi
8183 {code}
8184
8185
8186 ---
8187
8188 * [HBASE-23993](https://issues.apache.org/jira/browse/HBASE-23993) | *Major* | **Use loopback for zk standalone server in minizkcluster**
8189
8190 MiniZKCluster now puts up its standalone node listening on loopback/127.0.0.1 rather than "localhost".
8191
8192
8193 ---
8194
8195 * [HBASE-23986](https://issues.apache.org/jira/browse/HBASE-23986) | *Major* | **Bump hadoop-two.version to 2.10.0 on master and branch-2**
8196
8197 Bumped hadoop-two.version to 2.10.0, which means we will drop the support for hadoop-2.8.x and hadoop-2.9.x.
8198
8199
8200 ---
8201
8202 * [HBASE-23930](https://issues.apache.org/jira/browse/HBASE-23930) | *Minor* | **Shell should attempt to format \`timestamp\` attributes as ISO-8601**
8203
8204 Change timestamp display to be ISO8601 when toString on Cell and outputting in shell....
8205
8206 User used to see....
8207
8208   column=table:state, timestamp=1583967620343 .....
8209
8210 ... but now sees:
8211
8212   column=table:state, timestamp=2020-03-11T23:00:20.343Z ....
8213
8214
8215 ---
8216
8217 * [HBASE-22827](https://issues.apache.org/jira/browse/HBASE-22827) | *Major* | **Expose multi-region merge in shell and Admin API**
8218
8219 merge\_region shell command can now be used to merge more than 2 regions as well. It takes a list of regions as comma separated values or as an array of regions, and not just 2 regions. The full regionnames and encoded regionnames are continued to be accepted.
8220
8221
8222 ---
8223
8224 * [HBASE-23767](https://issues.apache.org/jira/browse/HBASE-23767) | *Major* | **Add JDK11 compilation and unit test support to Github precommit**
8225
8226 Rebuild our Dockerfile with support for multiple JDK versions. Use multiple stages in the Jenkinsfile instead of yetus's multijdk because of YETUS-953. Run those multiple stages in parallel to speed up results.
8227
8228 Note that multiple stages means multiple Yetus invocations means multiple comments on the PreCommit. This should become more obvious to users once we can make use of GitHub Checks API, HBASE-23902.
8229
8230
8231 ---
8232
8233 * [HBASE-22978](https://issues.apache.org/jira/browse/HBASE-22978) | *Minor* | **Online slow response log**
8234
8235 get\_slowlog\_responses and clear\_slowlog\_responses are used to retrieve and clear slow RPC logs from RingBuffer maintained by RegionServers.
8236
8237 New Admin APIs:
8238 1.   List\<SlowLogRecord\> getSlowLogResponses(final Set\<ServerName\> serverNames,
8239       final SlowLogQueryFilter slowLogQueryFilter) throws IOException;
8240
8241 2.   List\<Boolean\> clearSlowLogResponses(final Set\<ServerName\> serverNames)
8242       throws IOException;
8243
8244 Configs:
8245
8246 1. hbase.regionserver.slowlog.ringbuffer.size:
8247 Default size of ringbuffer to be maintained by each RegionServer in order to store online slowlog responses. This is an in-memory ring buffer of requests that were judged to be too slow in addition to the responseTooSlow logging. The in-memory representation would be complete. For more details, please look into Doc Section: Get Slow Response Log from shell
8248
8249 Default
8250 256
8251
8252 2. hbase.regionserver.slowlog.buffer.enabled:
8253 Indicates whether RegionServers have ring buffer running for storing Online Slow logs in FIFO manner with limited entries. The size of the ring buffer is indicated by config: hbase.regionserver.slowlog.ringbuffer.size The default value is false, turn this on and get latest slowlog responses with complete data.
8254
8255 Default
8256 false
8257
8258
8259 For more details, please look into "Get Slow Response Log from shell" section from HBase book.
8260
8261
8262 ---
8263
8264 * [HBASE-23926](https://issues.apache.org/jira/browse/HBASE-23926) | *Major* | **[Flakey Tests] Down the flakies re-run ferocity; it makes for too many fails.**
8265
8266 Down the flakey re-rerun fork count from 1.0C -- i.e. a fork per CPU -- to 0.25C. On a recent run, the machine had 16 cores. 0.25 is 4 cores. We'd hardcoded fork count at 3 previous to changes made by parent.
8267
8268
8269 ---
8270
8271 * [HBASE-23146](https://issues.apache.org/jira/browse/HBASE-23146) | *Major* | **Support CheckAndMutate with multiple conditions**
8272
8273 Add a checkAndMutate(row, filter) method in the AsyncTable interface and the Table interface.
8274
8275 This method atomically checks if the row matches the specified filter. If it does, it adds the Put/Delete/RowMutations.
8276
8277 This is a fluent style API, the code is like:
8278
8279 For Table interface:
8280 {code}
8281 table.checkAndMutate(row, filter).thenPut(put);
8282 {code}
8283
8284 For AsyncTable interface:
8285 {code}
8286 table.checkAndMutate(row, filter).thenPut(put)
8287     .thenAccept(succ -\> {
8288       if (succ) {
8289         System.out.println("Check and put succeeded");
8290       } else {
8291         System.out.println("Check and put failed");
8292       }
8293     });
8294 {code}
8295
8296
8297 ---
8298
8299 * [HBASE-23874](https://issues.apache.org/jira/browse/HBASE-23874) | *Minor* | **Move Jira-attached file precommit definition from script in Jenkins config to dev-support**
8300
8301 The Jira Precommit job (https://builds.apache.org/job/PreCommit-HBASE-Build/) will now look for a file within the source tree (dev-support/jenkins\_precommit\_jira\_yetus.sh) instead of depending on a script section embedded in the job.
8302
8303
8304 ---
8305
8306 * [HBASE-23865](https://issues.apache.org/jira/browse/HBASE-23865) | *Major* | **Up flakey history from 5 to 10**
8307
8308 Changed flakey list reporting to show 5 rather than 10 items. Also changed the second and first part fort counts to be 1C rather than hardcoded 3.
8309
8310
8311 ---
8312
8313 * [HBASE-23554](https://issues.apache.org/jira/browse/HBASE-23554) | *Major* | **Encoded regionname to regionname utility**
8314
8315     Adds shell command regioninfo:
8316
8317       hbase(main):001:0\>  regioninfo '0e6aa5c19ae2b2627649dc7708ce27d0'
8318       {ENCODED =\> 0e6aa5c19ae2b2627649dc7708ce27d0, NAME =\> 'TestTable,,1575941375972.0e6aa5c19ae2b2627649dc7708ce27d0.', STARTKEY =\> '', ENDKEY =\> '00000000000000000000299441'}
8319       Took 0.4737 seconds
8320
8321
8322 ---
8323
8324 * [HBASE-23350](https://issues.apache.org/jira/browse/HBASE-23350) | *Major* | **Make compaction files cacheonWrite configurable based on threshold**
8325
8326 This JIRA adds a new configuration - \`hbase.rs.cachecompactedblocksonwrite.threshold\`. This configuration is the maximum total size (in bytes) of the compacted files below which the configuration \`hbase.rs.cachecompactedblocksonwrite\` is honoured. If the total size of the compacted fies exceeds this threshold, even when \`hbase.rs.cachecompactedblocksonwrite\` is enabled, the data blocks are not cached. Caching index and bloom blocks is not affected by this configuration (user configuration is always honoured).
8327
8328 Default value of this configuration is Long.MAX\_VALUE. This means whatever the total size of the compacted files, it wil be cached.
8329
8330
8331 ---
8332
8333 * [HBASE-17115](https://issues.apache.org/jira/browse/HBASE-17115) | *Major* | **HMaster/HRegion Info Server does not honour admin.acl**
8334
8335 Implements authorization for the HBase Web UI by limiting access to certain endpoints which could be used to extract sensitive information from HBase.
8336
8337 Access to these restricted endpoints can be limited to a group of administrators, identified either by a list of users (hbase.security.authentication.spnego.admin.users) or by a list of groups
8338 (hbase.security.authentication.spnego.admin.groups).  By default, neither of these values are set which will preserve backwards compatibility (allowing all authenticated users to access all endpoints).
8339
8340 Further, users who have sensitive information in the HBase service configuration can set hbase.security.authentication.ui.config.protected to true which will treat the configuration endpoint as a protected, admin-only resource. By default, all authenticated users may access the configuration endpoint.
8341
8342
8343 ---
8344
8345 * [HBASE-23647](https://issues.apache.org/jira/browse/HBASE-23647) | *Major* | **Make MasterRegistry the default registry impl**
8346
8347 <!-- markdown -->
8348 Enables master based registry as the default registry used by clients to fetch connection metadata.
8349 Refer to the section "Master Registry" in the client documentation for more details and advantages
8350 of this implementation over the default Zookeeper based registry.
8351
8352 Configuration parameter that controls the registry in use: `hbase.client.registry.impl`
8353
8354 Where to set this: HBase client configuration (hbase-site.xml)
8355
8356 Possible values:
8357 - `org.apache.hadoop.hbase.client.ZKConnectionRegistry` (For ZK based registry implementation)
8358 - `org.apache.hadoop.hbase.client.MasterRegistry` (New, for master based registry implementation)
8359
8360 Notes on defaults:
8361
8362 - For v3.0.0 and later, MasterRegistry is the default registry
8363 - For all releases in 2.x line, ZK based registry is the default.
8364
8365 This feature has been back ported to 2.3.0 and later releases. MasterRegistry can be enabled by setting the following client configuration.
8366
8367 ```
8368 <property>
8369   <name>hbase.client.registry.impl</name>
8370   <value>org.apache.hadoop.hbase.client.MasterRegistry</value>
8371 </property>
8372 ```
8373
8374
8375 ---
8376
8377 * [HBASE-23069](https://issues.apache.org/jira/browse/HBASE-23069) | *Critical* | **periodic dependency bump for Sep 2019**
8378
8379 caffeine: 2.6.2 =\> 2.8.1
8380 commons-codec: 1.10 =\> 1.13
8381 commons-io: 2.5 =\> 2.6
8382 disrupter: 3.3.6 =\> 3.4.2
8383 httpcore: 4.4.6 =\> 4.4.13
8384 jackson: 2.9.10 =\> 2.10.1
8385 jackson.databind: 2.9.10.1 =\> 2.10.1
8386 jetty: 9.3.27.v20190418 =\> 9.3.28.v20191105
8387 protobuf.plugin: 0.5.0 =\> 0.6.1
8388 zookeeper: 3.4.10 =\> 3.4.14
8389 slf4j: 1.7.25 =\> 1.7.30
8390 rat: 0.12 =\> 0.13
8391 asciidoctor: 1.5.5 =\> 1.5.8
8392 asciidoctor.pdf: 1.5.0-alpha.15 =\> 1.5.0-rc.2
8393 error-prone: 2.3.3 =\> 2.3.4
8394
8395
8396 ---
8397
8398 * [HBASE-23686](https://issues.apache.org/jira/browse/HBASE-23686) | *Major* | **Revert binary incompatible change and remove reflection**
8399
8400 - Reverts a binary incompatible binary change for ByteRangeUtils
8401 - Usage of reflection inside CommonFSUtils removed
8402
8403
8404 ---
8405
8406 * [HBASE-23055](https://issues.apache.org/jira/browse/HBASE-23055) | *Major* | **Alter hbase:meta**
8407
8408 Adds being able to edit hbase:meta table schema. For example,
8409
8410 hbase(main):006:0\> alter 'hbase:meta', {NAME =\> 'info', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}
8411 Updating all regions with the new schema...
8412 All regions updated.
8413 Done.
8414 Took 1.2138 seconds
8415
8416 You can even add columnfamilies. Howevert, you cannot delete any of the core hbase:meta column families such as 'info' and 'table'.
8417
8418
8419 ---
8420
8421 * [HBASE-23347](https://issues.apache.org/jira/browse/HBASE-23347) | *Major* | **Pluggable RPC authentication**
8422
8423 This change introduces an internal abstraction layer which allows for new SASL-based authentication mechanisms to be used inside HBase services. All existing SASL-based authentication mechanism were ported to the new abstraction, making no external change in runtime semantics, client API, or RPC serialization format.
8424
8425 Developers familiar with extending HBase can implement authentication mechanism beyond simple Kerberos and DelegationTokens which authenticate HBase users against some other user database. HBase service authentication (Master to/from RegionServer) continue to operate solely over Kerberos.
8426
8427
8428 ---
8429
8430 * [HBASE-23156](https://issues.apache.org/jira/browse/HBASE-23156) | *Major* | **start-hbase.sh failed with ClassNotFoundException when build with hadoop3**
8431
8432 Introduce a new hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 3.x.
8433
8434
8435 ---
8436
8437 * [HBASE-23680](https://issues.apache.org/jira/browse/HBASE-23680) | *Major* | **RegionProcedureStore missing cleaning of hfile archive**
8438
8439 Add a new config to hbase-default.xml
8440
8441   \<property\>
8442     \<name\>hbase.procedure.store.region.hfilecleaner.plugins\</name\>
8443     \<value\>org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner\</value\>
8444     \<description\>A comma-separated list of BaseHFileCleanerDelegate invoked by
8445     the RegionProcedureStore HFileCleaner service. These HFiles cleaners are
8446     called in order, so put the cleaner that prunes the most files in front. To
8447     implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath
8448     and add the fully qualified class name here. Always add the above
8449     default hfile cleaners in the list as they will be overwritten in
8450     hbase-site.xml.\</description\>
8451   \</property\>
8452
8453 It will share the same TTL with other HFileCleaners. And you can also implement your own cleaner and change this property to enable it.
8454
8455
8456 ---
8457
8458 * [HBASE-23675](https://issues.apache.org/jira/browse/HBASE-23675) | *Minor* | **Move to Apache parent POM version 22**
8459
8460 Updated parent pom to Apache version 22.
8461
8462
8463 ---
8464
8465 * [HBASE-23679](https://issues.apache.org/jira/browse/HBASE-23679) | *Critical* | **FileSystem instance leaks due to bulk loads with Kerberos enabled**
8466
8467 This issues fixes an issue with Bulk Loading on installations with Kerberos enabled and more than a single RegionServer. When multiple tables are involved in hosting a table's regions which are being bulk-loaded into, all but the RegionServer hosting the table's first Region will "leak" one DistributedFileSystem object onto the heap, never freeing that memory. Eventually, with enough bulk loads, this will create a situation for RegionServers where they have no free heap space and will either spend all time in JVM GC, lose their ZK session, or crash with an OutOfMemoryError.
8468
8469 The only mitigation for this issue is to periodically restart RegionServers. All earlier versions of HBase 2.x are subject to this issue (2.0.x, \<=2.1.8, \<=2.2.3)
8470
8471
8472 ---
8473
8474 * [HBASE-23286](https://issues.apache.org/jira/browse/HBASE-23286) | *Major* | **Improve MTTR: Split WAL to HFile**
8475
8476 Add a new feature to improve MTTR which have 3 steps to failover:
8477 1. Read WAL and write HFile to region’s column family’s recovered.hfiles directory.
8478 2. Open region.
8479 3. Bulkload the recovered.hfiles for every column family.
8480
8481 Compared to DLS(distributed log split), this feature will reduce region open time significantly.
8482
8483 Config hbase.wal.split.to.hfile to true to enable this featue.
8484
8485
8486 ---
8487
8488 * [HBASE-23619](https://issues.apache.org/jira/browse/HBASE-23619) | *Trivial* | **Use built-in formatting for logging in hbase-zookeeper**
8489
8490 Changed the logging in hbase-zookeeper to use built-in formatting
8491
8492
8493 ---
8494
8495 * [HBASE-23628](https://issues.apache.org/jira/browse/HBASE-23628) | *Minor* | **Replace Apache Commons Digest Base64 with JDK8 Base64**
8496
8497 From the PR:
8498
8499 "Yes. The two create the same output... I just wrote a small test suite to increase my confidence on that. I generated many tens of millions of random byte patterns and compared the output of the two algorithms. They came back identical every time.
8500
8501 "Just in case any inquiring minds would like to know, there is no longer an encoding required when generating the strings. The JDK implementation specifically specifies that strings returned are StandardCharsets.ISO\_8859\_1. This does not change anything because UTF8 and ISO\_8859 overlap for the limited character set (64 characters) the encoding uses."
8502
8503
8504 ---
8505
8506 * [HBASE-23651](https://issues.apache.org/jira/browse/HBASE-23651) | *Major* | **Region balance throttling can be disabled**
8507
8508 Set hbase.balancer.max.balancing to a int value which \<=0 will disable region balance throttling.
8509
8510
8511 ---
8512
8513 * [HBASE-23588](https://issues.apache.org/jira/browse/HBASE-23588) | *Major* | **Cache index blocks and bloom blocks on write if CacheCompactedBlocksOnWrite is enabled**
8514
8515 If cacheOnWrite is enabled during flush or compaction, index and bloom blocks(with data blocks) would be automatically cached during write.
8516
8517
8518 ---
8519
8520 * [HBASE-23369](https://issues.apache.org/jira/browse/HBASE-23369) | *Major* | **Auto-close 'unknown' Regions reported as OPEN on RegionServers**
8521
8522 If a RegionServer reports a Region as OPEN in disagreement with Master's status on the Region, the Master now tells the RegionServer to silently close the Region.
8523
8524
8525 ---
8526
8527 * [HBASE-23596](https://issues.apache.org/jira/browse/HBASE-23596) | *Major* | **HBCKServerCrashProcedure can double assign**
8528
8529 Makes it so the recently added HBCKServerCrashProcedure -- the SCP that gets invoked when an operator schedules an SCP via hbck2 scheduleRecoveries command -- now works the same as SCP EXCEPT if master knows nothing of the scheduled servername. In this latter case, HBCKSCP will do a full scan of hbase:meta looking for instances of the passed servername. If any found it will attempt cleanup of hbase:meta references by reassigning any found OPEN or OPENING and by closing any in CLOSING state.
8530
8531 Used to fix instances of what the 'HBCK Report' page shows as 'Unknown Servers'.
8532
8533
8534 ---
8535
8536 * [HBASE-23624](https://issues.apache.org/jira/browse/HBASE-23624) | *Major* | **Add a tool to dump the procedure info in HFile**
8537
8538 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.HFileProcedurePrettyPrinter to run the tool.
8539
8540
8541 ---
8542
8543 * [HBASE-23590](https://issues.apache.org/jira/browse/HBASE-23590) | *Major* | **Update maxStoreFileRefCount to maxCompactedStoreFileRefCount**
8544
8545 RegionsRecoveryChore introduced as part of HBASE-22460 tries to reopen regions based on config: hbase.regions.recovery.store.file.ref.count.
8546 Region reopen needs to take into consideration all compacted away store files that belong to the region and not store files(non-compacted).
8547
8548 Fixed this bug as part of this Jira.
8549 Updated description for corresponding configs:
8550
8551 1. hbase.master.regions.recovery.check.interval :
8552
8553 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
8554
8555 2. hbase.regions.recovery.store.file.ref.count :
8556
8557 Very large number of ref count on a compacted store file indicates that it is a ref leak on that object(compacted store file). Such files can not be removed after it is invalidated via compaction. Only way to recover in such scenario is to reopen the region which can release all resources, like the refcount, leases, etc. This config represents Store files Ref Count threshold value considered for reopening regions. Any region with compacted store files ref count \> this value would be eligible for reopening by master. Here, we get the max refCount among all refCounts on all compacted away store files that belong to a particular region. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
8558
8559
8560 ---
8561
8562 * [HBASE-23618](https://issues.apache.org/jira/browse/HBASE-23618) | *Major* | **Add a tool to dump procedure info in the WAL file**
8563
8564 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.WALProcedurePrettyPrinter to run the tool.
8565
8566
8567 ---
8568
8569 * [HBASE-23617](https://issues.apache.org/jira/browse/HBASE-23617) | *Major* | **Add a stress test tool for region based procedure store**
8570
8571 Use ./hbase org.apache.hadoop.hbase.procedure2.store.region.RegionProcedureStorePerformanceEvaluation to run the tool.
8572
8573
8574 ---
8575
8576 * [HBASE-23326](https://issues.apache.org/jira/browse/HBASE-23326) | *Critical* | **Implement a ProcedureStore which stores procedures in a HRegion**
8577
8578 Use a region based procedure store to replace the old customized WAL based procedure store. The procedure data migration is done automatically during upgrading. After upgrading, the MasterProcWALs directory will be deleted and a new MasterProc directory will be created. And notice that a region will still write WAL so we still have WAL files and they will be moved to the oldWALs directory. The file name is mostly like a normal WAL file, and the only difference is that it is ended with "$masterproc$".
8579
8580
8581 ---
8582
8583 * [HBASE-23320](https://issues.apache.org/jira/browse/HBASE-23320) | *Major* | **Upgrade surefire plugin to 3.0.0-M4**
8584
8585 Bumped surefire plugin to 3.0.0-M4
8586
8587
8588 ---
8589
8590 * [HBASE-20461](https://issues.apache.org/jira/browse/HBASE-20461) | *Major* | **Implement fsync for AsyncFSWAL**
8591
8592 Now AsyncFSWAL also supports Durability.FSYNC\_WAL.
8593
8594
8595 ---
8596
8597 * [HBASE-23066](https://issues.apache.org/jira/browse/HBASE-23066) | *Minor* | **Create a config that forces to cache blocks on compaction**
8598
8599 The configuration 'hbase.rs.cacheblocksonwrite' was used to enable caching the blocks on write. But purposefully we were not caching the blocks when we do compaction (since it may be very aggressive) as the caching happens as and when the writer completes a block.
8600 In cloud environments since they have bigger sized caches - though they try to enable 'hbase.rs.prefetchblocksonopen' (non - aggressive way of caching the blocks proactively on reader creation) it does not help them because it takes time to cache the compacted blocks.
8601 This feature creates a new configuration  'hbase.rs.cachecompactedblocksonwrite' which when set to 'true' will enable the blocks created out of compaction.
8602 Remember that since it is aggressive caching the user should be having enough cache space - if not it may lead to other active blocks getting evicted.
8603 From the shell this can be enabled by using the option per Column Family also by using the below format
8604 {code}
8605 create 't1', 'f1', {NUMREGIONS =\> 15, SPLITALGO =\> 'HexStringSplit', CONFIGURATION =\> {'hbase.rs.cachecompactedblocksonwrite' =\> 'true'}}
8606 {code}
8607
8608
8609 ---
8610
8611 * [HBASE-23239](https://issues.apache.org/jira/browse/HBASE-23239) | *Major* | **Reporting on status of backing MOB files from client-facing cells**
8612
8613 <!-- markdown -->
8614
8615 Users of the MOB feature can now use the `mobrefs` utility to get statistics about data in the MOB system and verify the health of backing files on HDFS.
8616
8617 ```
8618 HADOOP_CLASSPATH=/etc/hbase/conf:$(hbase mapredcp) yarn jar \
8619     /some/path/to/hbase-shaded-mapreduce.jar mobrefs mobrefs-report-output some_table foo
8620 ```
8621
8622 See javadocs of the class `MobRefReporter` for more details.
8623
8624 the reference guide has added some information about MOB internals and troubleshooting.
8625
8626
8627 ---
8628
8629 * [HBASE-23549](https://issues.apache.org/jira/browse/HBASE-23549) | *Minor* | **Document steps to disable MOB for a column family**
8630
8631 The reference guide now includes a walk through of disabling the MOB feature if needed while maintaining availability.
8632
8633
8634 ---
8635
8636 * [HBASE-23582](https://issues.apache.org/jira/browse/HBASE-23582) | *Minor* | **Unbalanced braces in string representation of table descriptor**
8637
8638 Fixed unbalanced braces in string representation within HBase shell
8639
8640
8641 ---
8642
8643 * [HBASE-23293](https://issues.apache.org/jira/browse/HBASE-23293) | *Minor* | **[REPLICATION] make ship edits timeout configurable**
8644
8645 The default rpc timeout for ReplicationSourceShipper#shipEdits is 60s, when bulkload replication enabled, timeout exception may be occurred.
8646 Now we can conf the timeout value through replication.source.shipedits.timeout, and it’s adaptive.
8647
8648
8649 ---
8650
8651 * [HBASE-23312](https://issues.apache.org/jira/browse/HBASE-23312) | *Major* | **HBase Thrift SPNEGO configs (HBASE-19852) should be backwards compatible**
8652
8653 The newer HBase Thrift SPNEGO configs should not be required. The hbase.thrift.spnego.keytab.file and hbase.thrift.spnego.principal configs will fall back to the hbase.thrift.keytab.file and hbase.thrift.kerberos.principal original configs. The older configs will log a deprecation warning. It is preferred to new the newer SPNEGO configurations.
8654
8655
8656 ---
8657
8658 * [HBASE-22969](https://issues.apache.org/jira/browse/HBASE-22969) | *Minor* | **A new binary component comparator(BinaryComponentComparator) to perform comparison of arbitrary length and position**
8659
8660 With BinaryComponentCompartor applications will be able to design diverse and powerful set of filters for rows and columns. See https://issues.apache.org/jira/browse/HBASE-22969 for example. In general, the comparator can be used with any filter taking ByteArrayComparable. As of now, following filters take ByteArrayComparable:
8661
8662 1. RowFilter
8663 2. ValueFilter
8664 3. QualifierFilter
8665 4. FamilyFilter
8666 5. ColumnValueFilter
8667
8668
8669 ---
8670
8671 * [HBASE-23234](https://issues.apache.org/jira/browse/HBASE-23234) | *Major* | **Provide .editorconfig based on checkstyle configuration**
8672
8673 Adds a .editorconfig file with configurations populated by IntelliJ, based on our checkstyle configuration. There's lots of IntelliJ-specific configs in here that I assume are not replicated to Eclipse or Netbeans users. Any devs using those tools should push whatever updates they see fit, but please start with the checkstyle configs as the origin of truth.
8674
8675
8676 ---
8677
8678 * [HBASE-23322](https://issues.apache.org/jira/browse/HBASE-23322) | *Minor* | **[hbck2] Simplification on HBCKSCP scheduling**
8679
8680 An hbck2 scheduleRecoveries will run a subclass of ServerCrashProcedure which asks Master what Regions were on the dead Server but it will also do a hbase:meta table scan to see if any vestiges of the old Server remain (for the case where an SCP failed mid-point leaving references in place or where Master and hbase:meta deviated in accounting).
8681
8682
8683 ---
8684
8685 * [HBASE-23321](https://issues.apache.org/jira/browse/HBASE-23321) | *Minor* | **[hbck2] fixHoles of fixMeta doesn't update in-memory state**
8686
8687 If holes in hbase:meta, hbck2 fixMeta now will update Master in-memory state so you do not need to restart master just so you can assign the new hole-bridging regions.
8688
8689
8690 ---
8691
8692 * [HBASE-23282](https://issues.apache.org/jira/browse/HBASE-23282) | *Major* | **HBCKServerCrashProcedure for 'Unknown Servers'**
8693
8694 hbck2 scheduleRecoveries will now run a SCP that also looks in hbase:meta for any references to the scheduled server -- not just consult Master in-memory state -- just in case vestiges of the server are leftover in hbase:meta
8695
8696
8697 ---
8698
8699 * [HBASE-19450](https://issues.apache.org/jira/browse/HBASE-19450) | *Minor* | **Add log about average execution time for ScheduledChore**
8700
8701 <!-- markdown -->
8702 HBase internal chores now log a moving average of how long execution of each chore takes at `INFO` level for the logger `org.apache.hadoop.hbase.ScheduledChore`.
8703
8704 Such messages will happen at most once per five minutes.
8705
8706
8707 ---
8708
8709 * [HBASE-23250](https://issues.apache.org/jira/browse/HBASE-23250) | *Minor* | **Log message about CleanerChore delegate initialization should be at INFO**
8710
8711 CleanerChore delegate initialization is now logged at INFO level instead of DEBUG
8712
8713
8714 ---
8715
8716 * [HBASE-23243](https://issues.apache.org/jira/browse/HBASE-23243) | *Major* | **[pv2] Filter out SUCCESS procedures; on decent-sized cluster, plethora overwhelms problems**
8717
8718 The 'Procedures & Locks' tab in Master UI only displays problematic Procedures now (RUNNABLE, WAITING-TIMEOUT, etc.). It no longer notes procedures whose state is SUCCESS.
8719
8720
8721 ---
8722
8723 * [HBASE-23227](https://issues.apache.org/jira/browse/HBASE-23227) | *Blocker* | **Upgrade jackson-databind to 2.9.10.1 to avoid recent CVEs**
8724
8725 <!-- markdown -->
8726
8727 the Apache HBase REST Proxy now uses Jackson Databind version 2.9.10.1 to address the following CVEs
8728
8729   - CVE-2019-16942
8730   - CVE-2019-16943
8731
8732 Users of prior releases with Jackson Databind 2.9.10 are advised to either upgrade to this release or to upgrade their local Jackson Databind jar directly.
8733
8734
8735 ---
8736
8737 * [HBASE-23222](https://issues.apache.org/jira/browse/HBASE-23222) | *Critical* | **Better logging and mitigation for MOB compaction failures**
8738
8739 <!-- markdown -->
8740
8741 The MOB compaction process in the HBase Master now logs more about its activity.
8742
8743 In the event that you run into the problems described in HBASE-22075, there is a new HFileCleanerDelegate that will stop all removal of MOB hfiles from the archive area. It can be configured by adding `org.apache.hadoop.hbase.mob.ManualMobMaintHFileCleaner` to the list configured for `hbase.master.hfilecleaner.plugins`. This new cleaner delegate will cause your archive area to grow unbounded; you will have to manually prune files which may be prohibitively complex. Consider if your use case will allow you to mitigate by disabling mob compactions instead.
8744
8745 Caveats:
8746 * Be sure the list of cleaner delegates still includes the default cleaners you will likely need: ttl, snapshot, and hlink.
8747 * Be mindful that if you enable this cleaner delegate then there will be *no* automated process for removing these mob hfiles. You should see a single region per table in `%hbase_root%/archive` that accumulates files over time. You will have to determine which of these files are safe or not to remove.
8748 * You should list this cleaner delegate after the snapshot and hlink delegates so that you can enable sufficient logging to determine when an archived mob hfile is needed by those subsystems. When set to `TRACE` logging, the CleanerChore logger will include archive retention decision justifications.
8749 * If your use case creates a large number of uniquely named tables, this new delegate will cause memory pressure on the master.
8750
8751
8752 ---
8753
8754 * [HBASE-15519](https://issues.apache.org/jira/browse/HBASE-15519) | *Major* | **Add per-user metrics**
8755
8756 Adds per-user metrics for reads/writes to each RegionServer. These metrics are exported by default. hbase.regionserver.user.metrics.enabled can be used to disable the feature if desired for any reason.
8757
8758
8759 ---
8760
8761 * [HBASE-22460](https://issues.apache.org/jira/browse/HBASE-22460) | *Minor* | **Reopen a region if store reader references may have leaked**
8762
8763 Leaked store files can not be removed even after it is invalidated via compaction. A reasonable mitigation for a reader reference leak would be a fast reopen of the region on the same server.
8764
8765 Configs:
8766
8767 1. hbase.master.regions.recovery.check.interval :
8768
8769 Regions Recovery Chore interval in milliseconds. This chore keeps running at this interval to find all regions with configurable max store file ref count and reopens them. Defaults to 20 mins
8770
8771 2. hbase.regions.recovery.store.file.ref.count :
8772
8773 This config represents Store files Ref Count threshold value considered for reopening regions. Any region with store files ref count \> this value would be eligible for reopening by master. Default value -1 indicates this feature is turned off. Only positive integer value should be provided to enable this feature.
8774
8775
8776 ---
8777
8778 * [HBASE-23172](https://issues.apache.org/jira/browse/HBASE-23172) | *Minor* | **HBase Canary region success count metrics reflect column family successes, not region successes**
8779
8780 Added a comment to make clear that read/write success counts are tallying column family success counts, not region success counts.
8781
8782 Additionally, the region read and write latencies previously only stored the latencies of the last column family of the region reads/writes. This has been fixed by using a map of each region to a list of read and write latency values.
8783
8784
8785 ---
8786
8787 * [HBASE-23177](https://issues.apache.org/jira/browse/HBASE-23177) | *Major* | **If fail to open reference because FNFE, make it plain it is a Reference**
8788
8789 Changes the message on the FNFE exception thrown when the file a Reference points to is missing; the message now includes detail on Reference as well as pointed-to file so can connect how FNFE relates to region open.
8790
8791
8792 ---
8793
8794 * [HBASE-20626](https://issues.apache.org/jira/browse/HBASE-20626) | *Major* | **Change the value of "Requests Per Second" on WEBUI**
8795
8796 Use 'totalRowActionRequestCount' to calculate QPS on web UI.
8797
8798
8799 ---
8800
8801 * [HBASE-22874](https://issues.apache.org/jira/browse/HBASE-22874) | *Critical* | **Define a public interface for Canary and move existing implementation to LimitedPrivate**
8802
8803 <!-- markdown -->
8804 Downstream users who wish to programmatically check the health of their HBase cluster may now rely on a public interface derived from the previously private implementation of the canary cli tool. The interface is named `Canary` and can be found in the user facing javadocs.
8805
8806 Downstream users who previously relied on the invoking the canary via the Java classname (either on the command line or programmatically) will need to change how they do so because the non-public implementation has moved.
8807
8808
8809 ---
8810
8811 * [HBASE-23035](https://issues.apache.org/jira/browse/HBASE-23035) | *Major* | **Retain region to the last RegionServer make the failover slower**
8812
8813 Since 2.0.0，when one regionserver crashed and back online again, AssignmentManager will retain the region locations and try assign the regions to this regionserver(same host:port with the crashed one) again. But for 1.x.x, the behavior is round-robin assignment for the regions belong to the crashed regionserver. This jira change the "retain" assignment to round-robin assignment, which is same with 1.x.x version. This change will make the failover faster and improve availability.
8814
8815
8816 ---
8817
8818 * [HBASE-23046](https://issues.apache.org/jira/browse/HBASE-23046) | *Minor* | **Remove compatibility case from truncate command**
8819
8820 Remove backward compatibility from \`truncate\` and \`truncate\_preserve\` shell commands. This means that these commands from HBase Clients are not compatible with pre-0.99 HBase clusters.
8821
8822
8823 ---
8824
8825 * [HBASE-23040](https://issues.apache.org/jira/browse/HBASE-23040) | *Minor* | **region mover gives NullPointerException instead of saying a host isn't in the cluster**
8826
8827 giving the region mover "unload" command a region server name that isn't recognized by the cluster results in a "I don't know about that host" message instead of a NPE.
8828
8829 set log level to DEBUG if you'd like the region mover to log the set of region server names it got back from the cluster.
8830
8831
8832 ---
8833
8834 * [HBASE-21874](https://issues.apache.org/jira/browse/HBASE-21874) | *Major* | **Bucket cache on Persistent memory**
8835
8836 Added a new IOEngine type for Bucket cache ie Persistent memory. In order to use BC over pmem configure IOEngine as
8837 \<property\>
8838     \<name\>hbase.bucketcache.ioengine\</name\>
8839     \<value\> pmem:///path in persistent memory \</value\>
8840   \</property\>
8841
8842
8843 ---
8844
8845 * [HBASE-22760](https://issues.apache.org/jira/browse/HBASE-22760) | *Major* | **Stop/Resume Snapshot Auto-Cleanup activity with shell command**
8846
8847 By default, snapshot auto cleanup based on TTL would be enabled for any new cluster. At any point in time, if snapshot cleanup is supposed to be stopped due to some snapshot restore activity or any other reason, it is advisable to disable it using shell command:
8848 hbase\> snapshot\_cleanup\_switch false
8849
8850 We can re-enable it using:
8851 hbase\> snapshot\_cleanup\_switch true
8852
8853 We can query whether snapshot auto cleanup is enabled for cluster using:
8854 hbase\> snapshot\_cleanup\_enabled
8855
8856
8857 ---
8858
8859 * [HBASE-22796](https://issues.apache.org/jira/browse/HBASE-22796) | *Major* | **[HBCK2] Add fix of overlaps to fixMeta hbck Service**
8860
8861 Adds fix of overlaps to the fixMeta hbck service method. Uses the bulk-merge facility. Merges a max of 10 at a time. Set hbase.master.metafixer.max.merge.count to higher if you want to do more than 10 in the one go.
8862
8863
8864 ---
8865
8866 * [HBASE-21745](https://issues.apache.org/jira/browse/HBASE-21745) | *Critical* | **Make HBCK2 be able to fix issues other than region assignment**
8867
8868 This issue adds via its subtasks:
8869
8870  \* An 'HBCK Report' page to the Master UI added by HBASE-22527+HBASE-22709+HBASE-22723+ (since 2.1.6, 2.2.1, 2.3.0). Lists consistency or anomalies found via new hbase:meta consistency checking extensions added to CatalogJanitor (holes, overlaps, bad servers) and by a new 'HBCK chore' that runs at a lesser periodicity that will note filesystem orphans and overlaps as well as the following conditions:
8871  \*\* Master thought this region opened, but no regionserver reported it.
8872  \*\* Master thought this region opened on Server1, but regionserver reported Server2
8873  \*\* More than one regionservers reported opened this region
8874  Both chores can be triggered from the shell to regenerate ‘new’ reports.
8875  \* Means of scheduling a ServerCrashProcedure (HBASE-21393).
8876  \* An ‘offline’ hbase:meta rebuild (HBASE-22680).
8877  \* Offline replace of hbase.version and hbase.id
8878  \* Documentation on how to use completebulkload tool to ‘adopt’ orphaned data found by new HBCK2 ‘filesystem’ check (see below) and ‘HBCK chore’ (HBASE-22859)
8879  \* A ‘holes’ and ‘overlaps’ fix that runs in the master that uses new bulk-merge facility to collapse many overlaps in the one go.
8880  \* hbase-operator-tools HBCK2 client tool got a bunch of additions:
8881  \*\* A specialized 'fix' for the case where operators ran old hbck 'offlinemeta' repair and destroyed their hbase:meta; it ties together holes in meta with orphaned data in the fs (HBASE-22567)
8882  \*\* A ‘filesystem’ command that reports on orphan data as well as bad references and hlinks with a ‘fix’ for the latter two options (based on hbck1 facility updated).
8883  \*\* Adds back the ‘replication’ fix facility from hbck1 (HBASE-22717)
8884
8885 The compound result is that hbck2 is now in excess of hbck1 abilities. The provided functionality is disaggregated as per the hbck2 philosophy of providing 'plumbing' rather than 'porcelain' so there is work to do still adding fix-it playbooks, scripting across outages, and automation.
8886
8887
8888 ---
8889
8890 * [HBASE-22802](https://issues.apache.org/jira/browse/HBASE-22802) | *Major* | **Avoid temp ByteBuffer allocation in FileIOEngine#read**
8891
8892 HBASE-21879 introduces a utility class (org.apache.hadoop.hbase.io.ByteBuffAllocator) used for allocating/freeing ByteBuffers from/to NIO ByteBuffer pool, when BucketCache enabled with file or mmap engine, we will use this ByteBuffer pool to avoid temp ByteBuffer allocation a lot.
8893
8894
8895 ---
8896
8897 * [HBASE-11062](https://issues.apache.org/jira/browse/HBASE-11062) | *Major* | **hbtop**
8898
8899 Introduces hbtop that's a real-time monitoring tool for HBase like Unix's top command. See the ref guide for the details: https://hbase.apache.org/book.html#hbtop
8900
8901
8902 ---
8903
8904 * [HBASE-21879](https://issues.apache.org/jira/browse/HBASE-21879) | *Major* | **Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose**
8905
8906 Before this issue, we've made the read path 100% offheap when block hit the BucketCache 100%, but if the cache missed then RS need to read the block by on-heap API, which would cause high young GC pressure.
8907 This issue will read the block by offheap even if reading the block from filesystem directly, it have some requirement for hadoop version(\>=2.9.3) but can also works with older hadoop version(means still works fine but will read block onheap). We have written a careful doc about the implementation, performance and practice here: https://docs.google.com/document/d/1xSy9axGxafoH-Qc17zbD2Bd--rWjjI00xTWQZ8ZwI\_E/edit#heading=h.nch5d72p27ex, for more details please read it.
8908
8909
8910 ---
8911
8912 * [HBASE-22618](https://issues.apache.org/jira/browse/HBASE-22618) | *Major* | **added the possibility to load custom cost functions**
8913
8914 <!-- markdown -->
8915 Extends `StochasticLoadBalancer` to support user-provided cost function. These are loaded in addition to the default set of cost functions. Custom function implementations must extend `StochasticLoadBalancer$CostFunction`. Enable any additional functions by placing them on the master class path and configuring `hbase.master.balancer.stochastic.additionalCostFunctions` with a comma-separated list of fully-qualified class names.
8916
8917
8918 ---
8919
8920 * [HBASE-22867](https://issues.apache.org/jira/browse/HBASE-22867) | *Critical* | **The ForkJoinPool in CleanerChore will spawn thousands of threads in our cluster with thousands table**
8921
8922 Replace the ForkJoinPool in CleanerChore by ThreadPoolExecutor which can limit the spawn thread size and avoid  the master GC frequently.  The replacement is an internal implementation in CleanerChore,  so no config key change, the upstream users can just upgrade the hbase master without any other change.
8923
8924
8925 ---
8926
8927 * [HBASE-22810](https://issues.apache.org/jira/browse/HBASE-22810) | *Major* | **Initialize an separate ThreadPoolExecutor for taking/restoring snapshot**
8928
8929 Introduced a new config key for the snapshot taking/restoring operations at master side:  hbase.master.executor.snapshot.threads, its default value is 3.  means we can have 3 snapshot operations running at the same time.
8930
8931
8932 ---
8933
8934 * [HBASE-22863](https://issues.apache.org/jira/browse/HBASE-22863) | *Major* | **Avoid Jackson versions and dependencies with known CVEs**
8935
8936 1. Stopped exposing vulnerable Jackson1 dependencies so that downstreamers would not pull it in from HBase.
8937 2. However, since Hadoop requires some Jackson1 dependencies, put vulnerable Jackson mapper at test scope in some HBase modules and hence, HBase tarball created by hbase-assembly contains Jackson1 mapper jar in lib. Still, downsteam applications can't pull in Jackson1 from HBase.
8938
8939
8940 ---
8941
8942 * [HBASE-22841](https://issues.apache.org/jira/browse/HBASE-22841) | *Major* | **TimeRange's factory functions do not support ranges, only \`allTime\` and \`at\`**
8943
8944 Add serveral API in TimeRange class for avoiding using the deprecated TimeRange constructor:
8945 \* TimeRange#from: Represents the time interval [minStamp, Long.MAX\_VALUE)
8946 \* TimeRange#until: Represents the time interval [0, maxStamp)
8947 \* TimeRange#between: Represents the time interval [minStamp, maxStamp)
8948
8949
8950 ---
8951
8952 * [HBASE-22833](https://issues.apache.org/jira/browse/HBASE-22833) | *Minor* | **MultiRowRangeFilter should provide a method for creating a filter which is functionally equivalent to multiple prefix filters**
8953
8954 Provide a public method in MultiRowRangeFilter class to speed the requirement of filtering with multiple row prefixes, it will expand the row prefixes as multiple rowkey ranges by MultiRowRangeFilter, it's more efficient.
8955 {code}
8956 public MultiRowRangeFilter(byte[][] rowKeyPrefixes);
8957 {code}
8958
8959
8960 ---
8961
8962 * [HBASE-22856](https://issues.apache.org/jira/browse/HBASE-22856) | *Major* | **HBASE-Find-Flaky-Tests fails with pip error**
8963
8964 Update the base docker image to ubuntu 18.04 for the find flaky tests jenkins job.
8965
8966
8967 ---
8968
8969 * [HBASE-22771](https://issues.apache.org/jira/browse/HBASE-22771) | *Major* | **[HBCK2] fixMeta method and server-side support**
8970
8971 Adds a fixMeta method to hbck Service. Fixes holes in hbase:meta. Follow-up to fix overlaps. See HBASE-22567 also.
8972
8973 Follow-on is adding a client-side to hbase-operator-tools that can exploit this new addition (HBASE-22825)
8974
8975
8976 ---
8977
8978 * [HBASE-22777](https://issues.apache.org/jira/browse/HBASE-22777) | *Major* | **Add a multi-region merge (for fixing overlaps, etc.)**
8979
8980 Changes merge so you can merge more than two regions at a time.  Currently only available inside HBase. HBASE-22827, a follow-on, is about exposing the facility in the Admin API (and then via the shell).
8981
8982
8983 ---
8984
8985 * [HBASE-15666](https://issues.apache.org/jira/browse/HBASE-15666) | *Critical* | **shaded dependencies for hbase-testing-util**
8986
8987 New shaded artifact for testing: hbase-shaded-testing-util.
8988
8989
8990 ---
8991
8992 * [HBASE-22776](https://issues.apache.org/jira/browse/HBASE-22776) | *Major* | **Rename config names in user scan snapshot feature**
8993
8994 After HBASE-22776, the steps to config user scan snapshot feature is as followings:
8995 1. Check HDFS configuration
8996 2. Add master coprocessor:
8997     hbase.coprocessor.master.classes=
8998     “org.apache.hadoop.hbase.security.access.AccessController,
8999 org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController”
9000 3. Enable this feature:
9001     hbase.acl.sync.to.hdfs.enable=true
9002 4. Modify table scheme to enable this feature for a table:
9003     alter 't1', CONFIGURATION =\> {'hbase.acl.sync.to.hdfs.enable' =\> 'true'}
9004
9005
9006 ---
9007
9008 * [HBASE-22539](https://issues.apache.org/jira/browse/HBASE-22539) | *Blocker* | **WAL corruption due to early DBBs re-use when Durability.ASYNC\_WAL is used**
9009
9010 We found a critical bug which can lead to WAL corruption when Durability.ASYNC\_WAL is used. The reason is that we release a ByteBuffer before actually persist the content into WAL file.
9011
9012 The problem maybe lead to several errors, for example, ArrayIndexOfOutBounds when replaying WAL. This is because that the ByteBuffer is reused by others.
9013
9014 ERROR org.apache.hadoop.hbase.executor.EventHandler: Caught throwable while processing event RS\_LOG\_REPLAY
9015 java.lang.ArrayIndexOutOfBoundsException: 18056
9016         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1365)
9017         at org.apache.hadoop.hbase.KeyValue.getFamilyLength(KeyValue.java:1358)
9018         at org.apache.hadoop.hbase.PrivateCellUtil.matchingFamily(PrivateCellUtil.java:735)
9019         at org.apache.hadoop.hbase.CellUtil.matchingFamily(CellUtil.java:816)
9020         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEditFamily(WALEdit.java:143)
9021         at org.apache.hadoop.hbase.wal.WALEdit.isMetaEdit(WALEdit.java:148)
9022         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:297)
9023         at org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:195)
9024         at org.apache.hadoop.hbase.regionserver.SplitLogWorker$1.exec(SplitLogWorker.java:100)
9025
9026 And may even cause segmentation fault and crash the JVM directly. You will see a hs\_err\_pidXXX.log file and usually the problem is SIGSEGV. This is usually because that the ByteBuffer has already been returned to the OS and used for other purpose.
9027
9028 The problem has been reported several times in the past and this time Wellington Ramos Chevreuil provided the full logs and deeply analyzed the logs so we can find the root cause. And Lijin Bin figured out that the problem may only happen when Durability.ASYNC\_WAL is used. Thanks to them.
9029
9030 The problem only effects the 2.x releases, all users are highly recommand to upgrade to a release which has this fix in, especially that if you use Durability.ASYNC\_WAL.
9031
9032
9033 ---
9034
9035 * [HBASE-22737](https://issues.apache.org/jira/browse/HBASE-22737) | *Major* | **Add a new admin method and shell cmd to trigger the hbck chore to run**
9036
9037 Add a new method runHbckChore in Hbck interface and a new shell cmd hbck\_chore\_run to request HBCK chore to run at master side.
9038
9039
9040 ---
9041
9042 * [HBASE-22741](https://issues.apache.org/jira/browse/HBASE-22741) | *Major* | **Show catalogjanitor consistency complaints in new 'HBCK Report' page**
9043
9044 Adds a "CatalogJanitor hbase:meta Consistency Issues" section to the new 'HBCK Report' page added by HBASE-22709. This section is empty unless the most recent CatalogJanitor scan turned up problems. If so, will show table of issues found.
9045
9046
9047 ---
9048
9049 * [HBASE-22723](https://issues.apache.org/jira/browse/HBASE-22723) | *Major* | **Have CatalogJanitor report holes and overlaps; i.e. problems it sees when doing its regular scan of hbase:meta**
9050
9051 When CatalogJanitor runs, it now checks for holes, overlaps, empty info:regioninfo columns and bad servers. Dumps findings into log. Follow-up adds report to new 'HBCK Report' linked off the Master UI.
9052
9053 NOTE: All features but the badserver check made it into branch-2.1 and branch-2.0 backports.
9054
9055
9056 ---
9057
9058 * [HBASE-22714](https://issues.apache.org/jira/browse/HBASE-22714) | *Trivial* | **BuffferedMutatorParams opertationTimeOut() is misspelt**
9059
9060 The misspelled BufferedMutatorParams.opertationTimeout method has been marked as deprecated, and will be removed in 4.0.0. Please use the BufferedMutatorParams.operationTimeout method instead.
9061
9062
9063 ---
9064
9065 * [HBASE-22580](https://issues.apache.org/jira/browse/HBASE-22580) | *Major* | **Add a table attribute to make user scan snapshot feature configurable for table**
9066
9067 If a table user scan snapshots of the table, please config the following table scheme attribute to make granted users' ACLs are added to hfiles:
9068 alter 't1', CONFIGURATION =\> {'hbase.user.scan.snapshot.enable' =\> 'true'}
9069
9070
9071 ---
9072
9073 * [HBASE-22709](https://issues.apache.org/jira/browse/HBASE-22709) | *Major* | **Add a chore thread in master to do hbck checking and display results in 'HBCK Report' page**
9074
9075 1. Add a new chore thread in master to do hbck checking
9076 2. Add a new web ui "HBCK Report" page to display checking results.
9077
9078 This feature is enabled by default. And the hbck chore run per 60 minutes by default. You can config "hbase.master.hbck.checker.interval" to a value lesser than or equal to 0 for disabling the chore.
9079
9080 Notice: the config "hbase.master.hbck.checker.interval" was renamed to "hbase.master.hbck.chore.interval" in HBASE-22737.
9081
9082
9083 ---
9084
9085 * [HBASE-21773](https://issues.apache.org/jira/browse/HBASE-21773) | *Critical* | **rowcounter utility should respond to pleas for help**
9086
9087 This adds [-h\|-help] options to rowcounter. Passing either -h or -help will print rowcounter guide as below:
9088
9089 $hbase rowcounter -h
9090
9091 usage: hbase rowcounter \<tablename\> [options] [\<column1\> \<column2\>...]
9092 Options:
9093     --starttime=\<arg\>       starting time filter to start counting rows from.
9094     --endtime=\<arg\>         end time filter limit, to only count rows up to this timestamp.
9095     --range=\<arg\>           [startKey],[endKey][;[startKey],[endKey]...]]
9096     --expectedCount=\<arg\>   expected number of rows to be count.
9097 For performance, consider the following configuration properties:
9098 -Dhbase.client.scanner.caching=100
9099 -Dmapreduce.map.speculative=false
9100
9101
9102 ---
9103
9104 * [HBASE-22578](https://issues.apache.org/jira/browse/HBASE-22578) | *Major* | **HFileCleaner should not delete empty ns/table directories used for user san snapshot feature**
9105
9106 The HFileCleaner will clean the empty directories under archive, but if enable user scan snaphot feature, the user ACLs are set at there directories, so please config the following cleaner to make the directories with user ACLs not be cleaned:
9107 hbase.master.hfilecleaner.plugins=org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclCleaner
9108
9109
9110 ---
9111
9112 * [HBASE-22722](https://issues.apache.org/jira/browse/HBASE-22722) | *Blocker* | **Upgrade jackson databind dependencies to 2.9.9.1**
9113
9114 Upgrade jackson databind dependency to 2.9.9.1 due to CVEs
9115
9116 https://nvd.nist.gov/vuln/detail/CVE-2019-12814
9117
9118 https://nvd.nist.gov/vuln/detail/CVE-2019-12384
9119
9120
9121 ---
9122
9123 * [HBASE-22527](https://issues.apache.org/jira/browse/HBASE-22527) | *Major* | **[hbck2] Add a master web ui to show the problematic regions**
9124
9125 Add a new master web UI to show the potentially problematic opened regions. There are three case:
9126 1. Master thought this region opened, but no regionserver reported it.
9127 2. Master thought this region opened on Server1, but regionserver reported Server2
9128 3. More than one regionservers reported opened this region
9129
9130
9131 ---
9132
9133 * [HBASE-22648](https://issues.apache.org/jira/browse/HBASE-22648) | *Minor* | **Snapshot TTL**
9134
9135 Feature: Take a Snapshot With TTL for auto-cleanup
9136
9137 Attribute:
9138 1. TTL
9139      - Specify TTL in sec while creating snapshot. e.g. snapshot 'mytable', 'snapshot1234', {TTL =\> 86400}  (snapshot to be auto-cleaned after 24 hr)
9140
9141 Configs:
9142 1. Default Snapshot TTL:
9143      - FOREVER by default
9144      - User specified Default TTL(sec) with config: hbase.master.snapshot.ttl
9145
9146 2. If Snapshot cleanup is supposed to be stopped due to some snapshot restore activity, disable it with config:
9147      - hbase.master.cleaner.snapshot.disable: "true"
9148     With this config, HMaster needs restart just like any other hbase-site config.
9149
9150
9151 For more details, see the section "Take a Snapshot With TTL" in the HBase Reference Guide.
9152
9153
9154 ---
9155
9156 * [HBASE-22610](https://issues.apache.org/jira/browse/HBASE-22610) | *Trivial* | **[BucketCache] Rename "hbase.offheapcache.minblocksize"**
9157
9158 The config point "hbase.offheapcache.minblocksize" was wrong and is now deprecated. The new config point is "hbase.blockcache.minblocksize".
9159
9160
9161 ---
9162
9163 * [HBASE-22690](https://issues.apache.org/jira/browse/HBASE-22690) | *Major* | **Deprecate / Remove OfflineMetaRepair in hbase-2+**
9164
9165 OfflineMetaRepair is no longer supported in HBase-2+. Please refer to https://hbase.apache.org/book.html#HBCK2
9166
9167 This tool is deprecated in 2.x and will be removed in 3.0.
9168
9169
9170 ---
9171
9172 * [HBASE-22673](https://issues.apache.org/jira/browse/HBASE-22673) | *Major* | **Avoid to expose protobuf stuff in Hbck interface**
9173
9174 Mark the Hbck#scheduleServerCrashProcedure(List\<HBaseProtos.ServerName\> serverNames) as deprecated. Use Hbck#scheduleServerCrashProcedures(List\<ServerName\> serverNames) instead.
9175
9176
9177 ---
9178
9179 * [HBASE-22617](https://issues.apache.org/jira/browse/HBASE-22617) | *Blocker* | **Recovered WAL directories not getting cleaned up**
9180
9181 In HBASE-20734 we moved the recovered.edits onto the wal file system but when constructing the directory we missed the BASE\_NAMESPACE\_DIR('data'). So when using the default config, you will find that there are lots of new directories at the same level with the 'data' directory.
9182
9183 In this issue, we add the BASE\_NAMESPACE\_DIR back, and also try our best to clean up the wrong directories. But we can only clean up the region level directories, so if you want a clean fs layout on HDFS you still need to manually delete the empty directories at the same level with 'data'.
9184
9185 The effect versions are 2.2.0, 2.1.[1-5], 1.4.[8-10], 1.3.[3-5].
9186
9187
9188 ---
9189
9190 * [HBASE-21995](https://issues.apache.org/jira/browse/HBASE-21995) | *Major* | **Add a coprocessor to set HDFS ACL for hbase granted user**
9191
9192 Add a coprocessor to set HDFS acls to make hbase granted users with READ permission have the access to scan snapshots.
9193 To use this feature, please make sure the HDFS config is set:
9194 dfs.namenode.acls.enabled=true
9195 fs.permissions.umask-mode=027
9196
9197 and set the HBase config:
9198 hbase.coprocessor.master.classes="org.apache.hadoop.hbase.security.access.AccessController,org.apache.hadoop.hbase.security.access.SnapshotScannerHDFSAclController"
9199 hbase.user.scan.snapshot.enable=true
9200
9201
9202 ---
9203
9204 * [HBASE-22596](https://issues.apache.org/jira/browse/HBASE-22596) | *Minor* | **[Chore] Separate the execution period between CompactionChecker and PeriodicMemStoreFlusher**
9205
9206 hbase.regionserver.compaction.check.period is used for controlling how often the compaction checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
9207
9208 hbase.regionserver.flush.check.period is used for controlling how ofter the flush checker runs. If unset, will use hbase.server.thread.wakefrequency as default value.
9209
9210
9211 ---
9212
9213 * [HBASE-22588](https://issues.apache.org/jira/browse/HBASE-22588) | *Major* | **Upgrade jaxws-ri dependency to 2.3.2**
9214
9215 <!-- markdown -->
9216
9217 When run with JDK11 HBase now uses more recent version of the jaxws reference implementation (v2.3.2).
9218
9219
9220 ---
9221
9222 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
9223
9224 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
9225
9226
9227 ---
9228
9229 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
9230
9231 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
9232
9233
9234 ---
9235
9236 * [HBASE-22459](https://issues.apache.org/jira/browse/HBASE-22459) | *Minor* | **Expose store reader reference count**
9237
9238 This change exposes the aggregate count of store reader references for a given store as 'storeRefCount' in region metrics and ClusterStatus.
9239
9240
9241 ---
9242
9243 * [HBASE-22469](https://issues.apache.org/jira/browse/HBASE-22469) | *Minor* | **replace md5 checksum in saveVersion script with sha512 for hbase version information**
9244
9245 The HBase "source checksum" now uses SHA512 instead of MD5.
9246
9247
9248 ---
9249
9250 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
9251
9252 <!-- markdown -->
9253
9254 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
9255
9256 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
9257
9258
9259 ---
9260
9261 * [HBASE-20782](https://issues.apache.org/jira/browse/HBASE-20782) | *Minor* | **Fix duplication of TestServletFilter.access**
9262
9263 The access method was used to the HttpServerFunctionalTest class as a common place.
9264
9265
9266 ---
9267
9268 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
9269
9270 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
9271
9272
9273 ---
9274
9275 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
9276
9277 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
9278
9279
9280 ---
9281
9282 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
9283
9284 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
9285
9286
9287 ---
9288
9289 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
9290
9291 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
9292
9293
9294 ---
9295
9296 * [HBASE-21048](https://issues.apache.org/jira/browse/HBASE-21048) | *Major* | **Get LogLevel is not working from console in secure environment**
9297
9298 Support get\|set LogLevel in secure(kerberized) environment.
9299
9300
9301 ---
9302
9303 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
9304
9305 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
9306
9307
9308 ---
9309
9310 * [HBASE-22377](https://issues.apache.org/jira/browse/HBASE-22377) | *Major* | **Provide API to check the existence of a namespace which does not require ADMIN permissions**
9311
9312 This change adds the new method listNamespaces to the Admin interface, which can be used to retrieve a list of the namespaces present in the schema as an unprivileged operation. Formerly the only available method for accomplishing this was listNamespaceDescriptors, which requires GLOBAL CREATE or ADMIN permissions.
9313
9314
9315 ---
9316
9317 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
9318
9319 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
9320
9321
9322 ---
9323
9324 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
9325
9326 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
9327
9328
9329 ---
9330
9331 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
9332
9333 Updated metrics core from 3.2.1 to 3.2.6.
9334
9335
9336 ---
9337
9338 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
9339
9340 The rubocop definition for the maximum method length was set to 75.
9341
9342
9343 ---
9344
9345 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
9346
9347 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
9348
9349
9350 ---
9351
9352 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
9353
9354 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
9355
9356
9357 ---
9358
9359 * [HBASE-22301](https://issues.apache.org/jira/browse/HBASE-22301) | *Minor* | **Consider rolling the WAL if the HDFS write pipeline is slow**
9360
9361 This change adds new conditions for rolling the WAL for when syncs on the HDFS writer pipeline are perceived to be slow.
9362
9363 As before the configuration parameter hbase.regionserver.wal.slowsync.ms sets the slow sync warning threshold.
9364
9365 If we encounter hbase.regionserver.wal.slowsync.roll.threshold number of slow syncs (default 100) within the interval defined by hbase.regionserver.wal.slowsync.roll.interval.ms (default 1 minute), we will request a WAL roll.
9366
9367 Or, if the time for any sync exceeds the threshold set by hbase.regionserver.wal.roll.on.sync.ms (default 10 seconds) we will request a WAL roll immediately.
9368
9369 Operators can monitor how often these new thresholds result in a WAL roll by looking at newly added metrics to the WAL related metric group:
9370 \* slowSyncRollRequest - How many times a roll was requested due to sync too slow on the write pipeline.
9371
9372 Additionally, as a part of this change there are also additional metrics for existing reasons for a WAL roll:
9373 \* errorRollRequest - How many times a roll was requested due to I/O or other errors.
9374 \* sizeRollRequest - How many times a roll was requested due to file size roll threshold.
9375
9376
9377 ---
9378
9379 * [HBASE-21883](https://issues.apache.org/jira/browse/HBASE-21883) | *Minor* | **Enhancements to Major Compaction tool**
9380
9381 MajorCompactorTTL Tool allows to compact all regions in a table that have been TTLed out. This saves space on DFS and is useful for tables which are similar to time series data. This is typically scheduled to run frequently (say via cron) to cleanup old data on an ongoing basis.
9382
9383 RSGroupMajorCompactionTTL tool is similar to MajorCompactorTTL but runs at a region server group level. If multiple tables in an rsgroup are similar to time-series data, then it runs a single command to clean them up. As more tables are added/removed from rsgroup, it's easy to have a single command to take care of all of them.
9384
9385
9386 ---
9387
9388 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
9389
9390 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
9391
9392
9393 ---
9394
9395 * [HBASE-22083](https://issues.apache.org/jira/browse/HBASE-22083) | *Minor* | **move eclipse specific configs into a profile**
9396
9397 <!-- markdown -->
9398 Maven project integration for Eclipse has been isolated into a maven profile to ensure it only is active when in an Eclipse project.
9399
9400 Things should continue to behave the same for Eclipse users. If something should go wrong folks should manually activate the `eclipse-specific` profile.
9401
9402
9403 ---
9404
9405 * [HBASE-22307](https://issues.apache.org/jira/browse/HBASE-22307) | *Major* | **Deprecated Preemptive Fail Fast**
9406
9407 Deprecated Preemptive Fail Fast related constants in HConstants, the support of this feature will be removed in 3.0.0 so use these constants will have no effect for 3.0.0+ releases. And the constants will be kept till 4.0.0.
9408
9409 Users can use 'hbase.client.perserver.requests.threshold' to control the number of concurrent requests to the same region server. Please see the release note of HBASE-16388 for more details.
9410
9411
9412 ---
9413
9414 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
9415
9416 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
9417
9418
9419 ---
9420
9421 * [HBASE-19222](https://issues.apache.org/jira/browse/HBASE-19222) | *Major* | **update jruby to 9.1.17.0**
9422
9423 <!-- markdown -->
9424
9425 The default version of JRuby shipped with HBase has been updated to the JRuby 9.1.17.0 release.
9426
9427 For details on changes see [the release notes for JRuby 9.1.17.0](https://www.jruby.org/2018/04/23/jruby-9-1-17-0)
9428
9429
9430 ---
9431
9432 * [HBASE-22279](https://issues.apache.org/jira/browse/HBASE-22279) | *Major* | **Add a getRegionLocator method in Table/AsyncTable interface**
9433
9434 Add below method in Table interface:
9435
9436 RegionLocator getRegionLocator() throws IOException;
9437
9438 Add below methods in AsyncTable interface:
9439
9440 AsyncTableRegionLocator getRegionLocator();
9441 CompletableFuture\<TableDescriptor\> getDescriptor();
9442
9443
9444 ---
9445
9446 * [HBASE-15560](https://issues.apache.org/jira/browse/HBASE-15560) | *Major* | **TinyLFU-based BlockCache**
9447
9448 LruBlockCache uses the Segmented LRU (SLRU) policy to capture frequency and recency of the working set. It achieves concurrency by using an O(n) background thread to prioritize the entries and evict. Accessing an entry is O(1) by a hash table lookup, recording its logical access time, and setting a frequency flag. A write is performed in O(1) time by updating the hash table and triggering an async eviction thread. This provides ideal concurrency and minimizes the latencies by penalizing the thread instead of the caller. However the policy does not age the frequencies and may not be resilient to various workload patterns.
9449
9450 This change introduces a new L1 policy, TinyLfuBlockCache, which records the frequency in a counting sketch, ages periodically by halving the counters, and orders entries by SLRU. An entry is discarded by comparing the frequency of the new arrival to the SLRU's victim, and keeping the one with the highest frequency. This allows the operations to be performed in O(1) time and, though the use of a compact sketch, a much larger history is retained beyond the current working set. In a variety of real world traces the policy had near optimal hit rates.
9451
9452 New configuration variable hfile.block.cache.policy sets the eviction policy for the L1 block cache. The default is "LRU" (LruBlockCache). Set to "TinyLFU" to use TinyLfuBlockCache instead.
9453
9454
9455 ---
9456
9457 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
9458
9459 Introduced
9460
9461 Future\<Void\> createTableAsync(TableDescriptor);
9462
9463
9464 ---
9465
9466 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
9467
9468 Introduced these methods:
9469 void move(byte[]);
9470 void move(byte[], ServerName);
9471 Future\<Void\> splitRegionAsync(byte[]);
9472
9473 These methods are deprecated:
9474 void move(byte[], byte[])
9475
9476
9477 ---
9478
9479 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
9480
9481 Add a new jenkins file for running pre commit check for GitHub PR.
9482
9483
9484 ---
9485
9486 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
9487
9488 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
9489
9490
9491 ---
9492
9493 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
9494
9495 When insufficient permissions, you now get:
9496
9497 HTTP/1.1 403 Forbidden
9498
9499 on the HTTP side, and in the message
9500
9501 Forbidden
9502 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
9503 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
9504 and the rest of the ADE stack
9505
9506
9507 ---
9508
9509 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
9510
9511 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
9512
9513
9514 ---
9515
9516 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
9517
9518 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
9519
9520
9521 ---
9522
9523 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
9524
9525 <!-- markdown -->
9526 Fixed awkward dependency issue that prevented site building.
9527
9528 #### note specific to HBase 2.1.4
9529 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
9530 ```
9531 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
9532 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
9533         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
9534         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
9535         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
9536         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
9537         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
9538         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
9539         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
9540         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
9541         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
9542         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
9543         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
9544         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
9545         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
9546         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
9547         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
9548         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
9549         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
9550         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
9551         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
9552         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
9553         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
9554         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
9555         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
9556         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
9557         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
9558         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
9559 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
9560         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
9561         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
9562         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
9563         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
9564         ... 26 more
9565
9566 ```
9567
9568 Workaround via any _one_ of the following:
9569 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
9570 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
9571 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
9572 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
9573 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
9574
9575
9576 ---
9577
9578 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
9579
9580 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
9581
9582
9583 ---
9584
9585 * [HBASE-22063](https://issues.apache.org/jira/browse/HBASE-22063) | *Major* | **Deprecated Admin.deleteSnapshot(byte[])**
9586
9587 Deprecate Admin.deleteSnapshot(byte[]), please use the String version instead.
9588
9589
9590 ---
9591
9592 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
9593
9594 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
9595
9596 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
9597
9598
9599 ---
9600
9601 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
9602
9603 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
9604
9605
9606 ---
9607
9608 * [HBASE-22044](https://issues.apache.org/jira/browse/HBASE-22044) | *Major* | **ByteBufferUtils should not be IA.Public API**
9609
9610 <!-- markdown -->
9611
9612 As of HBase 3.0, the ByteBufferUtils class is now marked as a Private API for internal project use only. Downstream users are advised that it no longer has any compatibility promises across releases.
9613
9614 As of earlier HBase release lines the class is now marked as deprecated to call attention to this planned transition.
9615
9616
9617 ---
9618
9619 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
9620
9621 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
9622
9623
9624 ---
9625
9626 * [HBASE-22001](https://issues.apache.org/jira/browse/HBASE-22001) | *Major* | **Polish the Admin interface**
9627
9628 Add a cloneSnapshotAsync method with restoreAcl parameter.
9629 Deprecated restoreSnapshotAsync method as it just ignores the failsafe configuration.
9630 Make snapshotAsync method returns a Future\<Void\>.
9631 Deprecated the snapshot related methods which take a 'byte[]' as the snapshot name.
9632 Use default methods to reduce the code base for implementation classes.
9633
9634
9635 ---
9636
9637 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
9638
9639 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
9640
9641
9642 ---
9643
9644 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
9645
9646 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
9647 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
9648
9649 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
9650
9651 For example:
9652 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
9653
9654
9655 ---
9656
9657 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
9658
9659 Adds below flush, split, and compaction metrics
9660
9661  +  // split related metrics
9662  +  private MutableFastCounter splitRequest;
9663  +  private MutableFastCounter splitSuccess;
9664  +  private MetricHistogram splitTimeHisto;
9665  +
9666  +  // flush related metrics
9667  +  private MetricHistogram flushTimeHisto;
9668  +  private MetricHistogram flushMemstoreSizeHisto;
9669  +  private MetricHistogram flushOutputSizeHisto;
9670  +  private MutableFastCounter flushedMemstoreBytes;
9671  +  private MutableFastCounter flushedOutputBytes;
9672  +
9673  +  // compaction related metrics
9674  +  private MetricHistogram compactionTimeHisto;
9675  +  private MetricHistogram compactionInputFileCountHisto;
9676  +  private MetricHistogram compactionInputSizeHisto;
9677  +  private MetricHistogram compactionOutputFileCountHisto;
9678  +  private MetricHistogram compactionOutputSizeHisto;
9679  +  private MutableFastCounter compactedInputBytes;
9680  +  private MutableFastCounter compactedOutputBytes;
9681  +
9682  +  private MetricHistogram majorCompactionTimeHisto;
9683  +  private MetricHistogram majorCompactionInputFileCountHisto;
9684  +  private MetricHistogram majorCompactionInputSizeHisto;
9685  +  private MetricHistogram majorCompactionOutputFileCountHisto;
9686  +  private MetricHistogram majorCompactionOutputSizeHisto;
9687  +  private MutableFastCounter majorCompactedInputBytes;
9688  +  private MutableFastCounter majorCompactedOutputBytes;
9689
9690
9691 ---
9692
9693 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
9694
9695 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
9696
9697
9698 ---
9699
9700 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
9701
9702 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
9703 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
9704
9705
9706 ---
9707
9708 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
9709
9710 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
9711
9712 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
9713
9714
9715 ---
9716
9717 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
9718
9719 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
9720 Shell commands are as follows:
9721 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
9722
9723 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
9724 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
9725 Shell commands are as follows:
9726 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
9727 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
9728 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
9729
9730
9731 ---
9732
9733 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
9734
9735 Change spotbugs version to 3.1.11.
9736
9737
9738 ---
9739
9740 * [HBASE-21505](https://issues.apache.org/jira/browse/HBASE-21505) | *Major* | **Several inconsistencies on information reported for Replication Sources by hbase shell status 'replication' command.**
9741
9742 This modifies "status 'replication'" output, fixing inconsistencies on the reporting times and ages of last shipped edits, as well as wrong calculation of replication lags.
9743
9744 It also introduces additional info for each recovery queue, which was not accounted by this command before.
9745
9746 The new output for "status 'replication'" command is explained in details below:
9747 a) Source started, target stopped, no edits arrived on source yet:
9748 ...
9749  SOURCE: PeerID=1
9750          Normal Queue: 1
9751            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
9752 ...
9753 b) Source started, target stopped, add edit on source:
9754 ...
9755 Normal Queue: 1
9756            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:21:00 GMT 2018, Replication Lag=2459
9757 ...
9758 c) Source started, target stopped, edit added on source, restart source:
9759 ...
9760 SOURCE: PeerID=1
9761          Normal Queue: 1
9762            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
9763          Recovered Queue: 1-hbase01.home,16020,1542784524057
9764            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:23:00 GMT 2018, Replication Lag=201495
9765 ...
9766 d) Source started, target stopped, add edit on source, restart source, add another edit on source:
9767 ...
9768 SOURCE: PeerID=1
9769          Normal Queue: 1
9770            No Ops shipped since last restart, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=6349
9771          Recovered Queue: 1-hbase01.home,16020,1542782758742
9772            No Ops shipped since last restart, SizeOfLogQueue=0, TimeStampOfLastArrivedInSource=Wed Nov 21 06:53:05 GMT 2018, Replication Lag=569394
9773 ...
9774 e) Source started, target stopped, add edit on source, restart source, add another edit on source, start target:
9775 ...
9776        SOURCE: PeerID=1
9777          Normal Queue: 1
9778            AgeOfLastShippedOp=30000, TimeStampOfLastShippedOp=Wed Nov 21 07:07:58 GMT 2018, SizeOfLogQueue=1, TimeStampOfLastArrivedInSource=Wed Nov 21 07:02:28 GMT 2018, Replication Lag=0
9779 ...
9780 f) Source started, target stopped, add edit on source, restart source, restart target:
9781 ...
9782 SOURCE: PeerID=1
9783          Normal Queue: 1
9784            No Ops shipped since last restart, SizeOfLogQueue=1, No edits for this source since it started, Replication Lag=0
9785 ...
9786
9787
9788 ---
9789
9790 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
9791
9792 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
9793
9794
9795 ---
9796
9797 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
9798
9799 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
9800 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
9801 disable\_exceed\_throttle\_quota
9802 There are two limits when enable exceed throttle quota:
9803 1. Must set at least one read and one write region server throttle quota;
9804 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
9805
9806
9807 ---
9808
9809 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
9810
9811 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
9812
9813
9814 ---
9815
9816 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
9817
9818 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
9819
9820
9821 ---
9822
9823 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
9824
9825 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
9826
9827
9828 ---
9829
9830 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
9831
9832 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
9833
9834 hbase\> help 'scan'
9835
9836
9837 ---
9838
9839 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
9840
9841 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
9842
9843 For example:
9844 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
9845
9846
9847 ---
9848
9849 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
9850
9851 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
9852 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
9853
9854
9855 ---
9856
9857 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
9858
9859 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
9860
9861
9862 ---
9863
9864 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
9865
9866 Make StoppedRpcClientException extend DoNotRetryIOException.
9867
9868
9869 ---
9870
9871 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
9872
9873 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
9874 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
9875
9876
9877 ---
9878
9879 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
9880
9881 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
9882
9883 The effect releases are:
9884 2.1.x: 2.1.2 and below
9885 2.0.x: 2.0.4 and below
9886 1.x: 1.4.x and below
9887
9888 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
9889
9890
9891 ---
9892
9893 * [HBASE-20894](https://issues.apache.org/jira/browse/HBASE-20894) | *Major* | **Move BucketCache from java serialization to protobuf**
9894
9895 For users who have configured hbase.bucketcache.ioengine with either the file:, files:, or mmap: prefix, and configured it to be persistent via the hbase.bucketcache.persistent.path property, the serialization format of the bucket cache has changed between versions. The old state will not be read during startup, and there is currently no migration path. The impact is expected to be minimal, however, since the cache will rebuild over time as access patterns dictate.
9896
9897
9898
9899
9900 # HBASE  2.2.0 Release Notes
9901
9902 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
9903
9904
9905 ---
9906
9907 * [HBASE-21970](https://issues.apache.org/jira/browse/HBASE-21970) | *Major* | **Document that how to upgrade from 2.0 or 2.1 to 2.2+**
9908
9909 See the document http://hbase.apache.org/book.html#upgrade2.2 about how to upgrade from 2.0 or 2.1 to 2.2+.
9910
9911 HBase 2.2+ uses a new Procedure form assiging/unassigning/moving Regions. It does not process HBase 2.1 and 2.0's Unassign/Assign Procedure types. Upgrade requires that we first drain the Master Procedure Store of old style Procedures before starting the new 2.2 Master. So you need to make sure that before you kill the old version (2.0 or 2.1) Master, there is no region in transition. And once the new version (2.2+) Master is up, you can rolling upgrade RegionServers one by one.
9912
9913 And there is a more safer way if you are running 2.1.1+ or 2.0.3+ cluster. It need four steps to upgrade Master.
9914
9915 1. Shutdown both active and standby Masters (Your cluster will continue to server reads and writes without interruption).
9916 2. Set the property hbase.procedure.upgrade-to-2-2 to true in hbase-site.xml for the Master, and start only one Master, still using the 2.1.1+ (or 2.0.3+) version.
9917 3. Wait until the Master quits. Confirm that there is a 'READY TO ROLLING UPGRADE' message in the Master log as the cause of the shutdown. The Procedure Store is now empty.
9918 4. Start new Masters with the new 2.2+ version.
9919
9920 Then you can rolling upgrade RegionServers one by one. See HBASE-21075 for more details.
9921
9922
9923 ---
9924
9925 * [HBASE-21536](https://issues.apache.org/jira/browse/HBASE-21536) | *Trivial* | **Fix completebulkload usage instructions**
9926
9927 Added completebulkload short name for BulkLoadHFilesTool to bin/hbase.
9928
9929
9930 ---
9931
9932 * [HBASE-22500](https://issues.apache.org/jira/browse/HBASE-22500) | *Blocker* | **Modify pom and jenkins jobs for hadoop versions**
9933
9934 Change the default hadoop-3 version to 3.1.2. Drop the support for the releases which are effected by CVE-2018-8029, see this email https://lists.apache.org/thread.html/3d6831c3893cd27b6850aea2feff7d536888286d588e703c6ffd2e82@%3Cuser.hadoop.apache.org%3E
9935
9936
9937 ---
9938
9939 * [HBASE-22148](https://issues.apache.org/jira/browse/HBASE-22148) | *Blocker* | **Provide an alternative to CellUtil.setTimestamp**
9940
9941 <!-- markdown -->
9942
9943 The `CellUtil.setTimestamp` method changes to be an API with audience `LimitedPrivate(COPROC)` in HBase 3.0. With that designation the API should remain stable within a given minor release line, but may change between minor releases.
9944
9945 Previously, this method was deprecated in HBase 2.0 for removal in HBase 3.0. Deprecation messages in HBase 2.y releases have been updated to indicate the expected API audience change.
9946
9947
9948 ---
9949
9950 * [HBASE-21991](https://issues.apache.org/jira/browse/HBASE-21991) | *Major* | **Fix MetaMetrics issues - [Race condition, Faulty remove logic], few improvements**
9951
9952 The class LossyCounting was unintentionally marked Public but was never intended to be part of our public API. This oversight has been corrected and LossyCounting is now marked as Private and going forward may be subject to additional breaking changes or removal without notice. If you have taken a dependency on this class we recommend cloning it locally into your project before upgrading to this release.
9953
9954
9955 ---
9956
9957 * [HBASE-22226](https://issues.apache.org/jira/browse/HBASE-22226) | *Trivial* | **Incorrect level for headings in asciidoc**
9958
9959 Warnings for level headings are corrected in the book for the HBase Incompatibilities section.
9960
9961
9962 ---
9963
9964 * [HBASE-20970](https://issues.apache.org/jira/browse/HBASE-20970) | *Major* | **Update hadoop check versions for hadoop3 in hbase-personality**
9965
9966 Add hadoop 3.0.3, 3.1.1 3.1.2 in our hadoop check jobs.
9967
9968
9969 ---
9970
9971 * [HBASE-21784](https://issues.apache.org/jira/browse/HBASE-21784) | *Major* | **Dump replication queue should show list of wal files ordered chronologically**
9972
9973 The DumpReplicationQueues tool will now list replication queues sorted in chronological order.
9974
9975
9976 ---
9977
9978 * [HBASE-22384](https://issues.apache.org/jira/browse/HBASE-22384) | *Minor* | **Formatting issues in administration section of book**
9979
9980 Fixes a formatting issue in the administration section of the book, where listing indentation were a little bit off.
9981
9982
9983 ---
9984
9985 * [HBASE-22399](https://issues.apache.org/jira/browse/HBASE-22399) | *Major* | **Change default hadoop-two.version to 2.8.x and remove the 2.7.x hadoop checks**
9986
9987 Now the default hadoop-two.version has been changed to 2.8.5, and all hadoop versions before 2.8.2(exclude) will not be supported any more.
9988
9989
9990 ---
9991
9992 * [HBASE-22392](https://issues.apache.org/jira/browse/HBASE-22392) | *Trivial* | **Remove extra/useless +**
9993
9994 Removed extra + in HRegion, HStore and LoadIncrementalHFiles for branch-2 and HRegion and HStore for branch-1.
9995
9996
9997 ---
9998
9999 * [HBASE-20494](https://issues.apache.org/jira/browse/HBASE-20494) | *Major* | **Upgrade com.yammer.metrics dependency**
10000
10001 Updated metrics core from 3.2.1 to 3.2.6.
10002
10003
10004 ---
10005
10006 * [HBASE-22358](https://issues.apache.org/jira/browse/HBASE-22358) | *Minor* | **Change rubocop configuration for method length**
10007
10008 The rubocop definition for the maximum method length was set to 75.
10009
10010
10011 ---
10012
10013 * [HBASE-22379](https://issues.apache.org/jira/browse/HBASE-22379) | *Minor* | **Fix Markdown for "Voting on Release Candidates" in book**
10014
10015 Fixes the formatting of the "Voting on Release Candidates" to actually show the quote and code formatting of the RAT check.
10016
10017
10018 ---
10019
10020 * [HBASE-20851](https://issues.apache.org/jira/browse/HBASE-20851) | *Minor* | **Change rubocop config for max line length of 100**
10021
10022 The rubocop configuration in the hbase-shell module now allows a line length with 100 characters, instead of 80 as before. For everything before 2.1.5 this change introduces rubocop itself.
10023
10024
10025 ---
10026
10027 * [HBASE-22054](https://issues.apache.org/jira/browse/HBASE-22054) | *Minor* | **Space Quota: Compaction is not working for super user in case of NO\_WRITES\_COMPACTIONS**
10028
10029 This change allows the system and superusers to initiate compactions, even when a space quota violation policy disallows compactions from happening. The original intent behind disallowing of compactions was to prevent end-user compactions from creating undue I/O load, not disallowing \*any\* compaction in the system.
10030
10031
10032 ---
10033
10034 * [HBASE-22292](https://issues.apache.org/jira/browse/HBASE-22292) | *Blocker* | **PreemptiveFastFailInterceptor clean repeatedFailuresMap issue**
10035
10036 Adds new configuration hbase.client.failure.map.cleanup.interval which defaults to ten minutes.
10037
10038
10039 ---
10040
10041 * [HBASE-22155](https://issues.apache.org/jira/browse/HBASE-22155) | *Major* | **Move 2.2.0 on to hbase-thirdparty-2.2.0**
10042
10043  Updates libs used internally by hbase via hbase-thirdparty as follows:
10044
10045  gson 2.8.1 -\\\> 2.8.5
10046  guava 22.0 -\\\> 27.1-jre
10047  pb 3.5.1 -\\\> 3.7.0
10048  netty 4.1.17 -\\\> 4.1.34
10049  commons-collections4 4.1 -\\\> 4.3
10050
10051
10052 ---
10053
10054 * [HBASE-22178](https://issues.apache.org/jira/browse/HBASE-22178) | *Major* | **Introduce a createTableAsync with TableDescriptor method in Admin**
10055
10056 Introduced
10057
10058 Future\<Void\> createTableAsync(TableDescriptor);
10059
10060
10061 ---
10062
10063 * [HBASE-22108](https://issues.apache.org/jira/browse/HBASE-22108) | *Major* | **Avoid passing null in Admin methods**
10064
10065 Introduced these methods:
10066 void move(byte[]);
10067 void move(byte[], ServerName);
10068 Future\<Void\> splitRegionAsync(byte[]);
10069
10070 These methods are deprecated:
10071 void move(byte[], byte[])
10072
10073
10074 ---
10075
10076 * [HBASE-22152](https://issues.apache.org/jira/browse/HBASE-22152) | *Major* | **Create a jenkins file for yetus to processing GitHub PR**
10077
10078 Add a new jenkins file for running pre commit check for GitHub PR.
10079
10080
10081 ---
10082
10083 * [HBASE-22007](https://issues.apache.org/jira/browse/HBASE-22007) | *Major* | **Add restoreSnapshot and cloneSnapshot with acl methods in AsyncAdmin**
10084
10085 Add cloneSnapshot/restoreSnapshot with acl methods in AsyncAdmin.
10086
10087
10088 ---
10089
10090 * [HBASE-22123](https://issues.apache.org/jira/browse/HBASE-22123) | *Minor* | **REST gateway reports Insufficient permissions exceptions as 404 Not Found**
10091
10092 When insufficient permissions, you now get:
10093
10094 HTTP/1.1 403 Forbidden
10095
10096 on the HTTP side, and in the message
10097
10098 Forbidden
10099 org.apache.hadoop.hbase.security.AccessDeniedException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user ‘myuser',action: get, tableName:mytable, family:cf.
10100 at org.apache.ranger.authorization.hbase.RangerAuthorizationCoprocessor.authorizeAccess(RangerAuthorizationCoprocessor.java:547)
10101 and the rest of the ADE stack
10102
10103
10104 ---
10105
10106 * [HBASE-22100](https://issues.apache.org/jira/browse/HBASE-22100) | *Minor* | **False positive for error prone warnings in pre commit job**
10107
10108 Now we will sort the javac WARNING/ERROR before generating diff in pre-commit so we can get a stable output for the error prone. The downside is that we just sort the output lexicographically so the line number will also be sorted lexicographically, which is a bit strange to human.
10109
10110
10111 ---
10112
10113 * [HBASE-22057](https://issues.apache.org/jira/browse/HBASE-22057) | *Major* | **Impose upper-bound on size of ZK ops sent in a single multi()**
10114
10115 Exposes a new configuration property "zookeeper.multi.max.size" which dictates the maximum size of deletes that HBase will make to ZooKeeper in a single RPC. This property defaults to 1MB, which should fall beneath the default ZooKeeper limit of 2MB, controlled by "jute.maxbuffer".
10116
10117
10118 ---
10119
10120 * [HBASE-22052](https://issues.apache.org/jira/browse/HBASE-22052) | *Major* | **pom cleaning; filter out jersey-core in hadoop2 to match hadoop3 and remove redunant version specifications**
10121
10122 <!-- markdown -->
10123 Fixed awkward dependency issue that prevented site building.
10124
10125 #### note specific to HBase 2.1.4
10126 HBase 2.1.4 shipped with an early version of this fix that incorrectly altered the libraries included in our binary assembly for using Apache Hadoop 2.7 (the current build default Hadoop version for 2.1.z). For folks running out of the box against a Hadoop 2.7 cluster (or folks who skip the installation step of [replacing the bundled Hadoop libraries](http://hbase.apache.org/book.html#hadoop)) this will result in a failure at Region Server startup due to a missing class definition. e.g.:
10127 ```
10128 2019-03-27 09:02:05,779 ERROR [main] regionserver.HRegionServer: Failed construction RegionServer
10129 java.lang.NoClassDefFoundError: org/apache/htrace/SamplerBuilder
10130         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:644)
10131         at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:628)
10132         at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:149)
10133         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2667)
10134         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:93)
10135         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2701)
10136         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2683)
10137         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:372)
10138         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:171)
10139         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:356)
10140         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:295)
10141         at org.apache.hadoop.hbase.util.CommonFSUtils.getRootDir(CommonFSUtils.java:362)
10142         at org.apache.hadoop.hbase.util.CommonFSUtils.isValidWALRootDir(CommonFSUtils.java:411)
10143         at org.apache.hadoop.hbase.util.CommonFSUtils.getWALRootDir(CommonFSUtils.java:387)
10144         at org.apache.hadoop.hbase.regionserver.HRegionServer.initializeFileSystem(HRegionServer.java:704)
10145         at org.apache.hadoop.hbase.regionserver.HRegionServer.<init>(HRegionServer.java:613)
10146         at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
10147         at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
10148         at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
10149         at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
10150         at org.apache.hadoop.hbase.regionserver.HRegionServer.constructRegionServer(HRegionServer.java:3029)
10151         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.start(HRegionServerCommandLine.java:63)
10152         at org.apache.hadoop.hbase.regionserver.HRegionServerCommandLine.run(HRegionServerCommandLine.java:87)
10153         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
10154         at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:149)
10155         at org.apache.hadoop.hbase.regionserver.HRegionServer.main(HRegionServer.java:3047)
10156 Caused by: java.lang.ClassNotFoundException: org.apache.htrace.SamplerBuilder
10157         at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
10158         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
10159         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349)
10160         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
10161         ... 26 more
10162
10163 ```
10164
10165 Workaround via any _one_ of the following:
10166 * If you are running against a Hadoop cluster that is 2.8+, ensure you replace the Hadoop libaries in the default binary assembly with those for your version.
10167 * If you are running against a Hadoop cluster that is 2.8+, build the binary assembly from the source release while specifying your Hadoop version.
10168 * If you are running against a Hadoop cluster that is a supported 2.7 release, ensure the `hadoop` executable is in the `PATH` seen at Region Server startup and that you are not using the `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` bypass.
10169 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers via the HBASE_CLASSPATH environment variable.
10170 * For any supported Hadoop version, manually make the Apache HTrace artifact `htrace-core-3.1.0-incubating.jar` available to all Region Servers by copying it into the directory `${HBASE_HOME}/lib/client-facing-thirdparty/`.
10171
10172
10173 ---
10174
10175 * [HBASE-22065](https://issues.apache.org/jira/browse/HBASE-22065) | *Major* | **Add listTableDescriptors(List\<TableName\>) method in AsyncAdmin**
10176
10177 Add a listTableDescriptors(List\<TableName\>) method in the AsyncAdmin interface, to align with the Admin interface.
10178
10179
10180 ---
10181
10182 * [HBASE-22040](https://issues.apache.org/jira/browse/HBASE-22040) | *Major* | **Add mergeRegionsAsync with a List of region names method in AsyncAdmin**
10183
10184 Add a mergeRegionsAsync(byte[][], boolean) method in the AsyncAdmin interface.
10185
10186 Instead of using assert, now we will throw IllegalArgumentException when you want to merge less than 2 regions at client side. And also, at master side, instead of using assert, now we will throw DoNotRetryIOException if you want merge more than 2 regions, since we only support merging two regions at once for now.
10187
10188
10189 ---
10190
10191 * [HBASE-22039](https://issues.apache.org/jira/browse/HBASE-22039) | *Major* | **Should add the synchronous parameter for the XXXSwitch method in AsyncAdmin**
10192
10193 Add drainXXX parameter for balancerSwitch/splitSwitch/mergeSwitch methods in the AsyncAdmin interface, which has the same meaning with the synchronous parameter for these methods in the Admin interface.
10194
10195
10196 ---
10197
10198 * [HBASE-21810](https://issues.apache.org/jira/browse/HBASE-21810) | *Major* | **bulkload  support set hfile compression on client**
10199
10200 bulkload (HFileOutputFormat2)  support config the compression on client ,you can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  override the auto-detection of the target table's compression
10201
10202
10203 ---
10204
10205 * [HBASE-22000](https://issues.apache.org/jira/browse/HBASE-22000) | *Major* | **Deprecated isTableAvailable with splitKeys**
10206
10207 Deprecated AsyncTable.isTableAvailable(TableName, byte[][]).
10208
10209
10210 ---
10211
10212 * [HBASE-21871](https://issues.apache.org/jira/browse/HBASE-21871) | *Major* | **Support to specify a peer table name in VerifyReplication tool**
10213
10214 After HBASE-21871, we can specify a peer table name with --peerTableName in VerifyReplication tool like the following:
10215 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable 5 TestTable
10216
10217 In addition, we can compare any 2 tables in any remote clusters with specifying both peerId and --peerTableName.
10218
10219 For example:
10220 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication --peerTableName=peerTable zk1,zk2,zk3:2181/hbase TestTable
10221
10222
10223 ---
10224
10225 * [HBASE-15728](https://issues.apache.org/jira/browse/HBASE-15728) | *Major* | **Add remaining per-table region / store / flush / compaction related metrics**
10226
10227 Adds below flush, split, and compaction metrics
10228
10229  +  // split related metrics
10230  +  private MutableFastCounter splitRequest;
10231  +  private MutableFastCounter splitSuccess;
10232  +  private MetricHistogram splitTimeHisto;
10233  +
10234  +  // flush related metrics
10235  +  private MetricHistogram flushTimeHisto;
10236  +  private MetricHistogram flushMemstoreSizeHisto;
10237  +  private MetricHistogram flushOutputSizeHisto;
10238  +  private MutableFastCounter flushedMemstoreBytes;
10239  +  private MutableFastCounter flushedOutputBytes;
10240  +
10241  +  // compaction related metrics
10242  +  private MetricHistogram compactionTimeHisto;
10243  +  private MetricHistogram compactionInputFileCountHisto;
10244  +  private MetricHistogram compactionInputSizeHisto;
10245  +  private MetricHistogram compactionOutputFileCountHisto;
10246  +  private MetricHistogram compactionOutputSizeHisto;
10247  +  private MutableFastCounter compactedInputBytes;
10248  +  private MutableFastCounter compactedOutputBytes;
10249  +
10250  +  private MetricHistogram majorCompactionTimeHisto;
10251  +  private MetricHistogram majorCompactionInputFileCountHisto;
10252  +  private MetricHistogram majorCompactionInputSizeHisto;
10253  +  private MetricHistogram majorCompactionOutputFileCountHisto;
10254  +  private MetricHistogram majorCompactionOutputSizeHisto;
10255  +  private MutableFastCounter majorCompactedInputBytes;
10256  +  private MutableFastCounter majorCompactedOutputBytes;
10257
10258
10259 ---
10260
10261 * [HBASE-20886](https://issues.apache.org/jira/browse/HBASE-20886) | *Critical* | **[Auth] Support keytab login in hbase client**
10262
10263 From 2.2.0, hbase supports client login via keytab. To use this feature, client should specify \`hbase.client.keytab.file\` and \`hbase.client.keytab.principal\` in hbase-site.xml, then the connection will contain the needed credentials which be renewed periodically to communicate with kerberized hbase cluster.
10264
10265
10266 ---
10267
10268 * [HBASE-21410](https://issues.apache.org/jira/browse/HBASE-21410) | *Major* | **A helper page that help find all problematic regions and procedures**
10269
10270 After HBASE-21410, we add a helper page to Master UI. This helper page is mainly to help HBase operator quickly found all regions and pids that are get stuck.
10271 There are 2 entries to get in this page.
10272 One is showing in the Regions in Transition section, it made "num region(s) in transition" a link that you can click and check all regions in transition and their related procedure IDs.
10273 The other one is showing in the table details section, it made the number of CLOSING or OPENING regions a link, which you can click and check regions and related procedure IDs of CLOSING or OPENING regions of a certain table.
10274 In this helper page, not only you can see all regions and related procedures, there are 2 buttons at the top which will show these regions or procedure IDs in text format. This is mainly aim to help operator to easily copy and paste all problematic procedure IDs and encoded region names to HBCK2's command line, by which we HBase operator can bypass these procedures or assign these regions.
10275
10276
10277 ---
10278
10279 * [HBASE-21588](https://issues.apache.org/jira/browse/HBASE-21588) | *Major* | **Procedure v2 wal splitting implementation**
10280
10281 After HBASE-21588, we introduce a new way to do WAL splitting coordination by procedure framework. This can simplify the process of WAL splitting and no need to connect zookeeper any more.
10282 During ServerCrashProcedure, it will create a SplitWALProcedure for each WAL that need to split. Then each SplitWALProcedure will spawn a SplitWALRemoteProcedure to send the request to regionserver.
10283 At the RegionServer side, whole process is handled by SplitWALCallable. It split the WAL and return the result to master.
10284 According to my test, this patch has a better performance as the number of WALs that need to split increase. And it can relieve the pressure on zookeeper.
10285
10286
10287 ---
10288
10289 * [HBASE-20734](https://issues.apache.org/jira/browse/HBASE-20734) | *Major* | **Colocate recovered edits directory with hbase.wal.dir**
10290
10291 Previously the recovered.edits directory was under the root directory. This JIRA moves the recovered.edits directory to be under the hbase.wal.dir if set. It also adds a check for any recovered.edits found under the root directory for backwards compatibility. This gives improvements when a faster media(like SSD) or more local FileSystem is used for the hbase.wal.dir than the root dir.
10292
10293
10294 ---
10295
10296 * [HBASE-20401](https://issues.apache.org/jira/browse/HBASE-20401) | *Minor* | **Make \`MAX\_WAIT\` and \`waitIfNotFinished\` in CleanerContext configurable**
10297
10298 When oldwals (and hfile) cleaner cleans stale wals (and hfiles), it will periodically check and wait the clean results from filesystem, the total wait time will be no more than a max time.
10299
10300 The periodically wait and check configurations are hbase.oldwals.cleaner.thread.check.interval.msec (default is 500 ms) and hbase.regionserver.hfilecleaner.thread.check.interval.msec (default is 1000 ms).
10301
10302 Meanwhile, The max time configurations are hbase.oldwals.cleaner.thread.timeout.msec and hbase.regionserver.hfilecleaner.thread.timeout.msec, they are set to 60 seconds by default.
10303
10304 All support dynamic configuration.
10305
10306 e.g. in the oldwals cleaning scenario, one may consider tuning hbase.oldwals.cleaner.thread.timeout.msec and hbase.oldwals.cleaner.thread.check.interval.msec
10307
10308 1. While deleting a oldwal never complete (strange but possible), then delete file task needs to wait for a max of 60 seconds. Here, 60 seconds might be too long, or the opposite way is to increase more than 60 seconds in the use cases of slow file delete.
10309 2. The check and wait of a file delete is set to default in the period of 500 milliseconds, one might want to tune this checking period to a short interval to check more frequently or to a longer interval to avoid checking too often to manage their delete file task checking period (the longer interval may be use to avoid checking too fast while using a high latency storage).
10310
10311
10312 ---
10313
10314 * [HBASE-21481](https://issues.apache.org/jira/browse/HBASE-21481) | *Major* | **[acl] Superuser's permissions should not be granted or revoked by any non-su global admin**
10315
10316 HBASE-21481 improves the quality of access control, by strengthening the protection of super users's privileges.
10317
10318
10319 ---
10320
10321 * [HBASE-21082](https://issues.apache.org/jira/browse/HBASE-21082) | *Critical* | **Reimplement assign/unassign related procedure metrics**
10322
10323 Now we have four types of RIT procedure metrics, assign, unassign, move, reopen. The meaning of assign/unassign is changed, as we will not increase the unassign metric and then the assign metric when moving a region.
10324 Also introduced two new procedure metrics, open and close, which are used to track the open/close region calls to region server. We may send open/close multiple times to finish a RIT since we may retry multiple times.
10325
10326
10327 ---
10328
10329 * [HBASE-20724](https://issues.apache.org/jira/browse/HBASE-20724) | *Critical* | **Sometimes some compacted storefiles are still opened after region failover**
10330
10331 Problem: This is an old problem since HBASE-2231. The compaction event marker was only writed to WAL. But after flush, the WAL may be archived, which means an useful compaction event marker be deleted, too. So the compacted store files cannot be archived when region open and replay WAL.
10332
10333 Solution: After this jira, the compaction event tracker will be writed to HFile. When region open and load store files, read the compaction evnet tracker from HFile and archive the compacted store files which still exist.
10334
10335
10336 ---
10337
10338 * [HBASE-21820](https://issues.apache.org/jira/browse/HBASE-21820) | *Major* | **Implement CLUSTER quota scope**
10339
10340 HBase contains two quota scopes: MACHINE and CLUSTER. Before this patch, set quota operations did not expose scope option to client api and use MACHINE as default, CLUSTER scope can not be set and used.
10341 Shell commands are as follows:
10342 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
10343
10344 This issue implements CLUSTER scope in a simple way: For user, namespace, user over namespace quota, use [ClusterLimit / RSNum] as machine limit. For table and user over table quota, use [ClusterLimit / TotalTableRegionNum \* MachineTableRegionNum] as machine limit.
10345 After this patch, user can set CLUSTER scope quota, but MACHINE is still default if user ignore scope.
10346 Shell commands are as follows:
10347 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec'
10348 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> MACHINE
10349 set\_quota, TYPE =\> THROTTLE, TABLE =\> 't1', LIMIT =\> '10req/sec', SCOPE =\> CLUSTER
10350
10351
10352 ---
10353
10354 * [HBASE-21057](https://issues.apache.org/jira/browse/HBASE-21057) | *Minor* | **upgrade to latest spotbugs**
10355
10356 Change spotbugs version to 3.1.11.
10357
10358
10359 ---
10360
10361 * [HBASE-21922](https://issues.apache.org/jira/browse/HBASE-21922) | *Major* | **BloomContext#sanityCheck may failed when use ROWPREFIX\_DELIMITED bloom filter**
10362
10363 Remove bloom filter type ROWPREFIX\_DELIMITED. May add it back when find a better solution.
10364
10365
10366 ---
10367
10368 * [HBASE-21783](https://issues.apache.org/jira/browse/HBASE-21783) | *Major* | **Support exceed user/table/ns throttle quota if region server has available quota**
10369
10370 Support enable or disable exceed throttle quota. Exceed throttle quota means, user can over consume user/namespace/table quota if region server has additional available quota because other users don't consume at the same time.
10371 Use the following shell commands to enable/disable exceed throttle quota: enable\_exceed\_throttle\_quota
10372 disable\_exceed\_throttle\_quota
10373 There are two limits when enable exceed throttle quota:
10374 1. Must set at least one read and one write region server throttle quota;
10375 2. All region server throttle quotas must be in seconds time unit. Because once previous requests exceed their quota and consume region server quota, quota in other time units may be refilled in a long time, this may affect later requests.
10376
10377
10378 ---
10379
10380 * [HBASE-20587](https://issues.apache.org/jira/browse/HBASE-20587) | *Major* | **Replace Jackson with shaded thirdparty gson**
10381
10382 Remove jackson dependencies from most hbase modules except hbase-rest, use shaded gson instead. The output json will be a bit different since jackson can use getter/setter, but gson will always use the fields.
10383
10384
10385 ---
10386
10387 * [HBASE-21928](https://issues.apache.org/jira/browse/HBASE-21928) | *Major* | **Deprecated HConstants.META\_QOS**
10388
10389 Mark HConstants.META\_QOS as deprecated. It is for internal use only, which is the highest priority. You should not try to set a priority greater than or equal to this value, although it is no harm but also useless.
10390
10391
10392 ---
10393
10394 * [HBASE-17942](https://issues.apache.org/jira/browse/HBASE-17942) | *Major* | **Disable region splits and merges per table**
10395
10396 This patch adds the ability to disable split and/or merge for a table (By default, split and merge are enabled for a table).
10397
10398
10399 ---
10400
10401 * [HBASE-21636](https://issues.apache.org/jira/browse/HBASE-21636) | *Major* | **Enhance the shell scan command to support missing scanner specifications like ReadType, IsolationLevel etc.**
10402
10403 Allows shell to set Scan options previously not exposed. See additions as part of the scan help by typing following hbase shell:
10404
10405 hbase\> help 'scan'
10406
10407
10408 ---
10409
10410 * [HBASE-21201](https://issues.apache.org/jira/browse/HBASE-21201) | *Major* | **Support to run VerifyReplication MR tool without peerid**
10411
10412 We can specify peerQuorumAddress instead of peerId in VerifyReplication tool. So it no longer requires peerId to be setup when using this tool.
10413
10414 For example:
10415 hbase org.apache.hadoop.hbase.mapreduce.replication.VerifyReplication zk1,zk2,zk3:2181/hbase testTable
10416
10417
10418 ---
10419
10420 * [HBASE-21838](https://issues.apache.org/jira/browse/HBASE-21838) | *Major* | **Create a special ReplicationEndpoint just for verifying the WAL entries are fine**
10421
10422 Introduce a VerifyWALEntriesReplicationEndpoint which replicates nothing but only verifies if all the cells are valid.
10423 It can be used to capture bugs for writing WAL, as most times we will not read the WALs again after writing it if there are no region server crashes.
10424
10425
10426 ---
10427
10428 * [HBASE-21727](https://issues.apache.org/jira/browse/HBASE-21727) | *Minor* | **Simplify documentation around client timeout**
10429
10430 Deprecated HBaseConfiguration#getInt(Configuration, String, String, int) method and removed it from 3.0.0 version.
10431
10432
10433 ---
10434
10435 * [HBASE-21764](https://issues.apache.org/jira/browse/HBASE-21764) | *Major* | **Size of in-memory compaction thread pool should be configurable**
10436
10437 Introduced an new config key in this issue: hbase.regionserver.inmemory.compaction.pool.size. the default value would be 10.  you can configure this to set the pool size of in-memory compaction pool. Note that all memstores in one region server will share the same pool, so if you have many regions in one region server,  you need to set this larger to compact faster for better read performance.
10438
10439
10440 ---
10441
10442 * [HBASE-21684](https://issues.apache.org/jira/browse/HBASE-21684) | *Major* | **Throw DNRIOE when connection or rpc client is closed**
10443
10444 Make StoppedRpcClientException extend DoNotRetryIOException.
10445
10446
10447 ---
10448
10449 * [HBASE-21739](https://issues.apache.org/jira/browse/HBASE-21739) | *Major* | **Move grant/revoke from regionserver to master**
10450
10451 To implement user permission control in Precedure V2, move grant and revoke method from AccessController to master firstly.
10452 Mark AccessController#grant and AccessController#revoke as deprecated and please use Admin#grant and Admin#revoke instead.
10453
10454
10455 ---
10456
10457 * [HBASE-21791](https://issues.apache.org/jira/browse/HBASE-21791) | *Blocker* | **Upgrade thrift dependency to 0.12.0**
10458
10459 IMPORTANT: Due to security issues, all users who use hbase thrift should avoid using releases which do not have this fix.
10460
10461 The effect releases are:
10462 2.1.x: 2.1.2 and below
10463 2.0.x: 2.0.4 and below
10464 1.x: 1.4.x and below
10465
10466 If you are using the effect releases above, please consider upgrading to a newer release ASAP.
10467
10468
10469 ---
10470
10471 * [HBASE-21792](https://issues.apache.org/jira/browse/HBASE-21792) | *Major* | **Mark HTableMultiplexer as deprecated and remove it in 3.0.0**
10472
10473 HTableMultiplexer exposes the implementation class, and it is incomplete, so we mark it as deprecated and remove it in 3.0.0 release.
10474
10475 There is no direct replacement for HTableMultiplexer, please use BufferedMutator if you want to batch mutations to a table.
10476
10477
10478 ---
10479
10480 * [HBASE-21782](https://issues.apache.org/jira/browse/HBASE-21782) | *Major* | **LoadIncrementalHFiles should not be IA.Public**
10481
10482 Introduce a BulkLoadHFiles interface which is marked as IA.Public, for doing bulk load programmatically.
10483 Introduce a BulkLoadHFilesTool which extends BulkLoadHFiles, and is marked as IA.LimitedPrivate(TOOLS), for using from command line.
10484 The old LoadIncrementalHFiles is deprecated and will be removed in 3.0.0.
10485
10486
10487 ---
10488
10489 * [HBASE-21762](https://issues.apache.org/jira/browse/HBASE-21762) | *Major* | **Move some methods in ClusterConnection to Connection**
10490
10491 Move the two getHbck method from ClusterConnection to Connection, and mark the methods as IA.LimitedPrivate(HBCK), as ClusterConnection is IA.Private and should not be depended by HBCK2.
10492
10493 Add a clearRegionLocationCache method in Connection to clear the region location cache for all the tables. As in RegionLocator, most of the methods have a 'reload' parameter, which implicitly tells user that we have a region location cache, so adding a method to clear the cache is fine.
10494
10495
10496 ---
10497
10498 * [HBASE-21713](https://issues.apache.org/jira/browse/HBASE-21713) | *Major* | **Support set region server throttle quota**
10499
10500 Support set region server rpc throttle quota which represents the read/write ability of region servers and throttles when region server's total requests exceeding the limit.
10501
10502 Use the following shell command to set RS quota:
10503 set\_quota TYPE =\> THROTTLE, REGIONSERVER =\> 'all', THROTTLE\_TYPE =\> WRITE, LIMIT =\> '20000req/sec'
10504 set\_quota TYPE =\> THROTTLE, REGIONSERVER =\> 'all', LIMIT =\> NONE
10505 "all" represents the throttle quota of all region servers and setting specified region server quota isn't supported currently.
10506
10507
10508 ---
10509
10510 * [HBASE-21689](https://issues.apache.org/jira/browse/HBASE-21689) | *Minor* | **Make table/namespace specific current quota info available in shell(describe\_namespace & describe)**
10511
10512 In shell commands "describe\_namespace" and "describe", which are used to see the descriptors of the namespaces and tables respectively, quotas set on that particular namespace/table will also be printed along.
10513
10514
10515 ---
10516
10517 * [HBASE-17370](https://issues.apache.org/jira/browse/HBASE-17370) | *Major* | **Fix or provide shell scripts to drain and decommission region server**
10518
10519 Adds shell support for the following:
10520 - List decommissioned/draining region servers
10521 - Decommission a list of region servers, optionally offload corresponding regions
10522 - Recommission a region server, optionally load a list of passed regions
10523
10524
10525 ---
10526
10527 * [HBASE-21734](https://issues.apache.org/jira/browse/HBASE-21734) | *Major* | **Some optimization in FilterListWithOR**
10528
10529 After HBASE-21620, the filterListWithOR has been a bit slow because we need to merge each sub-filter's RC , while before HBASE-21620, we will skip many RC merging, but the logic was wrong. So here we choose another way to optimaze the performance: removing the KeyValueUtil#toNewKeyCell.
10530 Anoop Sam John suggested that the KeyValueUtil#toNewKeyCell can save some GC before because if we copy key part of cell into a single byte[], then the block the cell refering won't be refered by the filter list any more, the upper layer can GC the data block quickly. while after HBASE-21620, we will update the prevCellList for every encountered cell now, so the lifecycle of cell in prevCellList for FilterList will be quite shorter. so just use the cell ref for saving cpu.
10531 BTW, we removed all the arrays streams usage in filter list, because it's also quite time-consuming in our test.
10532
10533
10534 ---
10535
10536 * [HBASE-21738](https://issues.apache.org/jira/browse/HBASE-21738) | *Critical* | **Remove all the CSLM#size operation in our memstore because it's an quite time consuming.**
10537
10538 We found the memstore snapshotting would cost much time because of calling the time-consuming ConcurrentSkipListMap#Size, it would make the p999 latency spike happen. So in this issue, we remove all ConcurrentSkipListMap#size in memstore by counting the cellsCount in MemstoreSizeing. As the issue described, the p999 latency spike was mitigated.
10539
10540
10541 ---
10542
10543 * [HBASE-21034](https://issues.apache.org/jira/browse/HBASE-21034) | *Major* | **Add new throttle type: read/write capacity unit**
10544
10545 Provides a new throttle type: capacity unit. One read/write/request capacity unit represents that read/write/read+write up to 1K data. If data size is more than 1K, then consume additional capacity units.
10546
10547 Use shell command to set capacity unit(CU):
10548 set\_quota TYPE =\> THROTTLE, THROTTLE\_TYPE =\> WRITE, USER =\> 'u1', LIMIT =\> '10CU/sec'
10549
10550 Use the "hbase.quota.read.capacity.unit" property to set the data size of one read capacity unit in bytes, the default value is 1K. Use the "hbase.quota.write.capacity.unit" property to set the data size of one write capacity unit in bytes, the default value is 1K.
10551
10552
10553 ---
10554
10555 * [HBASE-21595](https://issues.apache.org/jira/browse/HBASE-21595) | *Minor* | **Print thread's information and stack traces when RS is aborting forcibly**
10556
10557 Does thread dump on stdout on abort.
10558
10559
10560 ---
10561
10562 * [HBASE-21732](https://issues.apache.org/jira/browse/HBASE-21732) | *Critical* | **Should call toUpperCase before using Enum.valueOf in some methods for ColumnFamilyDescriptor**
10563
10564 Now all the Enum configs in ColumnFamilyDescriptor can accept lower case config value.
10565
10566
10567 ---
10568
10569 * [HBASE-21712](https://issues.apache.org/jira/browse/HBASE-21712) | *Minor* | **Make submit-patch.py python3 compatible**
10570
10571 Python3 support was added to dev-support/submit-patch.py. To install newly required dependencies run \`pip install -r dev-support/python-requirements.txt\` command.
10572
10573
10574 ---
10575
10576 * [HBASE-21657](https://issues.apache.org/jira/browse/HBASE-21657) | *Major* | **PrivateCellUtil#estimatedSerializedSizeOf has been the bottleneck in 100% scan case.**
10577
10578 In HBASE-21657,  I simplified the path of estimatedSerialiedSize() & estimatedSerialiedSizeOfCell() by moving the general getSerializedSize()
10579 and heapSize() from ExtendedCell to Cell interface. The patch also included some other improvments:
10580
10581 1. For 99%  of case, our cells has no tags, so let the HFileScannerImpl just return the NoTagsByteBufferKeyValue if no tags, which means we can save
10582    lots of cpu time when sending no tags cell to rpc because can just return the length instead of getting the serialize size by caculating offset/length
10583    of each fields(row/cf/cq..)
10584 2. Move the subclass's getSerializedSize implementation from ExtendedCell to their own class, which mean we did not need to call ExtendedCell's
10585    getSerialiedSize() firstly, then forward to subclass's getSerializedSize(withTags).
10586 3. Give a estimated result arraylist size for avoiding the frequent list extension when in a big scan, now we estimate the array size as min(scan.rows, 512).
10587    it's also help a lot.
10588
10589 We gain almost ~40% throughput improvement in 100% scan case for branch-2 (cacheHitRatio~100%)[1], it's a good thing. While it's a incompatible change in
10590 some case, such as if the upstream user implemented their own Cells, although it's rare but can happen, then their compile will be error.
10591
10592
10593 ---
10594
10595 * [HBASE-21647](https://issues.apache.org/jira/browse/HBASE-21647) | *Major* | **Add status track for splitting WAL tasks**
10596
10597 Adds task monitor that shows ServerCrashProcedure progress in UI.
10598
10599
10600 ---
10601
10602 * [HBASE-21652](https://issues.apache.org/jira/browse/HBASE-21652) | *Major* | **Refactor ThriftServer making thrift2 server inherited from thrift1 server**
10603
10604 Before this issue, thrift1 server and thrift2 server are totally different servers. If a new feature is added to thrift1 server, thrfit2 server have to make the same change to support it(e.g. authorization). After this issue, thrift2 server is inherited from thrift1, thrift2 server now have all the features thrift1 server has(e.g http support, which thrift2 server doesn't have before).  The way to start thrift1 or thrift2 server remain the same after this issue.
10605
10606
10607 ---
10608
10609 * [HBASE-21661](https://issues.apache.org/jira/browse/HBASE-21661) | *Major* | **Provide Thrift2 implementation of Table/Admin**
10610
10611 ThriftAdmin/ThriftTable are implemented based on Thrift2. With ThriftAdmin/ThriftTable, People can use thrift2 protocol just like HTable/HBaseAdmin.
10612 Example of using ThriftConnection
10613 Configuration conf = HBaseConfiguration.create();
10614 conf.set(ClusterConnection.HBASE\_CLIENT\_CONNECTION\_IMPL,ThriftConnection.class.getName());
10615 Connection conn = ConnectionFactory.createConnection(conf);
10616 Table table = conn.getTable(tablename)
10617 It is just like a normal Connection, similar use experience with the default ConnectionImplementation
10618
10619
10620 ---
10621
10622 * [HBASE-21618](https://issues.apache.org/jira/browse/HBASE-21618) | *Critical* | **Scan with the same startRow(inclusive=true) and stopRow(inclusive=false) returns one result**
10623
10624 There was a bug when scan with the same startRow(inclusive=true) and stopRow(inclusive=false). The old incorrect behavior is return one result. After this fix, the new correct behavior is return nothing.
10625
10626
10627 ---
10628
10629 * [HBASE-21159](https://issues.apache.org/jira/browse/HBASE-21159) | *Major* | **Add shell command to switch throttle on or off**
10630
10631 Support enable or disable rpc throttle when hbase quota is enabled. If hbase quota is enabled, rpc throttle is enabled by default.  When disable rpc throttle, HBase will not throttle any request. Use the following commands to switch rpc throttle : enable\_rpc\_throttle / disable\_rpc\_throttle.
10632
10633
10634 ---
10635
10636 * [HBASE-21659](https://issues.apache.org/jira/browse/HBASE-21659) | *Minor* | **Avoid to load duplicate coprocessors in system config and table descriptor**
10637
10638 Add a new configuration "hbase.skip.load.duplicate.table.coprocessor". The default value is false to keep compatible with the old behavior. Config it true to skip load duplicate table coprocessor.
10639
10640
10641 ---
10642
10643 * [HBASE-21650](https://issues.apache.org/jira/browse/HBASE-21650) | *Major* | **Add DDL operation and some other miscellaneous to thrift2**
10644
10645 Added DDL operations and some other structure definition to thrift2. Methods added:
10646 create/modify/addColumnFamily/deleteColumnFamily/modifyColumnFamily/enable/disable/truncate/delete table
10647 create/modify/delete namespace
10648 get(list)TableDescriptor(s)/get(list)NamespaceDescirptor(s)
10649 tableExists/isTableEnabled/isTableDisabled/isTableAvailabe
10650 And some class definitions along with those methods
10651
10652
10653 ---
10654
10655 * [HBASE-21643](https://issues.apache.org/jira/browse/HBASE-21643) | *Major* | **Introduce two new region coprocessor method and deprecated postMutationBeforeWAL**
10656
10657 Deprecated region coprocessor postMutationBeforeWAL and introduce two new region coprocessor postIncrementBeforeWAL and postAppendBeforeWAL instead.
10658
10659
10660 ---
10661
10662 * [HBASE-21635](https://issues.apache.org/jira/browse/HBASE-21635) | *Major* | **Use maven enforcer to ban imports from illegal packages**
10663
10664 Use de.skuzzle.enforcer.restrict-imports-enforcer-rule extension for maven enforcer plugin to ban illegal imports at compile time. Now if you use illegal imports, for example, import com.google.common.\*, there will be a compile error, instead of a checkstyle warning.
10665
10666
10667 ---
10668
10669 * [HBASE-21401](https://issues.apache.org/jira/browse/HBASE-21401) | *Critical* | **Sanity check when constructing the KeyValue**
10670
10671 Add a sanity check when constructing KeyValue from a byte[]. we use the constructor when we're reading kv from socket or HFIle or WAL(replication). the santiy check isn't designed for discovering the bits corruption in network transferring or disk IO. It is designed to detect bugs inside HBase in advance. and HBASE-21459 indicated that there's extremely small performance loss for diff kinds of keyvalue.
10672
10673
10674 ---
10675
10676 * [HBASE-21554](https://issues.apache.org/jira/browse/HBASE-21554) | *Minor* | **Show replication endpoint classname for replication peer on master web UI**
10677
10678 The replication UI on master will show the replication endpoint classname.
10679
10680
10681 ---
10682
10683 * [HBASE-21549](https://issues.apache.org/jira/browse/HBASE-21549) | *Major* | **Add shell command for serial replication peer**
10684
10685 Add a SERIAL flag for add\_peer command to identifiy whether or not the replication peer is a serial replication peer. The default serial flag is false.
10686
10687
10688 ---
10689
10690 * [HBASE-21453](https://issues.apache.org/jira/browse/HBASE-21453) | *Major* | **Convert ReadOnlyZKClient to DEBUG instead of INFO**
10691
10692 Log level of ReadOnlyZKClient moved to debug.
10693
10694
10695 ---
10696
10697 * [HBASE-21283](https://issues.apache.org/jira/browse/HBASE-21283) | *Minor* | **Add new shell command 'rit' for listing regions in transition**
10698
10699 <!-- markdown -->
10700
10701 The HBase `shell` now includes a command to list regions currently in transition.
10702
10703 ```
10704 HBase Shell
10705 Use "help" to get list of supported commands.
10706 Use "exit" to quit this interactive shell.
10707 Version 1.5.0-SNAPSHOT, r9bb6d2fa8b760f16cd046657240ebd4ad91cb6de, Mon Oct  8 21:05:50 UTC 2018
10708
10709 hbase(main):001:0> help 'rit'
10710 List all regions in transition.
10711 Examples:
10712   hbase> rit
10713
10714 hbase(main):002:0> create ...
10715 0 row(s) in 2.5150 seconds
10716 => Hbase::Table - IntegrationTestBigLinkedList
10717
10718 hbase(main):003:0> rit
10719 0 row(s) in 0.0340 seconds
10720
10721 hbase(main):004:0> unassign '56f0c38c81ae453d19906ce156a2d6a1'
10722 0 row(s) in 0.0540 seconds
10723
10724 hbase(main):005:0> rit
10725 IntegrationTestBigLinkedList,L\xCC\xCC\xCC\xCC\xCC\xCC\xCB,1539117183224.56f0c38c81ae453d19906ce156a2d6a1. state=PENDING_CLOSE, ts=Tue Oct 09 20:33:34 UTC 2018 (0s ago), server=null
10726 1 row(s) in 0.0170 seconds
10727 ```
10728
10729
10730 ---
10731
10732 * [HBASE-21567](https://issues.apache.org/jira/browse/HBASE-21567) | *Major* | **Allow overriding configs starting up the shell**
10733
10734 Allow passing of -Dkey=value option to shell to override hbase-\* configuration: e.g.:
10735
10736 $ ./bin/hbase shell -Dhbase.zookeeper.quorum=ZK0.remote.cluster.example.org,ZK1.remote.cluster.example.org,ZK2.remote.cluster.example.org -Draining=false
10737 ...
10738 hbase(main):001:0\> @shell.hbase.configuration.get("hbase.zookeeper.quorum")
10739 =\> "ZK0.remote.cluster.example.org,ZK1.remote.cluster.example.org,ZK2.remote.cluster.example.org"
10740 hbase(main):002:0\> @shell.hbase.configuration.get("raining")
10741 =\> "false"
10742
10743
10744 ---
10745
10746 * [HBASE-21560](https://issues.apache.org/jira/browse/HBASE-21560) | *Major* | **Return a new TableDescriptor for MasterObserver#preModifyTable to allow coprocessor modify the TableDescriptor**
10747
10748 Incompatible change. Allow MasterObserver#preModifyTable to return a new TableDescriptor. And master will use this returned TableDescriptor to modify table.
10749
10750
10751 ---
10752
10753 * [HBASE-21551](https://issues.apache.org/jira/browse/HBASE-21551) | *Blocker* | **Memory leak when use scan with STREAM at server side**
10754
10755 <!-- markdown -->
10756 ### Summary
10757 HBase clusters will experience Region Server failures due to out of memory errors due to a leak given any of the following:
10758
10759 * User initiates Scan operations set to use the STREAM reading type
10760 * User initiates Scan operations set to use the default reading type that read more than 4 * the block size of column families involved in the scan (e.g. by default 4*64KiB)
10761 * Compactions run
10762
10763 ### Root cause
10764
10765 When there are long running scans the Region Server process attempts to optimize access by using a different API geared towards sequential access. Due to an error in HBASE-20704 for HBase 2.0+ the Region Server fails to release related resources when those scans finish. That same optimization path is always used for the HBase internal file compaction process.
10766
10767 ### Workaround
10768
10769 Impact for this error can be minimized by setting the config value “hbase.storescanner.pread.max.bytes” to MAX_INT to avoid the optimization for default user scans. Clients should also be checked to ensure they do not pass the STREAM read type to the Scan API. This will have a severe impact on performance for long scans.
10770
10771 Compactions always use this sequential optimized reading mechanism so downstream users will need to periodically restart Region Server roles after compactions have happened.
10772
10773
10774 ---
10775
10776 * [HBASE-21550](https://issues.apache.org/jira/browse/HBASE-21550) | *Major* | **Add a new method preCreateTableRegionInfos for MasterObserver which allows CPs to modify the TableDescriptor**
10777
10778 Add a new method preCreateTableRegionInfos for MasterObserver, which will be called before creating region infos for the given table,  before the preCreateTable method. It allows you to return a new TableDescritor to override the original one. Returns null or throws exception will stop the creation.
10779
10780
10781 ---
10782
10783 * [HBASE-21492](https://issues.apache.org/jira/browse/HBASE-21492) | *Critical* | **CellCodec Written To WAL Before It's Verified**
10784
10785 After HBASE-21492 the return type of WALCellCodec#getWALCellCodecClass has been changed from String to Class
10786
10787
10788 ---
10789
10790 * [HBASE-21387](https://issues.apache.org/jira/browse/HBASE-21387) | *Major* | **Race condition surrounding in progress snapshot handling in snapshot cache leads to loss of snapshot files**
10791
10792 To prevent race condition between in progress snapshot (performed by TakeSnapshotHandler) and HFileCleaner which results in data loss, this JIRA introduced mutual exclusion between taking snapshot and running HFileCleaner. That is, at any given moment, either some snapshot can be taken or, HFileCleaner checks hfiles which are not referenced, but not both can be running.
10793
10794
10795 ---
10796
10797 * [HBASE-21452](https://issues.apache.org/jira/browse/HBASE-21452) | *Major* | **Illegal character in hbase counters group name**
10798
10799 Changes group name of hbase metrics from "HBase Counters" to "HBaseCounters".
10800
10801
10802 ---
10803
10804 * [HBASE-21443](https://issues.apache.org/jira/browse/HBASE-21443) | *Major* | **[hbase-connectors] Purge hbase-\* modules from core now they've been moved to hbase-connectors**
10805
10806 Parent issue moved hbase-spark\* modules to hbase-connectors. This issue removes hbase-spark\* modules from hbase core repo.
10807
10808
10809 ---
10810
10811 * [HBASE-21430](https://issues.apache.org/jira/browse/HBASE-21430) | *Major* | **[hbase-connectors] Move hbase-spark\* modules to hbase-connectors repo**
10812
10813 hbase-spark\* modules have been cloned to https://github.com/apache/hbase-connectors All spark connector dev is to happen in that repo from here on out.
10814
10815 Let me file a subtask to remove hbase-spark\* modules from hbase core.
10816
10817
10818 ---
10819
10820 * [HBASE-21417](https://issues.apache.org/jira/browse/HBASE-21417) | *Critical* | **Pre commit build is broken due to surefire plugin crashes**
10821
10822 Add -Djdk.net.URLClassPath.disableClassPathURLCheck=true when executing surefire plugin.
10823
10824
10825 ---
10826
10827 * [HBASE-21191](https://issues.apache.org/jira/browse/HBASE-21191) | *Major* | **Add a holding-pattern if no assign for meta or namespace (Can happen if masterprocwals have been cleared).**
10828
10829 Puts master startup into holding pattern if meta is not assigned (previous it would exit). To make progress again, operator needs to inject an assign (Caveats and instruction can be found in HBASE-21035).
10830
10831
10832 ---
10833
10834 * [HBASE-21322](https://issues.apache.org/jira/browse/HBASE-21322) | *Critical* | **Add a scheduleServerCrashProcedure() API to HbckService**
10835
10836 Adds scheduleServerCrashProcedure to the HbckService.
10837
10838
10839 ---
10840
10841 * [HBASE-21325](https://issues.apache.org/jira/browse/HBASE-21325) | *Major* | **Force to terminate regionserver when abort hang in somewhere**
10842
10843 Add two new config hbase.regionserver.abort.timeout and hbase.regionserver.abort.timeout.task. If regionserver abort timeout, it will schedule an abort timeout task to run. The default abort task is SystemExitWhenAbortTimeout, which will force to terminate region server when abort timeout. And you can config a special abort timeout task by hbase.regionserver.abort.timeout.task.
10844
10845
10846 ---
10847
10848 * [HBASE-21215](https://issues.apache.org/jira/browse/HBASE-21215) | *Major* | **Figure how to invoke hbck2; make it easy to find**
10849
10850 Adds to bin/hbase means of invoking hbck2. Pass the new '-j' option on the 'hbck' command with a value of the full path to the HBCK2.jar.
10851
10852 E.g:
10853
10854 $ ./bin/hbase hbck -j ~/checkouts/hbase-operator-tools/hbase-hbck2/target/hbase-hbck2-1.0.0-SNAPSHOT.jar  setTableState x ENABLED
10855
10856
10857 ---
10858
10859 * [HBASE-21372](https://issues.apache.org/jira/browse/HBASE-21372) | *Major* | **Set hbase.assignment.maximum.attempts to Long.MAX**
10860
10861 Retry assigns 'forever' (or until an intervention such as a ServerCrashProcedure).
10862
10863 Previous retry was a maximum of ten times but on failure, handling was an indeterminate.
10864
10865
10866 ---
10867
10868 * [HBASE-21338](https://issues.apache.org/jira/browse/HBASE-21338) | *Major* | **[balancer] If balancer is an ill-fit for cluster size, it gives little indication**
10869
10870 The description claims the balancer not dynamically configurable but this is an error; it is http://hbase.apache.org/book.html#dyn\_config
10871
10872 Also, if balancer is seen to be cutting out too soon, try setting "hbase.master.balancer.stochastic.runMaxSteps" to true.
10873
10874 Adds cleaner logging around balancer start.
10875
10876
10877 ---
10878
10879 * [HBASE-21073](https://issues.apache.org/jira/browse/HBASE-21073) | *Major* | **"Maintenance mode" master**
10880
10881     Instead of being an ephemeral state set by hbck, maintenance mode is now
10882     an explicit toggle set by either configuration property or environment
10883     variable. In maintenance mode, master will host system tables and not
10884     assign any user-space tables to RSs. This gives operators the ability to
10885     affect repairs to meta table with fewer moving parts.
10886
10887
10888 ---
10889
10890 * [HBASE-21335](https://issues.apache.org/jira/browse/HBASE-21335) | *Critical* | **Change the default wait time of HBCK2 tool**
10891
10892 Changed waitTime parameter to lockWait on bypass. Changed default waitTime from 0 -- i.e. wait for ever -- to 1ms so if lock is held, we'll go past it and if override enforce bypass.
10893
10894
10895 ---
10896
10897 * [HBASE-21291](https://issues.apache.org/jira/browse/HBASE-21291) | *Major* | **Add a test for bypassing stuck state-machine procedures**
10898
10899 bypass will now throw an Exception if passed a lockWait \<= 0; i.e bypass will prevent an operator getting stuck on an entity lock waiting forever (lockWait == 0)
10900
10901
10902 ---
10903
10904 * [HBASE-21320](https://issues.apache.org/jira/browse/HBASE-21320) | *Major* | **[canary] Cleanup of usage and add commentary**
10905
10906 Cleans up usage and docs around Canary.  Does not change command-line args (though we should -- smile).
10907
10908
10909 ---
10910
10911 * [HBASE-21278](https://issues.apache.org/jira/browse/HBASE-21278) | *Critical* | **Do not rollback successful sub procedures when rolling back a procedure**
10912
10913 For the sub procedures which are successfully finished, do not do rollback. This is a change in rollback behavior.
10914
10915 State changes which are done by sub procedures should be handled by parent procedures when rolling back. For example, when rolling back a MergeTableProcedure, we will schedule new procedures to bring the offline regions online instead of rolling back the original procedures which off-lined the regions (in fact these procedures can not be rolled back...).
10916
10917
10918 ---
10919
10920 * [HBASE-21158](https://issues.apache.org/jira/browse/HBASE-21158) | *Critical* | **Empty qualifier cell should not be returned if it does not match QualifierFilter**
10921
10922 <!-- markdown -->
10923
10924 Scans that make use of `QualifierFilter` previously would erroneously return both columns with an empty qualifier along with those that matched. After this change that behavior has changed to only return those columns that match.
10925
10926
10927 ---
10928
10929 * [HBASE-21098](https://issues.apache.org/jira/browse/HBASE-21098) | *Major* | **Improve Snapshot Performance with Temporary Snapshot Directory when rootDir on S3**
10930
10931 It is recommended to place the working directory on-cluster on HDFS as doing so has shown a strong performance increase due to data locality. It is important to note that the working directory should not overlap with any existing directories as the working directory will be cleaned out during the snapshot process. Beyond that, any well-named directory on HDFS should be sufficient.
10932
10933
10934 ---
10935
10936 * [HBASE-21185](https://issues.apache.org/jira/browse/HBASE-21185) | *Minor* | **WALPrettyPrinter: Additional useful info to be printed by wal printer tool, for debugability purposes**
10937
10938 This adds two extra features to WALPrettyPrinter tool:
10939
10940 1) Output for each cell combined size of cell descriptors, plus the cell value itself, in a given WAL edit. This is printed on the results as "cell total size sum:" info by default;
10941
10942 2) An optional -g/--goto argument, that allows to seek straight to that specific WAL file position, then sequentially reading the WAL from that point towards its end;
10943
10944
10945 ---
10946
10947 * [HBASE-21287](https://issues.apache.org/jira/browse/HBASE-21287) | *Major* | **JVMClusterUtil Master initialization wait time not configurable**
10948
10949 Local HBase cluster (as used by unit tests) wait times on startup and initialization can be configured via \`hbase.master.start.timeout.localHBaseCluster\` and \`hbase.master.init.timeout.localHBaseCluster\`
10950
10951
10952 ---
10953
10954 * [HBASE-21280](https://issues.apache.org/jira/browse/HBASE-21280) | *Trivial* | **Add anchors for each heading in UI**
10955
10956 Adds anchors #tables, #tasks, etc.
10957
10958
10959 ---
10960
10961 * [HBASE-21232](https://issues.apache.org/jira/browse/HBASE-21232) | *Major* | **Show table state in Tables view on Master home page**
10962
10963 Add table state column to the tables panel
10964
10965
10966 ---
10967
10968 * [HBASE-21223](https://issues.apache.org/jira/browse/HBASE-21223) | *Critical* | **[amv2] Remove abort\_procedure from shell**
10969
10970 Removed the abort\_procedure command from shell -- dangerous -- and deprecated abortProcedure in Admin API.
10971
10972
10973 ---
10974
10975 * [HBASE-20636](https://issues.apache.org/jira/browse/HBASE-20636) | *Major* | **Introduce two bloom filter type : ROWPREFIX\_FIXED\_LENGTH and ROWPREFIX\_DELIMITED**
10976
10977 Add two bloom filter type : ROWPREFIX\_FIXED\_LENGTH and ROWPREFIX\_DELIMITED
10978 1. ROWPREFIX\_FIXED\_LENGTH: specify the length of the prefix
10979 2. ROWPREFIX\_DELIMITED: specify the delimiter of the prefix
10980 Need to specify parameters for these two types of bloomfilter, otherwise the table will fail to create
10981 Example:
10982 create 't1', {NAME =\> 'f1', BLOOMFILTER =\> 'ROWPREFIX\_FIXED\_LENGTH', CONFIGURATION =\> {'RowPrefixBloomFilter.prefix\_length' =\> '10'}}
10983 create 't1', {NAME =\> 'f1', BLOOMFILTER =\> 'ROWPREFIX\_DELIMITED', CONFIGURATION =\> {'RowPrefixDelimitedBloomFilter.delimiter' =\> '#'}}
10984
10985
10986 ---
10987
10988 * [HBASE-21156](https://issues.apache.org/jira/browse/HBASE-21156) | *Critical* | **[hbck2] Queue an assign of hbase:meta and bulk assign/unassign**
10989
10990 Adds 'raw' assigns/unassigns to the Hbck Service. Takes a list of encoded region names and bulk assigns/unassigns. Skirts Master 'state' check and does not invoke Coprocessors. For repair only.
10991
10992 Here is what HBCK2 usage looks like now:
10993
10994 {code}
10995 $ java -cp hbase-hbck2-1.0.0-SNAPSHOT.jar  org.apache.hbase.HBCK2
10996 usage: HBCK2 \<OPTIONS\> COMMAND [\<ARGS\>]
10997
10998 Options:
10999  -d,--debug                      run with debug output
11000  -h,--help                       output this help message
11001     --hbase.zookeeper.peerport   peerport of target hbase ensemble
11002     --hbase.zookeeper.quorum     ensemble of target hbase
11003     --zookeeper.znode.parent     parent znode of target hbase
11004
11005 Commands:
11006  setTableState \<TABLENAME\> \<STATE\>
11007    Possible table states: ENABLED, DISABLED, DISABLING, ENABLING
11008    To read current table state, in the hbase shell run:
11009      hbase\> get 'hbase:meta', '\<TABLENAME\>', 'table:state'
11010    A value of \\x08\\x00 == ENABLED, \\x08\\x01 == DISABLED, etc.
11011    An example making table name 'user' ENABLED:
11012      $ HBCK2 setTableState users ENABLED
11013    Returns whatever the previous table state was.
11014
11015  assign \<ENCODED\_REGIONNAME\> ...
11016    A 'raw' assign that can be used even during Master initialization.
11017    Skirts Coprocessors. Pass one or more encoded RegionNames:
11018    e.g. 1588230740 is hard-coded encoding for hbase:meta region and
11019    de00010733901a05f5a2a3a382e27dd4 is an example of what a random
11020    user-space encoded Region name looks like. For example:
11021      $ HBCK2 assign 1588230740 de00010733901a05f5a2a3a382e27dd4
11022    Returns the pid of the created AssignProcedure or -1 if none.
11023
11024  unassign \<ENCODED\_REGIONNAME\> ...
11025    A 'raw' unassign that can be used even during Master initialization.
11026    Skirts Coprocessors. Pass one or more encoded RegionNames:
11027    Skirts Coprocessors. Pass one or more encoded RegionNames:
11028    de00010733901a05f5a2a3a382e27dd4 is an example of what a random
11029    user-space encoded Region name looks like. For example:
11030      $ HBCK2 unassign 1588230740 de00010733901a05f5a2a3a382e27dd4
11031    Returns the pid of the created UnassignProcedure or -1 if none.
11032 {code}
11033
11034
11035 ---
11036
11037 * [HBASE-21021](https://issues.apache.org/jira/browse/HBASE-21021) | *Major* | **Result returned by Append operation should be ordered**
11038
11039 This change ensures Append operations are assembled into the expected order.
11040
11041
11042 ---
11043
11044 * [HBASE-21171](https://issues.apache.org/jira/browse/HBASE-21171) | *Major* | **[amv2] Tool to parse a directory of MasterProcWALs standalone**
11045
11046 Make it so can run the WAL parse and load system in isolation. Here is an example:
11047
11048 {code}$ HBASE\_OPTS=" -XX:+UnlockDiagnosticVMOptions -XX:+UnlockCommercialFeatures -XX:+FlightRecorder -XX:+DebugNonSafepoints" ./bin/hbase org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore ~/big\_set\_of\_masterprocwals/
11049 {code}
11050
11051
11052 ---
11053
11054 * [HBASE-21107](https://issues.apache.org/jira/browse/HBASE-21107) | *Minor* | **add a metrics for netty direct memory**
11055
11056 Add a new nettyDirectMemoryUsage under server's ipc metrics to show direct memory usage for netty rpc server.
11057
11058
11059 ---
11060
11061 * [HBASE-21153](https://issues.apache.org/jira/browse/HBASE-21153) | *Major* | **Shaded client jars should always build in relevant phase to avoid confusion**
11062
11063 Client facing artifacts are now built whenever Maven is run through the "package" goal. Previously, the client facing artifacts would create placeholder jars that skipped repackaging HBase and third-party dependencies unless the "release" profile was active.
11064
11065 Build times may be noticeably longer depending on your build hardware. For example, the Jenkins worker nodes maintained by ASF Infra take ~14% longer to do a full packaging build. An example portability-focused personal laptop took ~25% longer.
11066
11067
11068 ---
11069
11070 * [HBASE-20942](https://issues.apache.org/jira/browse/HBASE-20942) | *Major* | **Improve RpcServer TRACE logging**
11071
11072 Allows configuration of the length of RPC messages printed to the log at TRACE level via "hbase.ipc.trace.param.size" in RpcServer.
11073
11074
11075 ---
11076
11077 * [HBASE-20649](https://issues.apache.org/jira/browse/HBASE-20649) | *Minor* | **Validate HFiles do not have PREFIX\_TREE DataBlockEncoding**
11078
11079 <!-- markdown -->
11080 Users who have previously made use of prefix tree encoding can now check that their existing HFiles no longer contain data that uses it with an additional preupgrade check command.
11081
11082 ```
11083 hbase pre-upgrade validate-hfile
11084 ```
11085
11086 Please see the "HFile Content validation" section of the ref guide's coverage of the pre-upgrade validator tool for usage details.
11087
11088
11089 ---
11090
11091 * [HBASE-20941](https://issues.apache.org/jira/browse/HBASE-20941) | *Major* | **Create and implement HbckService in master**
11092
11093 Adds an HBCK Service and a first method to force-change-in-table-state for use by an HBCK client effecting 'repair' to a malfunctioning HBase.
11094
11095
11096 ---
11097
11098 * [HBASE-21071](https://issues.apache.org/jira/browse/HBASE-21071) | *Major* | **HBaseTestingUtility::startMiniCluster() to use builder pattern**
11099
11100 Cleanup all the cluster start override combos in HBaseTestingUtility by adding a StartMiniClusterOption and Builder.
11101
11102
11103 ---
11104
11105 * [HBASE-21072](https://issues.apache.org/jira/browse/HBASE-21072) | *Major* | **Block out HBCK1 in hbase2**
11106
11107 Fence out hbase-1.x hbck1 instances. Stop them making state changes on an hbase-2.x cluster; they could do damage. We do this by writing the hbck1 lock file into place on hbase-2.x Master start-up.
11108
11109 To disable this new behavior, set hbase.write.hbck1.lock.file to false
11110
11111
11112 ---
11113
11114 * [HBASE-20881](https://issues.apache.org/jira/browse/HBASE-20881) | *Major* | **Introduce a region transition procedure to handle all the state transition for a region**
11115
11116 Introduced a new TransitRegionStateProcedure to replace the old AssignProcedure/UnassignProcedure/MoveRegionProcedure. In the old code, MRP will not be attached to RegionStateNode, so it can not be interrupted by ServerCrashProcedure, which introduces lots of tricky code to deal with races, and also causes lots of other difficulties on how to prevent scheduling redundant or even conflict procedures for a region.
11117
11118 And now TRSP is the only one procedure which can bring region online or offline. When you want to schedule one, you need to check whether there is already one attached to the RegionStateNode, under the lock of the RegionStateNode. If not just go ahead, and if there is one, then you should do something, for example, give up and fail directly, or tell the TRSP to give up(This is what SCP does). Since the check and attach are both under the lock of RSN, it will greatly reduce the possible races, and make the code much simpler.
11119
11120
11121 ---
11122
11123 * [HBASE-21012](https://issues.apache.org/jira/browse/HBASE-21012) | *Critical* | **Revert the change of serializing TimeRangeTracker**
11124
11125 HFiles generated by 2.0.0, 2.0.1, 2.1.0 are not forward compatible to 1.4.6-, 1.3.2.1-, 1.2.6.1-, and other inactive releases. Why HFile lose compatability is hbase in new versions (2.0.0, 2.0.1, 2.1.0) use protobuf to serialize/deserialize TimeRangeTracker (TRT) while old versions use DataInput/DataOutput. To solve this, We have to put HBASE-21012 to 2.x and put HBASE-21013 in 1.x. For more information, please check HBASE-21008.
11126
11127
11128 ---
11129
11130 * [HBASE-20965](https://issues.apache.org/jira/browse/HBASE-20965) | *Major* | **Separate region server report requests to new handlers**
11131
11132 After HBASE-20965, we can use MasterFifoRpcScheduler in master to separate RegionServerReport requests to indenpedent handler. To use this feature, please set "hbase.master.rpc.scheduler.factory.class" to
11133  "org.apache.hadoop.hbase.ipc.MasterFifoRpcScheduler". Use "hbase.master.server.report.handler.count" to set RegionServerReport handlers count, the default value is half of "hbase.regionserver.handler.count" value, but at least 1, and the other handlers count in master is "hbase.regionserver.handler.count" value minus RegionServerReport handlers count, but at least 1 too.
11134
11135
11136 ---
11137
11138 * [HBASE-20813](https://issues.apache.org/jira/browse/HBASE-20813) | *Minor* | **Remove RPC quotas when the associated table/Namespace is dropped off**
11139
11140 In previous releases, when a Space Quota was configured on a table or namespace and that table or namespace was deleted, the Space Quota was also deleted. This change improves the implementation so that the same is also done for RPC Quotas.
11141
11142
11143 ---
11144
11145 * [HBASE-20986](https://issues.apache.org/jira/browse/HBASE-20986) | *Major* | **Separate the config of block size when we do log splitting and write Hlog**
11146
11147 After HBASE-20986, we can set different value to block size of WAL and recovered edits. Both of their default value is 2 \* default HDFS blocksize. And hbase.regionserver.recoverededits.blocksize is for block size of recovered edits while hbase.regionserver.hlog.blocksize is for block size of WAL.
11148
11149
11150 ---
11151
11152 * [HBASE-20856](https://issues.apache.org/jira/browse/HBASE-20856) | *Minor* | **PITA having to set WAL provider in two places**
11153
11154 With this change if a WAL's meta provider (hbase.wal.meta\_provider) is not explicitly set, it now defaults to whatever hbase.wal.provider is set to. Previous, the two settings operated independently, each with its own default.
11155
11156 This change is operationally incompatible with previous HBase versions because the default WAL meta provider no longer defaults to AsyncFSWALProvider but to hbase.wal.provider.
11157
11158 The thought is that this is more in line with an operator's expectation, that a change in hbase.wal.provider is sufficient to change how WALs are written, especially given hbase.wal.meta\_provider is an obscure configuration and that the very idea that meta regions would have their own wal provider would likely come as a surprise.
11159
11160
11161 ---
11162
11163 * [HBASE-20538](https://issues.apache.org/jira/browse/HBASE-20538) | *Critical* | **Upgrade our hadoop versions to 2.7.7 and 3.0.3**
11164
11165 Update hadoop-two.version to 2.7.7 and hadoop-three.version to 3.0.3 due to a JDK issue which is solved by HADOOP-15473.
11166
11167
11168 ---
11169
11170 * [HBASE-20846](https://issues.apache.org/jira/browse/HBASE-20846) | *Major* | **Restore procedure locks when master restarts**
11171
11172 1. Make hasLock method final, and add a locked field in Procedure to record whether we have the lock. We will set it to true in doAcquireLock and to false in doReleaseLock. The sub procedures do not need to manage it any more.
11173
11174 2. Also added a locked field in the proto message. When storing, the field will be set according to the return value of hasLock. And when loading, there is a new field in Procedure called lockedWhenLoading. We will set it to true if the locked field in proto message is true.
11175
11176 3. The reason why we can not set the locked field directly to true by calling doAcquireLock is that, during initialization, most procedures need to wait until master is initialized. So the solution here is that, we introduced a new method called waitInitialized in Procedure, and move the wait master initialized related code from acquireLock to this method. And we added a restoreLock method to Procedure, if lockedWhenLoading is true, we will call the acquireLock to get the lock, but do not set locked to true. And later when we call doAcquireLock and pass the waitInitialized check, we will test lockedWhenLoading, if it is true, when we just set the locked field to true and return, without actually calling the acquireLock method since we have already called it once.
11177
11178
11179 ---
11180
11181 * [HBASE-20672](https://issues.apache.org/jira/browse/HBASE-20672) | *Minor* | **New metrics ReadRequestRate and WriteRequestRate**
11182
11183 Exposing 2 new metrics in HBase to provide ReadRequestRate and WriteRequestRate at region server level. These metrics give the rate of request handled by the region server and are reset after every monitoring interval.
11184
11185
11186 ---
11187
11188 * [HBASE-6028](https://issues.apache.org/jira/browse/HBASE-6028) | *Minor* | **Implement a cancel for in-progress compactions**
11189
11190 Added a new command to the shell to switch on/off compactions called "compaction\_switch". Disabling compactions will interrupt any currently ongoing compactions. This setting will be lost on restart of the server. Added the configuration hbase.regionserver.compaction.enabled so user can enable/disable compactions via hbase-site.xml.
11191
11192
11193 ---
11194
11195 * [HBASE-20884](https://issues.apache.org/jira/browse/HBASE-20884) | *Major* | **Replace usage of our Base64 implementation with java.util.Base64**
11196
11197 Class org.apache.hadoop.hbase.util.Base64 has been removed in it's entirety from HBase 2+. In HBase 1, unused methods have been removed from the class and the audience was changed from  Public to Private. This class was originally intended as an internal utility class that could be used externally but thinking since changed; these classes should not have been advertised as public to end-users.
11198
11199 This represents an incompatible change for users who relied on this implementation. An alternative implementation for affected clients is available at java.util.Base64 when using Java 8 or newer; be aware, it may encode/decode differently. For clients seeking to restore this specific implementation, it is available in the public domain for download at http://iharder.sourceforge.net/current/java/base64/
11200
11201
11202 ---
11203
11204 * [HBASE-20357](https://issues.apache.org/jira/browse/HBASE-20357) | *Major* | **AccessControlClient API Enhancement**
11205
11206 This enhances the AccessControlClient APIs to retrieve the permissions based on namespace, table name, family and qualifier for specific user. AccessControlClient can also validate a user whether allowed to perform specified operations on a particular table.
11207 Following APIs have been added,
11208 1) getUserPermissions(Connection connection, String tableRegex, byte[] columnFamily, byte[] columnQualifier, String userName)
11209          Scope of retrieving permission will be same as existing.
11210 2) hasPermission(onnection connection, String tableName, byte[] columnFamily, byte[] columnQualifier, String userName, Permission.Action... actions)
11211      Scope of validating user privilege,
11212            User can perform self check without any special privilege but ADMIN privilege will be required to perform check for other users.
11213            For example, suppose there are two users "userA" & "userB" then there can be below scenarios,
11214             a. When userA want to check whether userA have privilege to perform mentioned actions
11215                  userA don't need ADMIN privilege, as it's a self query.
11216             b. When userA want to check whether userB have privilege to perform mentioned actions,
11217                  userA must have ADMIN or superuser privilege, as it's trying to query for other user.
11218
11219
11220
11221 # HBASE  2.1.0 Release Notes
11222
11223 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
11224
11225
11226 ---
11227
11228 * [HBASE-20691](https://issues.apache.org/jira/browse/HBASE-20691) | *Blocker* | **Storage policy should allow deferring to HDFS**
11229
11230 After HBASE-20691 we have changed the default setting of hbase.wal.storage.policy from "HOT" back to "NONE" which means we defer the policy to HDFS. This fixes the problem of release 2.0.0 that the storage policy of WAL directory will defer to HDFS and may not be "HOT" even if you explicitly set hbase.wal.storage.policy to "HOT"
11231
11232
11233 ---
11234
11235 * [HBASE-20839](https://issues.apache.org/jira/browse/HBASE-20839) | *Blocker* | **Fallback to FSHLog if we can not instantiated AsyncFSWAL when user does not specify AsyncFSWAL explicitly**
11236
11237 As we hack into the internal of DFSClient when implementing AsyncFSWAL to get better performance, a patch release of hadoop can make it broken.
11238
11239 So now, if user does not specify a wal provider, then we will first try to use 'asyncfs', i.e, the AsyncFSWALProvider. If we fail due to some compatible issues, we will fallback to 'filesystem', i.e, FSHLog.
11240
11241
11242 ---
11243
11244 * [HBASE-20193](https://issues.apache.org/jira/browse/HBASE-20193) | *Critical* | **Basic Replication Web UI - Regionserver**
11245
11246 After HBASE-20193, we add a section to web ui to show the replication status of each wal group. There are 2 parts of this section, they both show the peerId, wal group and current replicating log of each replication source. And one is showing the information of replication log queue, i.e. size of current log, log queue size and replicating offset. The other one is showing the delay of replication, i.e. last shipped age and replication delay.
11247 If the offset shows -1 and replication delay is UNKNOWN, that means replication is not started. This may be caused by this peer is disabled or the replicationEndpoint is sleeping due to some reason.
11248
11249
11250 ---
11251
11252 * [HBASE-19997](https://issues.apache.org/jira/browse/HBASE-19997) | *Blocker* | **[rolling upgrade] 1.x =\> 2.x**
11253
11254 Now we have a 'basically work' solution for rolling upgrade from 1.4.x to 2.x. Please see the "Rolling Upgrade from 1.x to 2.x" section in ref guide for more details.
11255
11256
11257 ---
11258
11259 * [HBASE-20270](https://issues.apache.org/jira/browse/HBASE-20270) | *Major* | **Turn off command help that follows all errors in shell**
11260
11261 <!-- markdown -->
11262 The command help that followed all errors, before, is now no longer available. Erroneous command inputs would now just show error-texts followed by the shell command to try for seeing the help message. It looks like: For usage try 'help “create”’. Operators can copy-paste the command to get the help message.
11263
11264
11265 ---
11266
11267 * [HBASE-20194](https://issues.apache.org/jira/browse/HBASE-20194) | *Critical* | **Basic Replication WebUI - Master**
11268
11269 After HBASE-20194, we added 2 parts to master's web page.
11270 One is Peers that shows all replication peers and some of their configurations, like peer id, cluster key, state, bandwidth, and which namespace or table it will replicate.
11271 The other one is replication status of all regionservers, we added a tab to region servers division, then we can check the replication delay of all region servers for any peer. This table shows AgeOfLastShippedOp, SizeOfLogQueue and ReplicationLag for each regionserver and the table is sort by ReplicationLag in descending order. By this way we can easily find the problematic region server. If the replication delay is UNKNOWN, that means this walGroup doesn't start replicate yet and it may get disabled. ReplicationLag will update once this peer start replicate.
11272
11273
11274 ---
11275
11276 * [HBASE-18569](https://issues.apache.org/jira/browse/HBASE-18569) | *Major* | **Add prefetch support for async region locator**
11277
11278 Add prefetch support for async region locator. The default value is 10. Set 'hbase.client.locate.prefetch.limit' in hbase-site.xml if you want to use another value for it.
11279
11280
11281 ---
11282
11283 * [HBASE-20642](https://issues.apache.org/jira/browse/HBASE-20642) | *Major* | **IntegrationTestDDLMasterFailover throws 'InvalidFamilyOperationException**
11284
11285 This changes client-side nonce generation to use the same nonce for re-submissions of client RPC DDL operations.
11286
11287
11288 ---
11289
11290 * [HBASE-20708](https://issues.apache.org/jira/browse/HBASE-20708) | *Blocker* | **Remove the usage of RecoverMetaProcedure in master startup**
11291
11292 Introduce an InitMetaProcedure to initialize meta table for a new HBase deploy. Marked RecoverMetaProcedure deprecated and remove the usage of it in the current code base. We still need to keep it in place for compatibility. The code in RecoverMetaProcedure has been moved to ServerCrashProcedure, and SCP will always be enabled and we will rely on it to bring meta region online.
11293
11294 For more on the issue addressed by this commit, see the design doc for overview and plan: https://docs.google.com/document/d/1\_872oHzrhJq4ck7f6zmp1J--zMhsIFvXSZyX1Mxg5MA/edit#heading=h.xy1z4alsq7uy
11295
11296
11297 ---
11298
11299 * [HBASE-20334](https://issues.apache.org/jira/browse/HBASE-20334) | *Major* | **add a test that expressly uses both our shaded client and the one from hadoop 3**
11300
11301 <!-- markdown -->
11302
11303 HBase now includes a helper script that can be used to run a basic functionality test for a given HBase installation at in `dev_support`. The test can optionally be given an HBase client artifact to rely on and can optionally be given specific Hadoop client artifacts to use.
11304
11305 For usage information see `./dev-support/hbase_nightly_pseudo-distributed-test.sh --help`.
11306
11307 The project nightly tests now make use of this test to check running on top of Hadoop 2, Hadoop 3, and Hadoop 3 with shaded client artifacts.
11308
11309
11310 ---
11311
11312 * [HBASE-19735](https://issues.apache.org/jira/browse/HBASE-19735) | *Major* | **Create a minimal "client" tarball installation**
11313
11314 <!-- markdown -->
11315
11316 The HBase convenience binary artifacts now includes a client focused tarball that a) includes more docs and b) does not include scripts or jars only needed for running HBase cluster services.
11317
11318 The new artifact is made as a normal part of the `assembly:single` maven command.
11319
11320
11321 ---
11322
11323 * [HBASE-20615](https://issues.apache.org/jira/browse/HBASE-20615) | *Major* | **emphasize use of shaded client jars when they're present in an install**
11324
11325 <!-- markdown -->
11326
11327 HBase's built in scripts now rely on the downstream facing shaded artifacts where possible. In particular interest to downstream users, the `hbase classpath` and `hbase mapredcp` commands now return the relevant shaded client artifact and only those third paty jars needed to make use of them (e.g. slf4j-api, commons-logging, htrace, etc).
11328
11329 Downstream users should note that by default the `hbase classpath` command will treat having `hadoop` on the shell's PATH as an implicit request to include the output of the `hadoop classpath` command in the returned classpath. This long-existing behavior can be opted out of by setting the environment variable `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP` to the value "true". For example: `HBASE_DISABLE_HADOOP_CLASSPATH_LOOKUP="true" bin/hbase classpath`.
11330
11331
11332 ---
11333
11334 * [HBASE-20333](https://issues.apache.org/jira/browse/HBASE-20333) | *Critical* | **break up shaded client into one with no Hadoop and one that's standalone**
11335
11336 <!-- markdown -->
11337
11338 Downstream users who need to use both HBase and Hadoop APIs should switch to relying on the new `hbase-shaded-client-byo-hadoop` artifact rather than the existing `hbase-shaded-client` artifact. The new artifact no longer includes and Hadoop classes.
11339
11340 It should work in combination with either the output of `hadoop classpath` or the Hadoop provided client-facing shaded artifacts in Hadoop 3+.
11341
11342
11343 ---
11344
11345 * [HBASE-20332](https://issues.apache.org/jira/browse/HBASE-20332) | *Critical* | **shaded mapreduce module shouldn't include hadoop**
11346
11347 <!-- markdown -->
11348
11349 The `hbase-shaded-mapreduce` artifact no longer include its own copy of Hadoop classes. Users who make use of the artifact via YARN should be able to get these classes from YARN's classpath without having to make any changes.
11350
11351
11352 ---
11353
11354 * [HBASE-20681](https://issues.apache.org/jira/browse/HBASE-20681) | *Major* | **IntegrationTestDriver fails after HADOOP-15406 due to missing hamcrest-core**
11355
11356 <!-- markdown -->
11357
11358 Users of our integration tests on Hadoop 3 can now add all needed dependencies by pointing at jars included in our binary convenience artifact.
11359
11360 Prior to this fix, downstream users on Hadoop 3 would need to get a copy of the Hamcrest v1.3 jar from elsewhere.
11361
11362
11363 ---
11364
11365 * [HBASE-19852](https://issues.apache.org/jira/browse/HBASE-19852) | *Major* | **HBase Thrift 1 server SPNEGO Improvements**
11366
11367 Adds two new properties for hbase-site.xml for THRIFT SPNEGO when in HTTP mode:
11368 \* hbase.thrift.spnego.keytab.file
11369 \* hbase.thrift.spnego.principal
11370
11371
11372 ---
11373
11374 * [HBASE-20590](https://issues.apache.org/jira/browse/HBASE-20590) | *Critical* | **REST Java client is not able to negotiate with the server in the secure mode**
11375
11376 Adds a negotiation logic between a secure java REST client and server. After this jira the Java REST client will start responding to the Negotiate challenge sent by the server. Adds RESTDemoClient which can be used to verify whether the secure Java REST client works against secure REST server or not.
11377
11378
11379 ---
11380
11381 * [HBASE-20634](https://issues.apache.org/jira/browse/HBASE-20634) | *Critical* | **Reopen region while server crash can cause the procedure to be stuck**
11382
11383 A second attempt at fixing HBASE-20173. Fixes unfinished keeping of server state inside AM (ONLINE=\>SPLITTING=\>OFFLINE=\>null). Concurrent unassigns look at server state to figure if they should wait on SCP to wake them up or not.
11384
11385
11386 ---
11387
11388 * [HBASE-20579](https://issues.apache.org/jira/browse/HBASE-20579) | *Minor* | **Improve snapshot manifest copy in ExportSnapshot**
11389
11390 This patch adds an FSUtil.copyFilesParallel() to help copy files in parallel, and it will return all the paths of directories and files traversed. Thus when we copy manifest in ExportSnapshot, we can copy reference files concurrently and use the paths it returns to help setOwner and setPermission.
11391 The size of thread pool is determined by the configuration snapshot.export.copy.references.threads, and its default value is the number of runtime available processors.
11392
11393
11394 ---
11395
11396 * [HBASE-18116](https://issues.apache.org/jira/browse/HBASE-18116) | *Major* | **Replication source in-memory accounting should not include bulk transfer hfiles**
11397
11398 Before this change we would incorrectly include the size of enqueued store files for bulk replication in the calculation for determining whether or not to rate limit the transfer of WAL edits. Because bulk replication uses a separate and asynchronous mechanism for file transfer this could incorrectly limit the batch sizes for WAL replication if bulk replication in progress, with negative impact on latency and throughput.
11399
11400
11401 ---
11402
11403 * [HBASE-20592](https://issues.apache.org/jira/browse/HBASE-20592) | *Minor* | **Create a tool to verify tables do not have prefix tree encoding**
11404
11405 PreUpgradeValidator tool with DataBlockEncoding validator was added to verify cluster is upgradable to HBase 2.
11406
11407
11408 ---
11409
11410 * [HBASE-20501](https://issues.apache.org/jira/browse/HBASE-20501) | *Blocker* | **Change the Hadoop minimum version to 2.7.1**
11411
11412 <!-- markdown -->
11413 HBase is no longer able to maintain compatibility with Apache Hadoop versions that are no longer receiving updates. This release raises the minimum supported version to Hadoop 2.7.1. Downstream users are strongly advised to upgrade to the latest Hadoop 2.7 maintenance release.
11414
11415 Downstream users of earlier HBase versions are similarly advised to upgrade to Hadoop 2.7.1+. When doing so, it is especially important to follow the guidance from [the HBase Reference Guide's Hadoop section](http://hbase.apache.org/book.html#hadoop) on replacing the Hadoop artifacts bundled with HBase.
11416
11417
11418 ---
11419
11420 * [HBASE-20601](https://issues.apache.org/jira/browse/HBASE-20601) | *Minor* | **Add multiPut support and other miscellaneous to PE**
11421
11422 1. Add multiPut support
11423 Set --multiPut=number to enable batchput(meanwhile, --autoflush need be set to false)
11424
11425 2. Add Connection Count support
11426 Added a new parameter connCount to PE. set --connCount=2 means all threads will share 2 connections.
11427 oneCon option and connCount option shouldn't be set at the same time.
11428
11429 3. Add avg RT and avg TPS/QPS statstic for all threads
11430
11431 4. Delete some redundant code
11432 Now RandomWriteTest is inherited from SequentialWrite.
11433
11434
11435 ---
11436
11437 * [HBASE-20544](https://issues.apache.org/jira/browse/HBASE-20544) | *Blocker* | **downstream HBaseTestingUtility fails with invalid port**
11438
11439 <!-- markdown -->
11440
11441 HBase now relies on an internal mechanism to determine when it is running a local hbase cluster meant for external interaction vs an encapsulated test. When created via the `HBaseTestingUtility`, ports for Master and RegionServer services and UIs will be set to random ports to allow for multiple parallel uses on a single machine. Normally when running a Standalone HBase Deployment (as described in the HBase Reference Guide) the ports will be picked according to the same defaults used in a full cluster set up. If you wish to instead use the random port assignment set `hbase.localcluster.assign.random.ports` to true.
11442
11443
11444 ---
11445
11446 * [HBASE-20004](https://issues.apache.org/jira/browse/HBASE-20004) | *Minor* | **Client is not able to execute REST queries in a secure cluster**
11447
11448 Added 'hbase.rest.http.allow.options.method' configuration property to allow user to decide whether Rest Server HTTP should allow OPTIONS method or not. By default it is enabled in HBase 2.1.0+ versions and in other versions it is disabled.
11449 Similarly 'hbase.thrift.http.allow.options.method' is added HBase 1.5, 2.1.0 and 3.0.0 versions. It is disabled by default.
11450
11451
11452 ---
11453
11454 * [HBASE-20327](https://issues.apache.org/jira/browse/HBASE-20327) | *Minor* | **When qualifier is not specified, append and incr operation do not work (shell)**
11455
11456 This change will enable users to perform append and increment operation with null qualifier via hbase-shell.
11457
11458
11459 ---
11460
11461 * [HBASE-18842](https://issues.apache.org/jira/browse/HBASE-18842) | *Minor* | **The hbase shell clone\_snaphost command returns bad error message**
11462
11463 <!-- markdown -->
11464
11465 When attempting to clone a snapshot but using a namespace that does not exist, the HBase shell will now correctly report the exception as caused by the passed namespace. Previously, the shell would report that the problem was an unknown namespace but it would claim the user provided table name was not found as a namespace. Both before and after this change the shell properly used the passed namespace to attempt to handle the request.
11466
11467
11468 ---
11469
11470 * [HBASE-20406](https://issues.apache.org/jira/browse/HBASE-20406) | *Major* | **HBase Thrift HTTP - Shouldn't handle TRACE/OPTIONS methods**
11471
11472 <!-- markdown -->
11473 When configured to do thrift-over-http, the HBase Thrift API Server no longer accepts the HTTP methods TRACE nor OPTIONS.
11474
11475
11476 ---
11477
11478 * [HBASE-20046](https://issues.apache.org/jira/browse/HBASE-20046) | *Major* | **Reconsider the implementation for serial replication**
11479
11480 Now in replication we can make sure the order of pushing logs is same as the order of requests from client. Set the serial flag to true for a replication peer to enable this feature.
11481
11482
11483 ---
11484
11485 * [HBASE-20159](https://issues.apache.org/jira/browse/HBASE-20159) | *Major* | **Support using separate ZK quorums for client**
11486
11487 After HBASE-20159 we allow client to use different ZK quorums by introducing three new properties: hbase.client.zookeeper.quorum and hbase.client.zookeeper.property.clientPort to specify client zookeeper properties (note that the combination of these two properties should be different from the server ZK quorums), and hbase.client.zookeeper.observer.mode to indicate whether the client ZK nodes are in observer mode (false by default)
11488
11489 HConstants.DEFAULT\_ZOOKEPER\_CLIENT\_PORT has been removed in HBase 3.0 and replaced by the correctly spelled DEFAULT\_ZOOKEEPER\_CLIENT\_PORT.
11490
11491
11492 ---
11493
11494 * [HBASE-20242](https://issues.apache.org/jira/browse/HBASE-20242) | *Major* | **The open sequence number will grow if we fail to open a region after writing the max sequence id file**
11495
11496 Now when opening a region, we will store the current max sequence id of the region to its max sequence id file instead of the 'next sequence id'. This could avoid the sequence id bumping when we fail to open a region, and also align to the behavior when we close a region.
11497
11498
11499 ---
11500
11501 * [HBASE-19024](https://issues.apache.org/jira/browse/HBASE-19024) | *Critical* | **Configurable default durability for synchronous WAL**
11502
11503 The default durability setting for the synchronous WAL is Durability.SYNC\_WAL, which triggers HDFS hflush() to flush edits to the datanodes. We also support Durability.FSYNC\_WAL, which instead triggers HDFS hsync() to flush \_and\_ fsync edits. This change introduces the new configuration setting "hbase.wal.hsync", defaulting to FALSE, that if set to TRUE changes the default durability setting for the synchronous WAL to  FSYNC\_WAL.
11504
11505
11506 ---
11507
11508 * [HBASE-19389](https://issues.apache.org/jira/browse/HBASE-19389) | *Critical* | **Limit concurrency of put with dense (hundreds) columns to prevent write handler exhausted**
11509
11510 After HBASE-19389 we introduced a RegionServer self-protection mechanism to prevent write handler getting exhausted by high concurrency put with dense columns, mainly through two new properties: hbase.region.store.parallel.put.limit.min.column.count to decide what kind of put (with how many columns within a single column family) to limit (100 by default) and hbase.region.store.parallel.put.limit to limit the concurrency (10 by default). There's another property for advanced user and please check source and javadoc of StoreHotnessProtector for more details.
11511
11512
11513 ---
11514
11515 * [HBASE-20148](https://issues.apache.org/jira/browse/HBASE-20148) | *Major* | **Make serial replication as a option for a peer instead of a table**
11516
11517 A new method setSerial has been added to the interface ReplicationPeerConfigBuilder which is marked as IA.Public. This interface is not supposed to be implemented by client code, but if you do, this will be an incompatible change as you need to add this method to your implementation too.
11518
11519
11520 ---
11521
11522 * [HBASE-19397](https://issues.apache.org/jira/browse/HBASE-19397) | *Major* | **Design  procedures for ReplicationManager to notify peer change event from master**
11523
11524 Introduce 5 procedures to do peer modifications:
11525 AddPeerProcedure
11526 RemovePeerProcedure
11527 UpdatePeerConfigProcedure
11528 EnablePeerProcedure
11529 DisablePeerProcedure
11530
11531 The procedures are all executed with the following stage:
11532 1. Call pre CP hook, if an exception is thrown then give up
11533 2. Check whether the operation is valid, if not then give up
11534 3. Update peer storage. Notice that if we have entered this stage, then we can not rollback any more.
11535 4. Schedule sub procedures to refresh the peer config on every RS.
11536 5. Do post cleanup if any.
11537 6. Call post CP hook. The exception thrown will be ignored since we have already done the work.
11538
11539 The procedure will hold an exclusive lock on the peer id, so now there is no concurrent modifications on a single peer.
11540
11541 And now it is guaranteed that once the procedure is done, the peer modification has already taken effect on all RSes.
11542
11543 Abstracte a storage layer for replication peer/queue manangement, and refactored the upper layer to remove zk related naming/code/comment.
11544
11545 Add pre/postExecuteProcedures CP hooks to RegionServerObserver, and add permission check for executeProcedures method which requires the caller to be system user or super user.
11546
11547 On rolling upgrade: just do not do any replication peer modifications during the rolling upgrading. There is no pb/layout changes on the peer/queue storage on zk.
11548 # HBASE  2.0.0 Release Notes
11549
11550
11551 These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.
11552
11553
11554 ---
11555
11556 * [HBASE-20464](https://issues.apache.org/jira/browse/HBASE-20464) | *Major* | **Disable IMC**
11557
11558 Change the default so that on creation of new tables, In-Memory Compaction BASIC is NOT enabled.
11559
11560 This change is in branch-2.0 only, not in branch-2.
11561
11562
11563 ---
11564
11565 * [HBASE-20276](https://issues.apache.org/jira/browse/HBASE-20276) | *Blocker* | **[shell] Revert shell REPL change and document**
11566
11567 <!-- markdown -->
11568
11569
11570
11571 The HBase shell now behaves as it did prior to the changes that started in HBASE-15965. Namely, some shell commands return values that may be further manipulated within the shell's IRB session.
11572
11573 The command line option `--return-values` is no longer acted on by the shell since it now always behaves as it did when passed this parameter. Passing the option results in a harmless warning about this change.
11574
11575 Users who wish to maintain the behavior seen in the 1.4.0-1.4.2 releases of the HBase shell should refer to the section _irbrc_ in the reference guide for how to configure their IRB session to avoid echoing expression results to the console.
11576
11577
11578 ---
11579
11580 * [HBASE-18792](https://issues.apache.org/jira/browse/HBASE-18792) | *Blocker* | **hbase-2 needs to defend against hbck operations**
11581
11582 As of HBase version 2.0, the hbck tool is significantly changed. In general, all Read-Only options are supported and can be be used safely. Most -fix/ -repair options are NOT supported. Please see usage below for details on which options are not supported:
11583
11584
11585 Usage: fsck [opts] {only tables}
11586  where [opts] are:
11587    -help Display help options (this)
11588    -details Display full report of all regions.
11589    -timelag \<timeInSeconds\>  Process only regions that  have not experienced any metadata updates in the last  \<timeInSeconds\> seconds.
11590    -sleepBeforeRerun \<timeInSeconds\> Sleep this many seconds before checking if the fix worked if run with -fix
11591    -summary Print only summary of the tables and status.
11592    -metaonly Only check the state of the hbase:meta table.
11593    -sidelineDir \<hdfs://\> HDFS path to backup existing meta.
11594    -boundaries Verify that regions boundaries are the same between META and store files.
11595    -exclusive Abort if another hbck is exclusive or fixing.
11596
11597   Datafile Repair options: (expert features, use with caution!)
11598    -checkCorruptHFiles     Check all Hfiles by opening them to make sure they are valid
11599    -sidelineCorruptHFiles  Quarantine corrupted HFiles.  implies -checkCorruptHFiles
11600
11601  Replication options
11602    -fixReplication   Deletes replication queues for removed peers
11603
11604   Metadata Repair options supported as of version 2.0: (expert features, use with caution!)
11605    -fixVersionFile   Try to fix missing hbase.version file in hdfs.
11606    -fixReferenceFiles  Try to offline lingering reference store files
11607    -fixHFileLinks  Try to offline lingering HFileLinks
11608    -noHdfsChecking   Don't load/check region info from HDFS. Assumes hbase:meta region info is good. Won't check/fix any HDFS issue, e.g. hole, orphan, or overlap
11609    -ignorePreCheckPermission  ignore filesystem permission pre-check
11610
11611 NOTE: Following options are NOT supported as of HBase version 2.0+.
11612
11613   UNSUPPORTED Metadata Repair options: (expert features, use with caution!)
11614    -fix              Try to fix region assignments.  This is for backwards compatiblity
11615    -fixAssignments   Try to fix region assignments.  Replaces the old -fix
11616    -fixMeta          Try to fix meta problems.  This assumes HDFS region info is good.
11617    -fixHdfsHoles     Try to fix region holes in hdfs.
11618    -fixHdfsOrphans   Try to fix region dirs with no .regioninfo file in hdfs
11619    -fixTableOrphans  Try to fix table dirs with no .tableinfo file in hdfs (online mode only)
11620    -fixHdfsOverlaps  Try to fix region overlaps in hdfs.
11621    -maxMerge \<n\>     When fixing region overlaps, allow at most \<n\> regions to merge. (n=5 by default)
11622    -sidelineBigOverlaps  When fixing region overlaps, allow to sideline big overlaps
11623    -maxOverlapsToSideline \<n\>  When fixing region overlaps, allow at most \<n\> regions to sideline per group. (n=2 by default)
11624    -fixSplitParents  Try to force offline split parents to be online.
11625    -removeParents    Try to offline and sideline lingering parents and keep daughter regions.
11626    -fixEmptyMetaCells  Try to fix hbase:meta entries not referencing any region (empty REGIONINFO\_QUALIFIER rows)
11627
11628   UNSUPPORTED Metadata Repair shortcuts
11629    -repair           Shortcut for -fixAssignments -fixMeta -fixHdfsHoles -fixHdfsOrphans -fixHdfsOverlaps -fixVersionFile -sidelineBigOverlaps -fixReferenceFiles-fixHFileLinks
11630    -repairHoles      Shortcut for -fixAssignments -fixMeta -fixHdfsHoles
11631
11632
11633 ---
11634
11635 * [HBASE-19994](https://issues.apache.org/jira/browse/HBASE-19994) | *Major* | **Create a new class for RPC throttling exception, make it retryable.**
11636
11637 A new RpcThrottlingException deprecates ThrottlingException. The new RpcThrottlingException is a retryable Exception that clients will retry when Rpc throttling quota is exceeded. The deprecated ThrottlingException is a nonretryable Exception.
11638
11639
11640 ---
11641
11642 * [HBASE-20224](https://issues.apache.org/jira/browse/HBASE-20224) | *Blocker* | **Web UI is broken in standalone mode**
11643
11644 Standalone webui was broken inadvertently by HBASE-20027.
11645
11646
11647 ---
11648
11649 * [HBASE-18784](https://issues.apache.org/jira/browse/HBASE-18784) | *Major* | **Use of filesystem that requires hflush / hsync / append / etc should query outputstream capabilities**
11650
11651 <!-- markdown -->
11652
11653
11654
11655 If HBase is run on top of Apache Hadoop libraries that support the needed APIs it will verify that underlying Filesystem implementations provide the needed durability mechanisms to safely operate. The needed APIs *should* be present in Hadoop 3 release and Hadoop 2 releases starting in the Hadoop 2.9 series. If the APIs are not available, HBase behaves as it has in previous releases (that is, it moves forward assuming such a check would pass).
11656
11657 Where this check fails, it is unsafe to rely on HBase in a production setting. In the event of process or node failure, the HBase RegionServer process may fail to have access to all the data it previously wrote to its write ahead log, resulting in data loss. In the event of process or node failure, the HBase master process may lose all or part of the write ahead log that it relies on for cluster management operations, leaving the cluster in an inconsistent state that we aren't sure it could recover from.
11658
11659 Notably, the LocalFileSystem implementation provided by Hadoop reports (accurately) via these new APIs that it can not provide the durability HBase needs to operate. As such, the current instructions for single-node HBase operation have been updated both with a) how to bypass this safety check and b) a strong warning about the dire consequences of doing so outside of a dev/test environment.
11660
11661
11662 ---
11663
11664 * [HBASE-20219](https://issues.apache.org/jira/browse/HBASE-20219) | *Critical* | **An error occurs when scanning with reversed=true and loadColumnFamiliesOnDemand=true**
11665
11666 Throws DoNotRetryIOException when you ask for a reverse scan loading adjacent column families on demand. Previous it threw IllegalStateException
11667
11668
11669 ---
11670
11671 * [HBASE-20358](https://issues.apache.org/jira/browse/HBASE-20358) | *Minor* | **Fix bin/hbase thrift usage text**
11672
11673 Cleanup usage message and command-line processing (no functional change).
11674
11675
11676 ---
11677
11678 * [HBASE-20182](https://issues.apache.org/jira/browse/HBASE-20182) | *Blocker* | **Can not locate region after split and merge**
11679
11680 Now if we hit a split parent when locating a region, we will skip to the next row and try again until the region does not contain our row. So there will be no RegionOfflineException for a split parent any more, instead, if the split children have not been onlined yet, i.e, we finally arrive at a region which does not contain our row, an IOException will be thrown.
11681
11682
11683 ---
11684
11685 * [HBASE-20149](https://issues.apache.org/jira/browse/HBASE-20149) | *Critical* | **Purge dev javadoc from bin tarball (or make a separate tarball of javadoc)**
11686
11687 We no longer include dev or dev test javadocs in our binary bundle. We still build them; they are just not included because they were half the size of the resultant tarball.
11688
11689 Here is our story on javadoc as of this commit:
11690
11691  \* apidocs - user facing main api javadocs. currently for a release line, published on website and linked from menu. included in the bin tarball
11692  \* devapidocs - hbase internal javadocs. currently for a release line, published on the website but not linked from the menu. no longer included in the bin tarball.
11693  \* testapidocs - user facing test scope api javadocs. currently for a release line, not published. included in the bin tarball.
11694  \* testdevapidocs - hbase internal test scope javadocs. currently for a release line, not published. no longer included in the bin tarball
11695
11696
11697 ---
11698
11699 * [HBASE-18828](https://issues.apache.org/jira/browse/HBASE-18828) | *Blocker* | **[2.0] Generate CHANGES.txt**
11700
11701 Moves us over to yetus releasedocmaker tooling generating CHANGES. CHANGES is not markdown (CHANGES.md) as opposed to CHANGES.txt. We've also added a new RELEASENOTES.md that lists JIRA release notes (courtesy of releasedocmaker).
11702
11703 CHANGES/RELEASENOTES are current as of now. Will need a 'freshening' when we cut the RC.
11704
11705
11706 ---
11707
11708 * [HBASE-14175](https://issues.apache.org/jira/browse/HBASE-14175) | *Critical* | **Adopt releasedocmaker for better generated release notes**
11709
11710 We will use yetus releasedocmaker to make our changes doc from here on out. A CHANGELOG.md will replace our current CHANGES.txt. Adjacent, we'll keep up a RELEASENOTES.md doc courtesy of releasedocmaker.
11711
11712 Over in HBASE-18828 is where we are working through steps for the RM integrating this new tooling.
11713
11714
11715 ---
11716
11717 * [HBASE-16499](https://issues.apache.org/jira/browse/HBASE-16499) | *Critical* | **slow replication for small HBase clusters**
11718
11719 Changed the default value for replication.source.ratio from 0.1 to 0.5. Which means now by default 50% of the total RegionServers in peer cluster(s) will participate in replication.
11720
11721
11722 ---
11723
11724 * [HBASE-16459](https://issues.apache.org/jira/browse/HBASE-16459) | *Trivial* | **Remove unused hbase shell --format option**
11725
11726 <!-- markdown -->
11727
11728
11729
11730
11731 The HBase `shell` command no longer recognizes the option `--format`. Previously this option only recognized the default value of 'console'. The default value is now always used.
11732
11733
11734 ---
11735
11736 * [HBASE-20259](https://issues.apache.org/jira/browse/HBASE-20259) | *Critical* | **Doc configs for in-memory-compaction and add detail to in-memory-compaction logging**
11737
11738 Disables in-memory compaction as default.
11739
11740 Adds logging of in-memory compaction configuration on creation.
11741
11742 Adds a chapter to the refguide on this new feature.
11743
11744
11745 ---
11746
11747 * [HBASE-20282](https://issues.apache.org/jira/browse/HBASE-20282) | *Major* | **Provide short name invocations for useful tools**
11748
11749 \`hbase regionsplitter\` is a new short invocation for \`hbase org.apache.hadoop.hbase.util.RegionSplitter\`
11750
11751
11752 ---
11753
11754 * [HBASE-20314](https://issues.apache.org/jira/browse/HBASE-20314) | *Major* | **Precommit build for master branch fails because of surefire fork fails**
11755
11756 Upgrade surefire plugin to 2.21.0.
11757
11758
11759 ---
11760
11761 * [HBASE-20130](https://issues.apache.org/jira/browse/HBASE-20130) | *Critical* | **Use defaults (16020 & 16030) as base ports when the RS is bound to localhost**
11762
11763 <!-- markdown -->
11764
11765
11766
11767 When region servers bind to localhost (mostly in pseudo distributed mode), default ports (16020 & 16030) are used as base ports. This will support up to 9 instances of region servers by default with `local-regionservers.sh` script. If additional instances are needed, see the reference guide on how to deploy with a different range using the environment variables `HBASE_RS_BASE_PORT` and `HBASE_RS_INFO_BASE_PORT`.
11768
11769
11770 ---
11771
11772 * [HBASE-20111](https://issues.apache.org/jira/browse/HBASE-20111) | *Critical* | **Able to split region explicitly even on shouldSplit return false from split policy**
11773
11774 When a split is requested on a Region, the RegionServer hosting that Region will now consult the configured SplitPolicy for that table when determining if a split of that Region is allowed. When a split is disallowed (due to the Region not being OPEN or the SplitPolicy denying the request), the operation will \*not\* be implicitly retried as it has previously done. Users will need to guard against and explicitly retry region split requests which are denied by the system.
11775
11776
11777 ---
11778
11779 * [HBASE-20223](https://issues.apache.org/jira/browse/HBASE-20223) | *Blocker* | **Use hbase-thirdparty 2.1.0**
11780
11781 Moves commons-cli and commons-collections4 into the HBase thirdparty shaded jar which means that these are no longer generally available for users on the classpath.
11782
11783
11784 ---
11785
11786 * [HBASE-19128](https://issues.apache.org/jira/browse/HBASE-19128) | *Major* | **Purge Distributed Log Replay from codebase, configurations, text; mark the feature as unsupported, broken.**
11787
11788 Removes Distributed Log Replay feature. Disable the feature before upgrading.
11789
11790
11791 ---
11792
11793 * [HBASE-19504](https://issues.apache.org/jira/browse/HBASE-19504) | *Major* | **Add TimeRange support into checkAndMutate**
11794
11795 1) checkAndMutate accept a TimeRange to query the specified cell
11796 2) remove writeToWAL flag from Region#checkAndMutate since it is useless (this is a incompatible change)
11797
11798
11799 ---
11800
11801 * [HBASE-20237](https://issues.apache.org/jira/browse/HBASE-20237) | *Critical* | **Put back getClosestRowBefore and throw UnknownProtocolException instead... for asynchbase client**
11802
11803 Throw UnknownProtocolException if a client connects and tries to invoke the old getClosestRowOrBefore method. Pre-hbase-1.0.0 or asynchbase do this instead of using its replacement, the reverse Scan.
11804
11805 getClosestRowOrBefore was implemented as a flag on Get. Before this patch though the flag was set, hbase2 were ignoring it. This made it look like a pre-1.0.0 client was 'working' but then it'd fail finding the appropriate Region for a client-specified row doing lookups into hbase:meta.
11806
11807
11808 ---
11809
11810 * [HBASE-20247](https://issues.apache.org/jira/browse/HBASE-20247) | *Major* | **Set version as 2.0.0 in branch-2.0 in prep for first RC**
11811
11812 Set version as 2.0.0 on branch-2.0.
11813
11814
11815 ---
11816
11817 * [HBASE-20090](https://issues.apache.org/jira/browse/HBASE-20090) | *Major* | **Properly handle Preconditions check failure in MemStoreFlusher$FlushHandler.run**
11818
11819 When there is concurrent region split, MemStoreFlusher may not find flushable region if the only candidate region left hasn't received writes (resulting in 0 data size).
11820 After this JIRA, such scenario wouldn't trigger Precondition assertion (replaced by an if statement to see whether there is any flushable region).
11821 If there is no flushable region, a DEBUG log would appear in region server log, saying "Above memory mark but there is no flushable region".
11822
11823
11824 ---
11825
11826 * [HBASE-19552](https://issues.apache.org/jira/browse/HBASE-19552) | *Major* | **update hbase to use new thirdparty libs**
11827
11828 hbase-thirdparty libs have moved to o.a.h.thirdparty offset. Netty shading system property is no longer necessary.
11829
11830
11831 ---
11832
11833 * [HBASE-20119](https://issues.apache.org/jira/browse/HBASE-20119) | *Minor* | **Introduce a pojo class to carry coprocessor information in order to make TableDescriptorBuilder accept multiple cp at once**
11834
11835 1) Make all methods in TableDescriptorBuilder be setter pattern.
11836 addCoprocessor -\> setCoprocessor
11837 addColumnFamily -\> setColumnFamily
11838 (addCoprocessor and addColumnFamily are still in branch-2 but they are marked as deprecated)
11839 2) add CoprocessorDescriptor to carry cp information
11840 3) add CoprocessorDescriptorBuilder to build CoprocessorDescriptor
11841 4) TD disallow user to set negative priority to coprocessor since parsing the negative value will cause a exception
11842
11843
11844 ---
11845
11846 * [HBASE-17165](https://issues.apache.org/jira/browse/HBASE-17165) | *Critical* | **Add retry to LoadIncrementalHFiles tool**
11847
11848 Adds retry to load of incremental hfiles. Pertinent key is HConstants.HBASE\_CLIENT\_RETRIES\_NUMBER. Default is HConstants.DEFAULT\_HBASE\_CLIENT\_RETRIES\_NUMBER.
11849
11850
11851 ---
11852
11853 * [HBASE-20108](https://issues.apache.org/jira/browse/HBASE-20108) | *Critical* | **\`hbase zkcli\` falls into a non-interactive prompt after HBASE-15199**
11854
11855 This issue fixes a runtime dependency issues where JLine is not made available on the classpath which causes the ZooKeeper CLI to appear non-interactive. JLine was being made available unintentionally via the JRuby jar file on the classpath for the HBase shell. While the JRuby jar is not always present, the fix made here was to selectively include the JLine dependency on the zkcli command's classpath.
11856
11857
11858 ---
11859
11860 * [HBASE-8770](https://issues.apache.org/jira/browse/HBASE-8770) | *Blocker* | **deletes and puts with the same ts should be resolved according to mvcc/seqNum**
11861
11862 This behavior is available as a new feature. See HBASE-15968 release note.
11863
11864 This issue is just about adding to the refguide documentation on the HBASE\_15968 feature.
11865
11866
11867 ---
11868
11869 * [HBASE-19114](https://issues.apache.org/jira/browse/HBASE-19114) | *Major* | **Split out o.a.h.h.zookeeper from hbase-server and hbase-client**
11870
11871 Splits out most of ZooKeeper related code into a separate new module: hbase-zookeeper.
11872 Also, renames some ZooKeeper related classes to follow a common naming pattern - "ZK" prefix - as compared to many different styles earlier.
11873
11874
11875 ---
11876
11877 * [HBASE-19437](https://issues.apache.org/jira/browse/HBASE-19437) | *Critical* | **Batch operation can't handle the null result for Append/Increment**
11878
11879 The result from server is changed from null to Result.EMPTY\_RESULT when Append/Increment operation can't retrieve any data from server,
11880
11881
11882 ---
11883
11884 * [HBASE-17448](https://issues.apache.org/jira/browse/HBASE-17448) | *Major* | **Export metrics from RecoverableZooKeeper**
11885
11886 Committed to master and branch-1
11887
11888
11889 ---
11890
11891 * [HBASE-19400](https://issues.apache.org/jira/browse/HBASE-19400) | *Major* | **Add missing security checks in MasterRpcServices**
11892
11893 Added ACL check to following Admin functions:
11894 enableCatalogJanitor, runCatalogJanitor, cleanerChoreSwitch, runCleanerChore, execProcedure, execProcedureWithReturn, normalize, normalizerSwitch, coprocessorService.
11895 When ACL is enabled, only those with ADMIN rights will be able to invoke these operations successfully.
11896
11897
11898 ---
11899
11900 * [HBASE-20048](https://issues.apache.org/jira/browse/HBASE-20048) | *Blocker* | **Revert serial replication feature**
11901
11902 Revert the serial replication feature from all branches. Plan to reimplement it soon and land onto 2.1 release line.
11903
11904
11905 ---
11906
11907 * [HBASE-19166](https://issues.apache.org/jira/browse/HBASE-19166) | *Blocker* | **AsyncProtobufLogWriter persists ProtobufLogWriter as class name for backward compatibility**
11908
11909 For backward compatibility, AsyncProtobufLogWriter uses "ProtobufLogWriter" as writer class name and SecureAsyncProtobufLogWriter uses "SecureProtobufLogWriter" as writer class name.
11910
11911
11912 ---
11913
11914 * [HBASE-18596](https://issues.apache.org/jira/browse/HBASE-18596) | *Blocker* | **[TEST] A hbase1 cluster should be able to replicate to a hbase2 cluster; verify**
11915
11916 Replication between versions verified as basically working. 0.98.25-SNAPSHOT to beta-2 hbase2 and a 1.2-ish version tried.
11917
11918
11919 ---
11920
11921 * [HBASE-20017](https://issues.apache.org/jira/browse/HBASE-20017) | *Blocker* | **BufferedMutatorImpl submit the same mutation repeatedly**
11922
11923 This change fixes multithreading issues in the implementation of BufferedMutator. BufferedMutator should not be used with 1.4 releases prior to 1.4.2.
11924
11925
11926 ---
11927
11928 * [HBASE-20032](https://issues.apache.org/jira/browse/HBASE-20032) | *Minor* | **Receving multiple warnings for missing reporting.plugins.plugin.version**
11929
11930 Add (latest) version elements missing from reporting plugins in top-level pom.
11931
11932
11933 ---
11934
11935 * [HBASE-19954](https://issues.apache.org/jira/browse/HBASE-19954) | *Major* | **Separate TestBlockReorder into individual tests to avoid ShutdownHook suppression error against hadoop3**
11936
11937 hadoop3 minidfscluster removes all shutdown handlers when the cluster goes down which made this test that does FS-stuff fail (Fix was to break up the test so each test method ran with an unadulterated FS).
11938
11939
11940 ---
11941
11942 * [HBASE-20014](https://issues.apache.org/jira/browse/HBASE-20014) | *Major* | **TestAdmin1 Times out**
11943
11944 Ups the overall test timeout from 10 minutes to 13minutes. 15minutes is the surefire timeout.
11945
11946
11947 ---
11948
11949 * [HBASE-20020](https://issues.apache.org/jira/browse/HBASE-20020) | *Critical* | **Make sure we throw DoNotRetryIOException when ConnectionImplementation is closed**
11950
11951 Add checkClosed to core Client methods. Avoid unnecessary retry.
11952
11953
11954 ---
11955
11956 * [HBASE-19978](https://issues.apache.org/jira/browse/HBASE-19978) | *Major* | **The keepalive logic is incomplete in ProcedureExecutor**
11957
11958 Completes keep-alive logic and then enables it; ProcedureExecutor Workers will spin up more threads when need settling back to the core count after the burst in demand has passed. Default keep-alive is one minute. Default core-count is CPUs/4 or 16, which ever is greater. Maximum is an arbitrary core-count \* 10 (a limit that should never be hit and if it is, there is something else very wrong).
11959
11960
11961 ---
11962
11963 * [HBASE-19950](https://issues.apache.org/jira/browse/HBASE-19950) | *Minor* | **Introduce a ColumnValueFilter**
11964
11965 ColumnValueFilter provides a way to fetch matched cells only by providing specified column, value and a comparator, which is different from SingleValueFilter, fetching an entire row as soon as a matched cell found.
11966
11967
11968 ---
11969
11970 * [HBASE-18294](https://issues.apache.org/jira/browse/HBASE-18294) | *Major* | **Reduce global heap pressure: flush based on heap occupancy**
11971
11972 A region is flushed if its memory component exceeds the region flush threshold.
11973 A flush policy decides which stores to flush by comparing the size of the store to a column-family-flush threshold.
11974 If the overall size of all memstores in the machine exceeds the bounds defined by the administrator (denoted global pressure) a region is selected and flushed.
11975 HBASE-18294 changes flush decisions to be based on heap-occupancy and not data (key-value) size, consistently across levels. This rolls back some of the changes by HBASE-16747. Specifically,
11976 (1) RSs, Regions and stores track their overall on-heap and off-heap occupancy,
11977 (2) A region is flushed when its on-heap+off-heap size exceeds the region flush threshold specified in hbase.hregion.memstore.flush.size,
11978 (3) The store to be flushed is chosen based on its on-heap+off-heap size
11979 (4) At the RS level, a flush is triggered when the overall on-heap exceeds the on-heap limit, or when the overall off-heap size exceeds the off-heap limit (low/high water marks).
11980
11981 Note that when the region flush size is set to XXmb a region flush may be triggered even before writing keys and values of size XX because the total heap occupancy of the region which includes additional metadata exceeded the threshold.
11982
11983
11984 ---
11985
11986 * [HBASE-19116](https://issues.apache.org/jira/browse/HBASE-19116) | *Critical* | **Currently the tail of hfiles with CellComparator\* classname makes it so hbase1 can't open hbase2 written hfiles; fix**
11987
11988 hbase-2.x sets KeyValue Comparators into the tail of hfiles rather than CellComparator, what it uses internally, just so hbase-1.x can continue to read hbase-2.x written hfiles.
11989
11990
11991 ---
11992
11993 * [HBASE-19948](https://issues.apache.org/jira/browse/HBASE-19948) | *Major* | **Since HBASE-19873, HBaseClassTestRule, Small/Medium/Large has different semantic**
11994
11995 In subtask, fixed doc and annotations to be more explicit that test timings are for the whole Test Fixture/Test Class/Test Suite NOT the test method only as we'd measuring up to this (tother subtasks untethered Categorization and test timeout such that all categories now have a ten minute timeout -- no test can run longer than ten minutes or it gets killed/timedout).
11996
11997
11998 ---
11999
12000 * [HBASE-16060](https://issues.apache.org/jira/browse/HBASE-16060) | *Blocker* | **1.x clients cannot access table state talking to 2.0 cluster**
12001
12002 By default, we mirror table state to zookeeper so hbase-1.x clients will work against an hbase-2 cluster (With this patch, hbase-1.x clients can do most Admin functions including table create; hbase-1.x clients can do all Table/DML against hbase-2 cluster).
12003
12004 Flag to disable mirroring is hbase.mirror.table.state.to.zookeeper; set it to false in Configuration.
12005
12006 Related, Master on startup will look to see if there are table state znodes left over by an hbase-1 instance. If any found, it will migrate the table state to hbase-2 setting the state into the hbase:meta table where table state is now kept. We will do this check on every Master start. Notion is that this will be overall beneficial with low impediment. To disable the migration check, set hbase.migrate.table.state.from.zookeeper to false.
12007
12008
12009 ---
12010
12011 * [HBASE-19900](https://issues.apache.org/jira/browse/HBASE-19900) | *Critical* | **Region-level exception destroy the result of batch**
12012
12013 This fix makes the following changes to how client handle the both of action result and region exception.
12014 1) honor the action result rather than region exception. If the action have both of true result and region exception, the action is fine as the exception is caused by other actions which are in the same region.
12015 2) honor the action exception rather than region exception. If the action have both of action exception and region exception, we deal with the action exception only. If we also handle the region exception for the same action, it will introduce the negative count of actions in progress. The AsyncRequestFuture#waitUntilDone will block forever.
12016
12017
12018 ---
12019
12020 * [HBASE-19841](https://issues.apache.org/jira/browse/HBASE-19841) | *Major* | **Tests against hadoop3 fail with StreamLacksCapabilityException**
12021
12022 HBaseTestingUtility now assumes that all clusters will use local storage until a MiniDFSCluster is started or assigned.
12023
12024
12025 ---
12026
12027 * [HBASE-19528](https://issues.apache.org/jira/browse/HBASE-19528) | *Major* | **Major Compaction Tool**
12028
12029 Tool allows you to compact a cluster with given concurrency of regionservers compacting at a given time.  If tool completes successfully everything requested for compaction will be compacted, regardless of region moves, splits and merges.
12030
12031
12032 ---
12033
12034 * [HBASE-19919](https://issues.apache.org/jira/browse/HBASE-19919) | *Major* | **Tidying up logging**
12035
12036 (I thought this change innocuous but I made work for a co-worker when I upped interval between log cleaner runs -- meant a smoke test failed because we were slow doing an expected cleanup).
12037
12038 Edit of log lines removing redundancy. Shorten thread names shown in log.  Made some log TRACE instead of DEBUG.  Capitalizations.
12039
12040 Upped log cleaner interval from every minute to every ten minutes. hbase.master.cleaner.interval
12041
12042 Lowered default count of threads started by Procedure Executor from count of CPUs to 1/4 of count of CPUs.
12043
12044
12045 ---
12046
12047 * [HBASE-19901](https://issues.apache.org/jira/browse/HBASE-19901) | *Major* | **Up yetus proclimit on nightlies**
12048
12049 Pass to yetus a dockermemlimit of 20G and a proclimit of 10000. Defaults are 4G and 1G respectively.
12050
12051
12052 ---
12053
12054 * [HBASE-19912](https://issues.apache.org/jira/browse/HBASE-19912) | *Minor* | **The flag "writeToWAL" of Region#checkAndRowMutate is useless**
12055
12056 Remove useless 'writeToWAL' flag of Region#checkAndRowMutate & related class
12057
12058
12059 ---
12060
12061 * [HBASE-19911](https://issues.apache.org/jira/browse/HBASE-19911) | *Major* | **Convert some tests from small to medium because they are timing out: TestNettyRpcServer, TestClientClusterStatus, TestCheckTestClasses**
12062
12063 Changed a few tests so they are medium sized rather than small size.
12064
12065 Also, upped the time we wait on small tests to 60seconds from 30seconds. Small tests are tests that run in 15seconds or less. What we changed was the timeout watcher. It is now more lax, more tolerant of dodgy infrastructure that might be running tests slowly.
12066
12067
12068 ---
12069
12070 * [HBASE-19892](https://issues.apache.org/jira/browse/HBASE-19892) | *Major* | **Checking 'patch attach' and yetus 0.7.0 and move to Yetus 0.7.0**
12071
12072 Moved our internal yetus reference from 0.6.0 to 0.7.0. Concurrently, I changed hadoopqa to run with 0.7.0 (by editing the config in jenkins).
12073
12074
12075 ---
12076
12077 * [HBASE-19873](https://issues.apache.org/jira/browse/HBASE-19873) | *Major* | **Add a CategoryBasedTimeout ClassRule for all UTs**
12078
12079 Along with @category -- small, medium, large -- all hbase tests must now carry a ClassRule as follows:
12080
12081 +  @ClassRule
12082 +  public static final HBaseClassTestRule CLASS\_RULE =
12083 +      HBaseClassTestRule.forClass(TestInterfaceAudienceAnnotations.class);
12084
12085 where the class changes by test.
12086
12087 Currently the classrule enforces timeout for the whole test suite -- i.e. if a SmallTest Category then all the tests in the TestSuite must complete inside 60seconds, the timeout we set on SmallTest Category test suite -- but is meant to be a repository for general, runtime, hbase test facility.
12088
12089
12090 ---
12091
12092 * [HBASE-19770](https://issues.apache.org/jira/browse/HBASE-19770) | *Critical* | **Add '--return-values' option to Shell to print return values of commands in interactive mode**
12093
12094 Introduces a new option to the HBase shell: -r, --return-values. When the shell is in "interactive" mode (default), the return value of shell commands are not returned to the user as they dirty the console output. For those who desire this functionality, the "--return-values" option restores the old functionality of the commands passing their return value to the user.
12095
12096
12097 ---
12098
12099 * [HBASE-15321](https://issues.apache.org/jira/browse/HBASE-15321) | *Major* | **Ability to open a HRegion from hdfs snapshot.**
12100
12101 HRegion.openReadOnlyFileSystemHRegion() provides the ability to open HRegion from a read-only hdfs snapshot.  Because hdfs snapshots are read-only, no cleanup happens when using this API.
12102
12103
12104 ---
12105
12106 * [HBASE-17513](https://issues.apache.org/jira/browse/HBASE-17513) | *Critical* | **Thrift Server 1 uses different QOP settings than RPC and Thrift Server 2 and can easily be misconfigured so there is no encryption when the operator expects it.**
12107
12108 This change fixes an issue where users could have unintentionally configured the HBase Thrift1 server to run without wire-encryption, when they believed they had configured the Thrift1 server to do so.
12109
12110
12111 ---
12112
12113 * [HBASE-19828](https://issues.apache.org/jira/browse/HBASE-19828) | *Major* | **Flakey TestRegionsOnMasterOptions.testRegionsOnAllServers**
12114
12115 Disables TestRegionsOnMasterOptions because Regions on Master does not work reliably; see HBASE-19831.
12116
12117
12118 ---
12119
12120 * [HBASE-18963](https://issues.apache.org/jira/browse/HBASE-18963) | *Major* | **Remove MultiRowMutationProcessor and implement mutateRows... methods using batchMutate()**
12121
12122 Modified HRegion.mutateRow() APIs to use batchMutate() instead of processRowsWithLocks() with MultiRowMutationProcessor. MultiRowMutationProcessor is removed to have single write path that uses batchMutate().
12123
12124
12125 ---
12126
12127 * [HBASE-19163](https://issues.apache.org/jira/browse/HBASE-19163) | *Major* | **"Maximum lock count exceeded" from region server's batch processing**
12128
12129 When there are many mutations against the same row in a batch, as each mutation will acquire a shared row lock, it will exceed the maximum shared lock count the java ReadWritelock supports (64k). Along with other optimization, the batch is divided into multiple possible minibatches. A new config is added to limit the maximum number of mutations in the minibatch.
12130
12131    \<property\>
12132     \<name\>hbase.regionserver.minibatch.size\</name\>
12133     \<value\>20000\</value\>
12134    \</property\>
12135 The default value is 20000.
12136
12137
12138 ---
12139
12140 * [HBASE-19739](https://issues.apache.org/jira/browse/HBASE-19739) | *Minor* | **Include thrift IDL files in HBase binary distribution**
12141
12142 Thrift IDLs are now shipped, bundled up in the respective hbase-\*thrift.jars (look for files ending in .thrift).
12143
12144
12145 ---
12146
12147 * [HBASE-11409](https://issues.apache.org/jira/browse/HBASE-11409) | *Major* | **Add more flexibility for input directory structure to LoadIncrementalHFiles**
12148
12149 Allows for users to bulk load entire tables from hdfs by specifying the parameter -loadTable.  This allows you to pass in a table level directory and have all regions column families bulk loaded, if you do not specify the -loadTable parameter LoadIncrementalHFiles will work as before. Note: you must have a pre-created table to run with -loadTable it will not create one for you.
12150
12151
12152 ---
12153
12154 * [HBASE-19769](https://issues.apache.org/jira/browse/HBASE-19769) | *Critical* | **IllegalAccessError on package-private Hadoop metrics2 classes in MapReduce jobs**
12155
12156 Client-side ZooKeeper metrics which were added to 2.0.0 alpha/beta releases cause issues when launching MapReduce jobs via {{yarn jar}} on the command line. This stems from ClassLoader separation issues that YARN implements. It was chosen that the easiest solution was to remove these ZooKeeper metrics entirely.
12157
12158
12159 ---
12160
12161 * [HBASE-19783](https://issues.apache.org/jira/browse/HBASE-19783) | *Minor* | **Change replication peer cluster key/endpoint from a not-null value to null is not allowed**
12162
12163 To reduce the confusing behavior, now when you call updatePeerConfig with empty ClusterKey or ReplicationEndpointImpl, but the value of field of the to-be-updated ReplicationPeerConfig is not null, we will throw exception instead of ignoring them.
12164
12165
12166 ---
12167
12168 * [HBASE-19483](https://issues.apache.org/jira/browse/HBASE-19483) | *Major* | **Add proper privilege check for rsgroup commands**
12169
12170 This JIRA aims at refactoring AccessController, using ACL as core library in CPs.
12171 1. Stripping out a public class AccessChecker from AccessController, using ACL as core library in CPs. AccessChecker don't have any dependency on anything CP related. Create it's instance from other CPS.
12172 2. Change the default value of hbase.security.authorization to false.
12173 3. Don't use CP hooks to check access in RSGroup. Use the access checker instance directly in functions of RSGroupAdminServiceImpl.
12174
12175
12176 ---
12177
12178 * [HBASE-19358](https://issues.apache.org/jira/browse/HBASE-19358) | *Major* | **Improve the stability of splitting log when do fail over**
12179
12180 After HBASE-19358 we introduced a new property hbase.split.writer.creation.bounded to limit the opening writers for each WALSplitter. If set to true, we won't open any writer for recovered.edits until the entries accumulated in memory reaching hbase.regionserver.hlog.splitlog.buffersize (which defaults at 128M) and will write and close the file in one go instead of keeping the writer open. It's false by default and we recommend to set it to true if your cluster has a high region load (like more than 300 regions per RS), especially when you observed obvious NN/HDFS slow down during hbase (single RS or cluster) failover.
12181
12182
12183 ---
12184
12185 * [HBASE-19651](https://issues.apache.org/jira/browse/HBASE-19651) | *Minor* | **Remove LimitInputStream**
12186
12187 HBase had copied from guava the file LmiitedInputStream. This commit removes the copied file in favor of (our internal, shaded) guava's ByteStreams.limit. Guava 14.0's LIS noted: "Use ByteStreams.limit(java.io.InputStream, long) instead. This class is scheduled to be removed in Guava release 15.0."
12188
12189
12190 ---
12191
12192 * [HBASE-19691](https://issues.apache.org/jira/browse/HBASE-19691) | *Critical* | **Do not require ADMIN permission for obtaining ClusterStatus**
12193
12194 This change reverts an unintentional requirement for global ADMIN permission to obtain cluster status from the active HMaster.
12195
12196
12197 ---
12198
12199 * [HBASE-19486](https://issues.apache.org/jira/browse/HBASE-19486) | *Major* | ** Periodically ensure records are not buffered too long by BufferedMutator**
12200
12201 The BufferedMutator now supports two settings that are used to ensure records do not stay too long in the buffer of a BufferedMutator. For periodically flushing the BufferedMutator there is now a "Timeout": "How old may the oldest record in the buffer be before we force a flush" and a "TimerTick": How often do we check if the timeout has been exceeded. Using these settings you can make the BufferedMutator automatically flush the write buffer if after the specified number of milliseconds no flush has occurred.
12202
12203 This is mainly useful in streaming scenarios (i.e. writing data into HBase using Apache Flink/Beam/Storm) where it is common (especially in a test/development situation) to see small unpredictable bursts of data that need to be written into HBase. When using the BufferedMutator till now the effect was that records would remain in the write buffer until the buffer was full or an explicit flush was triggered. In practice this would mean that the 'last few records' of a burst would remain in the write buffer until the next burst arrives filling the buffer to capacity and thus triggering a flush.
12204
12205
12206 ---
12207
12208 * [HBASE-19670](https://issues.apache.org/jira/browse/HBASE-19670) | *Major* | **Workaround: Purge User API building from branch-2 so can make a beta-1**
12209
12210 Disable filtering of User API based off yetus annotation done in doclet. See parent issue for build failure currently being worked on but not done in time for a beta-1.
12211
12212
12213 ---
12214
12215 * [HBASE-19282](https://issues.apache.org/jira/browse/HBASE-19282) | *Major* | **CellChunkMap Benchmarking and User Interface**
12216
12217 When MSLAB is in use (that is the default config) , we will always use the CellChunkMap indexing variant for in memory flushed Immutable segments. When MSLAB is turned off, we will use CellAraryMap. These can not be changed with any configs.  The in memory flush threshold been made to be default to 10% of region flush size. This can be turned using 'hbase.memstore.inmemoryflush.threshold.factor'.
12218
12219
12220 ---
12221
12222 * [HBASE-19628](https://issues.apache.org/jira/browse/HBASE-19628) | *Major* | **ByteBufferCell should extend ExtendedCell**
12223
12224 ByteBufferCell → ByteBufferExtendedCell
12225 MapReduceCell → MapReduceExtendedCell
12226 ByteBufferChunkCell → ByteBufferChunkKeyValue
12227 NoTagByteBufferChunkCell → NoTagByteBufferChunkKeyValue
12228 KeyOnlyByteBufferCell → KeyOnlyByteBufferExtendedCell
12229 TagRewriteByteBufferCell → TagRewriteByteBufferExtendedCell
12230 ValueAndTagRewriteByteBufferCell → ValueAndTagRewriteByteBufferExtendedCell
12231 EmptyByteBufferCell → EmptyByteBufferExtendedCell
12232 FirstOnRowByteBufferCell → FirstOnRowByteBufferExtendedCell
12233 LastOnRowByteBufferCell → LastOnRowByteBufferExtendedCell
12234 FirstOnRowColByteBufferCell → FirstOnRowColByteBufferExtendedCell
12235 FirstOnRowColTSByteBufferCell → FirstOnRowColTSByteBufferExtendedCell
12236 LastOnRowColByteBufferCell → LastOnRowColByteBufferCell
12237 OffheapDecodedCell → OffheapDecodedExtendedCell
12238
12239
12240 ---
12241
12242 * [HBASE-19576](https://issues.apache.org/jira/browse/HBASE-19576) | *Major* | **Introduce builder for ReplicationPeerConfig and make it immutable**
12243
12244 Add a ReplicationPeerConfigBuilder to create ReplicationPeerConfig and make ReplicationPeerConfig immutable. Meanwhile, deprecated set\* methods in ReplicationPeerConfig.
12245
12246
12247 ---
12248
12249 * [HBASE-10092](https://issues.apache.org/jira/browse/HBASE-10092) | *Critical* | **Move to slf4j**
12250
12251 We now have slf4j as our front-end. Be careful adding logging from here on out; make sure it slf4j.
12252
12253 From here on out, as us devs go, we need to convert log messages from being 'guarded' -- i.e. surrounded by if (LOG.isDebugEnabled...) -- to instead being parameterized log messages. e.g. the latter rather than the former in the below:
12254
12255 logger.debug("The new entry is "+entry+".");
12256 logger.debug("The new entry is {}.", entry);
12257
12258 See [1] for background on perf benefits.
12259
12260 Note, FATAL log level is not present in slf4j. It is noted as a Marker but won't show in logs as a LEVEL.
12261
12262 1.  https://www.slf4j.org/faq.html#logging\_performance
12263
12264
12265 ---
12266
12267 * [HBASE-19148](https://issues.apache.org/jira/browse/HBASE-19148) | *Blocker* | **Reevaluate default values of configurations**
12268
12269 Removed unused hbase.fs.tmp.dir from hbase-default.xml.
12270
12271 Upped hbase.master.fileSplitTimeout from 30s to 10minutes (suggested by production experience)
12272
12273 Added note that handler-count should be ~CPU count.
12274
12275 hbase.regionserver.logroll.multiplier has been changed from 0.95 to 0.5 AND the default block size has been doubled.
12276
12277 A few of the core configs are now dumped to the log on startup.
12278
12279
12280 ---
12281
12282 * [HBASE-19492](https://issues.apache.org/jira/browse/HBASE-19492) | *Major* | **Add EXCLUDE\_NAMESPACE and EXCLUDE\_TABLECFS support to replication peer config**
12283
12284 Add two new field:  EXCLUDE\_NAMESPACE and EXCLUDE\_TABLECFS to replication peer config.
12285
12286 If replicate\_all flag is true, it means all user tables will be replicated to peer cluster. Then allow config exclude namespaces or exclude table-cfs which can't be replicated to  peer cluster.
12287
12288 If replicate\_all flag is false, it means all user tables can't be replicated to peer cluster. Then allow to config namespaces or table-cfs which will be replicated to peer cluster.
12289
12290
12291 ---
12292
12293 * [HBASE-19494](https://issues.apache.org/jira/browse/HBASE-19494) | *Major* | **Create simple WALKey filter that can be plugged in on the Replication Sink**
12294
12295 Adds means of adding very basic filter on the sink side of replication. We already have a means of installing filter source-side, which is better place to filter edits before they are shipped over the network, but this facility is needed by hbase-indexer.
12296
12297 Set hbase.replication.sink.walentrysinkfilter with a no-param Constructor implementation. See test in patch for example.
12298
12299
12300 ---
12301
12302 * [HBASE-19112](https://issues.apache.org/jira/browse/HBASE-19112) | *Blocker* | **Suspect methods on Cell to be deprecated**
12303
12304 Adds method Cell#getType which returns enum describing Cell Type.
12305
12306 Deprecates the following Cell methods:
12307
12308  getTypeByte
12309  getSequenceId
12310  getTagsArray
12311  getTagsOffset
12312  getTagsLength
12313
12314 CPs trying to build cells should use RawCellBuilderFactory that supports  building cells with tags.
12315
12316
12317 ---
12318
12319 * [HBASE-14790](https://issues.apache.org/jira/browse/HBASE-14790) | *Major* | **Implement a new DFSOutputStream for logging WAL only**
12320
12321 Implement a FanOutOneBlockAsyncDFSOutput for writing WAL only, the WAL provider which uses this class is AsyncFSWALProvider.
12322
12323 It is based on netty, and will write to 3 DNs at the same time concurrently(fan-out) so generally it will lead to a lower latency. And it is also fail-fast, the stream will become unwritable immediately after there are any read/write errors, no pipeline recovery. You need to call recoverLease to force close the output for this case. And it only supports to write a file with a single block. For WAL this is a good behavior as we can always open a new file when the old one is broken. The performance analysis in HBASE-16890 shows that it has a better performance.
12324
12325 Behavior changes:
12326 1. As now we write to 3 DNs concurrently, according to the visibility guarantee of HDFS, the data will be available immediately when arriving at DN since all the DNs will be considered as the last one in pipeline. This means replication may read uncommitted data and replicate it to the remote cluster and cause data inconsistency. HBASE-14004 is used to solve the problem.
12327 2. There will be no sync failure. When the output is broken, we will open a new file and write all the unacked wal entries to the new file. This means that we may have duplicated entries in wal files. HBASE-14949 is used to solve this problem.
12328
12329
12330 ---
12331
12332 * [HBASE-15536](https://issues.apache.org/jira/browse/HBASE-15536) | *Critical* | **Make AsyncFSWAL as our default WAL**
12333
12334 Now the default WALProvider is AsyncFSWALProvider, i.e. 'asyncfs'.
12335 If you want to change back to use FSHLog, please add this in hbase-site.xml
12336 {code}
12337 \<property\>
12338 \<name\>hbase.wal.provider\</name\>
12339 \<value\>filesystem\</value\>
12340 \</property\>
12341 {code}
12342 If you want to use FSHLog with multiwal, please add this in hbase-site.xml
12343 {code}
12344 \<property\>
12345 \<name\>hbase.wal.regiongrouping.delegate.provider\</name\>
12346 \<value\>filesystem\</value\>
12347 \</property\>
12348 {code}
12349
12350 This patch also sets hbase.wal.async.use-shared-event-loop to false so WAL has its own netty event group.
12351
12352
12353 ---
12354
12355 * [HBASE-19462](https://issues.apache.org/jira/browse/HBASE-19462) | *Major* | **Deprecate all addImmutable methods in Put**
12356
12357 Deprecates Put#addImmutable as of release 2.0.0, this will be removed in HBase 3.0.0. Use {@link #add(Cell)} and {@link org.apache.hadoop.hbase.CellBuilder} instead
12358
12359
12360 ---
12361
12362 * [HBASE-19213](https://issues.apache.org/jira/browse/HBASE-19213) | *Minor* | **Align check and mutate operations in Table and AsyncTable**
12363
12364 In Table interface deprecate checkAndPut, checkAndDelete and checkAndMutate methods.
12365 Similarly to AsyncTable a new method was added to replace the deprecated ones: CheckAndMutateBuilder checkAndMutate(byte[] row, byte[] family) with CheckAndMutateBuilder interface which can be used to construct the checkAnd\*() operations.
12366
12367
12368 ---
12369
12370 * [HBASE-19134](https://issues.apache.org/jira/browse/HBASE-19134) | *Major* | **Make WALKey an Interface; expose Read-Only version to CPs**
12371
12372 Made WALKey an Interface and added a WALKeyImpl implementation. WALKey comes through to Coprocessors. WALKey is read-only.
12373
12374
12375 ---
12376
12377 * [HBASE-18169](https://issues.apache.org/jira/browse/HBASE-18169) | *Blocker* | **Coprocessor fix and cleanup before 2.0.0 release**
12378
12379 Refactor of Coprocessor API for hbase2. Purged methods that exposed too much of our internals. Other hooks were recast so they no longer took or returned internal classes; instead we pass Interfaces or read-only versions of implementations.
12380
12381 Here is some overview doc on changes in hbase2 for Coprocessors including detail on why the change was made:
12382 https://github.com/apache/hbase/blob/branch-2.0/dev-support/design-docs/Coprocessor\_Design\_Improvements-Use\_composition\_instead\_of\_inheritance-HBASE-17732.adoc
12383
12384
12385 ---
12386
12387 * [HBASE-19301](https://issues.apache.org/jira/browse/HBASE-19301) | *Major* | **Provide way for CPs to create short circuited connection with custom configurations**
12388
12389 Provided a way for the CP users to create a short circuitable connection with custom configs.
12390
12391 createConnection(Configuration) is added to MasterCoprocessorEnvironment, RegionServerCoprocessorEnvironment and RegionCoprocessorEnvironment.
12392
12393 The getConnection() method already available in these Env interfaces returns the cluster connection used by the server (which the server also uses) where as this new method will create a new connection on request. The difference from connection created using ConnectionFactory APIs is that this connection can short circuit the calls to same server avoiding the RPC paths. The connection will NOT be cached/maintained by server. That should be done the CPs.
12394
12395 Be careful creating Connections out of a Coprocessor. See the javadoc on these createConnection and getConnection.
12396
12397
12398 ---
12399
12400 * [HBASE-19357](https://issues.apache.org/jira/browse/HBASE-19357) | *Major* | **Bucket cache no longer L2 for LRU cache**
12401
12402 Removed cacheDataInL1 option for HCD
12403 BucketCache is no longer the L2 for LRU on heap cache. When BC is used, data blocks will be strictly on BC only where as index/bloom blocks are on LRU L1 cache.
12404 Config 'hbase.bucketcache.combinedcache.enabled' is removed. There is no way set combined mode = false. Means make BC as victim handler for LRU cache.
12405 This will be one more noticeable change when one uses BucketCache in File mode.  Then the system table's data block(Including the META table)  will be cached in Bucket Cache files only. Plain scan from META files alone test reveal that the throughput of file mode BC is almost half only.  But for META entries we have RegionLocation cache at client side connections. So this would not be a big concern in a real cluster usage. Will check more on this and probably fix even when we do tiered BucketCache.
12406
12407
12408 ---
12409
12410 * [HBASE-19430](https://issues.apache.org/jira/browse/HBASE-19430) | *Major* | **Remove the SettableTimestamp and SettableSequenceId**
12411
12412 All the cells which are used in server side are of ExtendedCell now.
12413
12414
12415 ---
12416
12417 * [HBASE-19295](https://issues.apache.org/jira/browse/HBASE-19295) | *Major* | **The Configuration returned by CPEnv should be read-only.**
12418
12419 CoprocessorEnvironment#getConfiguration returns a READ-ONLY Configuration. Attempts at altering the returned Configuration -- whether setting or adding resources -- will result in an IllegalStateException warning of the Read-only condition of the returned Configuration.
12420
12421
12422 ---
12423
12424 * [HBASE-19410](https://issues.apache.org/jira/browse/HBASE-19410) | *Major* | **Move zookeeper related UTs to hbase-zookeeper and mark them as ZKTests**
12425
12426 There is a new HBaseZKTestingUtility which can only start a mini zookeeper cluster. And we will publish sources for test-jar for all modules.
12427
12428
12429 ---
12430
12431 * [HBASE-19323](https://issues.apache.org/jira/browse/HBASE-19323) | *Major* | **Make netty engine default in hbase2**
12432
12433 NettyRpcServer is now our default RPC server replacing SimpleRpcServer.
12434
12435
12436 ---
12437
12438 * [HBASE-19426](https://issues.apache.org/jira/browse/HBASE-19426) | *Major* | **Move has() and setTimestamp() to Mutation**
12439
12440 Moves #has and #setTimestamp back up to Mutation from the subclass Put so available to other Mutation implementations.
12441
12442
12443 ---
12444
12445 * [HBASE-19384](https://issues.apache.org/jira/browse/HBASE-19384) | *Critical* | **Results returned by preAppend hook in a coprocessor are replaced with null from other coprocessor even on bypass**
12446
12447 When a coprocessor sets 'bypass', we will skip calling subsequent Coprocessors that may be stacked-up on the method invocation; e.g. if a prePut has three coprocessors hooked up, if the first coprocessor decides to set 'bypass', we will not call the two subsequent coprocessors (this is similar to the 'complete' functionality that was in hbase1, removed in hbase2).
12448
12449
12450 ---
12451
12452 * [HBASE-19408](https://issues.apache.org/jira/browse/HBASE-19408) | *Trivial* | **Remove WALActionsListener.Base**
12453
12454 1) remove the WALActionsListener.Base
12455 2) provide default method implementation to WALActionsListener
12456 The person who want to receive the notification of WAL events should implements the WALActionsListener rather than WALActionsListener.Base.
12457
12458
12459 ---
12460
12461 * [HBASE-19339](https://issues.apache.org/jira/browse/HBASE-19339) | *Critical* | **Eager policy results in the negative size of memstore**
12462
12463 Enable TestAcidGuaranteesWithEagerPolicy and TestAcidGuaranteesWithAdaptivePolicy
12464
12465
12466 ---
12467
12468 * [HBASE-19336](https://issues.apache.org/jira/browse/HBASE-19336) | *Major* | **Improve rsgroup to allow assign all tables within a specified namespace by only writing namespace**
12469
12470 Add two new shell cmd.
12471 move\_namespaces\_rsgroup is used to reassign tables of specified namespaces from one RegionServer group to another.
12472 move\_servers\_namespaces\_rsgroup is used to reassign regionServers and tables of specified namespaces from one group to another.
12473
12474
12475 ---
12476
12477 * [HBASE-19285](https://issues.apache.org/jira/browse/HBASE-19285) | *Critical* | **Add per-table latency histograms**
12478
12479 Per-RegionServer table latency histograms have been returned to HBase (after being removed due to impacting performance). These metrics are exposed via a new JMX bean "TableLatencies" with the typical naming conventions: namespace, table, and histogram component.
12480
12481
12482 ---
12483
12484 * [HBASE-19359](https://issues.apache.org/jira/browse/HBASE-19359) | *Major* | **Revisit the default config of hbase client retries number**
12485
12486 The default value of hbase.client.retries.number was 35. It is now 10.
12487 And for server side, the default hbase.client.serverside.retries.multiplier was 10. So the server side retries number was 35 \* 10 = 350. It is now 3.
12488
12489
12490 ---
12491
12492 * [HBASE-18090](https://issues.apache.org/jira/browse/HBASE-18090) | *Major* | **Improve TableSnapshotInputFormat to allow more multiple mappers per region**
12493
12494 In this task, we make it possible to run multiple mappers per region in the table snapshot. The following code is primary table snapshot mapper initializatio:
12495
12496 TableMapReduceUtil.initTableSnapshotMapperJob(
12497           snapshotName,                     // The name of the snapshot (of a table) to read from
12498           scan,                                      // Scan instance to control CF and attribute selection
12499           mapper,                                 // mapper
12500           outputKeyClass,                   // mapper output key
12501           outputValueClass,                // mapper output value
12502           job,                                       // The current job to adjust
12503           true,                                     // upload HBase jars and jars for any of the configured job classes via the distributed cache (tmpjars)
12504           restoreDir,                           // a temporary directory to copy the snapshot files into
12505 );
12506
12507 The job only run one map task per region in the table snapshot. With this feature, client can specify the desired num of mappers when init table snapshot mapper job：
12508
12509 TableMapReduceUtil.initTableSnapshotMapperJob(
12510           snapshotName,                     // The name of the snapshot (of a table) to read from
12511           scan,                                      // Scan instance to control CF and attribute selection
12512           mapper,                                 // mapper
12513           outputKeyClass,                   // mapper output key
12514           outputValueClass,                // mapper output value
12515           job,                                       // The current job to adjust
12516           true,                                     // upload HBase jars and jars for any of the configured job classes via the distributed cache (tmpjars)
12517           restoreDir,                           // a temporary directory to copy the snapshot files into
12518           splitAlgorithm,                     // splitAlgo algorithm to split, current split algorithms  support RegionSplitter.UniformSplit() and RegionSplitter.HexStringSplit()
12519           n                                         // how many input splits to generate per one region
12520 );
12521
12522
12523 ---
12524
12525 * [HBASE-19035](https://issues.apache.org/jira/browse/HBASE-19035) | *Major* | **Miss metrics when coprocessor use region scanner to read data**
12526
12527 1. Move read requests count to region level. Because RegionScanner is exposed to CP.
12528 2. Update write requests count in processRowsWithLocks.
12529 3. Remove requestRowActionCount in RSRpcServices. This metric can be computed by region's readRequestsCount and writeRequestsCount.
12530
12531
12532 ---
12533
12534 * [HBASE-19318](https://issues.apache.org/jira/browse/HBASE-19318) | *Critical* | **MasterRpcServices#getSecurityCapabilities explicitly checks for the HBase AccessController implementation**
12535
12536 Fixes an issue with loading customer coprocessor endpoint implementations inside of the HBase Master which breaks Apache Ranger.
12537
12538
12539 ---
12540
12541 * [HBASE-19092](https://issues.apache.org/jira/browse/HBASE-19092) | *Critical* | **Make Tag IA.LimitedPrivate and expose for CPs**
12542
12543 This JIRA aims at exposing Tags for Coprocessor usage.
12544 Tag interface is now exposed to Coprocessors and CPs can make use of this interface to create their own Tags.
12545 RawCell is a new interface that is a subtype of Cell and that is exposed to CPs. RawCell has the following APIs
12546
12547 List\<Tag\> getTags()
12548 Optional\<Tag\> getTag(byte type)
12549 byte[] cloneTags()
12550
12551 The above APIs helps to read tags from the Cell.
12552
12553 CellUtil#createCell(Cell cell, List\<Tag\> tags)
12554 CellUtil#createCell(Cell cell, byte[] tags)
12555 CellUtil#createCell(Cell cell, byte[] value, byte[] tags)
12556 are deprecated.
12557 If CPs want to create a cell with Tags they can use the RegionCoprocessorEnvironment#getCellBuilder() that returns an ExtendedCellBuilder.
12558 Using ExtendedCellBuilder the CP can create Cells with Tags. Other helper methods to work on Tags are available as static APIs in Tag interface.
12559
12560
12561 ---
12562
12563 * [HBASE-19266](https://issues.apache.org/jira/browse/HBASE-19266) | *Minor* | **TestAcidGuarantees should cover adaptive in-memory compaction**
12564
12565 separate the TestAcidGuarantees by the policy:
12566 1) NONE -\> TestAcidGuaranteesWithNoInMemCompaction
12567 2) BASIC -\> TestAcidGuaranteesWithBasicPolicy
12568 3) EAGER -\> TestAcidGuaranteesWithEagerPolicy
12569 4) ADAPTIVE -\> TestAcidGuaranteesWithAdaptivePolicy
12570
12571 TestAcidGuaranteesWithEagerPolicy and TestAcidGuaranteesWithAdaptivePolicy are disabled by default as the eager policy may cause the negative size of memstore.
12572
12573
12574 ---
12575
12576 * [HBASE-16868](https://issues.apache.org/jira/browse/HBASE-16868) | *Critical* | **Add a replicate\_all flag to avoid misuse the namespaces and table-cfs config of replication peer**
12577
12578 Add a replicate\_all flag to replication peer config. The default value is true, which means all user tables (REPLICATION\_SCOPE != 0 ) will be replicated to peer cluster.
12579
12580 How to config a peer from replicate all to only replicate special namespace/tablecfs?
12581 Step1. Add a new peer with no namespace/tablecfs config, the replicate\_all flag will be true automatically.
12582 Step2. User want only replicate some namespaces or tables, so set replicate\_all flag to false first.
12583 Step3. Add special namespaces or table-cfs config to the replication peer.
12584
12585 How to config a peer from replicate special namespace/tablecfs to replicate all?
12586 Step1. Add a new peer with special namespace/tablecfs config, the replicate\_all flag will be false automatically.
12587 Step2. User want replicate all user tables, so remove the special namespace/tablecfs config first.
12588 Step3. Set replicate\_all flag to true.
12589
12590 How to config replicate nothing?
12591 Set replicate\_all flag to false and no namespace/tablecfs config, then all tables cannot be replicated to peer cluster.
12592
12593
12594 ---
12595
12596 * [HBASE-19122](https://issues.apache.org/jira/browse/HBASE-19122) | *Critical* | **preCompact and preFlush can bypass by returning null scanner; shut it down**
12597
12598 Remove the ability to 'bypass' preFlush and preCompact by returning a null Scanner. Bypass is disallowed on these methods in hbase2.
12599
12600
12601 ---
12602
12603 * [HBASE-19200](https://issues.apache.org/jira/browse/HBASE-19200) | *Major* | **make hbase-client only depend on ZKAsyncRegistry and ZNodePaths**
12604
12605 ConnectionImplementation now uses asynchronous connections to zookeeper via ZKAsyncRegistry to get cluster id, master address, meta region location, etc.
12606 Since ZKAsyncRegistry uses curator framework, this change purges a lot of zookeeper dependencies in hbase-client.
12607 Now hbase-client only depends on only ZKAsyncRegistry, ZNodePaths and the newly introduced ZKMetadata.
12608
12609
12610 ---
12611
12612 * [HBASE-19311](https://issues.apache.org/jira/browse/HBASE-19311) | *Major* | **Promote TestAcidGuarantees to LargeTests and start mini cluster once to make it faster**
12613
12614 Introduce a AcidGuaranteesTestTool and expose as tool instead of TestAcidGuarantees. Now TestAcidGuarantees is just a UT.
12615
12616
12617 ---
12618
12619 * [HBASE-19293](https://issues.apache.org/jira/browse/HBASE-19293) | *Major* | **Support adding a new replication peer in disabled state**
12620
12621 Add a boolean parameter which means the new replication peer's state is enabled or disabled for Admin/AsyncAdmin's addReplicationPeer method. Meanwhile, you can use shell cmd to add a enabled/disabled replication peer. The STATE parameter is optional and the default state is enabled.
12622
12623 hbase\> add\_peer '1', CLUSTER\_KEY =\> "server1.cie.com:2181:/hbase", STATE =\> "ENABLED"
12624 hbase\> add\_peer '1', CLUSTER\_KEY =\> "server1.cie.com:2181:/hbase", STATE =\> "DISABLED"
12625
12626
12627 ---
12628
12629 * [HBASE-19123](https://issues.apache.org/jira/browse/HBASE-19123) | *Major* | **Purge 'complete' support from Coprocesor Observers**
12630
12631 This issue removes the 'complete' facility that was in ObserverContext. It is no longer possible for a Coprocessor to cut the chain-of-invocation and insist its response prevails.
12632
12633
12634 ---
12635
12636 * [HBASE-18911](https://issues.apache.org/jira/browse/HBASE-18911) | *Major* | **Unify Admin and AsyncAdmin's methods name**
12637
12638 Deprecated 4 methods for Admin interface.
12639 Deprecated compactRegionServer(ServerName, boolean). Use compactRegionServer(ServerName) and majorCompactcompactRegionServer(ServerName) instead.
12640 Deprecated getRegionLoad(ServerName) method. Use getRegionLoads(ServerName) instead.
12641 Deprecated getRegionLoad(ServerName, TableName) method. Use getRegionLoads(ServerName, TableName) instead.
12642 Deprecated getQuotaRetriever(QuotaFilter) instead. Use  getQuota(QuotaFilter) instead.
12643
12644 Add 7 methods for Admin interface.
12645 ServerName getMaster();
12646 Collection\<ServerName\> getBackupMasters();
12647 Collection\<ServerName\> getRegionServers();
12648 boolean splitSwitch(boolean enabled, boolean synchronous);
12649 boolean mergeSwitch(boolean enabled, boolean synchronous);
12650 boolean isSplitEnabled();
12651 boolean isMergeEnabled();
12652
12653
12654 ---
12655
12656 * [HBASE-18703](https://issues.apache.org/jira/browse/HBASE-18703) | *Critical* | **Inconsistent behavior for preBatchMutate in doMiniBatchMutate and processRowsWithLocks**
12657
12658 Two write paths Region.batchMutate() and Region.mutateRows() are unified and inconsistencies are resolved.
12659
12660
12661 ---
12662
12663 * [HBASE-18964](https://issues.apache.org/jira/browse/HBASE-18964) | *Major* | **Deprecate RowProcessor and processRowsWithLocks() APIs that take RowProcessor as an argument**
12664
12665 RowProcessor and Region#processRowsWithLocks() methods that take RowProcessor as an argument are deprecated. Use Coprocessors if you want to customize handling.
12666
12667
12668 ---
12669
12670 * [HBASE-19251](https://issues.apache.org/jira/browse/HBASE-19251) | *Major* | **Merge RawAsyncTable and AsyncTable**
12671
12672 Merge the RawAsyncTable and AsyncTable interfaces. Use generic to reflection the difference between the observer style scan API. For the implementation which does not have a user specified thread pool, the observer is AdvancedScanResultConsumer. For the implementation which needs a user specified thread pool, the observer is ScanResultConsumer.
12673
12674
12675 ---
12676
12677 * [HBASE-19262](https://issues.apache.org/jira/browse/HBASE-19262) | *Major* | **Revisit checkstyle rules**
12678
12679 Change the import order rule that now we should put the shaded import at bottom. Ignore the VisibilityModifier warnings for test code.
12680
12681
12682 ---
12683
12684 * [HBASE-19187](https://issues.apache.org/jira/browse/HBASE-19187) | *Minor* | **Remove option to create on heap bucket cache**
12685
12686 Removing the on heap Bucket cache feature.
12687 The config "hbase.bucketcache.ioengine" no longer support the 'heap' value.
12688 Its supported values now are 'offheap',  'file:\<path\>', 'files:\<path\>'  and 'mmap:\<path\>'
12689
12690
12691 ---
12692
12693 * [HBASE-12350](https://issues.apache.org/jira/browse/HBASE-12350) | *Minor* | **Backport error-prone build support to branch-1 and branch-2**
12694
12695 This change introduces compile time support for running the error-prone suite of static analyses. Enable with -PerrorProne on the Maven command line. Requires JDK 8 or higher. (Don't enable if building with JDK 7.)
12696
12697
12698 ---
12699
12700 * [HBASE-14350](https://issues.apache.org/jira/browse/HBASE-14350) | *Blocker* | **Procedure V2 Phase 2: Assignment Manager**
12701
12702 (Incomplete)
12703
12704 = Incompatbiles
12705
12706 == Coprocessor Incompatibilities
12707
12708 Split/Merge have moved to the Master; it runs them now. Means hooks around Split/Merge are now noops. To intercept Split/Merge phases, CPs need to intercept on MasterObserver.
12709
12710
12711 ---
12712
12713 * [HBASE-19189](https://issues.apache.org/jira/browse/HBASE-19189) | *Major* | **Ad-hoc test job for running a subset of tests lots of times**
12714
12715 <!-- markdown -->
12716
12717
12718 Folks can now test out tests on an arbitrary release branch. Head over to [builds.a.o job "HBase-adhoc-run-tests"](https://builds.apache.org/view/H-L/view/HBase/job/HBase-adhoc-run-tests/), then pick "Build with parameters".
12719 Tests are specified as just names e.g. TestLogRollingNoCluster. can also be a glob. e.g. TestHFile*
12720
12721
12722 ---
12723
12724 * [HBASE-19220](https://issues.apache.org/jira/browse/HBASE-19220) | *Major* | **Async tests time out talking to zk; 'clusterid came back null'**
12725
12726 Changed retries from 3 to 30 for zk initial connect for registry.
12727
12728
12729 ---
12730
12731 * [HBASE-19002](https://issues.apache.org/jira/browse/HBASE-19002) | *Minor* | **Introduce more examples to show how to intercept normal region operations**
12732
12733 With the change in Coprocessor APIs, the hbase-examples module has been updated to provide additional examples that show how to write Coprocessors against the new API.
12734
12735
12736 ---
12737
12738 * [HBASE-18961](https://issues.apache.org/jira/browse/HBASE-18961) | *Major* | **doMiniBatchMutate() is big, split it into smaller methods**
12739
12740 HRegion.batchMutate()/ doMiniBatchMutate() is refactored with aim to unify batchMutate() and mutateRows() code paths later. batchMutate() currently handles 2 types of batches: MutationBatchOperations and ReplayBatchOperations. Common base class BatchOperations is augmented with common methods which are overridden in derived classes as needed. doMiniBatchMutate() is implemented using common methods in base class BatchOperations.
12741
12742
12743 ---
12744
12745 * [HBASE-19103](https://issues.apache.org/jira/browse/HBASE-19103) | *Minor* | **Add BigDecimalComparator for filter**
12746
12747 If BigDecimal is stored as value, and you need to add a matched comparator to the value filter when scanning, a BigDecimalComparator can be used.
12748
12749
12750 ---
12751
12752 * [HBASE-19111](https://issues.apache.org/jira/browse/HBASE-19111) | *Critical* | **Add missing CellUtil#isPut(Cell) methods**
12753
12754 A new public API method was added to CellUtil "isPut(Cell)" for clients to use to determine if the Cell is for a Put operation.
12755
12756 Additionally, other CellUtil API calls which expose Cell-implementation were marked as deprecated and will be removed in a future version.
12757
12758
12759 ---
12760
12761 * [HBASE-19160](https://issues.apache.org/jira/browse/HBASE-19160) | *Critical* | **Re-expose CellComparator**
12762
12763 CellComparator is now InterfaceAudience.Public
12764
12765
12766 ---
12767
12768 * [HBASE-19131](https://issues.apache.org/jira/browse/HBASE-19131) | *Major* | **Add the ClusterStatus hook and cleanup other hooks which can be replaced by ClusterStatus hook**
12769
12770 1) Add preGetClusterStatus() and postGetClusterStatus() hooks
12771 2) add preGetClusterStatus() to access control check - an admin action
12772
12773
12774 ---
12775
12776 * [HBASE-19095](https://issues.apache.org/jira/browse/HBASE-19095) | *Major* | **Add CP hooks in RegionObserver for in memory compaction**
12777
12778 Add 4 methods in RegionObserver:
12779 preMemStoreCompaction
12780 preMemStoreCompactionCompactScannerOpen
12781 preMemStoreCompactionCompact
12782 postMemStoreCompaction
12783 preMemStoreCompaction and postMemStoreCompaction will always be called for all in memory compactions. Under eager mode, preMemStoreCompactionCompactScannerOpen will be called before opening store scanner to allow you changing the max versions and TTL, and preMemStoreCompactionCompact will be called after the creation to let you do wrapping.
12784
12785
12786 ---
12787
12788 * [HBASE-19152](https://issues.apache.org/jira/browse/HBASE-19152) | *Trivial* | **Update refguide 'how to build an RC' and the make\_rc.sh script**
12789
12790 The make\_rc.sh script can run an hbase2 build now generating tarballs and pushing up to maven repository. TODO: Sign and checksum, check tarball, push to apache dist.....
12791
12792
12793 ---
12794
12795 * [HBASE-19179](https://issues.apache.org/jira/browse/HBASE-19179) | *Critical* | **Remove hbase-prefix-tree**
12796
12797 Purged the hbase-prefix-tree module and all references from the code base.
12798
12799 prefix-tree data block encoding was a super cool experimental feature that saw some usage initially but has since languished. If interested in carrying this sweet facility forward, write the dev list and we'll restore this module.
12800
12801
12802 ---
12803
12804 * [HBASE-19176](https://issues.apache.org/jira/browse/HBASE-19176) | *Major* | **Remove hbase-native-client from branch-2**
12805
12806 Removed the hbase-native-client module from branch-2 (it is still in Master). It is not complete. Look for a finished C++ client in the near future. Will restore native client to branch-2 at that point.
12807
12808
12809 ---
12810
12811 * [HBASE-19144](https://issues.apache.org/jira/browse/HBASE-19144) | *Major* | **[RSgroups] Retry assignments in FAILED\_OPEN state when servers (re)join the cluster**
12812
12813 When regionserver placement groups (RSGroups) is active, as servers join the cluster the Master will attempt to reassign regions in FAILED\_OPEN state.
12814
12815
12816 ---
12817
12818 * [HBASE-18770](https://issues.apache.org/jira/browse/HBASE-18770) | *Critical* | **Remove bypass method in ObserverContext and implement the 'bypass' logic case by case**
12819
12820 Removes blanket bypass mechanism (Observer#bypass). Instead, a curated subset of methods are bypassable.
12821
12822     Changes Coprocessor ObserverContext 'bypass' semantic. We flip the
12823     default so bypass is NOT supported on Observer invocations; only a
12824     couple of preXXX methods in RegionObserver allow it: e.g.  preGet
12825     and prePut but not preFlush, etc. Everywhere else, we throw
12826     a Exception if a Coprocessor Observer tries to invoke bypass. Master
12827     Observers can no longer stop or change move, split, assign, create table, etc.
12828     preBatchMutate can no longer be bypassed (bypass the finer-grained
12829     prePut, preDelete, etc. instead)
12830
12831     Ditto on complete, the mechanism that allowed a Coprocessor
12832     rule that all subsequent Coprocessors are skipped in an
12833     invocation chain; now, complete is only available to
12834     bypassable methods (and Coprocessors will get an exception if
12835     they try to 'complete' when it is not allowed).
12836
12837     See javadoc for whether a Coprocessor Observer method supports
12838     'bypass'. If no mention, 'bypass' is NOT supported.
12839
12840 The below methods have been marked deprecated in hbase2. We would have liked to have removed them because they use IA.Private parameters but they are in use by CoreCoprocessors or are critical to downstreamers and we have no alternatives to provide currently.
12841
12842 @Deprecated public boolean prePrepareTimeStampForDeleteVersion(final Mutation mutation, final Cell kv, final byte[] byteNow, final Get get) throws IOException {
12843
12844 @Deprecated public boolean preWALRestore(final RegionInfo info, final WALKey logKey, final WALEdit logEdit) throws IOException {
12845
12846 @Deprecated public void postWALRestore(final RegionInfo info, final WALKey logKey, final WALEdit logEdit) throws IOException {
12847
12848 @Deprecated public DeleteTracker postInstantiateDeleteTracker(DeleteTracker result) throws IOException
12849
12850 Metrics are updated now even if the Coprocessor does a bypass; e.g. The put count is updated even if a Coprocessor bypasses the core put operation (We do it this way so no need for Coprocessors to have access to our core metrics system).
12851
12852
12853 ---
12854
12855 * [HBASE-19033](https://issues.apache.org/jira/browse/HBASE-19033) | *Blocker* | **Allow CP users to change versions and TTL before opening StoreScanner**
12856
12857 Add back the three methods without a return value:
12858 preFlushScannerOpen
12859 preCompactScannerOpen
12860 preStoreScannerOpen
12861
12862 Introduce a ScanOptions interface to let CP users change the max versions and TTL of a ScanInfo. It will be passed as a parameter in the three methods above.
12863
12864 Inntroduce a new example WriteHeavyIncrementObserver which convert increment to put and do aggregating when get. It uses the above three methods.
12865
12866
12867 ---
12868
12869 * [HBASE-19110](https://issues.apache.org/jira/browse/HBASE-19110) | *Minor* | **Add default for Server#isStopping & #getFileSystem**
12870
12871 Made defaults for Server#isStopping and Server#getFileSystem. Should have done this when I added them (lesson learned, was actually mentioned in a review).
12872
12873
12874 ---
12875
12876 * [HBASE-19047](https://issues.apache.org/jira/browse/HBASE-19047) | *Critical* | **CP exposed Scanner types should not extend Shipper**
12877
12878 RegionObserver#preScannerOpen signature changed
12879 RegionScanner preScannerOpen( ObserverContext\<RegionCoprocessorEnvironment\> c, Scan scan,  RegionScanner s)   -\>   void preScannerOpen( ObserverContext\<RegionCoprocessorEnvironment\> c, Scan scan)
12880 The pre hook can no longer return a RegionScanner instance.
12881
12882
12883 ---
12884
12885 * [HBASE-18995](https://issues.apache.org/jira/browse/HBASE-18995) | *Critical* | **Move methods that are for internal usage from CellUtil to Private util class**
12886
12887 Split CellUtil into public CellUtil and PrivateCellUtil for Internal use only.
12888
12889
12890 ---
12891
12892 * [HBASE-18906](https://issues.apache.org/jira/browse/HBASE-18906) | *Critical* | **Provide Region#waitForFlushes API**
12893
12894 Provided an API in Region (Exposed to CPs)
12895 boolean waitForFlushes(long timeout)
12896 This call will make the current thread to be waiting for all flushes in this region to be finished.  (Upto the time out time being specified). The boolean return value specify whether the flushes are really over or the time out being elapsed. Return false when timeout elapsed but flushes are not over or  true when flushes are over
12897
12898
12899 ---
12900
12901 * [HBASE-18905](https://issues.apache.org/jira/browse/HBASE-18905) | *Major* | **Allow CPs to request flush on Region and know the completion of the requested flush**
12902
12903 Add a FlushLifeCycleTracker which is similiar to CompactionLifeCycleTracker for tracking flush.
12904 Add a requestFlush method in Region interface to let CP users request flush on a region. The operation is asynchronous, you need to use the FlushLifeCycleTracker to track the flush.
12905 The difference with CompactionLifeCycleTracker is that, flush is per region so we do not use Store as a parameter of the methods. And also, notExecuted means the whole flush has not been executed, and afterExecution means the whole flush has been finished, so we do not have a separated completed method. A flush will be ended either by notExecuted or afterExecution.
12906
12907
12908 ---
12909
12910 * [HBASE-19048](https://issues.apache.org/jira/browse/HBASE-19048) | *Major* | **Cleanup MasterObserver hooks which takes IA private params**
12911
12912 Purged InterfaceAudience.Private parameters from methods in MasterObserver.
12913
12914 preAbortProcedure no longer takes a ProcedureExecutor.
12915
12916 postGetProcedures no longer takes a list of Procedures.
12917
12918 postGetLocks no longer takes a list of locks.
12919
12920 preRequestLock and postRequestLock no longer take lock type.
12921
12922 preLockHeartbeat and postLockHeartbeat no longer takes a lock procedure.
12923
12924 The implication is that that the Coprocessors that depended on these params have had to coarsen so for example, the AccessController can not do access per Procedure or Lock but rather, makes a judgement on the general access (You'll need to be ADMIN to see list of procedures and locks).
12925
12926
12927 ---
12928
12929 * [HBASE-18994](https://issues.apache.org/jira/browse/HBASE-18994) | *Major* | **Decide if META/System tables should use Compacting Memstore or Default Memstore**
12930
12931 Added a new config 'hbase.systemtables.compacting.memstore.type"  for the system tables. By default all the system tables will have 'NONE' as the type and so it will be using the default memstore by default.
12932 {code}
12933  \<property\>
12934     \<name\>hbase.systemtables.compacting.memstore.type\</name\>
12935     \<value\>NONE\</value\>
12936   \</property\>
12937 {code}
12938
12939
12940 ---
12941
12942 * [HBASE-19029](https://issues.apache.org/jira/browse/HBASE-19029) | *Critical* | **Align RPC timout methods in Table and AsyncTableBase**
12943
12944 Deprecate the following methods in Table:
12945 - int getRpcTimeout()
12946 - int getReadRpcTimeout()
12947 - int getWriteRpcTimeout()
12948 - int getOperationTimeout()
12949
12950 Add the following methods to Table:
12951 - long getRpcTimeout(TimeUnit)
12952 - long getReadRpcTimeout(TimeUnit)
12953 - long getWriteRpcTimeout(TimeUnit)
12954 - long getOperationTimeout(TimeUnit)
12955
12956 Add missing deprecation tag for long getRpcTimeout(TimeUnit unit) in AsyncTableBase
12957
12958
12959 ---
12960
12961 * [HBASE-18410](https://issues.apache.org/jira/browse/HBASE-18410) | *Major* | **FilterList  Improvement.**
12962
12963 In this task, we fixed all existing bugs in FilterList, and did the code refactor which ensured interface compatibility .
12964
12965 The primary bug  fixes are :
12966 1. For sub-filter in FilterList with MUST\_PASS\_ONE, if previous filterKeyValue() of sub-filter returns NEXT\_COL, we cannot make sure that the next cell will be the first cell in next column, because FilterList choose the minimal forward step among sub-filters, and it may return a SKIP. so here we add an extra check to ensure that the next cell will match preivous return code for sub-filters.
12967 2. Previous logic about transforming cell of FilterList is incorrect, we should set the previous transform result (rather than the given cell in question) as the initial vaule of transform cell before call filterKeyValue() of FilterList.
12968 3. Handle the ReturnCodes which the previous code did not handle.
12969
12970 About code refactor, we divided the FilterList into two separated sub-classes: FilterListWithOR and FilterListWithAND,  The FilterListWithOR has been optimised to choose the next minimal step to seek cell rather than SKIP cell one by one, and the FilterListWithAND  has been optimised to choose the next maximal key to seek among sub-filters in filter list. All in all, The code in FilterList is clean and easier to follow now.
12971
12972 Note that ReturnCode NEXT\_ROW has been redefined as skipping to next row in current family,   not to next row in all family. it’s more reasonable, because ReturnCode is a concept in store level, not in region level.
12973
12974 Another bug that needs attention is: filterAllRemaining() in FilterList with MUST\_PASS\_ONE  will now return false if the filter list is empty whereas earlier it used to return true for Operator.MUST\_PASS\_ONE.  it's more reasonable now.
12975
12976
12977 ---
12978
12979 * [HBASE-19077](https://issues.apache.org/jira/browse/HBASE-19077) | *Critical* | **Have Region\*CoprocessorEnvironment provide an ImmutableOnlineRegions**
12980
12981 Adds getOnlineRegions to the RegionCoprocessorEnvironment (Context) and ditto to RegionServerCoprocessorEnvironment. Allows Coprocessor get list of Regions online on the currently hosting RegionServer.
12982
12983
12984 ---
12985
12986 * [HBASE-19021](https://issues.apache.org/jira/browse/HBASE-19021) | *Critical* | **Restore a few important missing logics for balancer in 2.0**
12987
12988 Re-enabled 'hbase.master.loadbalance.bytable', default 'false'.
12989 Draining servers are removed from consideration by blancer.balanceCluster() call.
12990
12991
12992 ---
12993
12994 * [HBASE-19049](https://issues.apache.org/jira/browse/HBASE-19049) | *Major* | **Update kerby to 1.0.1 GA release**
12995
12996 HBase now relies on Kerby version 1.0.1 for its test environment. No downstream facing change is expected.
12997
12998
12999 ---
13000
13001 * [HBASE-16290](https://issues.apache.org/jira/browse/HBASE-16290) | *Major* | **Dump summary of callQueue content; can help debugging**
13002
13003 Patch to print summary of call queues by size and count. This is displayed on the debug dump page of region server UI
13004
13005
13006 ---
13007
13008 * [HBASE-18846](https://issues.apache.org/jira/browse/HBASE-18846) | *Major* | **Accommodate the hbase-indexer/lily/SEP consumer deploy-type**
13009
13010 Makes it so hbase-indexer/lily can move off dependence on internal APIs and instead move to public APIs.
13011
13012 Adds being able to disable near-all HRegionServer services. This along with an existing plugin mechanism which allows configuring the RegionServer to host an alternate Connection implementation, makes it so we can put up a cluster of hollowed-out HRegionServers purposed to pose as a Replication Sink for a source HBase Cluster (Users do not need to figure our RPC, our PB encodings, build a distributed service, etc.). In the alternate supplied Connection implementation, hbase-indexer would install its own code to catch the Replication.
13013
13014 Below and attached are sample hbase-server.xml files and alternate Connection implementations. To start up an HRegionServer as a sink, first make sure there is a ZooKeeper ensemble we can talk to. If none, just start one:
13015 {code}
13016 ./bin/hbase-daemon.sh start zookeeper
13017 {code}
13018
13019 To start up a single RegionServer, put in place the below sample hbase-site.xml and a derviative of the below IndexerConnection on the CLASSPATH, and then start the RegionServer:
13020 {code}
13021 ./bin/hbase-daemon.sh  start  org.apache.hadoop.hbase.regionserver.HRegionServer
13022 {code}
13023 Stdout and Stderr will go into files under configured logs directory. Browse to localhost:16030 to find webui (unless disabled).
13024
13025 DETAILS
13026
13027 This patch adds configuration to disable RegionServer internal Services, Managers, Caches, etc., starting up.
13028
13029 By default a RegionServer starts up an Admin and Client Service. To disable either or both, use the below booleans:
13030 {code}
13031 hbase.regionserver.admin.service
13032 hbase.regionserver.client.service
13033 {code}
13034
13035 Both default true.
13036
13037 To make a HRegionServer startup and stay up without expecting to communicate with a master, set the below boolean to false:
13038
13039 {code}
13040 hbase.masterless
13041 {code]
13042 Default is false.
13043
13044 h3. Sample hbase-site.xml that disables internal HRegionServer Services
13045 Below is an example hbase-site.xml that turns off most Services and that then installs an alternate Connection implementation, one that is nulled out in all regards except in being able to return a "Table" that can catch a Replication Stream in its {code}batch(List\<? extends Row\> actions, Object[] results){code} method. i.e. what the hbase-indexer wants. I also add the example alternate Connection implementation below (both of these files are also attached to this issue). Expects there to be an up and running zookeeper ensemble.
13046
13047 {code}
13048 \<configuration\>
13049   \<!-- This file is an example for hbase-indexer. It shuts down
13050        facility in the regionserver and interjects a special
13051        Connection implementation which is how hbase-indexer will
13052        receive the replication stream from source hbase cluster.
13053        See the class referenced in the config.
13054
13055        Most of the config in here is booleans set to off and
13056        setting values to zero so services doon't start. Some of
13057        the flags are new via this patch.
13058 --\>
13059   \<!--Need this for the RegionServer to come up standalone--\>
13060   \<property\>
13061     \<name\>hbase.cluster.distributed\</name\>
13062     \<value\>true\</value\>
13063   \</property\>
13064
13065   \<!--This is what you implement, a Connection that returns a Table that
13066        overrides the batch call. It is at this point you do your indexer inserts.
13067     --\>
13068   \<property\>
13069     \<name\>hbase.client.connection.impl\</name\>
13070     \<value\>org.apache.hadoop.hbase.client.IndexerConnection\</value\>
13071     \<description\>A customs connection implementation just so we can interject our
13072       own Table class, one that has an override for the batch call which receives
13073       the replication stream edits; i.e. it is called by the replication sink
13074       #replicateEntries method.\</description\>
13075   \</property\>
13076
13077   \<!--Set hbase.regionserver.info.port to -1 for no webui--\>
13078
13079   \<!--Below are configs to shut down unused services in hregionserver--\>
13080   \<property\>
13081     \<name\>hbase.regionserver.admin.service\</name\>
13082     \<value\>false\</value\>
13083     \<description\>Do NOT stand up an Admin Service Interface on RPC\</description\>
13084   \</property\>
13085   \<property\>
13086     \<name\>hbase.regionserver.client.service\</name\>
13087     \<value\>false\</value\>
13088     \<description\>Do NOT stand up a client-facing Service on RPC\</description\>
13089   \</property\>
13090   \<property\>
13091     \<name\>hbase.wal.provider\</name\>
13092     \<value\>org.apache.hadoop.hbase.wal.DisabledWALProvider\</value\>
13093     \<description\>Set WAL service to be the null WAL\</description\>
13094   \</property\>
13095   \<property\>
13096     \<name\>hbase.regionserver.workers\</name\>
13097     \<value\>false\</value\>
13098     \<description\>Turn off all background workers, log splitters, executors, etc.\</description\>
13099   \</property\>
13100   \<property\>
13101     \<name\>hfile.block.cache.size\</name\>
13102     \<value\>0.0001\</value\>
13103     \<description\>Turn off block cache completely\</description\>
13104   \</property\>
13105   \<property\>
13106     \<name\>hbase.mob.file.cache.size\</name\>
13107     \<value\>0\</value\>
13108     \<description\>Disable MOB cache.\</description\>
13109   \</property\>
13110   \<property\>
13111     \<name\>hbase.masterless\</name\>
13112     \<value\>true\</value\>
13113     \<description\>Do not expect Master in cluster.\</description\>
13114   \</property\>
13115   \<property\>
13116     \<name\>hbase.regionserver.metahandler.count\</name\>
13117     \<value\>1\</value\>
13118     \<description\>How many priority handlers to run; we probably need none.
13119     Default is 20 which is too much on a server like this.\</description\>
13120   \</property\>
13121   \<property\>
13122     \<name\>hbase.regionserver.replication.handler.count\</name\>
13123     \<value\>1\</value\>
13124     \<description\>How many replication handlers to run; we probably need none.
13125     Default is 3 which is too much on a server like this.\</description\>
13126   \</property\>
13127   \<property\>
13128     \<name\>hbase.regionserver.handler.count\</name\>
13129     \<value\>10\</value\>
13130     \<description\>How many default handlers to run; tie to # of CPUs.
13131     Default is 30 which is too much on a server like this.\</description\>
13132   \</property\>
13133   \<property\>
13134     \<name\>hbase.ipc.server.read.threadpool.size\</name\>
13135     \<value\>3\</value\>
13136     \<description\>How many Listener request reaaders to run; tie to a portion # of CPUs (1/4?).
13137     Default is 10 which is too much on a server like this.\</description\>
13138   \</property\>
13139 \</configuration\>
13140 {code}
13141
13142 h2. Sample Connection Implementation
13143 Has call-out for where an hbase-indexer would insert its capture code.
13144 {code}
13145 package org.apache.hadoop.hbase.client;
13146
13147 import com.google.protobuf.Descriptors;
13148 import com.google.protobuf.Message;
13149 import com.google.protobuf.Service;
13150 import com.google.protobuf.ServiceException;
13151 import org.apache.hadoop.conf.Configuration;
13152 import org.apache.hadoop.hbase.CompareOperator;
13153 import org.apache.hadoop.hbase.HTableDescriptor;
13154 import org.apache.hadoop.hbase.TableName;
13155 import org.apache.hadoop.hbase.client.coprocessor.Batch;
13156 import org.apache.hadoop.hbase.filter.CompareFilter;
13157 import org.apache.hadoop.hbase.ipc.CoprocessorRpcChannel;
13158 import org.apache.hadoop.hbase.security.User;
13159
13160 import java.io.IOException;
13161 import java.util.List;
13162 import java.util.Map;
13163 import java.util.concurrent.ExecutorService;
13164
13165
13166 /\*\*
13167  \* Sample class for hbase-indexer.
13168  \* DO NOT COMMIT TO HBASE CODEBASE!!!
13169  \* Overrides Connection just so we can return a Table that has the
13170  \* method that the replication sink calls, i.e. Table#batch.
13171  \* It is at this point that the hbase-indexer catches the replication
13172  \* stream so it can insert into the lucene index.
13173  \*/
13174 public class IndexerConnection implements Connection {
13175   private final Configuration conf;
13176   private final User user;
13177   private final ExecutorService pool;
13178   private volatile boolean closed = false;
13179
13180   public IndexerConnection(Configuration conf, ExecutorService pool, User user) throws IOException {
13181     this.conf = conf;
13182     this.user = user;
13183     this.pool = pool;
13184   }
13185
13186   @Override
13187   public void abort(String why, Throwable e) {}
13188
13189   @Override
13190   public boolean isAborted() {
13191     return false;
13192   }
13193
13194   @Override
13195   public Configuration getConfiguration() {
13196     return this.conf;
13197   }
13198
13199   @Override
13200   public BufferedMutator getBufferedMutator(TableName tableName) throws IOException {
13201     return null;
13202   }
13203
13204   @Override
13205   public BufferedMutator getBufferedMutator(BufferedMutatorParams params) throws IOException {
13206     return null;
13207   }
13208
13209   @Override
13210   public RegionLocator getRegionLocator(TableName tableName) throws IOException {
13211     return null;
13212   }
13213
13214   @Override
13215   public Admin getAdmin() throws IOException {
13216     return null;
13217   }
13218
13219   @Override
13220   public void close() throws IOException {
13221     if (!this.closed) this.closed = true;
13222   }
13223
13224   @Override
13225   public boolean isClosed() {
13226     return this.closed;
13227   }
13228
13229   @Override
13230   public TableBuilder getTableBuilder(final TableName tn, ExecutorService pool) {
13231     if (isClosed()) {
13232       throw new RuntimeException("IndexerConnection is closed.");
13233     }
13234     final Configuration passedInConfiguration = getConfiguration();
13235     return new TableBuilder() {
13236       @Override
13237       public TableBuilder setOperationTimeout(int timeout) {
13238         return null;
13239       }
13240
13241       @Override
13242       public TableBuilder setRpcTimeout(int timeout) {
13243         return null;
13244       }
13245
13246       @Override
13247       public TableBuilder setReadRpcTimeout(int timeout) {
13248         return null;
13249       }
13250
13251       @Override
13252       public TableBuilder setWriteRpcTimeout(int timeout) {
13253         return null;
13254       }
13255
13256       @Override
13257       public Table build() {
13258         return new Table() {
13259           private final Configuration conf = passedInConfiguration;
13260           private final TableName tableName = tn;
13261
13262           @Override
13263           public TableName getName() {
13264             return this.tableName;
13265           }
13266
13267           @Override
13268           public Configuration getConfiguration() {
13269             return this.conf;
13270           }
13271
13272           @Override
13273           public void batch(List\<? extends Row\> actions, Object[] results)
13274           throws IOException, InterruptedException {
13275             // Implementation goes here.
13276           }
13277
13278           @Override
13279           public HTableDescriptor getTableDescriptor() throws IOException {
13280             return null;
13281           }
13282
13283           @Override
13284           public TableDescriptor getDescriptor() throws IOException {
13285             return null;
13286           }
13287
13288           @Override
13289           public boolean exists(Get get) throws IOException {
13290             return false;
13291           }
13292
13293           @Override
13294           public boolean[] existsAll(List\<Get\> gets) throws IOException {
13295             return new boolean[0];
13296           }
13297
13298           @Override
13299           public \<R\> void batchCallback(List\<? extends Row\> actions, Object[] results, Batch.Callback\<R\> callback) throws IOException, InterruptedException {
13300
13301           }
13302
13303           @Override
13304           public Result get(Get get) throws IOException {
13305             return null;
13306           }
13307
13308           @Override
13309           public Result[] get(List\<Get\> gets) throws IOException {
13310             return new Result[0];
13311           }
13312
13313           @Override
13314           public ResultScanner getScanner(Scan scan) throws IOException {
13315             return null;
13316           }
13317
13318           @Override
13319           public ResultScanner getScanner(byte[] family) throws IOException {
13320             return null;
13321           }
13322
13323           @Override
13324           public ResultScanner getScanner(byte[] family, byte[] qualifier) throws IOException {
13325             return null;
13326           }
13327
13328           @Override
13329           public void put(Put put) throws IOException {
13330
13331           }
13332
13333           @Override
13334           public void put(List\<Put\> puts) throws IOException {
13335
13336           }
13337
13338           @Override
13339           public boolean checkAndPut(byte[] row, byte[] family, byte[] qualifier, byte[] value, Put put) throws IOException {
13340             return false;
13341           }
13342
13343           @Override
13344           public boolean checkAndPut(byte[] row, byte[] family, byte[] qualifier, CompareFilter.CompareOp compareOp, byte[] value, Put put) throws IOException {
13345             return false;
13346           }
13347
13348           @Override
13349           public boolean checkAndPut(byte[] row, byte[] family, byte[] qualifier, CompareOperator op, byte[] value, Put put) throws IOException {
13350             return false;
13351           }
13352
13353           @Override
13354           public void delete(Delete delete) throws IOException {
13355
13356           }
13357
13358           @Override
13359           public void delete(List\<Delete\> deletes) throws IOException {
13360
13361           }
13362
13363           @Override
13364           public boolean checkAndDelete(byte[] row, byte[] family, byte[] qualifier, byte[] value, Delete delete) throws IOException {
13365             return false;
13366           }
13367
13368           @Override
13369           public boolean checkAndDelete(byte[] row, byte[] family, byte[] qualifier, CompareFilter.CompareOp compareOp, byte[] value, Delete delete) throws IOException {
13370             return false;
13371           }
13372
13373           @Override
13374           public boolean checkAndDelete(byte[] row, byte[] family, byte[] qualifier, CompareOperator op, byte[] value, Delete delete) throws IOException {
13375             return false;
13376           }
13377
13378           @Override
13379           public void mutateRow(RowMutations rm) throws IOException {
13380
13381           }
13382
13383           @Override
13384           public Result append(Append append) throws IOException {
13385             return null;
13386           }
13387
13388           @Override
13389           public Result increment(Increment increment) throws IOException {
13390             return null;
13391           }
13392
13393           @Override
13394           public long incrementColumnValue(byte[] row, byte[] family, byte[] qualifier, long amount) throws IOException {
13395             return 0;
13396           }
13397
13398           @Override
13399           public long incrementColumnValue(byte[] row, byte[] family, byte[] qualifier, long amount, Durability durability) throws IOException {
13400             return 0;
13401           }
13402
13403           @Override
13404           public void close() throws IOException {
13405
13406           }
13407
13408           @Override
13409           public CoprocessorRpcChannel coprocessorService(byte[] row) {
13410             return null;
13411           }
13412
13413           @Override
13414           public \<T extends Service, R\> Map\<byte[], R\> coprocessorService(Class\<T\> service, byte[] startKey, byte[] endKey, Batch.Call\<T, R\> callable) throws ServiceException, Throwable {
13415             return null;
13416           }
13417
13418           @Override
13419           public \<T extends Service, R\> void coprocessorService(Class\<T\> service, byte[] startKey, byte[] endKey, Batch.Call\<T, R\> callable, Batch.Callback\<R\> callback) throws ServiceException, Throwable {
13420
13421           }
13422
13423           @Override
13424           public \<R extends Message\> Map\<byte[], R\> batchCoprocessorService(Descriptors.MethodDescriptor methodDescriptor, Message request, byte[] startKey, byte[] endKey, R responsePrototype) throws ServiceException, Throwable {
13425             return null;
13426           }
13427
13428           @Override
13429           public \<R extends Message\> void batchCoprocessorService(Descriptors.MethodDescriptor methodDescriptor, Message request, byte[] startKey, byte[] endKey, R responsePrototype, Batch.Callback\<R\> callback) throws ServiceException, Throwable {
13430
13431           }
13432
13433           @Override
13434           public boolean checkAndMutate(byte[] row, byte[] family, byte[] qualifier, CompareFilter.CompareOp compareOp, byte[] value, RowMutations mutation) throws IOException {
13435             return false;
13436           }
13437
13438           @Override
13439           public boolean checkAndMutate(byte[] row, byte[] family, byte[] qualifier, CompareOperator op, byte[] value, RowMutations mutation) throws IOException {
13440             return false;
13441           }
13442
13443           @Override
13444           public void setOperationTimeout(int operationTimeout) {
13445
13446           }
13447
13448           @Override
13449           public int getOperationTimeout() {
13450             return 0;
13451           }
13452
13453           @Override
13454           public int getRpcTimeout() {
13455             return 0;
13456           }
13457
13458           @Override
13459           public void setRpcTimeout(int rpcTimeout) {
13460
13461           }
13462
13463           @Override
13464           public int getReadRpcTimeout() {
13465             return 0;
13466           }
13467
13468           @Override
13469           public void setReadRpcTimeout(int readRpcTimeout) {
13470
13471           }
13472
13473           @Override
13474           public int getWriteRpcTimeout() {
13475             return 0;
13476           }
13477
13478           @Override
13479           public void setWriteRpcTimeout(int writeRpcTimeout) {
13480
13481           }
13482         };
13483       }
13484     };
13485   }
13486 }
13487 {code}
13488
13489
13490 ---
13491
13492 * [HBASE-18873](https://issues.apache.org/jira/browse/HBASE-18873) | *Critical* | **Hide protobufs in GlobalQuotaSettings**
13493
13494 GlobalQuotaSettings was introduced to avoid protocol-specific Java classes from leaking into API which is users may leverage. This class has a number of methods which return plain-Java-objects instead of these protocol-specific classes in an effort to better provide stability in the future.
13495
13496
13497 ---
13498
13499 * [HBASE-18893](https://issues.apache.org/jira/browse/HBASE-18893) | *Major* | **Remove Add/Modify/DeleteColumnFamilyProcedure in favor of using ModifyTableProcedure**
13500
13501 The RPC calls for Add/Modify/DeleteColumn have been removed and are now backed by ModifyTable functionality. The corresponding permissions in AccessController have been removed as well.
13502
13503 The shell already bypassed these RPCs and used ModifyTable directly, and thus would not be getting these permission checks, this change brings the rest of the RPC inline with that.
13504
13505 Coprocessor hooks for pre/post Add/Modify/DeleteColumn have likewise been removed. Coprocessors needing to take special actions on schema change should instead process ModifyTable events (which they should have been doing already, but it was easy for developers to miss this nuance).
13506
13507
13508 ---
13509
13510 * [HBASE-16338](https://issues.apache.org/jira/browse/HBASE-16338) | *Major* | **update jackson to 2.y**
13511
13512 HBase has upgraded from Jackson 1 to Jackson 2. JSON output should not have changed and this should not be user facing, but server classpaths should be adjusted accordingly.
13513
13514
13515 ---
13516
13517 * [HBASE-19051](https://issues.apache.org/jira/browse/HBASE-19051) | *Minor* | **Add new split algorithm for num string**
13518
13519 Add new split algorithm DecimalStringSplit，row are decimal-encoded long values in the range "00000000" =\> "99999999" .
13520 create 't1','f', { NUMREGIONS =\> 10 , SPLITALGO =\> 'DecimalStringSplit' }
13521 The split point will be 10000000,20000000,...,90000000
13522
13523
13524 ---
13525
13526 * [HBASE-19067](https://issues.apache.org/jira/browse/HBASE-19067) | *Major* | **Do not expose getHDFSBlockDistribution in StoreFile**
13527
13528 Removed CP exposed StoreFile#getHDFSBlockDistribution
13529
13530
13531 ---
13532
13533 * [HBASE-18989](https://issues.apache.org/jira/browse/HBASE-18989) | *Major* | **Polish the compaction related CP hooks**
13534
13535 Add two new methods in CompactionLifeCycleTracker.
13536 The notExecuted method will be called if the selectCompaction failed or space quota limitation reached.
13537 The completed method will be called after all the requested compactions are finished. The compaction scheduling is pre Store so if you request compaction on a region it may lead to multiple compactions.
13538 Remove the User parameter in Region.requestCompaction methods as it is useless for CP users.
13539 Add a boolean parameter to indicate whether you want to do a major compaction. And so that the triggerMajorCompaction method is removed.
13540 Remove the getCompactionProgress method in Store interface.
13541 Add a UT to confirm that CompactionLifeCycleTracker works correctly, and it also shows how to use CompactionLifeCycleTracker to wait for the completion of a compaction.
13542
13543
13544 ---
13545
13546 * [HBASE-19046](https://issues.apache.org/jira/browse/HBASE-19046) | *Major* | **RegionObserver#postCompactSelection  Avoid passing shaded ImmutableList param**
13547
13548 RegionObserver#postCompactSelection signature is changed.
13549 Arg type org.apache.hadoop.hbase.shaded.com.google.common.collect.ImmutableList is replaced with java.util.List
13550
13551
13552 ---
13553
13554 * [HBASE-19043](https://issues.apache.org/jira/browse/HBASE-19043) | *Major* | **Purge TableWrapper and CoprocessorHConnnection**
13555
13556 Removes getTable from the CoprocessorEnvrionment Interface and from the BaseEnvironment implementation. Also removes TableWrapper and CoprocessorHConnection, two classes that were used by BaseEnvironment to keep a tag on Tables created by Coprocessors that BaseEnvironment might close them out on #shutdown.
13557
13558 Long after these classes and methods were added, in HBase 1.0.0, we moved to a mode where management of Tables was shifted from HBase to the Client; the Client is to manage lifecycle. Table also became a (relatively) lightweight construct so folks are used to getting a Table instance, using it, and then immediately closing it when done.
13559
13560 Coprocessors should do the same in hbase2.0.0.
13561
13562 CoprocessorHConnection short-circuited RPC. This feature has since been integrated into Server Connections; when they create a Connection, they get one that will short-circuit if the request is to a localhost so no need of CoprocessorHConnection any more.
13563
13564 Coprocessors get the Server Connection when they ask for a Connection from their \*CoprocessorEnvironment.
13565
13566
13567 ---
13568
13569 * [HBASE-19014](https://issues.apache.org/jira/browse/HBASE-19014) | *Major* | **surefire fails; When writing xml report stdout/stderr ... No such file or directory**
13570
13571 Running tests with a wildcard selector, i.e.{{-Dtest=org.apache.hadoop.hbase.server.\*}} no longer works.
13572
13573
13574 ---
13575
13576 * [HBASE-10367](https://issues.apache.org/jira/browse/HBASE-10367) | *Major* | **RegionServer graceful stop / decommissioning**
13577
13578 Added three top level Admin APIs to help decommissioning and graceful stop of region servers.
13579
13580   /\*\*
13581    \* Mark region server(s) as decommissioned to prevent additional regions from getting
13582    \* assigned to them. Optionally unload the regions on the servers. If there are multiple servers
13583    \* to be decommissioned, decommissioning them at the same time can prevent wasteful region
13584    \* movements. Region unloading is asynchronous.
13585    \* @param servers The list of servers to decommission.
13586    \* @param offload True to offload the regions from the decommissioned servers
13587    \*/
13588   void decommissionRegionServers(List\<ServerName\> servers, boolean offload) throws IOException;
13589
13590   /\*\*
13591    \* List region servers marked as decommissioned, which can not be assigned regions.
13592    \* @return List of decommissioned region servers.
13593    \*/
13594   List\<ServerName\> listDecommissionedRegionServers() throws IOException;
13595
13596   /\*\*
13597    \* Remove decommission marker from a region server to allow regions assignments.
13598    \* Load regions onto the server if a list of regions is given. Region loading is
13599    \* asynchronous.
13600    \* @param server The server to recommission.
13601    \* @param encodedRegionNames Regions to load onto the server.
13602    \*/
13603   void recommissionRegionServer(ServerName server, List\<byte[]\> encodedRegionNames)  throws IOException;
13604
13605
13606 ---
13607
13608 * [HBASE-19042](https://issues.apache.org/jira/browse/HBASE-19042) | *Blocker* | **Oracle Java 8u144 downloader broken in precommit check**
13609
13610 Precommit switched from Oracle JDK 8 to OpenJDK-8.
13611
13612
13613 ---
13614
13615 * [HBASE-18945](https://issues.apache.org/jira/browse/HBASE-18945) | *Major* | **Make a IA.LimitedPrivate interface for CellComparator**
13616
13617 CellCompartor has been added as an interface with IA.LimitedPrivate. It has the following methods
13618 #int compare(Cell leftCell, Cell rightCell);
13619 #int compareRows(Cell leftCell, Cell rightCell)
13620 #int compareRows(Cell cell, byte[] bytes, int offset, int length)
13621 #int compareWithoutRow(Cell leftCell, Cell rightCell)
13622 #int compareFamilies(Cell leftCell, Cell rightCell
13623 #int compareQualifiers(Cell leftCell, Cell rightCell)
13624 #int compareTimestamps(Cell leftCell, Cell rightCell)
13625 #int compareTimestamps(long leftCellts, long rightCellts)
13626
13627 This is exposed to CPs and CPs can make use of the above methods to do comparisons on the cells.
13628 For internal usage we have CellComparatorImpl and it has static references to COMPARATOR and META\_CELL\_COMPARATOR.
13629 So when a region or store is initialized we should use one of the above comparator. For META table we need the META\_CELL\_COMPARATOR and all other table's  regions/stores will use the COMPARTOR.
13630 While writing the comparator name in FixedFileTrailer of the Hfile we have now ensured that this rename of CellComparator.COMPARATOR/CellComparator.META\_CELL\_COMPARATOR to CellComparatorImpl.COMPARATOR/CellComparatorImpl.META\_CELL\_COMPARATOR is handled.
13631
13632 CellUtils is an util method that provides lot of APIs that helps to do compare, matching functionalities between two cells, or with a cell and a corrpesponding byte[] etc. Some of the APIs are internally used which will be cleaned up in a follow on JIRA HBASE-18995.
13633
13634
13635 ---
13636
13637 * [HBASE-19001](https://issues.apache.org/jira/browse/HBASE-19001) | *Major* | **Remove the hooks in RegionObserver which are designed to construct a StoreScanner which is marked as IA.Private**
13638
13639 These methods are removed:
13640 KeyValueScanner preStoreScannerOpen(ObserverContext\<RegionCoprocessorEnvironment\> c,
13641       Store store, Scan scan, NavigableSet\<byte[]\> targetCols, KeyValueScanner s, long readPt)
13642       throws IOException;
13643 InternalScanner preFlushScannerOpen(ObserverContext\<RegionCoprocessorEnvironment\> c,
13644       Store store, List\<KeyValueScanner\> scanners, InternalScanner s, long readPoint)
13645       throws IOException;
13646 InternalScanner preCompactScannerOpen(ObserverContext\<RegionCoprocessorEnvironment\> c,
13647       Store store, List\<? extends KeyValueScanner\> scanners, ScanType scanType, long earliestPutTs,
13648       InternalScanner s, CompactionLifeCycleTracker tracker, CompactionRequest request,
13649       long readPoint) throws IOException;
13650
13651 For flush and compaction, CP users are expected to wrap the InternalScanner in preFlush/preCompact. And for normal region operation, just use preGetOp/preScannerOpen to modify the Get/Scan object.
13652
13653 This method in Region interface is also removed as we do not need to use read point in CP hooks anymore:
13654 long getReadPoint(IsolationLevel isolationLevel);
13655
13656
13657 ---
13658
13659 * [HBASE-18350](https://issues.apache.org/jira/browse/HBASE-18350) | *Blocker* | **RSGroups are broken under AMv2**
13660
13661 Moves RSGroup on to AMv2. Reenables disabled RSGroups tests.
13662
13663
13664 ---
13665
13666 * [HBASE-18960](https://issues.apache.org/jira/browse/HBASE-18960) | *Major* | **A few bug fixes and minor improvements around batchMutate()**
13667
13668 All operations for which further processing is skipped by preBatchMutate coprocessor hook are treated as SUCCESS instead of FAILED.
13669
13670
13671 ---
13672
13673 * [HBASE-14247](https://issues.apache.org/jira/browse/HBASE-14247) | *Critical* | **Separate the old WALs into different regionserver directories**
13674
13675 Add a new config hbase.separate.oldlogdir.by.regionserver. The default value is false. If this config is true, the old wal dir will be separated by regionservers. This will change the oldWALs layout. The oldWALs is used by replication. So if a cluster didn't use replication, it can be rolling upgrade (upgrade this config from false to true) directly. If a cluster use replication, the oldWALs will be not found when layout changed. So the cluster need rolling upgrade twice. Firstly, only rolling cluster to use new version code. Secondly rolling the config from false to true. Because the cluster already rolling to new version code, so it can find the oldWALs in the new dir layout.
13676
13677
13678 ---
13679
13680 * [HBASE-18954](https://issues.apache.org/jira/browse/HBASE-18954) | *Major* | **Make \*CoprocessorHost classes private**
13681
13682 - Make CoprocessorHost and its implementations InterfaceAudience.Private
13683 - Configurations from "CoprocessorHost" have been moved to new "CoprocessorConfigurations" class.
13684
13685
13686 ---
13687
13688 * [HBASE-15410](https://issues.apache.org/jira/browse/HBASE-15410) | *Major* | **Utilize the max seek value when all Filters in MUST\_PASS\_ALL FilterList return SEEK\_NEXT\_USING\_HINT**
13689
13690 This optimization, targeting SEEK\_NEXT\_USING\_HINT return values, utilizes the max seek value and is transparent to Filters.
13691
13692
13693 ---
13694
13695 * [HBASE-18747](https://issues.apache.org/jira/browse/HBASE-18747) | *Critical* | **Introduce new example and helper classes to tell CP users how to do filtering on scanners**
13696
13697 Modify ZooKeeperScanPolicyObserver in hbase-examples to show how to do filtering in the CP hooks of flush and compaction in hbase-2.0.
13698
13699
13700 ---
13701
13702 * [HBASE-18108](https://issues.apache.org/jira/browse/HBASE-18108) | *Blocker* | **Procedure WALs are archived but not cleaned; fix**
13703
13704 The archived Procedure WALs are moved to \<hbase\_root\>/oldWALs/masterProcedureWALs
13705 directory. TimeToLiveProcedureWALCleaner class was added which regularly cleans the Procedure WAL files from there.
13706
13707 The TimeToLiveProcedureWALCleaner is added to hbase.master.logcleaner.plugins configuration value.
13708
13709 A new config parameter is added: hbase.master.procedurewalcleaner.ttl, which specifies how long a Procedure WAL should stay in the archive directory.
13710
13711
13712 ---
13713
13714 * [HBASE-18183](https://issues.apache.org/jira/browse/HBASE-18183) | *Major* | **Region interface cleanup for CP expose**
13715
13716 Below methods are removed from CP exposed Region interface
13717 getOpenSeqNum
13718 getOldestSeqIdOfStore
13719 isLoadingCfsOnDemandDefault
13720 getReadpoint
13721 updateReadRequestsCount
13722 updateWriteRequestsCount
13723 getRegionServicesForStores
13724 getMetrics
13725 getHDFSBlocksDistribution
13726 releaseRowLocks
13727 batchReplay
13728 get(Get get, boolean withCoprocessor, long nonceGroup, long nonce)
13729 bulkLoadHFiles
13730 execService
13731 registerService
13732 checkFamilies
13733 checkTimestamps
13734 prepareDelete
13735 prepareDeleteTimestamps
13736 updateCellTimestamps
13737 flush
13738 compact
13739 waitForFlushesAndCompactions
13740 waitForFlushes
13741
13742 Change signature of below methods by dropping params 'nonceGroup', 'nonce'
13743 append(Append append, long nonceGroup, long nonce)
13744 batchMutate(Mutation[] mutations, long nonceGroup, long nonce)
13745 increment(Increment increment, long nonceGroup, long nonce)
13746
13747
13748 ---
13749
13750 * [HBASE-18949](https://issues.apache.org/jira/browse/HBASE-18949) | *Major* | **Remove the CompactionRequest parameter in preCompactSelection**
13751
13752 Remove the CompactionRequest parameter in preCompactSelection as we do not have a CompactionRequest at that time.
13753
13754
13755 ---
13756
13757 * [HBASE-18909](https://issues.apache.org/jira/browse/HBASE-18909) | *Major* | **Deprecate Admin's methods which used String regex**
13758
13759 Pushed to master and branch-2. Thanks all for reviewing.
13760
13761
13762 ---
13763
13764 * [HBASE-18931](https://issues.apache.org/jira/browse/HBASE-18931) | *Major* | **Make ObserverContext an interface and remove private/testing methods**
13765
13766 Changes ObserverContext from a class to an interface and hides away constructor, testing functions and other internal-only functions in the implementation class.
13767
13768
13769 ---
13770
13771 * [HBASE-18878](https://issues.apache.org/jira/browse/HBASE-18878) | *Major* | **Use Optional\<T\> return types when T can be null**
13772
13773 **WARNING: No release note provided for this change.**
13774
13775
13776 ---
13777
13778 * [HBASE-18649](https://issues.apache.org/jira/browse/HBASE-18649) | *Major* | **Deprecate KV Usage in MR to move to Cells in 3.0**
13779
13780 All the mappers and reducers output type will be now of MapReduceCell type. No more KeyValue type. How ever in branch-2 for compatibility we have allowed the older interfaces/classes that work with KeyValue to stay in the code base but they have been marked as deprecated.
13781 The following interfaces/classes have been deprecated in branch-2
13782 Import#KeyValueWritableComparablePartitioner
13783 Import#KeyValueWritableComparator
13784 Import#KeyValueWritableComparable
13785 Import#KeyValueReducer
13786 Import#KeyValueSortImporter
13787 Import#KeyValueImporter
13788 KeyValueSortReducer
13789 KeyValueSerialization
13790 WALPlayer#WALKeyValueMapper
13791
13792 So any existing MR jobs that is using the above public interfaces/classes will continue to work in branch-2 and the expected output value type of those mappers and reducers can continue to be KeyValue type.
13793
13794 In branch-3 the mappers and reducers output will only expect MapReduceCell as the type and will no longer work with KeyValue type.
13795 The new public classes/interfaces added for branch-3 and in branch-2 are
13796 CellSerialization
13797 CellSortReducer
13798 Import#CellWritableComparablePartitioner
13799 Import#CellWritableComparable
13800 Import#CellWritableComparator
13801 Import#CellReducer
13802 Import#CellSortImporter
13803 Import#CellImporter
13804 WALPlayer#WALCellMapper
13805
13806
13807 ---
13808
13809 * [HBASE-18897](https://issues.apache.org/jira/browse/HBASE-18897) | *Major* | **Substitute MemStore for Memstore**
13810
13811 The changes of IA.Public/IA.LimitedPrivate classes are shown below:
13812 HTableDescriptor class
13813 \* boolean hasRegionMemstoreReplication()
13814 + boolean hasRegionMemStoreReplication()
13815 \* HTableDescriptor setRegionMemstoreReplication(boolean)
13816 + HTableDescriptor setRegionMemStoreReplication(boolean)
13817
13818 RegionLoadStats class
13819 \* int getMemstoreLoad()
13820 + int getMemStoreLoad()
13821
13822 ServerLoad class
13823 \* int getMemstoreSizeInMB()
13824 + int getMemStoreSizeMB()
13825
13826 Region class
13827 - long getMemstoreSize()
13828 + long getMemStoreSize()
13829
13830 Store class
13831 - MemstoreSize getMemStoreSize()
13832 + MemStoreSize getMemStoreSize()
13833 - MemstoreSize getFlushableSize()
13834 + MemStoreSize getFlushableSize()
13835 - MemstoreSize getSnapshotSize()
13836 + MemStoreSize getSnapshotSize()
13837
13838 StoreFile class
13839 - long getMaxMemstoreTS()
13840 + long getMaxMemStoreTS()
13841
13842
13843 ---
13844
13845 * [HBASE-18010](https://issues.apache.org/jira/browse/HBASE-18010) | *Major* | **Connect CellChunkMap to be used for flattening in CompactingMemStore**
13846
13847 The CellChunkMap is very dense index for Memstore ImmutableSegment and the only one that can be taken off-heap. However, CellChunkMap works on-heap as well. The coding of the entire flow of working with CellChunkMap is not yet finished, thus CellChunkMap is disabled for usage so far. The continuation is done under HBASE-18232.
13848
13849
13850 ---
13851
13852 * [HBASE-18883](https://issues.apache.org/jira/browse/HBASE-18883) | *Major* | **Upgrade to Curator 4.0**
13853
13854 Curator version has been updated from 2.x to 4.0 (running in ZK 3.4 compatibility mode).
13855
13856 Users who experience classpath issues due to version conflicts are recommended to use either the hbase-shaded-client or hbase-shaded-mapreduce artifacts.
13857
13858
13859 ---
13860
13861 * [HBASE-13844](https://issues.apache.org/jira/browse/HBASE-13844) | *Minor* | **Move static helper methods from KeyValue into CellUtils**
13862
13863 Move KeyValue.parseColumn() to CellUtil
13864
13865
13866 ---
13867
13868 * [HBASE-18839](https://issues.apache.org/jira/browse/HBASE-18839) | *Major* | **Apply RegionInfo to code base**
13869
13870 The incompatible changes of IA.Public/LimitedPrivate classes are shown below.
13871 + new method
13872 - removed method
13873 \* deprecated method
13874 -------------------------------------
13875 HRegionLocation class
13876 + RegionInfo getRegion()
13877 \* HRegionInfo getRegionInfo()
13878
13879 AsyncAdmin class
13880 + CompletableFuture\<List\<RegionInfo\>\> getOnlineRegions(ServerName serverName);
13881 - CompletableFuture\<List\<HRegionInfo\>\> getOnlineRegions(ServerName serverName);
13882 + CompletableFuture\<List\<RegionInfo\>\> getTableRegions(TableName tableName);
13883 - CompletableFuture\<List\<HRegionInfo\>\> getTableRegions(TableName tableName);
13884
13885 HBaseTestingUtility class
13886 - Table createTable(HTableDescriptor htd, byte[][] families, Configuration c)
13887 - Table createTable(HTableDescriptor htd, byte[][] families, byte[][] splitKeys, Configuration c)
13888 - Table createTable(HTableDescriptor htd, byte[][] splitRows)
13889 - void modifyTableSync(Admin admin, HTableDescriptor desc)
13890 - HRegion createLocalHRegion(HTableDescriptor desc, byte [] startKey, byte [] endKey)
13891 - HRegion createLocalHRegion(HRegionInfo info, HTableDescriptor desc)
13892 - HRegion createLocalHRegion(HRegionInfo info, TableDescriptor desc)
13893 + HRegion createLocalHRegion(RegionInfo info, TableDescriptor desc)
13894 - HRegion createLocalHRegion(HRegionInfo info, HTableDescriptor desc, WAL wal)
13895 - HRegion createLocalHRegion(HRegionInfo info, TableDescriptor desc, WAL wal)
13896 + HRegion createLocalHRegion(RegionInfo info, TableDescriptor desc, WAL wal)
13897 - List\<HRegionInfo\> createMultiRegionsInMeta(final Configuration conf,final TableDescriptor htd, byte [][] startKeys)
13898 + List\<HRegionInfo\> createMultiRegionsInMeta(final Configuration conf,final TableDescriptor htd, byte [][] startKeys)
13899 - WAL createWal(final Configuration conf, final Path rootDir, final HRegionInfo hri)
13900 + WAL createWal(final Configuration conf, final Path rootDir, final RegionInfo hri)
13901 - HRegion createRegionAndWAL(final HRegionInfo info, final Path rootDir,final Configuration conf, final HTableDescriptor htd)
13902 - HRegion createRegionAndWAL(final HRegionInfo info, final Path rootDir, final Configuration conf, final TableDescriptor htd)
13903 + HRegion createRegionAndWAL(final RegionInfo info, final Path rootDir, final Configuration conf, final TableDescriptor htd)
13904 - HRegion createRegionAndWAL(final HRegionInfo info, final Path rootDir, final Configuration conf, final HTableDescriptor htd, boolean initialize)
13905 + HRegion createRegionAndWAL(final RegionInfo info, final Path rootDir, final Configuration conf, final HTableDescriptor htd, boolean initialize)
13906 - boolean assignRegion(final HRegionInfo regionInfo)
13907 + boolean assignRegion(final RegionInfo regionInfo)
13908 - void moveRegionAndWait(HRegionInfo destRegion, ServerName destServer)
13909 + void moveRegionAndWait(RegionInfo destRegion, ServerName destServer)
13910 - int createPreSplitLoadTestTable(Configuration conf, HTableDescriptor desc, HColumnDescriptor hcd)
13911 - int createPreSplitLoadTestTable(Configuration conf, HTableDescriptor desc, HColumnDescriptor hcd, int numRegionsPerServer)
13912 - int createPreSplitLoadTestTable(Configuration conf, HTableDescriptor desc, HColumnDescriptor[] hcds, int numRegionsPerServer)
13913 - HRegion createTestRegion(String tableName, HColumnDescriptor cd)
13914
13915 WALEdit class
13916 - WALEdit createFlushWALEdit(HRegionInfo hri, FlushDescriptor f)
13917 + WALEdit createFlushWALEdit(RegionInfo hri, FlushDescriptor f)
13918 - WALEdit createRegionEventWALEdit(HRegionInfo hri,RegionEventDescriptor regionEventDesc)
13919 + WALEdit createRegionEventWALEdit(RegionInfo hri,RegionEventDescriptor regionEventDesc)
13920 - WALEdit createCompaction(final HRegionInfo hri, final CompactionDescriptor c)
13921 + WALEdit createCompaction(final RegionInfo hri, final CompactionDescriptor c)
13922 - byte[] getRowForRegion(HRegionInfo hri)
13923 + byte[] getRowForRegion(RegionInfo hri)
13924 - WALEdit createBulkLoadEvent(HRegionInfo hri, WALProtos.BulkLoadDescriptor bulkLoadDescriptor)
13925 + - WALEdit createBulkLoadEvent(RegionInfo hri, WALProtos.BulkLoadDescriptor bulkLoadDescriptor)
13926
13927 RegionScanner class
13928 - HRegionInfo getRegionInfo();
13929 + RegionInfo getRegionInfo();
13930
13931 RegionPlan class
13932 - RegionPlan(final HRegionInfo hri, ServerName source, ServerName dest)
13933 + RegionPlan(final RegionInfo hri, ServerName source, ServerName dest)
13934
13935 Region class
13936 - HRegionInfo getRegionInfo();
13937 + RegionInfo getRegionInfo();
13938
13939 TableSnapshotInputFormat.TableSnapshotRegionSplit class
13940 \* HRegionInfo getRegionInfo()
13941 + RegionInfo getRegion()
13942
13943 RawAsyncTable.CoprocessorCallback class
13944 - void onRegionComplete(HRegionInfo region, R resp)
13945 + void onRegionComplete(RegionInfo region, R resp)
13946 - void onRegionError(RegionInfo region, Throwable error);
13947 + void onRegionError(HRegionInfo region, Throwable error);
13948
13949
13950 ---
13951
13952 * [HBASE-18826](https://issues.apache.org/jira/browse/HBASE-18826) | *Major* | **Use HStore instead of Store in our own code base and remove unnecessary methods in Store interface**
13953
13954 **WARNING: No release note provided for this change.**
13955
13956
13957 ---
13958
13959 * [HBASE-17732](https://issues.apache.org/jira/browse/HBASE-17732) | *Critical* | **Coprocessor Design Improvements**
13960
13961 We are moving from Inheritence
13962 - Observer \*is\* Coprocessor
13963 - FooService \*is\* CoprocessorService
13964 To Composition
13965 - Coprocessor \*has\* Observer
13966 - Coprocessor \*has\* Service
13967 ------------------------------------------------------
13968 Summary
13969 ------------------------------------------------------
13970 - Adds four new interfaces - MasterCoprocessor, RegionCoprocessor, RegionServierCoprocessor,
13971   WALCoprocessor
13972 - These new \*Coprocessor interfaces have a get\*Observer() function for each observer type
13973   supported by them.
13974 - Added Coprocessor#getService() to base interface. All extending \*Coprocessor interfaces will
13975   get it from the base interface.
13976 - Added BulkLoadObserver hooks to RegionCoprocessorHost instad of SecureBulkLoadManager doing its
13977   own trickery.
13978 - CoprocessorHost#find\*() fuctions: Too many testing hooks digging into CP internals.
13979   Deleted if can, else marked @VisibleForTesting.
13980 ------------------------------------------------------
13981 Backward Compatibility
13982 ------------------------------------------------------
13983 - Old coprocessors implementing \*Observer won't get loaded (no backward compatibility guarantees).
13984 - Third party coprocessors only implementing Coprocessor will not get loaded (just like Observers).
13985 - Old coprocessors implementing CoprocessorService (for master/region host)
13986   /SingletonCoprocessorService (for RegionServer host) will continue to work with 2.0.
13987 - Added test to ensure backward compatibility of CoprocessorService/SingletonCoprocessorService
13988 - Note that if a coprocessor implements both observer and service in same class, its service
13989   component will continue to work but it's observer component won't work.
13990
13991
13992 ---
13993
13994 * [HBASE-18298](https://issues.apache.org/jira/browse/HBASE-18298) | *Critical* | **RegionServerServices Interface cleanup for CP expose**
13995
13996 We used to pass the RegionServerServices (RSS) which gave Coprocesosrs (CP) all sort of access to internal Server machinery. We now only allows the CP a subset of the RSS in the form of the CPRSS Interface. Particulars:
13997
13998 Removed method getRegionServerServices from CP exposed RegionCoprocessorEnvironment and RegionServerCoprocessorEnvironment and replaced with getCoprocessorRegionServerServices. This returns a new interface CoprocessorRegionServerServices which is only a subset of RegionServerServices. With that below methods are no longer exposed for CPs
13999 WAL getWAL(HRegionInfo regionInfo)
14000 List\<WAL\> getWALs()
14001 FlushRequester getFlushRequester()
14002 RegionServerAccounting getRegionServerAccounting()
14003 RegionServerRpcQuotaManager getRegionServerRpcQuotaManager()
14004 SecureBulkLoadManager getSecureBulkLoadManager()
14005 RegionServerSpaceQuotaManager getRegionServerSpaceQuotaManager()
14006 void postOpenDeployTasks(final PostOpenDeployContext context)
14007 void postOpenDeployTasks(final Region r)
14008 boolean reportRegionStateTransition(final RegionStateTransitionContext context)
14009 boolean reportRegionStateTransition(TransitionCode code, long openSeqNum, HRegionInfo... hris)
14010 boolean reportRegionStateTransition(TransitionCode code, HRegionInfo... hris)
14011 RpcServerInterface getRpcServer()
14012 ConcurrentMap\<byte[], Boolean\> getRegionsInTransitionInRS()
14013 Leases getLeases()
14014 ExecutorService getExecutorService()
14015 Map\<String, Region\> getRecoveringRegions()
14016 public ServerNonceManager getNonceManager()
14017 boolean registerService(Service service)
14018 HeapMemoryManager getHeapMemoryManager()
14019 double getCompactionPressure()
14020 ThroughputController getFlushThroughputController()
14021 double getFlushPressure()
14022 MetricsRegionServer getMetrics()
14023 EntityLock regionLock(List\<HRegionInfo\> regionInfos, String description, Abortable abort)
14024 void unassign(byte[] regionName)
14025 Configuration getConfiguration()
14026 ZooKeeperWatcher getZooKeeper()
14027 ClusterConnection getClusterConnection()
14028 MetaTableLocator getMetaTableLocator()
14029 CoordinatedStateManager getCoordinatedStateManager()
14030 ChoreService getChoreService()
14031 void stop(String why)
14032 void abort(String why, Throwable e)
14033 boolean isAborted()
14034 void updateRegionFavoredNodesMapping(String encodedRegionName, List\<ServerName\> favoredNodes)
14035 InetSocketAddress[] getFavoredNodesForRegion(String encodedRegionName)
14036 void addToOnlineRegions(Region region)
14037 boolean removeFromOnlineRegions(final Region r, ServerName destination)
14038
14039 Also 3 methods name have been changed
14040 List\<Region\> getOnlineRegions(TableName tableName) -\> List\<Region\> getRegions(TableName tableName)
14041 List\<Region\> getOnlineRegions() -\> List\<Region\> getRegions()
14042 Region getFromOnlineRegions(final String encodedRegionName) -\> Region getRegion(final String encodedRegionName)
14043
14044
14045 ---
14046
14047 * [HBASE-16769](https://issues.apache.org/jira/browse/HBASE-16769) | *Blocker* | **Deprecate/remove PB references from MasterObserver and RegionServerObserver**
14048
14049 Signature of below methods in MasterObserver changed and instead of org.apache.hadoop.hbase.shaded.protobuf.generated.SnapshotDescription param, we will be passing org.apache.hadoop.hbase.client.SnapshotDescription
14050 preListSnapshot
14051 postListSnapshot
14052 preSnapshot
14053 postSnapshot
14054 preCloneSnapshot
14055 postCloneSnapshot
14056 preRestoreSnapshot
14057 postRestoreSnapshot
14058 preDeleteSnapshot
14059 postDeleteSnapshot
14060
14061 Also changed signature of RegionServerObserver#preReplicateLogEntries and preReplicateLogEntries by removing params List\<org.apache.hadoop.hbase.shaded.protobuf.generated.AdminProtos.WALEntry\>, org.apache.hadoop.hbase.CellScanner
14062
14063
14064 ---
14065
14066 * [HBASE-18859](https://issues.apache.org/jira/browse/HBASE-18859) | *Major* | **Purge PB from BulkLoadObserver**
14067
14068 No longer pass the protobuf request to prePrepareBulkLoad and preCleanupBulkLoad in BulkLoadObserver as part of our effort to purge protobuf from our Coprocessor API Interface (if you need to read the Table and RegionInfo, pull it from the passed in RegionCoprocessorEnvironment ObserverContext).
14069
14070
14071 ---
14072
14073 * [HBASE-18731](https://issues.apache.org/jira/browse/HBASE-18731) | *Major* | **[compat 1-2] Mark protected methods of QuotaSettings that touch Protobuf internals as IA.Private**
14074
14075 The following methods in QuotaSettings were annotated InterfaceAudience.Private; they are for internal use only in hbase-2.0.0
14076
14077 buildSetQuotaRequestProto(final QuotaSettings settings)
14078 setupSetQuotaRequest(SetQuotaRequest.Builder builder)
14079
14080 Note that there were versions of these methods in HBase 1.y that used classes in the {{org.apache.hadoop.hbase.protobuf.generated}} package. That package no longer exists as a part of our cleanup of protobufs from our public facing API and the related methods have been removed.
14081
14082
14083 ---
14084
14085 * [HBASE-18825](https://issues.apache.org/jira/browse/HBASE-18825) | *Major* | **Use HStoreFile instead of StoreFile in our own code base and remove unnecessary methods in StoreFile interface**
14086
14087 Cleanup the StoreFile interface.
14088
14089 The metadata keys are moved to HStoreFile.
14090
14091 These methods are removed:
14092 CacheConfig getCacheConf();
14093 byte[] getMetadataValue(byte[] key);
14094 boolean isCompactedAway();
14095 boolean isReferencedInReads();
14096 void initReader() throws IOException;
14097 StoreFileScanner getPreadScanner(boolean cacheBlocks, long readPt, long scannerOrder, boolean canOptimizeForNonNullColumn);
14098 StoreFileScanner getStreamScanner(boolean canUseDropBehind, boolean cacheBlocks, boolean isCompaction, long readPt, long scannerOrder, boolean canOptimizeForNonNullColumn) throws IOException;
14099 StoreFileReader getReader();
14100 void closeReader(boolean evictOnClose) throws IOException;
14101 void markCompactedAway();
14102 void deleteReader() throws IOException;
14103
14104 Notice that these methods are still available in HStoreFile.
14105
14106 And the return value of getFirstKey and getLastKey are changed from Cell to Optional\<Cell\> to better indicate that they may not be available.
14107
14108
14109 ---
14110
14111 * [HBASE-18786](https://issues.apache.org/jira/browse/HBASE-18786) | *Major* | **FileNotFoundException should not be silently handled for primary region replicas**
14112
14113 FileNotFoundException opening a StoreFile in a primary replica now causes a RegionServer to crash out where before it would be ignored (or optionally handled via close/reopen).
14114
14115
14116 ---
14117
14118 * [HBASE-10504](https://issues.apache.org/jira/browse/HBASE-10504) | *Blocker* | **Define Replication Interface**
14119
14120 Adds a new plugin point ReplicationEndpoint. ReplicationSource, internal to hbase, tails the WAL and calls registered ReplicationEndpoints. ReplicationEndpoint implementations are responsible for actually shipping the edits to the other (hbase or non-hbase) cluster. ReplicationEndpoint can be defined per peer. Default inter-cluster replication works without any changes (lily etc should still work). ReplicationEndpoints have various facility including means for filtering out WAL edits source-side before they can be shipped to remote peers.
14121
14122
14123 ---
14124
14125 * [HBASE-18142](https://issues.apache.org/jira/browse/HBASE-18142) | *Major* | **Deletion of a cell deletes the previous versions too**
14126
14127 Now, delete.rb won't delete all versions of the specified column. It only delete the specified version (if user assigns a timestamp) or the latest version (default behavior)
14128
14129
14130 ---
14131
14132 * [HBASE-18446](https://issues.apache.org/jira/browse/HBASE-18446) | *Critical* | **Mark StoreFileScanner/StoreFileReader as IA.LimitedPrivate(Phoenix)**
14133
14134 Mark StoreFileScanner and StoreFileReader as IA.LimitPrivate(Phoenix).
14135 Deprecated the preStoreFileReaderOpen and postStoreFileReaderOpen method in RegionObserver to indicate that these methods are only supposed to be used by Phoenix.
14136
14137
14138 ---
14139
14140 * [HBASE-18798](https://issues.apache.org/jira/browse/HBASE-18798) | *Major* | **Remove the unused methods in RegionServerObserver**
14141
14142 Remove the following APIs from RegionServerObserver:
14143 # preRollBackMerge
14144 # postRollBackMerge
14145 # preMergeCommit
14146 # postMergeCommit
14147 # postMerge
14148 # preMerge
14149
14150
14151 ---
14152
14153 * [HBASE-18831](https://issues.apache.org/jira/browse/HBASE-18831) | *Major* | **Add explicit dependency on javax.el**
14154
14155 Specify an explicit version for javax.el. Without it we rely on repository cached metadata of which a prevalent version seems to list all versions between b01 and b08 but finishes with a b08-jbossorg which is in the jboss repo, a repo most of us do not list in our poms.
14156
14157
14158 ---
14159
14160 * [HBASE-17980](https://issues.apache.org/jira/browse/HBASE-17980) | *Major* | **Any HRegionInfo we give out should be immutable**
14161
14162 Provide alternate user-facing API that takes a RegionInfo Interface instead of a HRegionInfo; the old HRegionInfo methods have been deprecated in 2.0.0 and will be removed in 3.0.0.
14163
14164
14165 ---
14166
14167 * [HBASE-14004](https://issues.apache.org/jira/browse/HBASE-14004) | *Critical* | **[Replication] Inconsistency between Memstore and WAL may result in data in remote cluster that is not in the origin**
14168
14169 Now when replicating a wal file which is still opened for write, we will get its committed length from the WAL instance in the same RS to prevent replicating uncommit WALEdit.
14170
14171 This is very important if you use AsyncFSWAL, as we use fan-out in AsyncFSWAL. The data written to DN will be visible immediately as all DNs think it is the end of a pipeline, although the client has not received an ack, and also NN may truncate the file if the client crashes at the same time.
14172
14173
14174 ---
14175
14176 * [HBASE-18819](https://issues.apache.org/jira/browse/HBASE-18819) | *Major* | **Set version number to 2.0.0-alpha3 from 2.0.0-alpha3-SNAPSHOT**
14177
14178 Set version on branch-2 to be 2.0.0-alpha3 as part of RC making.
14179
14180
14181 ---
14182
14183 * [HBASE-18683](https://issues.apache.org/jira/browse/HBASE-18683) | *Major* | **Upgrade hbase to commons-math 3**
14184
14185 Moved on to commons-math3. Removed commons-math2.
14186
14187
14188 ---
14189
14190 * [HBASE-18453](https://issues.apache.org/jira/browse/HBASE-18453) | *Major* | **CompactionRequest should not be exposed to user directly**
14191
14192 Introduce a CompactionLifeCycleTracker to let the CP users know when the compaction starts and ends. CompactionRequest is marked as IA.Private and should be used in CP implementation any more.
14193
14194
14195 ---
14196
14197 * [HBASE-18794](https://issues.apache.org/jira/browse/HBASE-18794) | *Major* | **Remove deprecated methods in MasterObserver**
14198
14199 The removed APIs are shown below.
14200 # preCreateTableHandler
14201 # postCreateTableHandler
14202 # preDeleteTableHandler
14203 # postDeleteTableHandler
14204 # preTruncateTableHandler
14205 # postTruncateTableHandler
14206 # preModifyTableHandler
14207 # postModifyTableHandler
14208 # preAddColumn
14209 # postAddColumn
14210 # preAddColumnHandler
14211 # postAddColumnHandler
14212 # preModifyColumn
14213 # postModifyColumn
14214 # preModifyColumnHandler
14215 # postModifyColumnHandler
14216 # preDeleteColumn
14217 # postDeleteColumn
14218 # preDeleteColumnHandler
14219 # postDeleteColumnHandler
14220 # preEnableTableHandler
14221 # postEnableTableHandler
14222 # preDisableTableHandler
14223 # postDisableTableHandler
14224 # preDispatchMerge
14225 # postDispatchMerge
14226
14227
14228 ---
14229
14230 * [HBASE-14998](https://issues.apache.org/jira/browse/HBASE-14998) | *Blocker* | **Unify synchronous and asynchronous methods in Admin and cleanup**
14231
14232  \* Deprecates getAlterStatus. Everywhere else we talk of 'modify' rather
14233        'alter' and should use Future returned from async instead.
14234  \* isTableAvailable(TableName, byte [][]) has been deprecated to be
14235        removed; use the overrie instead. This is a weird method.
14236  \* Changed listTableDescriptor to getDescriptor.
14237  \* Renamed other like methods to have same pattern (deprecating the old):
14238         balancer =\> balance
14239         setBalancerRunning =\> balancerSwitch
14240         setNormalizerRunning =\> normalizerSwitch
14241         enableCatalogJanitor =\> catalogJanitorSwitch
14242         setCleanerChoreRunning =\> cleanerChoreSwitch
14243         setSplitOrMergeEnabled =\> splitOrMergeEnabledSwitch
14244
14245  \* Renamed (with deprecation of old) runCatalogScan =\> runCatalogJanitor.
14246  \* Reviewed generated javadoc and made some edits; purged reference to
14247        hbase issues from our API, fixed param names, etc.
14248  \* Made all the enable services methods have same pattern.
14249  \* Renamed takeSnapshotAsync as snapshotAsync (with deprecation of old)
14250  \* Renamed execProcedureWithRet as execProcedureWithReturn (with
14251        deprecation)
14252
14253
14254 ---
14255
14256 * [HBASE-18723](https://issues.apache.org/jira/browse/HBASE-18723) | *Major* | **[pom cleanup] Do a pass with dependency:analyze; remove unused and explicity list the dependencies we exploit**
14257
14258 Purged a bunch of dependencies included but unused. Added reference to dependencies we do use but did not list (transitively included). Purged all but junit from parent pom dependency set and did explicit include in modules instead; not all modules need mockito, etc. Still work to do: grey area around hadoop and its transitive includes need cleanup still to make the  dependency:analyze runs clean. Also figure how to purge junit from parent dependency list.
14259
14260
14261 ---
14262
14263 * [HBASE-17823](https://issues.apache.org/jira/browse/HBASE-17823) | *Major* | **Migrate to Apache Yetus Audience Annotations**
14264
14265 HBase now uses stability and audience annotations sourced from Apache Yetus, instead of the custom annotations that were previously in place.
14266
14267
14268 ---
14269
14270 * [HBASE-18793](https://issues.apache.org/jira/browse/HBASE-18793) | *Major* | **Remove deprecated methods in RegionObserver**
14271
14272 These deprecated methods are removed from RegionObserver:
14273 InternalScanner preFlushScannerOpen(ObserverContext, Store, List, InternalScanner) throws IOException;
14274 void preCompactSelection(ObserverContext, Store, List) throws IOException;
14275 void postCompactSelection(ObserverContext, Store, ImmutableList);
14276 InternalScanner preCompact(ObserverContext, Store, InternalScanner, ScanType) throws IOException;
14277 InternalScanner preCompactScannerOpen(ObserverContext, Store, List, ScanType, long, InternalScanner, CompactionRequest) throws IOException;
14278 InternalScanner preCompactScannerOpen( ObserverContext, Store store, List, ScanType, long, InternalScanner) throws IOException;
14279 void preSplit(ObserverContext) throws IOException;
14280 void preSplit(ObserverContext, byte[]) throws IOException;
14281 void postSplit(ObserverContext, Region, Region) throws IOException;
14282 void preSplitBeforePONR(ObserverContext, byte[], List) throws IOException;
14283 void preSplitAfterPONR(ObserverContext) throws IOException;
14284 void preRollBackSplit(ObserverContext) throws IOException;
14285 void postRollBackSplit(ObserverContext) throws IOException;
14286 void postCompleteSplit(ObserverContext) throws IOException;
14287 long preIncrementColumnValue(ObserverContext, byte[], byte[], byte[], long, boolean) throws IOException;
14288 long postIncrementColumnValue(ObserverContextc, byte[], byte[], byte[], long, boolean, long) throws IOException;
14289 KeyValueScanner preStoreScannerOpen(ObserverContext, Store, Scan, NavigableSet, KeyValueScanner) throws IOException;
14290 boolean postScannerFilterRow(ObserverContext, InternalScanner, byte[], int, short, boolean) throws IOException;
14291 boolean postBulkLoadHFile(ObserverContext, List, boolean) throws IOException;
14292
14293 And this method is also removed since we never call it in our code base:
14294 InternalScanner preFlushScannerOpen(ObserverContext, Store, KeyValueScanner, InternalScanner, long) throws IOException;
14295
14296 The deprecated annotation is removed for these two methods as they are still being used:
14297 void preFlush(ObserverContext) throws IOException;
14298 void postFlush(ObserverContextc) throws IOException;
14299
14300
14301 ---
14302
14303 * [HBASE-18733](https://issues.apache.org/jira/browse/HBASE-18733) | *Major* | **[compat 1-2] Hide WALKey**
14304
14305 WALKey, @InterfaceAudience.LimitedPrivate(HBaseInterfaceAudience.REPLICATION), changed a bunch for 2.0.0. See below. We figured it ok hiding it since it should be internals anyway -- only we should be making them.
14306
14307
14308 ---
14309
14310 * [HBASE-13271](https://issues.apache.org/jira/browse/HBASE-13271) | *Critical* | **Table#puts(List\<Put\>) operation is indeterminate; needs fixing**
14311
14312 Adds more spec on how Get, Delete, and Put work and how they differ to help the user.
14313
14314
14315 ---
14316
14317 * [HBASE-16479](https://issues.apache.org/jira/browse/HBASE-16479) | *Major* | **Move WALEdit from hbase.regionserver.wal package to hbase.wal package**
14318
14319 Incompatible move of WALEdit class from regionserver.wal to wal. Effects @InterfaceAudience.LimitedPrivate({ HBaseInterfaceAudience.REPLICATION,
14320     HBaseInterfaceAudience.COPROC })
14321
14322 (
14323
14324
14325 ---
14326
14327 * [HBASE-10240](https://issues.apache.org/jira/browse/HBASE-10240) | *Critical* | **Remove 0.94-\>0.96 migration code**
14328
14329 Purge 0.94=\>0.96 deprecated, migration code. This means that if you are on 0.94 and wish to go to hbase 2.0, you must first migrate to a version of hbase that is \>= 0.96.
14330
14331
14332 ---
14333
14334 * [HBASE-18783](https://issues.apache.org/jira/browse/HBASE-18783) | *Minor* | **Declare the builder of ClusterStatus as IA.Private, and remove the Writables from ClusterStatus**
14335
14336 **WARNING: No release note provided for this change.**
14337
14338
14339 ---
14340
14341 * [HBASE-18106](https://issues.apache.org/jira/browse/HBASE-18106) | *Critical* | **Redo ProcedureInfo and LockInfo**
14342
14343 Admin.listProcedures and Admin.listLocks were renamed to getProcedures and getLocks (listProcedures was added to hbase 1.2). This change was done in an incompatible way -- we just yanked listProcedures (Because Admin Interface is not compatible with hbase1).
14344
14345     Main changes:
14346     - ProcedureInfo and LockInfo were removed, we use JSON instead of them
14347     - Procedure and LockedResource are their server side equivalent
14348     - Procedure protobuf state\_data became obsolate, it is only kept for
14349       reading previously written WAL
14350     - Procedure protobuf contains a state\_message field, which stores the internal
14351       state messages (Any type instead of bytes)
14352     - Procedure.serializeStateData and deserializeStateData were changed slightly
14353     - Procedures internal states are available on client side
14354     - Procedures are displayed on web UI and in shell in the following jruby format:
14355       { ID =\> '1', PARENT\_ID = '-1', PARAMETERS =\> [ ..extra state information.. ] }
14356
14357
14358 ---
14359
14360 * [HBASE-18621](https://issues.apache.org/jira/browse/HBASE-18621) | *Major* | **Refactor ClusterOptions before applying to code base**
14361
14362 Provide a new way to get desired ClusterStatus with a set of ClusterStatus.Option, such that the response back to client can be limited.
14363 Note that, the constructor way to new a ClusterStatus will be no longer support after 2.0.0,  and use ClusterStatus.Builder instead.
14364
14365
14366 ---
14367
14368 * [HBASE-18780](https://issues.apache.org/jira/browse/HBASE-18780) | *Minor* | **Remove HLogPrettyPrinter and hlog command**
14369
14370 **WARNING: No release note provided for this change.**
14371
14372
14373 ---
14374
14375 * [HBASE-14997](https://issues.apache.org/jira/browse/HBASE-14997) | *Critical* | **Move compareOp and Comparators out of filter to client package**
14376
14377 Deprecate checkAnd\* APIs that take the filter CompareOp. Added new overrides that take a generic CompareOperator instead. CompareOperator will be used by checkAnd\* in Table API and by filters going forward.
14378
14379 Other nice improvements suggested by this issue have been moved out to HBASE-18774.
14380
14381
14382 ---
14383
14384 * [HBASE-17972](https://issues.apache.org/jira/browse/HBASE-17972) | *Minor* | **Remove mergePool from CompactSplitThread**
14385
14386 After this jira, mergePool will be permanently removed from CompactSplitThread.
14387
14388
14389 ---
14390
14391 * [HBASE-18704](https://issues.apache.org/jira/browse/HBASE-18704) | *Major* | **Upgrade hbase to commons-collections 4**
14392
14393 **WARNING: No release note provided for this change.**
14394
14395
14396 ---
14397
14398 * [HBASE-18697](https://issues.apache.org/jira/browse/HBASE-18697) | *Major* | **Need a shaded hbase-mapreduce module**
14399
14400 Replaces hbase-shaded-server-\<version\>.jar with hbase-shaded-mapreduce-\<version\>.jar.
14401
14402
14403 ---
14404
14405 * [HBASE-15607](https://issues.apache.org/jira/browse/HBASE-15607) | *Blocker* | **Remove PB references from Admin for 2.0**
14406
14407 All the references to Protos in Admin.java have been removed and replaced with respective POJO classes.
14408 The references to Protos that were removed are
14409 AdminProtos.GetRegionInfoResponse,
14410 HBaseProtos.SnapshotDescription, HBaseProtos.SnapshotDescription.Type,
14411  MasterProtos.SnapshotResponse.
14412 CompactionType, CompactionState and MasterSwitchType Enums have been moved out of Admin.java to standalone Enums.
14413
14414
14415 ---
14416
14417 * [HBASE-18674](https://issues.apache.org/jira/browse/HBASE-18674) | *Major* | **upgrade hbase to commons-lang3**
14418
14419 Move to commons-lang3 from common-lang (check it out!... Nice lib...Some nice utility)
14420
14421
14422 ---
14423
14424 * [HBASE-18736](https://issues.apache.org/jira/browse/HBASE-18736) | *Major* | **Cleanup the HTD/HCD for Admin**
14425
14426 Changed the passed arguments from HTD/HCD to TD/CFD for Admin.
14427
14428
14429 ---
14430
14431 * [HBASE-18699](https://issues.apache.org/jira/browse/HBASE-18699) | *Major* | **Copy LoadIncrementalHFiles to another package and mark the old one as deprecated**
14432
14433 Introduce a new o.a.h.h.tool.LoadIncrementalHFiles. The old o.a.h.h.mapreduce.LoadIncrementalHFiles is deprecated and will be removed in 3.0.0.
14434
14435
14436 ---
14437
14438 * [HBASE-18739](https://issues.apache.org/jira/browse/HBASE-18739) | *Major* | **Make all TimeRange Constructors InterfaceAudience Private.**
14439
14440 All constructors have already been deprecated. This change makes them InterfaceAudience Private.
14441
14442
14443 ---
14444
14445 * [HBASE-18675](https://issues.apache.org/jira/browse/HBASE-18675) | *Minor* | **Making {max,min}SessionTimeout configurable for MiniZooKeeperCluster**
14446
14447 <!-- markdown -->
14448
14449
14450 Standalone clusters and minicluster instances can now configure the session timeout for our embedded ZooKeeper quorum using `hbase.zookeeper.property.minSessionTimeout` and `hbase.zookeeper.property.maxSessionTimeout`.
14451
14452
14453 ---
14454
14455 * [HBASE-15806](https://issues.apache.org/jira/browse/HBASE-15806) | *Critical* | **An endpoint-based export tool**
14456
14457 org.apache.hadoop.hbase.coprocessor.Export
14458 Instructs HBase to dump the contents of table to HDFS in a sequence file
14459 + replaces MR by endpoint (see org.apache.hadoop.hbase.mapreduce.Export)
14460 + no large data to be transfered between hbase server and client
14461 + same command line as org.apache.hadoop.hbase.mapreduce.Export
14462 - user needs to alter table for deploying ExportEndpoint
14463 - user needs to adjust the endpoint timeout for dumping large data
14464 - user needs to get the EXECUTE permission
14465
14466
14467 ---
14468
14469 * [HBASE-18577](https://issues.apache.org/jira/browse/HBASE-18577) | *Critical* | **shaded client includes several non-relocated third party dependencies**
14470
14471 <!-- markdown -->
14472
14473
14474 The HBase shaded artifacts (hbase-shaded-client and hbase-shaded-server) no longer contain several non-relocated third party dependency classes that were mistakenly included. Downstream users who relied on these classes being present will need to add a runtime dependency onto an appropriate third party artifact.
14475
14476 Previously, we erroneously packaged several third party libs without relocating them. In some cases these libraries have now been relocated; in some cases they are no longer included at all.
14477
14478 Includes:
14479
14480 * jaxb
14481 * jetty
14482 * jersey
14483 * codahale metrics (HBase 1.4+ only)
14484 * commons-crypto
14485 * jets3t
14486 * junit
14487 * curator (HBase 1.4+)
14488 * netty 3 (HBase 1.1)
14489 * mokito-junit4 (HBase 1.1)
14490
14491 There is now testing to ensure that the shaded artifacts only contain expected relocated content. It can be run via `mvn -Dtest=noUnitTests -pl hbase-shaded/hbase-shaded-check-invariants -am -Prelease verify`.
14492
14493 For version 2.0+ this patch removes hadoop-mapreduce-client-core from the set of dependencies included for the hbase-client and hbase-shaded-client artifacts.
14494
14495 For 2.0+, the slf4j-log4j12 dependency is now optional for both shaded artifacts.
14496
14497
14498 ---
14499
14500 * [HBASE-14745](https://issues.apache.org/jira/browse/HBASE-14745) | *Blocker* | **Shade the last few dependencies in hbase-shaded-client**
14501
14502 Previously some dependencies in hbase-shaded-client were still leaking into the un-shaded namespace. This should now be fixed.
14503
14504 Additionally the rat checking on generated intermediate files from shading should be skipped.
14505
14506
14507 ---
14508
14509 * [HBASE-18665](https://issues.apache.org/jira/browse/HBASE-18665) | *Critical* | **ReversedScannerCallable invokes getRegionLocations incorrectly**
14510
14511 Performing reverse scan on tables used the meta cache incorrectly and fetched data from meta table every time. This fix solves this issue and which results in performance improvement for reverse scans.
14512
14513
14514 ---
14515
14516 * [HBASE-3935](https://issues.apache.org/jira/browse/HBASE-3935) | *Major* | **HServerLoad.storefileIndexSizeMB should be changed to storefileIndexSizeKB**
14517
14518 This patch removed the storefile\_index\_size\_MB in protobuf. It will cause the value of storefile\_index\_size\_MB is zero if user still use hbase-client 1.x.
14519
14520
14521 ---
14522
14523 * [HBASE-18640](https://issues.apache.org/jira/browse/HBASE-18640) | *Major* | **Move mapreduce out of hbase-server into separate hbase-mapreduce module**
14524
14525 - Moves all org.apache.hadoop.hbase.mapreduce.\* (except LoadIncrementalHFiles) and org.apache.hadoop.hbase.mapred.\* classes from hbase-server module to new hbase-mapreduce module.
14526 - Also moves following tools from hbase-server module to hbase-mapreduce module: CompactionTool, ExportSnapshot, PerformanceEvaluation, LoadTestTool
14527 - Very minor breakages in  LoadTestTool(LimitedPrivate HBaseInterfaceAudience.TOOLS)
14528
14529
14530 ---
14531
14532 * [HBASE-18519](https://issues.apache.org/jira/browse/HBASE-18519) | *Major* | **Use builder pattern to create cell**
14533
14534 Introduce the CellBuilder helper.
14535 1) Using CellBuilderFactory to get CellBuilder for creating cell with row,
14536     column, qualifier, type, and value.
14537 2) For internal use, the ExtendedCellBuilder, which is created by ExtendedCellBuilderFactory, is able to build cell with extra fields - sequence id and tags -
14538
14539
14540 ---
14541
14542 * [HBASE-18448](https://issues.apache.org/jira/browse/HBASE-18448) | *Minor* | **EndPoint example  for refreshing HFiles for stores**
14543
14544 Adds a new RefreshHFiles Coprocessor Endpoint example. Includes client and serverside-endpoint that iterates region Stores to call #refreshStoreFiles.
14545
14546
14547 ---
14548
14549 * [HBASE-18658](https://issues.apache.org/jira/browse/HBASE-18658) | *Major* | **Purge hokey hbase Service implementation; use (internal) Guava Service instead**
14550
14551 Removed hbase Service class. It was not fully-formed. Now Guava is relocated, use its Service instead internally; it has nice implementation facility too in AbstractService.
14552
14553
14554 ---
14555
14556 * [HBASE-15982](https://issues.apache.org/jira/browse/HBASE-15982) | *Blocker* | **Interface ReplicationEndpoint extends Guava's Service**
14557
14558     Breaking change to our ReplicationEndpoint and BaseReplicationEndpoint.
14559
14560     ReplicationEndpoint implemented Guava 0.12 Service. An abstract
14561     subclass, BaseReplicationEndpoint, provided default implementations
14562     and facility, among other things, by extending Guava's
14563     AbstractService class.
14564
14565     Both of these HBase classes were marked LimitedPrivate for
14566     REPLICATION so these classes were semi-public and made it so
14567     Guava 0.12 was part of our API.
14568
14569     Having Guava in our API was a mistake. It anchors us and the
14570     implementation of the Interface to Guava 0.12. This is untenable
14571     given Guava changes and that the Service Interface in particular
14572     has had extensive revamp and improvement done. We can't hold to
14573     the Guava Interface. It changed. We can't stay on Guava 0.12;
14574     implementors and others on our CLASSPATH won't abide being stuck
14575     on an old Guava.
14576
14577     So we make breaking changes. The unhitching of our Interface
14578     from Guava could only be done in a breaking manner. It undoes the
14579     LimitedPrivate on BaseReplicationEndpoint while keeping it for the RE
14580     Interface. It means consumers will have to copy/paste the
14581     AbstractService-based BRE into their own codebase also supplying their
14582     own Guava; HBase no longer 'supplies' this (our Guava usage has
14583     been internalized, relocated).
14584
14585     This patch then adds into RE the basic methods RE needs of the old
14586     Guava Service rather than return a Service to start/stop only to go
14587     back to the RE instance to do actual work. A few method names had to
14588     be changed so could make implementations with Guava Service internally
14589     and not have RE method names and types clash). Semantics remained the
14590     same otherwise. For example startAsync and stopAsync in Guava are start
14591     and stop in RE.
14592
14593
14594 ---
14595
14596 * [HBASE-18347](https://issues.apache.org/jira/browse/HBASE-18347) | *Major* | **Implement a BufferedMutator for async client**
14597
14598 Introduce an AsyncBufferedMutator for batching requests to HBase for a single table.
14599
14600 Use AsyncConnection.getBufferedMutator method to get an AsyncBufferedMutator instance.
14601
14602
14603 ---
14604
14605 * [HBASE-18546](https://issues.apache.org/jira/browse/HBASE-18546) | *Critical* | **Always overwrite the TS for Append/Increment unless no existing cells are found**
14606
14607 If there is no existing cell in submitting Append/Increment, the custom ts won't be overridden. By contrast, the cell's ts will always be overridden by server.
14608
14609
14610 ---
14611
14612 * [HBASE-18224](https://issues.apache.org/jira/browse/HBASE-18224) | *Critical* | **Upgrade jetty**
14613
14614 Moved from Jetty 9.3.x to 9.4.x.
14615
14616 Jetty returns more correct HTTP code when Header is too long, 431 instead of 413, and it requires more threads to start up (made default 16 instead of 10).
14617
14618
14619 ---
14620
14621 * [HBASE-17442](https://issues.apache.org/jira/browse/HBASE-17442) | *Critical* | **Move most of the replication related classes from hbase-client to hbase-replication package**
14622
14623 Move replication implementation's classes from hbase-client to hbase-replication package.
14624
14625
14626 ---
14627
14628 * [HBASE-18653](https://issues.apache.org/jira/browse/HBASE-18653) | *Major* | **Undo hbase2 check against \< hadoop2.6.x; i.e. implement agreed drop of hadoop 2.4 and 2.5 support in hbase2**
14629
14630 Change the yetus profile for branch-2 so it no longer runs hadoop 2.4.x and 2.5.x build checks.
14631
14632
14633 ---
14634
14635 * [HBASE-18630](https://issues.apache.org/jira/browse/HBASE-18630) | *Major* | **Prune dependencies; as is branch-2 has duplicates**
14636
14637 Removed doubled instances of javax.inject and commons-beanutils where the versions were close.
14638
14639 Other instances of 'double' includes have different groupids so wary pruning especially when transitive includes (hadoop or jetty et al.)
14640
14641
14642 ---
14643
14644 * [HBASE-18631](https://issues.apache.org/jira/browse/HBASE-18631) | *Minor* | **Allow configuration of ChaosMonkey properties via hbase-site**
14645
14646 This change invalidates the need for a separate Java properties file to configure the ChaosMonkey included with HBase. These properties can be provided directly in hbase-site.xml. If configuration in provided in both locations, the Java properties file takes precendence.
14647
14648
14649 ---
14650
14651 * [HBASE-18489](https://issues.apache.org/jira/browse/HBASE-18489) | *Major* | **Expose scan cursor in RawScanResultConsumer**
14652
14653 Add a 'cursor' method which returns an 'Optional\<Cursor\>' in 'RawScanResultConsumer.ScanController'. You can use this method to obtain the scan cursor if available.
14654
14655
14656 ---
14657
14658 * [HBASE-18511](https://issues.apache.org/jira/browse/HBASE-18511) | *Blocker* | **Default no regions on master**
14659
14660 Changes the configuration hbase.balancer.tablesOnMaster from list of table names that the can carry (with 'none' meaning no tables on the master) to instead be a boolean that is set to true if master carries tables/regions and false if it does not. If true, the master acts like any regionserver.
14661
14662 If false, then the master carries no tables. This is the default for hbase-2.0.0.
14663
14664 Another boolean configuration, hbase.balancer.tablesOnMaster.systemTablesOnly, when set to true, enables hbase.balancer.tablesOnMaster and makes it so the master hosts system tables exclusively (the long-time deploy mode of master branch and branch-2 up until this commit).
14665
14666 UPDATE: This is broke. See HBASE-19785.
14667 UPDATE2: Master carrying Regions does not work reliably, see HBASE-19828.
14668
14669 See HBASE-19831, the issue to fix regions on Master
14670
14671 The change of hbase.balancer.tablesOnMaster from String list to boolean and
14672 the addition of a simple boolean to enable system-tables on Master was done
14673 to constrain what operators might ask for via this master configuration.
14674 Stipulating what tables are bound to the Master server verges into
14675 regionserver grouping territory, a more robust means of specifying table
14676 and server combinations. Operators should use this latter if they want
14677 layouts more exotic than those supplied by the provided booleans.
14678
14679
14680 ---
14681
14682 * [HBASE-18553](https://issues.apache.org/jira/browse/HBASE-18553) | *Major* | **Expose scan cursor for asynchronous scanner**
14683
14684 The ResultScanner which is gotten from an AsyncTable will also return cursor results if Scan.isNeedCursorResult is true.
14685
14686
14687 ---
14688
14689 * [HBASE-18598](https://issues.apache.org/jira/browse/HBASE-18598) | *Minor* | **AsyncNonMetaRegionLocator use FIFO algorithm to get a candidate locate request**
14690
14691 Introduce FIFO algorithm to get a candidate locate request for AsyncNonMetaRegionLocator.
14692
14693
14694 ---
14695
14696 * [HBASE-18533](https://issues.apache.org/jira/browse/HBASE-18533) | *Major* | **Expose BucketCache values to be configured**
14697
14698 This patch exposes configuration for Bucketcache. These configs are very similar to those for the LRU cache, but are described below:
14699
14700 "hbase.bucketcache.single.factor"; /\*\* Single access bucket size \*/
14701 "hbase.bucketcache.multi.factor"; /\*\* Multiple access bucket size \*/
14702 "hbase.bucketcache.memory.factor"; /\*\* In-memory bucket size \*/
14703 "hbase.bucketcache.extrafreefactor"; /\*\* Free this floating point factor of extra blocks when evicting. For example free the number of blocks requested \* (1 + extraFreeFactor) \*/
14704 "hbase.bucketcache.acceptfactor"; /\*\* Acceptable size of cache (no evictions if size \< acceptable) \*/
14705 "hbase.bucketcache.minfactor"; /\*\* Minimum threshold of cache (when evicting, evict until size \< min) \*/
14706
14707
14708 ---
14709
14710 * [HBASE-18528](https://issues.apache.org/jira/browse/HBASE-18528) | *Critical* | **DON'T allow user to modify the passed table/column descriptor**
14711
14712 **WARNING: No release note provided for this change.**
14713
14714
14715 ---
14716
14717 * [HBASE-18271](https://issues.apache.org/jira/browse/HBASE-18271) | *Blocker* | **Shade netty**
14718
14719 Depend on hbase-thirdparty for our netty instead of directly relying on netty-all. netty is relocated in hbase-thirdparty from io.netty to org.apache.hadoop.hbase.shaded.io.netty. One kink is that netty bundles an .so. Its files also are relocated. So netty can find the .so content, need to specify on command-line a system property telling netty about the shading.
14720
14721 The .so trick is from
14722              https://stackoverflow.com/questions/33825743/rename-files-inside-a-jar-using-some-maven-plugin
14723
14724 In essence we need the below defined whenever we run tests or deploy:
14725
14726 -Dorg.apache.hadoop.hbase.shaded.io.netty.packagePrefix=org.apache.hadoop.hbase.shaded.
14727
14728 (The trailing '.' is required)
14729
14730 See toward the end of this issue for how to pass config: https://github.com/netty/netty/issues/6665
14731
14732 The system property has been added to bin/hbase. If starting hbase with other than bin/hbase, add this system property (at least on linux).
14733
14734 For devs, going forward, do not reference io.netty. Reference org.apache.hadoop.hbase.io.netty instead. Here is sample:
14735
14736 {code}
14737 -import io.netty.channel.Channel;
14738 -import io.netty.channel.EventLoop;
14739 +import org.apache.hadoop.hbase.shaded.io.netty.channel.Channel;
14740 +import org.apache.hadoop.hbase.shaded.io.netty.channel.EventLoop;
14741 {code}
14742
14743
14744 ---
14745
14746 * [HBASE-15511](https://issues.apache.org/jira/browse/HBASE-15511) | *Major* | **ClusterStatus should be able to return responses by scope**
14747
14748 Provide a new way to get desired ClusterStatus with a set of ClusterStatus.Option, such that the response back to client can be limited.
14749 Note that, the constructor way to new a ClusterStatus will be no longer support after 2.0.0,  and use ClusterStatus.Builder instead.
14750
14751
14752 ---
14753
14754 * [HBASE-18551](https://issues.apache.org/jira/browse/HBASE-18551) | *Major* | **[AMv2] UnassignProcedure and crashed regionservers**
14755
14756 Unassign will not proceed if it is unable to talk to the remote server. Now it will expire the server it is unable to communicate with and then wait until it is signaled by ServerCrashProcedure that the server's logs have been split. Only then will judge the unassign successful.
14757
14758 We do this because a subsequent assign lacking the crashed server context might open a region w/o first splitting logs.
14759
14760
14761 ---
14762
14763 * [HBASE-18469](https://issues.apache.org/jira/browse/HBASE-18469) | *Critical* | **Correct  RegionServer metric of  totalRequestCount**
14764
14765 In HBASE-18469 we introduced a new RegionServer metrics in name of "totalRowActionRequestCount" which counts in all row actions and equals to the sum of "readRequestCount" and "writeRequestCount". Meantime, we have changed "totalRequestCount" to count only once for multi request, while previously we will count in action number of the request. As a result, existing monitoring system on totalRequestCount will still work but see a smaller value, and we strongly recommend to change to use the new metrics to monitor server load.
14766
14767
14768 ---
14769
14770 * [HBASE-18500](https://issues.apache.org/jira/browse/HBASE-18500) | *Major* | **Performance issue: Don't use BufferedMutator for HTable's put method**
14771
14772 Remove the deprecated method get/setWriteBufferSize from Table and remove writeBufferSize from TableBuilder. Remove the BufferedMutatorImpl from HTable.
14773
14774
14775 ---
14776
14777 * [HBASE-18387](https://issues.apache.org/jira/browse/HBASE-18387) | *Minor* | **[Thrift] Make principal configurable in DemoClient.java**
14778
14779 This change allows the demonstration Thrift client to customize the server principal used by the Thrift server for instances secured with Kerberos.
14780
14781
14782 ---
14783
14784 * [HBASE-17125](https://issues.apache.org/jira/browse/HBASE-17125) | *Critical* | **Inconsistent result when use filter to read data**
14785
14786 Marked Scan and Get's setMaxVersions() and setMaxVersions(int) as deprecated. They are easy to misunderstand with column family's max versions, so use readAllVersions() and readVersions(int) instead.
14787
14788
14789 ---
14790
14791 * [HBASE-18492](https://issues.apache.org/jira/browse/HBASE-18492) | *Major* | **[AMv2] Embed code for selecting highest versioned region server for system table regions in AssignmentManager.processAssignQueue()**
14792
14793 Favors new servers over older versions when assigning system table regions (more to follow in this area; i.e. changes in the AM itself).
14794
14795
14796 ---
14797
14798 * [HBASE-18517](https://issues.apache.org/jira/browse/HBASE-18517) | *Major* | **limit max log message width in log4j**
14799
14800 Sets a log length max of 1000 characters.
14801
14802
14803 ---
14804
14805 * [HBASE-18502](https://issues.apache.org/jira/browse/HBASE-18502) | *Critical* | **Change MasterObserver to use TableDescriptor and ColumnFamilyDescriptor**
14806
14807 The methods which change to use TableDescriptor/ColumnFamilyDescriptor are shown below.
14808 + preCreateTable( ObserverContext,TableDescriptor, HRegionInfo[])
14809 + postCreateTable(ObserverContext ,TableDescriptor, HRegionInfo[])
14810 + preCreateTableAction(ObserverContext, TableDescriptor,HRegionInfo[])
14811 + postCompletedCreateTableAction(ObserverContext,TableDescriptor,HRegionInfo[])
14812 + preModifyTable(ObserverContext,TableName, TableDescriptor)
14813 + postModifyTable(ObserverContext,TableName, TableDescriptor)
14814 + preModifyTableAction( ObserverContext,TableName,TableDescriptor)
14815 + postCompletedModifyTableAction( ObserverContext,TableName,TableDescriptor)
14816 + preAddColumnFamily(ObserverContext,TableName, ColumnFamilyDescriptor)
14817 + postAddColumnFamily(ObserverContext,TableName, ColumnFamilyDescriptor)
14818 + preAddColumnFamilyAction(ObserverContext,TableName,ColumnFamilyDescriptor)
14819 + postCompletedAddColumnFamilyAction(ObserverContext,TableName, ColumnFamilyDescriptor)
14820 + preModifyColumnFamily(ObserverContext,TableName, ColumnFamilyDescriptor)
14821 + preModifyColumnFamilyAction(ObserverContext\<MasterCoprocessorEnvironment,TableName,ColumnFamilyDescriptor)
14822 + postCompletedModifyColumnFamilyAction(ObserverContext\<MasterCoprocessorEnvironment\>,TableName,ColumnFamilyDescriptor)
14823 + preCloneSnapshot(ObserverContext\<MasterCoprocessorEnvironment\>,SnapshotDescription,TableDescriptor)
14824 + postCloneSnapshot(ObserverContext\<MasterCoprocessorEnvironment\>,SnapshotDescription,TableDescripto)
14825 + preRestoreSnapshot(ObserverContext\<MasterCoprocessorEnvironment,SnapshotDescription,TableDescriptor)
14826 + postRestoreSnapshot(ObserverContext\<MasterCoprocessorEnvironment,SnapshotDescription,TableDescriptor)
14827 + preGetTableDescriptors(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableName\>, List\<TableDescriptor\>,String)
14828 + postGetTableDescriptors(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableName\>, List\<TableDescriptor\>,String)
14829 + preGetTableNames(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableDescriptor\>, String)
14830 + postGetTableNames(ObserverContext\<MasterCoprocessorEnvironment\>,List\<TableDescriptor\>, String)
14831
14832
14833 ---
14834
14835 * [HBASE-18520](https://issues.apache.org/jira/browse/HBASE-18520) | *Minor* | **Add jmx value to determine true Master Start time**
14836
14837 This JIRA adds a JMX value to track when the Master has finished initializing.
14838 The jmx config is 'masterFinishedInitializationTime' and details the time in millis that the Master is fully usable and ready to serve requests.
14839
14840
14841 ---
14842
14843 * [HBASE-17056](https://issues.apache.org/jira/browse/HBASE-17056) | *Critical* | **Remove checked in PB generated files**
14844
14845 Purge all checked in generated protobuf files (30MB). Generate protobuf files inline with the build. Remove checked-in and patched protobuf. Get it from new hbase-thirdparty instead.
14846
14847 Side-effect: Our protobuf went from 3.1.0 to 3.3.1.
14848
14849 Build does not take noticeably longer (still about 2.5 minutes to do a mvn clean install -DskipTests).
14850
14851 IDEs will probably require a mvn build first else they'll complain about missing (generated) files.
14852
14853
14854 ---
14855
14856 * [HBASE-18374](https://issues.apache.org/jira/browse/HBASE-18374) | *Major* | **RegionServer Metrics improvements**
14857
14858 This change adds the latency metrics checkAndPut, checkAndDelete, putBatch and deleteBatch . Also the previous regionserver "mutate" latency metrics are renamed to "put" metrics. Batch metrics capture the latency of the entire batch containing put/delete whereas put/delete metrics capture latency per operation. Note this change will break existing monitoring based on regionserver "mutate" latency metric.
14859
14860
14861 ---
14862
14863 * [HBASE-18023](https://issues.apache.org/jira/browse/HBASE-18023) | *Minor* | **Log multi-\* requests for more than threshold number of rows**
14864
14865 HBASE-18023 introduces a warning message in the RegionServer log when an RPC is received from a client that has more than 5000 "actions" (where an "action" is a collection of mutations for a specific row) in a single RPC. Misbehaving clients who send large RPCs to RegionServers can be malicious, causing temporary pauses via garbage collection or denial of service via crashes. The threshold of 5000 actions per RPC is defined by the property "hbase.rpc.rows.warning.threshold" in hbase-site.xml.
14866
14867
14868 ---
14869
14870 * [HBASE-15968](https://issues.apache.org/jira/browse/HBASE-15968) | *Major* | **New behavior of versions considering mvcc and ts rather than ts only**
14871
14872 This issue resolved two long-term issues in HBase:
14873 Puts may be masked by a delete before them.
14874 Major compactions change query results.
14875
14876 This issue offer a new behavior to fix this issue with a little performance reduction. Set NEW\_VERSION\_BEHAVIOR to true to enable this feature in CF level. See HBASE-15968 for details.
14877 Note if you enable this feature, the order of Mutations matters. But replication will disorder the entries by default. So you have to enable serial replication if you have slave clusters. See HBASE-9465 for details.
14878
14879
14880 ---
14881
14882 * [HBASE-18107](https://issues.apache.org/jira/browse/HBASE-18107) | *Major* | **[AMv2] Remove DispatchMergingRegionsRequest & DispatchMergingRegions**
14883
14884 Removes merge region code added into branch-2 but that was not needed after all. Branch-2 replaced dispatchMergingRegions with MergeTableRegionsProcedure.
14885
14886 Removed:
14887
14888 # dispatchMergingRegions from Connection (was superceded long ago in branch-1).
14889 # mergeRegions from RsRpcServices (was not used).
14890
14891
14892 ---
14893
14894 * [HBASE-15816](https://issues.apache.org/jira/browse/HBASE-15816) | *Major* | **Provide client with ability to set priority on Operations**
14895
14896 Added setPriority(int priority) API to Put, Delete, Increment, Append, Get and Scan pojos.  So for all these ops, the user can provide a custom priority level.
14897
14898
14899 ---
14900
14901 * [HBASE-18430](https://issues.apache.org/jira/browse/HBASE-18430) | *Major* | **Typo in "contributing to documentation" page**
14902
14903 Pushed to {{master}}. Thanks, Coral! Congratulations on your first Apache HBase commit!
14904
14905
14906 ---
14907
14908 * [HBASE-17908](https://issues.apache.org/jira/browse/HBASE-17908) | *Critical* | **Upgrade guava**
14909
14910 Use relocated guava 22.0 gotten from the new hbase-thirdparty ancillary project.
14911
14912 Incompatible change. ReplicationEndpoint and subclasses extend guava Service which changed pretty radically between 12.0 and 22.0. Change is kosher because implementations are marked audience private. Still, this will likely cause grief for the likes of the downstream lily indexer.
14913
14914
14915 ---
14916
14917 * [HBASE-16993](https://issues.apache.org/jira/browse/HBASE-16993) | *Major* | **BucketCache throw java.io.IOException: Invalid HFile block magic when configuring hbase.bucketcache.bucket.sizes**
14918
14919 Any value for hbase.bucketcache.bucket.sizes  configuration to be multiple of 256.  If that is not the case, instantiation of L2 Bucket cache itself will fail throwing IllegalArgumentException.
14920
14921
14922 ---
14923
14924 * [HBASE-16090](https://issues.apache.org/jira/browse/HBASE-16090) | *Major* | **ResultScanner is not closed in SyncTable#finishRemainingHashRanges()**
14925
14926 pushed to 1.3 and 1.2. SyncTable was introduced in 1.2, so skipping 1.1.
14927
14928
14929 ---
14930
14931 * [HBASE-18332](https://issues.apache.org/jira/browse/HBASE-18332) | *Minor* | **Upgrade asciidoctor-maven-plugin**
14932
14933 Committed to master and branch-2. Thanks!
14934
14935
14936 ---
14937
14938 * [HBASE-18161](https://issues.apache.org/jira/browse/HBASE-18161) | *Minor* | **Incremental Load support for Multiple-Table HFileOutputFormat**
14939
14940 In order to use this feature, a user must
14941 1. Register their tables when configuring their job
14942  2. Create a composite key of the tablename and original rowkey to send as the mapper output key.
14943
14944   To register their tables (and configure their job for incremental load into multiple tables), a user must call the static MultiHFileOutputFormat.configureIncrementalLoad function to register the HBase tables that will be ingested into.
14945
14946 To create the composite key, a helper function MultiHFileOutputFormat2.createCompositeKey should be called with the destination tablename and rowkey as arguments, and the result should be output as the mapper key.
14947
14948  Before this JIRA, for HFileOutputFormat2 a configuration for the storage policy was set per Column Family. This was set manually by the user. In this JIRA, this is unchanged when using HFileOutputFormat2. However, when specifically using MultiHFileOutputFormat2, the user now has to manually set the prefix by creating a composite of the table name and the column family. The user can create the new composite value by calling MultiHFileOutputFormat2.createCompositeKey with the tablename and column family as arguments.
14949
14950 Changes added through this JIRA are backwards compatible with existing HFileOutputFormat2 apis and functionality.
14951
14952 The configuration parameter "hbase.mapreduce.hfileoutputformat.table.name" is now a REQUIRED parameter though it is normally set automatically when configureIncrementalLoad method is called within HFileOutputFormat2
14953
14954
14955 ---
14956
14957 * [HBASE-18229](https://issues.apache.org/jira/browse/HBASE-18229) | *Critical* | **create new Async Split API to embrace AM v2**
14958
14959 A new splitRegionAsync() API is added in client. The existing splitRegion()  and split() API will call the new API so client does not have to change its code.
14960
14961 Move HBaseAdmin.splitXXX() logic to master, client splitXXX() API now go to master directly instead of going to RegionServer first.
14962
14963 Also added splitSync() API
14964
14965
14966 ---
14967
14968 * [HBASE-18339](https://issues.apache.org/jira/browse/HBASE-18339) | *Major* | **Update test-patch to use hadoop 3.0.0-alpha4**
14969
14970 HBase now defaults to Apache Hadoop 3.0.0-alpha4 when the Hadoop 3 profile is active.
14971
14972
14973 ---
14974
14975 * [HBASE-18267](https://issues.apache.org/jira/browse/HBASE-18267) | *Major* | **The result from the postAppend is ignored**
14976
14977 **WARNING: No release note provided for this change.**
14978
14979
14980 ---
14981
14982 * [HBASE-18307](https://issues.apache.org/jira/browse/HBASE-18307) | *Major* | **Share the same EventLoopGroup for NettyRpcServer, NettyRpcClient and AsyncFSWALProvider at RS side**
14983
14984 There are two configuration name changes as the event loop configs will not only effect rpc server but be shared by different components in the same RS instance.
14985
14986 'hbase.rpc.server.nativetransport' -\> 'hbase.netty.nativetransport'
14987
14988 'hbase.netty.rpc.server.worker.count' -\> 'hbase.netty.worker.count'
14989
14990
14991 ---
14992
14993 * [HBASE-18241](https://issues.apache.org/jira/browse/HBASE-18241) | *Critical* | **Change client.Table, client.Admin, Region, Store, and HBaseTestingUtility to not use HTableDescriptor or HColumnDescriptor**
14994
14995 - : removed API
14996 + : new API
14997 \* : deprecated API
14998 ---------------------------
14999 Region class
15000 - HTableDescriptor getTableDesc()
15001 +TableDescriptor getTableDescriptor()
15002
15003 Store class
15004 - HColumnDescriptor getFamily()
15005 + ColumnFamilyDescriptor getColumnFamilyDescriptor()
15006
15007 Table class
15008 \* HTableDescriptor getTableDescriptor()
15009 + TableDescriptor getDescriptor()\|
15010
15011 \*Admin class\*
15012 \* HTableDescriptor getTableDescriptor(TableName)
15013 + List\<TableDescriptor\> listTableDescriptor(TableName)\|
15014 \* HTableDescriptor[] getTableDescriptors(List\<String\>)
15015 \* HTableDescriptor[] getTableDescriptorsByTableName(List\<TableName\>)
15016 + List\<TableDescriptor\> listTableDescriptors(List\<TableName\>)
15017 \* HTableDescriptor[] listTables()
15018 + List\<TableDescriptor\> listTableDescriptors()
15019 \* HTableDescriptor[] listTables(Pattern)
15020 + List\<TableDescriptor\> listTableDescriptors(Pattern)
15021 \* HTableDescriptor[] listTables(String)
15022 + List\<TableDescriptor\> listTableDescriptors(String)
15023 \* HTableDescriptor[] listTables(Pattern, boolean)
15024 + List\<TableDescriptor\> listTableDescriptors(Pattern, boolean)
15025 \* HTableDescriptor[] listTables(String, boolean)
15026 + List\<TableDescriptor\> listTableDescriptors(String, boolean)
15027 \* HTableDescriptor[] deleteTables(String)
15028 \* HTableDescriptor[] deleteTables(Pattern)
15029 \* HTableDescriptor[] enableTables(String)
15030 \* HTableDescriptor[] enableTables(Pattern)
15031 \* HTableDescriptor[] disableTables(String)
15032 \* HTableDescriptor[] disableTables(Pattern)
15033 \* void modifyTable(TableName, HTableDescriptor)
15034 + void modifyTable(TableDescriptor)
15035 \* void modifyTableAsync(TableName, HTableDescriptor)
15036 + void modifyTableAsync(TableDescriptor)
15037 \* HTableDescriptor[] listTableDescriptorsByNamespace(String)
15038 + List\<TableDescriptor\> listTableDescriptorsByNamespace(byte[])
15039 \* void createTable(HTableDescriptor)
15040 + void createTable(TableDescriptor)
15041 \* void createTable(HTableDescriptor, byte[], byte[], int)
15042 + void createTable({color:red}TableDescriptor, byte[], byte[], int)
15043 \* void createTable(HTableDescriptor, byte[][])
15044 + void createTable(TableDescriptor, byte[][])
15045 \* Future\<Void\> createTableAsync(HTableDescriptor, byte[][])
15046 + Future\<Void\> createTableAsync(TableDescriptor, byte[][])
15047
15048 \*HBaseTestingUtility class\*
15049 \* Table createTable(HTableDescriptor, byte[][], Configuration)
15050 + Table createTable(TableDescriptor, byte[][], Configuration)
15051 \* Table createTable(HTableDescriptor, byte[][], byte[][], Configuration)
15052 + Table createTable(TableDescriptor, byte[][], byte[][], Configuration)
15053 \* public Table createTable(HTableDescriptor, byte[][])
15054 + public Table createTable(TableDescriptor, byte[][])
15055 \* void modifyTableSync(Admin, HTableDescriptor)
15056 + void modifyTableSync(Admin, TableDescriptor)
15057 \* HRegion createLocalHRegion(HTableDescriptor, byte [], byte [])
15058 + HRegion createLocalHRegion(TableDescriptor, byte [], byte [])
15059 \* HRegion createLocalHRegion(HRegionInf, HTableDescriptor)
15060 + HRegion createLocalHRegion(HRegionInf, TableDescriptor)
15061 \* HRegion createLocalHRegion(HRegionInfo, HTableDescriptor, WAL)
15062 + HRegion createLocalHRegion(HRegionInfo, TableDescriptor, WAL)
15063 \* List createMultiRegionsInMeta(final Configuration, HTableDescriptor, byte [][])
15064 + List createMultiRegionsInMeta(final Configuration, TableDescriptor, byte [][])
15065 \* HRegion createRegionAndWAL(HRegionInfo, Path, Configuration, HTableDescriptor)
15066 + HRegion createRegionAndWAL(HRegionInfo, Path, Configuration, TableDescriptor)
15067 \* HRegion createRegionAndWAL(HRegionInfo, Pat, Configuration, HTableDescriptor, boolean)
15068 + HRegion createRegionAndWAL(HRegionInfo, Pat, Configuration, TableDescriptor, boolean)
15069 \* int createPreSplitLoadTestTable(Configuration,HTableDescriptor, HColumnDescriptor)
15070 + int createPreSplitLoadTestTable(Configuration,TableDescriptor, ColumnFamilyDescriptor)
15071 \* int createPreSplitLoadTestTable(Configuration, HTableDescriptor, HColumnDescriptor, int)
15072 + int createPreSplitLoadTestTable(Configuration, TableDescriptor, ColumnFamilyDescriptor, int)
15073 \* int createPreSplitLoadTestTable(Configuration, HTableDescriptor, HColumnDescriptor[], int)
15074 + int createPreSplitLoadTestTable(Configuration, TableDescriptor, ColumnFamilyDescriptor[], int)
15075 \* int createPreSplitLoadTestTable(Configuration,HTableDescriptor, HColumnDescriptor[],SplitAlgorithm, int)
15076 + int createPreSplitLoadTestTable(Configuration,TableDescriptor, ColumnFamilyDescriptor[],SplitAlgorithm, int)
15077 \* HRegion createTestRegion(String, HColumnDescriptor)
15078 + HRegion createTestRegion(String, ColumnFamilyDescriptor)
15079
15080
15081 ---
15082
15083 * [HBASE-18083](https://issues.apache.org/jira/browse/HBASE-18083) | *Major* | **Make large/small file clean thread number configurable in HFileCleaner**
15084
15085 After HBASE-18083 we could configure HFileCleaner to use multiple threads for large/small (archived) hfile cleaning with hbase.regionserver.hfilecleaner.large.thread.count and hbase.regionserver.hfilecleaner.small.thread.count, both default to 1. These properties support online configuration change.
15086
15087
15088 ---
15089
15090 * [HBASE-17931](https://issues.apache.org/jira/browse/HBASE-17931) | *Blocker* | **Assign system tables to servers with highest version**
15091
15092 We usually keep compatibility between old client and new server so we can do rolling upgrade, HBase cluster first, then HBase client. But we don't guarantee new client can access old server.
15093 In an HBase cluster, we have system tables and region servers will access these tables so for servers they are also an HBase client. So if the system tables are in region servers with lower version we may get trouble because region servers with higher version may can not access them.
15094 After this patch, we will move all system regions to region servers with highest version. So when we do a rolling upgrade across two major or minor versions, we should ALWAYS UPGRADE MASTER FIRST and then upgrade region servers. The new master will handle system tables correctly.
15095
15096
15097 ---
15098
15099 * [HBASE-6581](https://issues.apache.org/jira/browse/HBASE-6581) | *Major* | **Build with hadoop.profile=3.0**
15100
15101 Make us build against hadoop trunk (3.0)
15102
15103
15104 ---
15105
15106 * [HBASE-16120](https://issues.apache.org/jira/browse/HBASE-16120) | *Minor* | **Add shell test for truncate\_preserve**
15107
15108 Add unit tests for truncate\_preserve
15109
15110
15111 ---
15112
15113 * [HBASE-18240](https://issues.apache.org/jira/browse/HBASE-18240) | *Major* | **Add hbase-thirdparty, a project with hbase utility including an hbase-shaded-thirdparty module with guava, netty, etc.**
15114
15115 Adds a new project, hbase-thirdparty, at https://git-wip-us.apache.org/repos/asf/hbase-thirdparty used by core hbase. GroupID org.apache.hbase.thirdparty. Version 1.0.0.
15116
15117 This project packages relocated third-party libraries used by Apache HBase such as protobuf, guava, and netty among others. HBase core depends on it.
15118
15119 It has threre submodules, one to patch and then relocate (shade) protobuf, and one to do messy .so renaming (netty). The remainder module relocates a bundle of other (unpatched) libs used by hbase. This latter set includes protobuf-util, netty-all, gson, and guava.
15120
15121 All shading is done using the same relocation offset of org.apache.hadoop.hbase.shaded; we add this prefix to the relocated thirdparty library class names.
15122
15123 See the pom.xml in hbase-thirdparty for the explicit version of each third-party lib included (of note, we update out internal protobuf from 3.1.0 to 3.3.1).
15124
15125
15126 ---
15127
15128 * [HBASE-15943](https://issues.apache.org/jira/browse/HBASE-15943) | *Major* | **Add page displaying JVM process metrics**
15129
15130 Adds new "Process Metrics' tab along the top which leads to new page that dumps mbean -- mostly jvm -- metrics
15131
15132
15133 ---
15134
15135 * [HBASE-14902](https://issues.apache.org/jira/browse/HBASE-14902) | *Major* | **Revert some of the stringency recently introduced by checkstyle tightening**
15136
15137 Changes the checkstyle so that on a continuation line for javadoc, instead of default four spaces, instead now it is two spaces. Also one line statements as in if (true) x =1; now pass checkstyle.
15138
15139
15140 ---
15141
15142 * [HBASE-17110](https://issues.apache.org/jira/browse/HBASE-17110) | *Major* | **Improve SimpleLoadBalancer to always take server-level balance into account**
15143
15144 After HBASE-17110 the bytable strategy for SimpleLoadBalancer will also take server level balance into account
15145
15146
15147 ---
15148
15149 * [HBASE-17928](https://issues.apache.org/jira/browse/HBASE-17928) | *Major* | **Shell tool to clear compaction queues**
15150
15151 Adds clear\_compaction\_queues to the hbase shell.
15152 {code}
15153   Clear compaction queues on a regionserver.
15154   The queue\_name contains short and long.
15155   short is shortCompactions's queue,long is longCompactions's queue.
15156
15157   Examples:
15158   hbase\> clear\_compaction\_queues 'host187.example.com,60020'
15159   hbase\> clear\_compaction\_queues 'host187.example.com,60020','long'
15160   hbase\> clear\_compaction\_queues 'host187.example.com,60020', ['long','short']
15161 {code}
15162
15163
15164 ---
15165
15166 * [HBASE-18164](https://issues.apache.org/jira/browse/HBASE-18164) | *Critical* | **Much faster locality cost function and candidate generator**
15167
15168 New locality cost function and candidate generator that use caching and incremental computation to allow the stochastic load balancer to consider ~20x more cluster configurations for big clusters.
15169
15170
15171 ---
15172
15173 * [HBASE-18226](https://issues.apache.org/jira/browse/HBASE-18226) | *Major* | **Disable reverse DNS lookup at HMaster and use the hostname provided by RegionServer**
15174
15175 The following config is added by this JIRA:
15176
15177 hbase.regionserver.hostname.disable.master.reversedns
15178
15179 This config is for experts: don't set its value unless you really know what you are doing.
15180 When set to true, regionserver will use the current node hostname for the servername and HMaster will skip reverse DNS lookup and use the hostname sent by regionserver instead. Note that this config and hbase.regionserver.hostname are mutually exclusive. See https://issues.apache.org/jira/browse/HBASE-18226 for more details.
15181
15182 Caution: please make sure rolling upgrade succeeds before turning on this feature.
15183
15184
15185 ---
15186
15187 * [HBASE-16242](https://issues.apache.org/jira/browse/HBASE-16242) | *Major* | **Upgrade Avro to 1.7.7**
15188
15189 Apache HBase now specifies that version 1.7.7 of the Apache Avro library should be pulled in by maven and included in the convenience binary tarball.
15190
15191
15192 ---
15193
15194 * [HBASE-18213](https://issues.apache.org/jira/browse/HBASE-18213) | *Major* | **Add documentation about the new async client**
15195
15196 Add documentation for async client in section '66. Client' in ref guide.
15197
15198
15199 ---
15200
15201 * [HBASE-17008](https://issues.apache.org/jira/browse/HBASE-17008) | *Critical* | **Examples to make AsyncClient go down easy**
15202
15203 Add two examples for async client. AsyncClientExample is a simple example to show you how to use AsyncTable. HttpProxyExample is an example for advance user to show you how to use RawAsyncTable to write a fully asynchronous HTTP proxy server. There is no extra thread pool, all operations are executed inside netty's event loop.
15204
15205
15206 ---
15207
15208 * [HBASE-18200](https://issues.apache.org/jira/browse/HBASE-18200) | *Major* | **Set hadoop check versions for branch-2 and branch-2.x in pre commit**
15209
15210 Allow setting different hadoop check versions for branch-2 and branch-2.x when running pre commit check.
15211
15212
15213 ---
15214
15215 * [HBASE-18187](https://issues.apache.org/jira/browse/HBASE-18187) | *Major* | **Release hbase-2.0.0-alpha1**
15216
15217 Pushed the release. For detail: http://apache-hbase.679495.n3.nabble.com/ANNOUNCE-Apache-HBase-2-0-0-alpha-1-is-now-available-for-download-td4088484.html
15218
15219
15220 ---
15221
15222 * [HBASE-18137](https://issues.apache.org/jira/browse/HBASE-18137) | *Critical* | **Replication gets stuck for empty WALs**
15223
15224 0-length WAL files can potentially cause the replication queue to get stuck.  A new config "replication.source.eof.autorecovery" has been added: if set to true (default is false), the 0-length WAL file will be skipped after 1) the max number of retries has been hit, and 2) there are more WAL files in the queue.  The risk of enabling this is that there is a chance the 0-length WAL file actually has some data (e.g. block went missing and will come back once a datanode is recovered).
15225
15226
15227 ---
15228
15229 * [HBASE-18192](https://issues.apache.org/jira/browse/HBASE-18192) | *Blocker* | **Replication drops recovered queues on region server shutdown**
15230
15231 If a region server that is processing recovered queue for another previously dead region server is gracefully shut down, it can drop the recovered queue under certain conditions. Running without this fix on a 1.2+ release means possibility of continuing data loss in replication, irrespective of which WALProvider is used.
15232 If a single WAL group (or DefaultWALProvider) is used, running without this fix will always cause dataloss in replication whenever a region server processing recovered queues is gracefully shutdown.
15233
15234
15235 ---
15236
15237 * [HBASE-18109](https://issues.apache.org/jira/browse/HBASE-18109) | *Critical* | **Assign system tables first (priority)**
15238
15239 Adds a sort of procedures before submission so system tables are queued first (which will help ensure they go out first). This should be good enough along w/ existing scheduling mechanisms to ensure system/meta are assigned first (See reasoning below). Open new issue if insufficient.
15240
15241
15242 ---
15243
15244 * [HBASE-18008](https://issues.apache.org/jira/browse/HBASE-18008) | *Major* | **Any HColumnDescriptor we give out should be immutable**
15245
15246 1) The HColumnDescriptor got from Admin, AsyncAdmin, and Table is immutable.
15247 2) HColumnDescriptor have been marked as "Deprecated" and user should substituted
15248      ColumnFamilyDescriptor for HColumnDescriptor.
15249 3) ColumnFamilyDescriptor is constructed through ColumnFamilyDescriptorBuilder and it contains all of the read-only methods from HColumnDescriptor
15250 4) The value to which the IS\_MOB/MOB\_THRESHOLD is mapped is stored as String rather than Boolean/Long. The MOB is an new feature to 2.0 so this change should be acceptable
15251
15252
15253 ---
15254
15255 * [HBASE-18149](https://issues.apache.org/jira/browse/HBASE-18149) | *Major* | **The setting rules for table-scope attributes and family-scope attributes should keep consistent**
15256
15257 If the table-scope attributes value is false, you need not to enclose 'false' in single quotation.Both COMPACTION\_ENABLED =\> false and COMPACTION\_ENABLED =\> 'false' will take effect
15258
15259
15260 ---
15261
15262 * [HBASE-17849](https://issues.apache.org/jira/browse/HBASE-17849) | *Major* | **PE tool random read is not totally random**
15263
15264 When randomRead and randomSeekScan is used with PE tool, now we allow using both --size and --rows. The --size specifies the total size of the data (the range) on which the reads should be performed and --rows specifies the number of rows to be read by each client with in that range.
15265
15266
15267 ---
15268
15269 * [HBASE-15576](https://issues.apache.org/jira/browse/HBASE-15576) | *Major* | **Scanning cursor to prevent blocking long time on ResultScanner.next()**
15270
15271 If you don't like scanning being blocked too long because of heartbeat and partial result, you can use Scan#setNeedCursorResult(true) to get a special result within scanning timeout setting time which will tell you where row the server is scanning. See its javadoc for more details.
15272
15273
15274 ---
15275
15276 * [HBASE-16549](https://issues.apache.org/jira/browse/HBASE-16549) | *Major* | **Procedure v2 - Add new AM metrics**
15277
15278 Following AMv2 procedures are modified to override onSubmit(), onFinish() hooks provided by HBASE-17888 to do
15279 metrics calculations when procedures are submitted and finshed:
15280 \* AssignProcedure
15281 \* UnassignProcedure
15282 \* MergeTableRegionProcedure
15283 \* SplitTableRegionProcedure
15284 \* ServerCrashProcedure
15285
15286 Following metrics is collected for each of the above procedure during lifetime of a process:
15287 \* Total number of requests submitted for a type of procedure
15288 \* Histogram of runtime in milliseconds for successfully completed procedures
15289 \* Total number of failed procedures
15290
15291 As we are moving away from Hadoop's metric2, hbase-metrics-api module is used for newly added metrics.
15292
15293
15294 ---
15295
15296 * [HBASE-9393](https://issues.apache.org/jira/browse/HBASE-9393) | *Critical* | **Hbase does not closing a closed socket resulting in many CLOSE\_WAIT**
15297
15298 To handle this issue client need to have Hadoop client 2.6.4 or 2.7.0+ Hadoop version as CanUnBuffer interface which was added as part of HDFS-7694 is available in only those versions.
15299
15300
15301 ---
15302
15303 * [HBASE-18038](https://issues.apache.org/jira/browse/HBASE-18038) | *Critical* | **Rename StoreFile to HStoreFile and add a StoreFile interface for CP**
15304
15305 StoreFile is now changed to an interface. This is an incompatible change. The coprocessors which implement RegionObserver may need to modify their code.
15306
15307
15308 ---
15309
15310 * [HBASE-16196](https://issues.apache.org/jira/browse/HBASE-16196) | *Critical* | **Update jruby to a newer version.**
15311
15312 The bundled JRuby 1.6.8 has been updated to version 9.1.9.0. The represents a change from Ruby 1.8 to Ruby 2.3.3, which introduces non-compatible language changes for user scripts.
15313
15314 This JRuby version update required an update to joni-2.1.11 and jcodings-1.0.18, used for regular expression matching, as well as several transitive dependency updates that should not be user-visible.
15315
15316
15317 ---
15318
15319 * [HBASE-14614](https://issues.apache.org/jira/browse/HBASE-14614) | *Major* | **Procedure v2: Core Assignment Manager**
15320
15321 Replaces the AssignmentManager with a new procedurev2-based AssignmentManager
15322
15323 h1. AMv2
15324 Puts AssignmentManager up on top of the ProcedureV2 state machine with persistence engine. Each assignment atom is now a Procedure implementation; e.g. an AssignProcedure and an UnassignProcedure. Molecules of aggregated Procedures are used to do more involved assignment steps: e.g. the move region procedure is made of an Unassign followed by an Assign subprocedure.
15325
15326 AMv2 is 1500 lines. Old AM was near 4000. Functionality has been moved out to Procedures. In-memory states of regions and servers has been cleaned up stored in new RegionStates implementation. RegionStateStore takes care of publishing final region state out to the hbase:meta table.
15327
15328 New RemoteProcedureDispatcher/RSProcedureDispatcher runs the Procedure-based assignments ‘remotely’. Knows about ‘servers’. Does aggregation of assignments by time on a time/count basis so can send procedures in batches rather than one per RPC. Procedure status comes back on the back of the RegionServer heartbeat reporting online regions. The response is passed to the AMv2 to ‘process’. It will check against the in-memory state. If there is a mismatch, it fences out the RegionServer on the assumption that something went wrong on the RS side.Timeouts trigger retries. The Procedure machine ensures only one operation at a time on any one region/table using locking and smarts about what is serial and what can be run concurrently.
15329
15330 New accounting of RegionServer version will be used running rolling restarts.
15331
15332 ‘States’ -- OPENING, CLOSING, etc. -- are now in-memory in-the-master only serialized out to the ProcedureV2 WAL. They are no longer persisted to ZooKeeper.
15333
15334 h2. Assign Detail
15335 The Assign starts by pushing the "assign" operation to the AssignmentManager and then will go into a “waiting" state. The AM will batch the "assign" requests and ask the Balancer where to put the region (the various policies will be respected: retain, round-robin, random). Once the AM and the balancer have found a place for the region, the procedure will be resumed and an "open region" request will be placed in the Remote Dispatcher queue, and the procedure once again will go into a "waiting state".  The Remote Dispatcher will batch the various requests for that server and they will be sent to the RS for execution. The RS will complete the open operation by calling master.reportRegionStateTransition(). The AM will intercept the transition report, and notify the procedure. The procedure will finish the assignment by publishing to new state on hbase:meta or it will retry the assignment.
15336
15337 h3. Unassign Detail
15338  The Unassign starts by placing a "close region" request in the Remote Dispatcher queue, and the procedure will then go into a "waiting state". The Remote Dispatcher will batch the various requests for that server and they will be sent to the RS for execution. The RS will complete the open operation by calling master.reportRegionStateTransition(). The AM will intercept the transition report, and notify the procedure. The procedure will finish the unassign by publishing its new state on meta or it will retry the unassign.
15339
15340 h1. New Configs
15341  \* "hbase.procedure.remote.dispatcher.threadpool.size" defaults 128
15342  \* "hbase.procedure.remote.dispatcher.delay.msec" default 150ms
15343  \* "hbase.procedure.remote.dispatcher.max.queue.size" with default 32
15344  \* "hbase.regionserver.rpc.startup.waittime" with default 60 seconds.
15345 h1. TODO
15346 As of this writing.
15347
15348 Put up a model diagram.
15349
15350  \* Handle region migration
15351  \* Handle meta assignment first
15352  \* Handle sys table assignment first (e.g. acl, namespace)
15353  \* Handle table priorities
15354  \* Do we report same AM metrics as we used too? We do it all in here now.
15355
15356 INCOMPATIBLE
15357 A known incompatible is that because splits and merges are now run from the master, Coprocessors that used to watch for merge/split from a RegionObserver now no longer work; to watch split/merges, you need to have an observer on the Master instead.
15358
15359
15360 ---
15361
15362 * [HBASE-3462](https://issues.apache.org/jira/browse/HBASE-3462) | *Major* | **Fix table.jsp in regards to splitting a region/table with an optional splitkey**
15363
15364 UI pages for splitting/merging now operate by taking a row key prefix from the user rather than a full region name.
15365
15366
15367 ---
15368
15369 * [HBASE-18129](https://issues.apache.org/jira/browse/HBASE-18129) | *Major* | **truncate\_preserve fails when the truncate method doesn't exists on the master**
15370
15371 The command truncate\_preserve will be fine when the truncate method doesn't exist on the master
15372
15373
15374 ---
15375
15376 * [HBASE-18122](https://issues.apache.org/jira/browse/HBASE-18122) | *Major* | **Scanner id should include ServerName of region server**
15377
15378 The scanner id is not from 1 anymore.
15379 The first 32 bits are MurmurHash32 of ServerName string "host,port,ts". The ServerName contains both host, port, and start timestamp so it can prevent collision. The lowest 32bit is generated by atomic int.
15380
15381
15382 ---
15383
15384 * [HBASE-17997](https://issues.apache.org/jira/browse/HBASE-17997) | *Major* | **In dev environment, add jruby-complete jar to classpath only when jruby is needed**
15385
15386 When JRUBY\_HOME is specified, if the command is "hbase shell" or "hbase org.jruby.Main", CLASSPATH and HBASE\_OPTS will be updated according to JRUBY\_HOME specified
15387 \* Jar under JRUBY\_HOME is added to CLASSPATH
15388 \* The following will be added into HBASE\_OPTS
15389
15390 -Djruby.home=$JRUBY\_HOME -Djruby.lib=$JRUBY\_HOME/lib
15391
15392
15393 That is, as long as JRUBY\_HOME is specified, JRUBY\_HOME specified will take precedence.
15394 \* In dev env, the jar recorded in cached\_classpath\_jruby.txt will be ignored
15395 \* In non dev env, jruby-complete jar packaged with HBase will be ignored
15396
15397
15398 ---
15399
15400 * [HBASE-15616](https://issues.apache.org/jira/browse/HBASE-15616) | *Major* | **Allow null qualifier for all table operations**
15401
15402 After this issue, all table operations will support null qualifier, such as put/get/scan/increment/append/checkAndMutate/checkAndPut/checkAndDelete.
15403
15404
15405 ---
15406
15407 * [HBASE-18035](https://issues.apache.org/jira/browse/HBASE-18035) | *Critical* | **Meta replica does not give any primaryOperationTimeout to primary meta region**
15408
15409 When a client is configured to use meta replica, it sends scan request to all meta replicas almost at the same time. Since meta replica contains stale data, if result from one of replica comes back first, the client may get wrong region locations. To fix this, "hbase.client.meta.replica.scan.timeout" is introduced, a client will always send to primary meta region first, wait the configured timeout for reply. If no result is received, it will send request to replica meta regions. The unit for "hbase.client.meta.replica.scan.timeout"  is microsecond, the default value is 1000000 (1 second).
15410
15411
15412 ---
15413
15414 * [HBASE-11013](https://issues.apache.org/jira/browse/HBASE-11013) | *Major* | **Clone Snapshots on Secure Cluster Should provide option to apply Retained User Permissions**
15415
15416 While creating a snapshot, it will save permissions of the original table into .snapshotinfo file(Backward compatibility) , which is in the snapshot root directory.  For clone\_snapshot/restore\_snapshot command, we provide an additional option( RESTORE\_ACL) to decide whether we will grant permissons of the origin table to the newly created table.
15417
15418
15419 ---
15420
15421 * [HBASE-18018](https://issues.apache.org/jira/browse/HBASE-18018) | *Major* | **Support abort for all procedures by default**
15422
15423 The default behavior for abort() method of StateMachineProcedure class is changed to support aborting all procedures irrespective of if procedure supports rollback or not.
15424
15425
15426 ---
15427
15428 * [HBASE-16851](https://issues.apache.org/jira/browse/HBASE-16851) | *Major* | **User-facing documentation for the In-Memory Compaction feature**
15429
15430 Two blog posts on Apache HBase blog: user manual and programmer manual.
15431 Ref. guide draft published: https://docs.google.com/document/d/1Xi1jh\_30NKnjE3wSR-XF5JQixtyT6H\_CdFTaVi78LKw/edit
15432
15433
15434 ---
15435
15436 * [HBASE-17343](https://issues.apache.org/jira/browse/HBASE-17343) | *Blocker* | **Make Compacting Memstore default in 2.0 with BASIC as the default type**
15437
15438  This JIRA changes the default MemStore to be CompactingMemStore instead of DefaultMemStore. In-memory compaction of CompactingMemStore demonstrated sizable improvement in HBase’s write amplification and read/write performance.
15439
15440 CompactingMemStore achieves these gains through smart use of RAM. The algorithm periodically re-organizes the in-memory data in efficient data structures and reduces redundancies. The  HBase server’s memory footprint therefore periodically expands and contracts. The outcome is longer lifetime of data in memory, less I/O, and overall faster performance. More details about the algorithm and its use appear in the Apache HBase Blog: https://blogs.apache.org/hbase/
15441
15442 How To Use:
15443 The in-memory compaction level can be configured both globally and per column family. The supported levels are none (DefaultMemStore), basic, and eager.
15444
15445 By default, all tables apply basic in-memory compaction. This global configuration can be overridden in hbase-site.xml, as follows:
15446
15447 \<property\>
15448  \<name\>hbase.hregion.compacting.memstore.type\</name\>
15449  \<value\>\<none\|basic\|eager\>\</value\>
15450  \</property\>
15451
15452 The level can also be configured in the HBase shell per column family, as follows:
15453
15454 create ‘\<tablename\>’,
15455 {NAME =\> ‘\<cfname\>’, IN\_MEMORY\_COMPACTION =\> ‘\<NONE\|BASIC\|EAGER\>’}
15456
15457
15458 ---
15459
15460 * [HBASE-17786](https://issues.apache.org/jira/browse/HBASE-17786) | *Major* | **Create LoadBalancer perf-tests (test balancer algorithm decoupled from workload)**
15461
15462 $ bin/hbase org.apache.hadoop.hbase.master.balancer.LoadBalancerPerformanceEvaluation -help
15463 usage: hbase org.apache.hadoop.hbase.master.balancer.LoadBalancerPerformanceEvaluation \<options\>
15464 Options:
15465  -regions \<arg\>         Number of regions to consider by load balancer. Default: 1000000
15466  -servers \<arg\>         Number of servers to consider by load balancer. Default: 1000
15467  -load\_balancer \<arg\>   Type of Load Balancer to use. Default:
15468                         org.apache.hadoop.hbase.master.balancer.StochasticLoadBalancer
15469
15470
15471 ---
15472
15473 * [HBASE-17887](https://issues.apache.org/jira/browse/HBASE-17887) | *Blocker* | **Row-level consistency is broken for read**
15474
15475 Now we pass on list of memstoreScanners to the StoreScanner along with the new files to ensure that the StoreScanner sees the latest memstore after flush.
15476
15477
15478 ---
15479
15480 * [HBASE-15296](https://issues.apache.org/jira/browse/HBASE-15296) | *Major* | **Break out writer and reader from StoreFile**
15481
15482 \<!-- mardown --\>
15483 Refactor that breaks out StoreFile Reader and Writer inner classes as StoreFileReader and StoreFileWriter.
15484
15485 NOTE! Changes RegionObserver Coprocessor Interface so incompatible change (Discussed on dev list in thread "[Note breaking change on RegionObserver in hbase-2.0.0](https://s.apache.org/hbase-dev-note-about-HBASE-15296)"
15486
15487
15488 ---
15489
15490 * [HBASE-15199](https://issues.apache.org/jira/browse/HBASE-15199) | *Critical* | **Move jruby jar so only on hbase-shell module classpath; currently globally available**
15491
15492 The JRuby jar is no longer automatically included in classpaths for HBase server processes nor clients. It is still included in the classpath for the HBase shell and for invocations of org.jruby.Main, which should cover HBase provided support scripts.
15493
15494
15495 ---
15496
15497 * [HBASE-18009](https://issues.apache.org/jira/browse/HBASE-18009) | *Major* | **Move RpcServer.Call to a separated file**
15498
15499 The return value of CallRunner.getCall is changed so this is an incompatible change as CallRunner is declared as IA.LimitedPrivate. CallRunner is declared as IS.Evolving so we do not break the rule. And we still keep the getCall method to reduce the impact to user code.
15500
15501
15502 ---
15503
15504 * [HBASE-14925](https://issues.apache.org/jira/browse/HBASE-14925) | *Major* | **Develop HBase shell command/tool to list table's region info through command line**
15505
15506 Added a shell command 'list\_regions' for displaying the table's region info through command line.
15507
15508         List all regions for a particular table as an array and also filter them by server name (optional) as prefix
15509         and maximum locality (optional). By default, it will return all the regions for the table with any locality.
15510         The command displays server name, region name, start key, end key, size of the region in MB, number of requests
15511         and the locality. The information can be projected out via an array as third parameter. By default all these information
15512         is displayed. Possible array values are SERVER\_NAME, REGION\_NAME, START\_KEY, END\_KEY, SIZE, REQ and LOCALITY. Values
15513         are not case sensitive. If you don't want to filter by server name, pass an empty hash / string as shown below.
15514
15515         Examples:
15516         hbase\> list\_regions 'table\_name'
15517         hbase\> list\_regions 'table\_name', 'server\_name'
15518         hbase\> list\_regions 'table\_name', {SERVER\_NAME =\> 'server\_name', LOCALITY\_THRESHOLD =\> 0.8}
15519         hbase\> list\_regions 'table\_name', {SERVER\_NAME =\> 'server\_name', LOCALITY\_THRESHOLD =\> 0.8}, ['SERVER\_NAME']
15520         hbase\> list\_regions 'table\_name', {}, ['SERVER\_NAME', 'start\_key']
15521         hbase\> list\_regions 'table\_name', '', ['SERVER\_NAME', 'start\_key']
15522
15523
15524 ---
15525
15526 * [HBASE-17471](https://issues.apache.org/jira/browse/HBASE-17471) | *Critical* | **Region Seqid will be out of order in WAL if using mvccPreAssign**
15527
15528 MVCCPreAssign is added by HBASE-16698, but pre-assign mvcc is only used in put/delete path. Other write paths like increment/append still assign mvcc in ringbuffer's consumer thread. If put and increment are used parallel. Then seqid in WAL may not increase monotonically. Disorder in wals will lead to data loss.This patch bring all mvcc/seqid event in wal.append, and synchronize wal append and mvcc acquirement. No disorder in wal will happen. Performance test shows no regression with this patch.
15529
15530
15531 ---
15532
15533 * [HBASE-16466](https://issues.apache.org/jira/browse/HBASE-16466) | *Major* | **HBase snapshots support in VerifyReplication tool to reduce load on live HBase cluster with large tables**
15534
15535 Support for snapshots in VerifyReplication tool i.e. verifyrep can compare source table snapshot against peer table snapshot which reduces load on RS by reading data from HDFS directly using Snapshot scanners.
15536 Instead of comparing against live tables whose state changes due to writes and compactions its better to compare HBase  snapshots which are immutable in nature.
15537
15538
15539 ---
15540
15541 * [HBASE-17263](https://issues.apache.org/jira/browse/HBASE-17263) | *Major* | **  Netty based rpc server impl**
15542
15543 A new RPC server based on Netty4 which can improve random read (get) performance. By default, it is off. To use this feature, please set “hbase.rpc.server.impl" to “org.apache.hadoop.hbase.ipc.NettyRpcServer”.
15544
15545 In one deploy, doubled the throughput and lowered the latency significantly: see https://www.slideshare.net/HBaseCon/lift-the-ceiling-of-hbase-throughputs?qid=597ee2fa-8125-4faa-bb3b-2bf1ba9ccafb&v=&b=&from\_search=6
15546
15547
15548 ---
15549
15550 * [HBASE-17957](https://issues.apache.org/jira/browse/HBASE-17957) | *Minor* | ** Custom metrics of replicate endpoints don't prepend "source." to global metrics**
15551
15552 Global custom metrics names follow the "source.metricsName" format.
15553
15554
15555 ---
15556
15557 * [HBASE-17757](https://issues.apache.org/jira/browse/HBASE-17757) | *Major* | **Unify blocksize after encoding to decrease memory fragment**
15558
15559 Blocksize is set in columnfamily's atrributes. It is used to control block sizes when generating blocks. But, it doesn't take encoding into count. If you set encoding to blocks, after encoding, the block size varies. Since blocks will be cached in memory after encoding (default), it will cause memory fragment if using blockcache, or decrease the pool efficiency if using bucketCache. This issue introduced a new config named 'hbase.writer.unified.encoded.blocksize.ratio'. The default value of this config is 1, meaning doing nothing. If this value is set to a smaller value like 0.5, and the blocksize is set to 64KB(default value of blocksize). It will unify the blocksize after encoding to 64KB \* 0.5 = 32KB. Unified blocksize will releaf the memory problems mentioned above.
15560
15561
15562 ---
15563
15564 * [HBASE-14286](https://issues.apache.org/jira/browse/HBASE-14286) | *Trivial* | **Correct typo in argument name for WALSplitter.writeRegionSequenceIdFile**
15565
15566 HBASE-14286 Correct typo in argument name for WALSplitter.writeRegionSequenceIdFile
15567
15568
15569 ---
15570
15571 * [HBASE-17817](https://issues.apache.org/jira/browse/HBASE-17817) | *Major* | **Make Regionservers log which tables it removed coprocessors from when aborting**
15572
15573 Add table name to exception logging when a coprocessor is removed from a table by the region server
15574
15575
15576 ---
15577
15578 * [HBASE-17877](https://issues.apache.org/jira/browse/HBASE-17877) | *Major* | **Improve HBase's byte[] comparator**
15579
15580 updated the lexicographic byte array comparator to use a slightly more optimized version similar to the one available in the guava library that compares only the first index where left[index] != right[index]. The comparator also returns the diff directly instead of mapping it to -1, 0, +1 range as was being done in the earlier version. We have seen significant performance gains, calculated in terms of throughput (ops/ms) with these changes ranging from approx 20% for smaller byte arrays upto 200 bytes and almost 100% for large byte array sizes that are in few KB's. We benchmarked with upto 16KB arrays and the general trend indicates that the performance improvement increases as the size of the byte array increases.
15581
15582
15583 ---
15584
15585 * [HBASE-9899](https://issues.apache.org/jira/browse/HBASE-9899) | *Major* | **for idempotent operation dups, return the result instead of throwing conflict exception**
15586
15587 Non-idempotent operations (increment/append/checkAndPut/...) may throw OperationConflictException even though the increment/append succeeded. For example (client rpc retries number set to 3):
15588
15589 1. first increment rpc request success
15590 2. client timeout and send second rpc request, but nonce is same and save in server. The server found that it has already succeed, so return a OperationConflictException to make sure that increment operation only be applied once in server.
15591
15592 This patch will solve this problem by read the previous result when receive a duplicate rpc request.
15593 1. Store the mvcc to OperationContext. When first rpc request succeed, store the mvcc for this operation nonce.
15594 2. When there are duplicate rpc request, convert to read result by the mvcc.
15595
15596
15597 ---
15598
15599 * [HBASE-15583](https://issues.apache.org/jira/browse/HBASE-15583) | *Minor* | **Any HTableDescriptor we give out should be immutable**
15600
15601 # The HTD got from Admin, AsyncAdmin, and Table is immutable.
15602 # DEFERRED\_LOG\_FLUSH is removed.
15603 # cleanup the deprecated construction of HTD
15604
15605
15606 ---
15607
15608 * [HBASE-17956](https://issues.apache.org/jira/browse/HBASE-17956) | *Major* | **Raw scan should ignore TTL**
15609
15610 Now raw scan can also read expired cells.
15611
15612
15613 ---
15614
15615 * [HBASE-15143](https://issues.apache.org/jira/browse/HBASE-15143) | *Minor* | **Procedure v2 - Web UI displaying queues**
15616
15617 Adds a new Admin#listLocks, a panel on the procedures page to list procedure locks, and a list\_locks command to the shell. Use it to see current state of procedure locking in Master process.
15618
15619
15620 ---
15621
15622 * [HBASE-17514](https://issues.apache.org/jira/browse/HBASE-17514) | *Minor* | **Warn when Thrift Server 1 is configured for proxy users but not the HTTP transport**
15623
15624 If users of the Thrift 1 Server enable proxy user support without enabling the prerequisite HTTP transport, we now log a WARN message about the mismatch.
15625
15626
15627 ---
15628
15629 * [HBASE-17914](https://issues.apache.org/jira/browse/HBASE-17914) | *Major* | **Create a new reader instead of cloning a new StoreFile when compaction**
15630
15631 StoreFile.createReader method is gone. Call initReader and then getReader instead.
15632
15633
15634 ---
15635
15636 * [HBASE-16477](https://issues.apache.org/jira/browse/HBASE-16477) | *Major* | **Remove Writable interface and related code from WALEdit/WALKey**
15637
15638 Removes the Writables, and related code from WALEdit class. HBase-2.0 will not be able to read WAL files written with 0.94.x and before.
15639
15640
15641 ---
15642
15643 * [HBASE-17858](https://issues.apache.org/jira/browse/HBASE-17858) | *Major* | **Update refguide about the IS annotation if necessary**
15644
15645 Updated refguide to tell users that IS annotation is only valid for IA.LimitedPrivate classes.
15646
15647
15648 ---
15649
15650 * [HBASE-17857](https://issues.apache.org/jira/browse/HBASE-17857) | *Major* | **Remove IS annotations from IA.Public classes**
15651
15652 Now we do not have InterfaceStability annotations for IA,Public API. The stability of these classes will follow the rule of 'Semantic Versioning'.
15653
15654
15655 ---
15656
15657 * [HBASE-17215](https://issues.apache.org/jira/browse/HBASE-17215) | *Major* | **Separate small/large file delete threads in HFileCleaner to accelerate archived hfile cleanup speed**
15658
15659 After HBASE-17215 we change to use two threads for (archived) hfile cleaning. The size throttling for large/small files could be set through "hbase.regionserver.thread.hfilecleaner.throttle" and default to 67108864 (64M). It supports online configuration change, just find the active master address through zookeeper dump and use it in update\_config command, e.g. update\_config 'hbasem1.et2.tbsite.net,60100,1488038696741'
15660
15661
15662 ---
15663
15664 * [HBASE-16780](https://issues.apache.org/jira/browse/HBASE-16780) | *Critical* | **Since move to protobuf3.1, Cells are limited to 64MB where previous they had no limit**
15665
15666 Upgrade internal pb to 3.2 from 3.1. 3.2 has fix for 64MB limit.
15667
15668
15669 ---
15670
15671 * [HBASE-17287](https://issues.apache.org/jira/browse/HBASE-17287) | *Blocker* | **Master becomes a zombie if filesystem object closes**
15672
15673 If filesystem is not available during log split, abort master server.
15674
15675
15676 ---
15677
15678 * [HBASE-17765](https://issues.apache.org/jira/browse/HBASE-17765) | *Major* | **Reviving the merge possibility in the CompactingMemStore**
15679
15680 Reviving the merge of the compacting pipeline: making the limit on the number of the segments in the pipeline configurable and adding the merge test.
15681
15682 In order to customize the pipeline size limit change the value of the "hbase.hregion.compacting.pipeline.segments.limit" in the hbase-site.xml
15683
15684 Value 1 means to merge the segments on any flush-in-memory. Value higher than 16 means no merge.
15685
15686
15687 ---
15688
15689 * [HBASE-13395](https://issues.apache.org/jira/browse/HBASE-13395) | *Major* | **Remove HTableInterface**
15690
15691 HTableInterface was deprecated in 0.21.0 and is removed in 2.0.0. Use org.apache.hadoop.hbase.client.Table instead.
15692
15693
15694 ---
15695
15696 * [HBASE-17595](https://issues.apache.org/jira/browse/HBASE-17595) | *Critical* | **Add partial result support for small/limited scan**
15697
15698 Now small scan and limited scan could also return partial results.
15699
15700
15701 ---
15702
15703 * [HBASE-16014](https://issues.apache.org/jira/browse/HBASE-16014) | *Major* | **Get and Put constructor argument lists are divergent**
15704
15705 Add 2 constructors fot API Get
15706 1. Get(byte[], int, int)
15707 2. Get(ByteBuffer)
15708
15709
15710 ---
15711
15712 * [HBASE-17584](https://issues.apache.org/jira/browse/HBASE-17584) | *Major* | **Expose ScanMetrics with ResultScanner rather than Scan**
15713
15714 Now you can use ResultScanner.getScanMetrics to get the scan metrics at any time during the scan operation. The old Scan.getScanMetrics is deprecated and still work, but if you use ResultScanner.getScanMetrics to get the scan metrics and reset it, then the metrics published to the Scan instaince will be messed up.
15715
15716
15717 ---
15718
15719 * [HBASE-17802](https://issues.apache.org/jira/browse/HBASE-17802) | *Major* | **Add note that minor versions can add methods to Interfaces**
15720
15721 Update our semver section to include a note on our allowing ourselves the right to add methods to an Interface over a minor version as agreed to up on the dev list:  "If a Client implements an HBase Interface, a recompile MAY be required upgrading to a newer minor version (See release notes for warning about incompatible changes). All effort will be made to provide a default implementation so this case should not arise."
15722
15723
15724 ---
15725
15726 * [HBASE-17426](https://issues.apache.org/jira/browse/HBASE-17426) | *Major* | **Inconsistent environment variable names for enabling JMX**
15727
15728 In bin/hbase-config.sh,
15729 if value for HBASE\_JMX\_BASE is empty, keep current behavior.
15730 if HBASE\_JMX\_OPTS is not empty, keep current behavior.
15731 otherwise use the value of HBASE\_JMX\_BASE
15732
15733
15734 ---
15735
15736 * [HBASE-17740](https://issues.apache.org/jira/browse/HBASE-17740) | *Critical* | **Correct the semantic of batch and partial for async client**
15737
15738 Now async client has the same semantic with sync client for batch and partial.
15739 '''
15740 Now setBatch doesn't mean setAllowPartialResult(true)
15741 If user setBatch(5) and rpc returns 3+5+5+5+3 cells, we should return 5+5+5+5+1 to user.
15742 '''
15743
15744 Also a minor API change:
15745 Result#createCompleteResult(List\<Result\>) is changed to Result#createCompleteResult(Iterable\<Result\>).
15746
15747
15748 ---
15749
15750 * [HBASE-17746](https://issues.apache.org/jira/browse/HBASE-17746) | *Major* | **TestSimpleRpcScheduler.testCoDelScheduling is broken**
15751
15752 The executor for CoDel is changed to FastPathBalancedQueueRpcExecutor
15753
15754
15755 ---
15756
15757 * [HBASE-17712](https://issues.apache.org/jira/browse/HBASE-17712) | *Major* | **Remove/Simplify the logic of RegionScannerImpl.handleFileNotFound**
15758
15759 Add a config named 'hbase.hregion.unassign.for.fnfe'. It is used to control whether to reopen a region when hitting FileNotFoundException. The default value is true.
15760
15761
15762 ---
15763
15764 * [HBASE-15941](https://issues.apache.org/jira/browse/HBASE-15941) | *Major* | **HBCK repair should not unsplit healthy splitted region**
15765
15766 A new option -removeParents is now available that will remove an old parent when two valid daughters for that parent exist and -fixHdfsOverlaps is used. If there is an issue trying to remove the parent from META or sidelining the parent from HDFS we will fallback to do a regular merge. For now this option only works when the overlap group consists only of 3 regions (a parent, daughter A and daughter B)
15767
15768
15769 ---
15770
15771 * [HBASE-17737](https://issues.apache.org/jira/browse/HBASE-17737) | *Major* | **Thrift2 proxy should support scan timeRange per column family**
15772
15773 Thrift2 proxy supports scan timeRange per column family
15774
15775
15776 ---
15777
15778 * [HBASE-17718](https://issues.apache.org/jira/browse/HBASE-17718) | *Major* | **Difference between RS's servername and its ephemeral node cause SSH stop working**
15779
15780 Fix our accidentally registering a RegionServer's ephermal znode BEFORE we checked in with the master.
15781
15782
15783 ---
15784
15785 * [HBASE-17717](https://issues.apache.org/jira/browse/HBASE-17717) | *Critical* | **Incorrect ZK ACL set for HBase superuser**
15786
15787 In previous versions of HBase, the system intended to set a ZooKeeper ACL on all "sensitive" ZNodes for the user specified in the hbase.superuser configuration property. Unfortunately, the ACL was malformed which resulted in the hbase.superuser being unable to access the sensitive ZNodes that HBase creates. This JIRA issue fixes this bug. HBase will automatically correct the ACLs on start so users do not need to manually correct the ACLs.
15788
15789
15790 ---
15791
15792 * [HBASE-17716](https://issues.apache.org/jira/browse/HBASE-17716) | *Minor* | **Formalize Scan Metric names**
15793
15794 HBASE-17716 breaks compatibility of ServerSideScanMetrics by changing public field names, and the issue is fixed through HBASE-17886
15795
15796
15797 ---
15798
15799 * [HBASE-15484](https://issues.apache.org/jira/browse/HBASE-15484) | *Blocker* | **Correct the semantic of batch and partial**
15800
15801 Now setBatch doesn't mean setAllowPartialResult(true)
15802 If user setBatch(5) and rpc returns 3+5+5+5+3 cells, we should return 5+5+5+5+1 to user.
15803 Scan#setBatch is helpful in paging queries, if you just want to prevent OOM at client, use setAllowPartialResults(true) is better.
15804 We deprecated isPartial and use mayHaveMoreCellsInRow. If it returns false, current Result must be the last one of this row.
15805
15806
15807 ---
15808
15809 * [HBASE-17312](https://issues.apache.org/jira/browse/HBASE-17312) | *Major* | **[JDK8] Use default method for Observer Coprocessors**
15810
15811 Deletes BaseMasterAndRegionObserver, BaseMasterObserver, BaseRegionObserver, BaseRegionServerObserver and BaseWALObserver.
15812 Their corresponding interface classes now use JDK8's 'default' keyword to provide empty/no-op implementations so that:
15813 1. Derived class don't break when more coprocessor hooks are added in future.
15814 2. Derived classes don't have to redundantly override functions they don't care about with empty implementations.
15815
15816 Earlier, BaseXXXObserver classes provided these exact two benefits, but with 'default' keyword in JDK8, they are not needed anymore.
15817
15818 To fix the breakages because of this change, simply change "Foo extends BaseXXXObserver" to "Foo implements XXXObserver".
15819
15820
15821 ---
15822
15823 * [HBASE-17647](https://issues.apache.org/jira/browse/HBASE-17647) | *Major* | **OffheapKeyValue#heapSize() implementation is wrong**
15824
15825 **WARNING: No release note provided for this change.**
15826
15827
15828 ---
15829
15830 * [HBASE-13718](https://issues.apache.org/jira/browse/HBASE-13718) | *Minor* | **Add a pretty printed table description to the table detail page of HBase's master**
15831
15832 <!-- markdown -->
15833
15834
15835 The table information page in the Master UI now includes a schema section that describes the column families defined for that table as well as any column family specific properties that are set.
15836
15837
15838 ---
15839
15840 * [HBASE-17472](https://issues.apache.org/jira/browse/HBASE-17472) | *Major* | **Correct the semantic of  permission grant**
15841
15842 Before this patch, later granted permissions will override previous granted permissions, and previous granted permissions LOST. this issue re-define grant semantic: for master branch, later granted permissions will merge with previous granted permissions.  for branch-1.4, grant keep override behavior for compatibility purpose, and a grant with mergeExistingPermission flag provided.
15843
15844
15845 ---
15846
15847 * [HBASE-17583](https://issues.apache.org/jira/browse/HBASE-17583) | *Major* | **Add inclusive/exclusive support for startRow and endRow of scan for sync client**
15848
15849 Now you can include/exlude the startRow and stopRow for a scan. And the new methods to specify startRow and stopRow are withStartRow and withStopRow. The old methods to specify startRow and Row(include constructors) are marked as deprecated as in the old time if startRow and stopRow are equal then we will consider it as a get scan and include the stopRow implicitly. This is strange after we can set inclusiveness explicitly so we add new methods and depredate the old methods. The deprecated methods will be removed in the future.
15850
15851
15852 ---
15853
15854 * [HBASE-9702](https://issues.apache.org/jira/browse/HBASE-9702) | *Major* | **Change unittests that use "table" or "testtable" to use method names.**
15855
15856 Changes all tests to use the TestName JUnit Rule everywhere rather than hardcode table/region/store names.
15857
15858
15859 ---
15860
15861 * [HBASE-17280](https://issues.apache.org/jira/browse/HBASE-17280) | *Minor* | **Add mechanism to control hbase cleaner behavior**
15862
15863 The HBase cleaner chore process cleans up old WAL files and archived HFiles. Cleaner operation can affect query performance when running heavy workloads, so disable the cleaner during peak hours. The cleaner has the following HBase shell commands:
15864
15865 - cleaner\_chore\_enabled: Queries whether cleaner chore is enabled/ disabled.
15866 - cleaner\_chore\_run: Manually runs the cleaner to remove files.
15867 - cleaner\_chore\_switch: enables or disables the cleaner and returns the previous state of the cleaner. For example, cleaner-switch true enables the cleaner.
15868
15869 Following APIs are added in Admin:
15870 - setCleanerChoreRunning(boolean on): Enable/Disable the cleaner chore
15871 - runCleanerChore(): Ask for cleaner chore to run
15872 - isCleanerChoreEnabled(): Query whether cleaner chore is enabled/ disabled.
15873
15874
15875 ---
15876
15877 * [HBASE-17599](https://issues.apache.org/jira/browse/HBASE-17599) | *Major* | **Use mayHaveMoreCellsInRow instead of isPartial**
15878
15879 The word 'isPartial' is ambiguous so we introduce a new method 'mayHaveMoreCellsInRow' to replace it. And the old meaning of 'isPartial' is not the same with 'mayHaveMoreCellsInRow' as for batched scan, if the number of returned cells equals to the batch, isPartial will be false. After this change the meaning of 'isPartial' will be same with 'mayHaveMoreCellsInRow'. This is an incompatible change but it is not likely to break a lot of things as for batched scan the old 'isPartial' is just a redundant information, i.e, if the number of returned cells reaches the batch limit. You have already know the number of returned cells and the value of batch.
15880
15881
15882 ---
15883
15884 * [HBASE-17437](https://issues.apache.org/jira/browse/HBASE-17437) | *Major* | **Support specifying a WAL directory outside of the root directory**
15885
15886 This patch adds support for specifying a WAL directory outside of the HBase root directory.
15887
15888 Multiple configuration variables were added to accomplish this:
15889 hbase.wal.dir: used to configure where the root WAL directory is located. Could be on a different FileSystem than the root directory. WAL directory can not be set to a subdirectory of the root directory. The default value of this is the root directory if unset.
15890
15891 hbase.rootdir.perms: Configures FileSystem permissions to set on the root directory. This is '700' by default.
15892
15893 hbase.wal.dir.perms: Configures FileSystem permissions to set on the WAL directory FileSystem. This is '700' by default.
15894
15895
15896 ---
15897
15898 * [HBASE-17350](https://issues.apache.org/jira/browse/HBASE-17350) | *Critical* | **Fixup of regionserver group-based assignment**
15899
15900 A few bug fixes and tweaks to the fsgroup feature.
15901
15902 Renamed shell command move\_rsgroup\_servers as move\_servers\_rsgroup
15903 Renamed shell comand move\_rsgroup\_tables as move\_tables\_rsgroup
15904
15905 Made the 'default' group more 'dynamic'; i.e. dead servers no longer show in the 'default' group.
15906
15907
15908 ---
15909
15910 * [HBASE-17578](https://issues.apache.org/jira/browse/HBASE-17578) | *Major* | **Thrift per-method metrics should still update in the case of exceptions**
15911
15912 In prior versions, the HBase Thrift handlers failed to increment per-method metrics when an exception was encountered.  These metrics will now always be incremented, whether an exception is encountered or not.  This change also adds exception-type metrics, similar to those exposed in regionservers, for individual exceptions which are received by the Thrift handlers.
15913
15914
15915 ---
15916
15917 * [HBASE-17508](https://issues.apache.org/jira/browse/HBASE-17508) | *Major* | **Unify the implementation of small scan and regular scan for sync client**
15918
15919 Now the scan.setSmall method is deprecated. Consider using scan.setLimit and scan.setReadType in the future. And we will open scanner lazily when you call scanner.next. This is an incompatible change which delays the table existence check and permission check.
15920
15921
15922 ---
15923
15924 * [HBASE-16981](https://issues.apache.org/jira/browse/HBASE-16981) | *Major* | **Expand Mob Compaction Partition policy from daily to weekly, monthly**
15925
15926 Mob compaction partition policy can be set by
15927 hbase\> create 't1', {NAME =\> 'f1', IS\_MOB =\> true, MOB\_THRESHOLD =\> 1000000, MOB\_COMPACT\_PARTITION\_POLICY =\> 'weekly'}
15928
15929 or
15930
15931 hbase\> alter 't1', {NAME =\> 'f1', IS\_MOB =\> true, MOB\_THRESHOLD =\> 1000000, MOB\_COMPACT\_PARTITION\_POLICY =\> 'monthly'}
15932
15933 Available MOB\_COMPACT\_PARTITION\_POLICY options are "daily", "weekly" and "monthly", the default is "daily".
15934
15935 When it is "weekly" policy, the mob compaction will try to compact files within one calendar week into one for a specific partition, similar for "daily" and "monthly".
15936
15937 With "weekly" policy, one mob file normally is compacted twice during its lifetime (that is first on daily basis and then all such daily based compacted files belonging to a week at the weekly interval), for one region, there normally are 52 files for one year. With "Monthly" policy, one mob file normally is compacted 3 times during its lifetime (First daily and then weekly followed by monthly at end of every month) and normally there are 12 files for one year.
15938
15939
15940 ---
15941
15942 * [HBASE-17197](https://issues.apache.org/jira/browse/HBASE-17197) | *Major* | **hfile does not work in 2.0**
15943
15944 The -f argument is no longer required specifying target file; just pass the file as an argument.
15945
15946
15947 ---
15948
15949 * [HBASE-16812](https://issues.apache.org/jira/browse/HBASE-16812) | *Minor* | **Clean up the locks in MOB**
15950
15951 In MOB-enabled column family, the lock in the major compaction is removed. All the delete markers are retained in the major compaction, and a MOB reference tag is appended to each of the retained delete markers.
15952
15953
15954 ---
15955
15956 * [HBASE-12894](https://issues.apache.org/jira/browse/HBASE-12894) | *Critical* | **Upgrade Jetty to 9.2.6**
15957
15958 Upgrades Jetty to 9.x from 6.x (Jetty9 is in different namespace from Jetty6). Also updated Jersey to 2.x and Servlet to 3.x.
15959
15960
15961 ---
15962
15963 * [HBASE-17566](https://issues.apache.org/jira/browse/HBASE-17566) | *Major* | **Jetty upgrade fixes**
15964
15965 Fix inability at finding static content post push of parent issue moving us to jetty9.
15966
15967
15968 ---
15969
15970 * [HBASE-9774](https://issues.apache.org/jira/browse/HBASE-9774) | *Major* | **HBase native metrics and metric collection for coprocessors**
15971
15972 This issue adds two new modules, hbase-metrics and hbase-metrics-api which define and implement the "new" metric system used internally within HBase. These two modules (and some other code in hbase-hadoop2-compat) module are referred as "HBase metrics framework" which is HBase-specific and independent of any other metrics library (including Hadoop metrics2 and dropwizards metrics).
15973
15974 HBase Metrics API (hbase-metrics-api) contains the interface that HBase exposes internally and to third party code (including coprocessors). It is a thin
15975 abstraction over the actual implementation for backwards compatibility guarantees. The metrics API in this hbase-metrics-api module is inspired by the Dropwizard metrics 3.1 API, however, the API is completely independent.
15976
15977 hbase-metrics module contains implementation of the "HBase Metrics API", including MetricRegistry, Counter, Histogram, etc. These are highly concurrent implementations of the Metric interfaces. Metrics in HBase are grouped into different sets (like WAL, RPC, RegionServer, etc). Each group of metrics should be tracked via a MetricRegistry specific to that group.
15978
15979 Historically, HBase has been using Hadoop's Metrics2 framework [3] for collecting and reporting the metrics internally. However, due to the difficultly of dealing with the Metrics2 framework, HBase is moving away from Hadoop's metrics implementation to its custom implementation. The move will happen incrementally, and during the time, both Hadoop Metrics2-based metrics and hbase-metrics module based classes will be in the source code. All new implementations for metrics SHOULD use the new API and framework.
15980
15981 This jira also introduces the metrics API to coprocessor implementations. Coprocessor writes can export custom metrics using the API and have those collected via metrics2 sinks, as well as exported via JMX in regionserver metrics.
15982
15983 More documentation available at: hbase-metrics-api/README.txt
15984
15985
15986 ---
15987
15988 * [HBASE-17491](https://issues.apache.org/jira/browse/HBASE-17491) | *Major* | **Remove all setters from HTable interface and introduce a TableBuilder to build Table instance**
15989
15990 After HBASE-17491 all setter methods in HTable are marked as deprecated, moved into TableBuilder, and will be removed later.
15991
15992
15993 ---
15994
15995 * [HBASE-17067](https://issues.apache.org/jira/browse/HBASE-17067) | *Major* | **Procedure v2 - remove tryAcquire\*Lock and use wait/wake to make framework event based**
15996
15997 Make the framework more 'lively'; undo 'suspend' notion in Procedure, rely on eventing mechanism instead. Lets us remove no longer needed synchronizations. Framework can now do more ops per second.
15998
15999
16000 ---
16001
16002 * [HBASE-16698](https://issues.apache.org/jira/browse/HBASE-16698) | *Major* | **Performance issue: handlers stuck waiting for CountDownLatch inside WALKey#getWriteEntry under high writing workload**
16003
16004 Assign sequenceid to an edit before we go on the ringbuffer; undoes contention on WALKey latch. Adds a new config "hbase.hregion.mvcc.preassign" which defaults to true: i.e. this speedup is enabled.
16005
16006 User could set this per-table level, like:
16007 create 'table',{NAME=\>'f1',CONFIGURATION=\>{'hbase.hregion.mvcc.preassign'=\>'false'}}
16008
16009
16010 ---
16011
16012 * [HBASE-17488](https://issues.apache.org/jira/browse/HBASE-17488) | *Trivial* | **WALEdit should be lazily instantiated**
16013
16014 prevent creating unused objects in the WALEdit's construction.
16015 +If the cp#preBatchMutate returns true, the WALEdit is useless. So we should create the WALEdit after step 2.
16016 +The cells came from cp should be counted because they are added into the WALEdit . The use case is the local index of phoenix
16017 +If the mutation contains the SKIP\_WAL property, its cells aren't added into the WALEdit. So these cells shouldn't be counted.
16018
16019
16020 ---
16021
16022 * [HBASE-16831](https://issues.apache.org/jira/browse/HBASE-16831) | *Minor* | **Procedure V2 - Remove org.apache.hadoop.hbase.zookeeper.lock**
16023
16024 Purges code that did zk-hosted locks for table ops (we do procedure-based locks now)
16025
16026
16027 ---
16028
16029 * [HBASE-16867](https://issues.apache.org/jira/browse/HBASE-16867) | *Major* | **Procedure V2 - Check ACLs for remote HBaseLock**
16030
16031 Add checking ACL when taking locks.
16032
16033
16034 ---
16035
16036 * [HBASE-16786](https://issues.apache.org/jira/browse/HBASE-16786) | *Major* | **Procedure V2 - Move ZK-lock's uses to Procedure framework locks (LockProcedure)**
16037
16038 Move locking to be procedure (Pv2) rather than zookeeper based. All locking moved over to new infrastructure including MOBing locking.
16039
16040
16041 ---
16042
16043 * [HBASE-17470](https://issues.apache.org/jira/browse/HBASE-17470) | *Major* | **Remove merge region code from region server**
16044
16045 In 1.x branches, Admin.mergeRegions calls MASTER via dispatchMergingRegions RPC; when executing dispatchMergingRegions RPC, MASTER calls RS via MergeRegions to complete the merge in RS-side.
16046
16047 With HBASE-16119, the merge logic moves to master-side.  This JIRA cleans up unused RPCs (dispatchMergingRegions and MergeRegions) , removes dangerous tools such as Merge and HMerge, and deletes unused RegionServer-side merge region logic in 2.0 release.
16048
16049
16050 ---
16051
16052 * [HBASE-16744](https://issues.apache.org/jira/browse/HBASE-16744) | *Major* | **Procedure V2 - Lock procedures to allow clients to acquire locks on tables/namespaces/regions**
16053
16054  Lock for HBase Entity either a Table, a Namespace, or Regions.
16055
16056 These are remote locks which live on master, and need periodic heartbeats to keep them alive. (Once we request the lock, internally an heartbeat thread will be started). If master doesn't receive the heartbeat in time, it'll release the lock and make it available to other users.
16057
16058 Use {@link LockServiceClient} to build instances. Then call {@link #requestLock()}. {@link #requestLock} will contact master to queue the lock and start the heartbeat thread which will check lock's status periodically and once the lock is acquired, it will send the heartbeats to the master.
16059
16060 Use {@link #await} or {@link #await(long, TimeUnit)} to wait for the lock to be acquired. Always call {@link #unlock()} irrespective of whether lock was acquired or not. If the lock was acquired, it'll be released. If it was not acquired, it is possible that master grants the lock in future and the heartbeat thread keeps it alive forever by sending heartbeats. Calling {@link #unlock()} will stop the heartbeat thread and cancel the lock queued on master.
16061
16062 There are 4 ways in which these remote locks may be released/can be lost:
16063   \* Call {@link #unlock}.
16064   \* Lock times out on master: Can happen because of network issues, GC pauses, etc. Worker thread will call the given abortable as soon as it detects such a situation. Fail to contact master: If worker thread can not contact mater and thus fails to send heartbeat before the timeout expires, it assumes that lock is lost and calls the
16065  \*     abortable.
16066 Worker thread is interrupted.
16067
16068 Use example:
16069
16070  EntityLock lock = lockServiceClient.\*Lock(...., "exampled lock", abortable);
16071   lock.requestLock();
16072   ....
16073    ....can do other initializations here since lock is 'asynchronous'...
16074  ....
16075  if (lock.await(timeout)) {
16076     ....logic requiring mutual exclusion
16077   }
16078    lock.unlock();
16079
16080
16081 ---
16082
16083 * [HBASE-14061](https://issues.apache.org/jira/browse/HBASE-14061) | *Major* | **Support CF-level Storage Policy**
16084
16085 After HBASE-14061 we support to set storage policy for HFile through "hbase.hstore.block.storage.policy" configuration, and we support CF-level setting to override the settings from configuration file. Currently supported storage policies include ALL\_SSD/ONE\_SSD/HOT/WARM/COLD, refer to http://hadoop.apache.org/docs/r2.6.0/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html for more details
16086
16087 For example, to create a table with two families: "cf1" with "ALL\_SSD" storage policy and "cf2" with "ONE\_SSD", we could use below command in hbase shell:
16088 create 'table',{NAME=\>'f1',STORAGE\_POLICY=\>'ALL\_SSD'},{NAME=\>'f2',STORAGE\_POLICY=\>'ONE\_SSD'}
16089
16090 We could also set the configuration in table attribute like all other configurations:
16091 create 'table',{NAME=\>'f1',CONFIGURATION=\>{'hbase.hstore.block.storage.policy'=\>'ONE\_SSD'}}
16092
16093
16094 ---
16095
16096 * [HBASE-17337](https://issues.apache.org/jira/browse/HBASE-17337) | *Major* | **list replication peers request should be routed through master**
16097
16098 List replication peers request will be roughed through master.
16099
16100
16101 ---
16102
16103 * [HBASE-15172](https://issues.apache.org/jira/browse/HBASE-15172) | *Major* | **Support setting storage policy in bulkload**
16104
16105 After HBASE-15172/HBASE-19016 we could set storage policy through "hbase.hstore.block.storage.policy" property for bulkload, or "hbase.hstore.block.storage.policy.\<family\_name\>" for a specified family. Supported storage policy includes: ALL\_SSD, ONE\_SSD, HOT, WARM, COLD, etc.
16106
16107
16108 ---
16109
16110 * [HBASE-17336](https://issues.apache.org/jira/browse/HBASE-17336) | *Major* | **get/update replication peer config requests should be routed through master**
16111
16112 Get/update replication peer config requests will be routed through master.
16113
16114
16115 ---
16116
16117 * [HBASE-17320](https://issues.apache.org/jira/browse/HBASE-17320) | *Major* | **Add inclusive/exclusive support for startRow and endRow of scan**
16118
16119 Now you can specific the inclusive of startRow and stopRow for a scan using the new methods withStartRow(byte[] startRow, boolean inclusive) and withStopRow(byte[] stopRow, boolean inclusive). The old setStartRow and setStopRow methods, and the constructors are marked as deprecated because of an strange behavior that we will include the stopRow implicitly if startRow equals to stopRow. This is used to support get scan in the old time. Use withStartRow and withStopRow instead.
16120
16121 For developers, the ConnectionUtils.createClosestRowBefore is also marked as deprecated as the row returned by this method is only very very close to the current row, not closest. Avoid using this method in the future.
16122
16123
16124 ---
16125
16126 * [HBASE-17314](https://issues.apache.org/jira/browse/HBASE-17314) | *Major* | **Limit total buffered size for all replication sources**
16127
16128 Add a conf "replication.total.buffer.quota" to limit total size of buffered entries in all replication peers. It will prevent server getting OOM if there are many peers. Default value is 256MB.
16129
16130
16131 ---
16132
16133 * [HBASE-17174](https://issues.apache.org/jira/browse/HBASE-17174) | *Minor* | **Refactor the AsyncProcess, BufferedMutatorImpl, and HTable**
16134
16135 + cleanup some unused code
16136 + allow being able to share pool between BufferedMutatorImpl
16137 + setting "hbase.client.request.controller.impl" to the name of the alternate RequestController (traffic control) implementation class in Configuration
16138 + The default RequestController implementation is SimpleRequestController
16139 + setting "hbase.client.log.detail.period.ms" to call logger on a period when waiting for tasks to complete
16140
16141
16142 ---
16143
16144 * [HBASE-17335](https://issues.apache.org/jira/browse/HBASE-17335) | *Major* | **enable/disable replication peer requests should be routed through master**
16145
16146 Enable/Disable replication peer requests will be routed through master.
16147
16148
16149 ---
16150
16151 * [HBASE-5401](https://issues.apache.org/jira/browse/HBASE-5401) | *Major* | **PerformanceEvaluation generates 10x the number of expected mappers**
16152
16153 Changes how many tasks PE runs when clients are mapreduce. Now tasks == client count. Previous we hardcoded ten tasks per client instance.
16154
16155
16156 ---
16157
16158 * [HBASE-11392](https://issues.apache.org/jira/browse/HBASE-11392) | *Critical* | **add/remove peer requests should be routed through master**
16159
16160 Add/Remove replication peer requests will be routed through master. And make ReplicationAdmin as Deprecated.
16161
16162
16163 ---
16164
16165 * [HBASE-15924](https://issues.apache.org/jira/browse/HBASE-15924) | *Major* | **Enhance hbase services autorestart capability to hbase-daemon.sh**
16166
16167 Now one can start hbase services with enabled "autostart/autorestart" feature in controlled fashion with the help of "--autostart-window-size" to define the window period and the "--autostart-window-retry-limit" to define the number of times the hbase services have to be restarted upon being killed/terminated abnormally within the provided window perioid.
16168
16169 The following cases are supported with "autostart/autorestart":
16170
16171 a) --autostart-window-size=0 and --autostart-window-retry-limit=0, indicates infinite window size and no retry limit
16172 b) not providing the args, will default to a)
16173 c) --autostart-window-size=0 and --autostart-window-retry-limit=\<positive value\> indicates the autostart process to bail out if the retry limit exceeds irrespective of window period
16174 d) --autostart-window-size=\<x\> and --autostart-window-retry-limit=\<y\> indicates the autostart process to bail out if the retry limit "y" is exceeded for the last window period "x".
16175
16176
16177 ---
16178
16179 * [HBASE-17331](https://issues.apache.org/jira/browse/HBASE-17331) | *Minor* | **Avoid busy waiting in ThrottledInputStream**
16180
16181 For each read(), old ThrottledInputStream sleeps/wakes/checks for many times for controlling the throughput. After this patch, ThrottledInputStream sleeps/wakes/checks only once. So we can reduce CPU usage.
16182
16183
16184 ---
16185
16186 * [HBASE-17296](https://issues.apache.org/jira/browse/HBASE-17296) | *Major* | **Provide per peer throttling for replication**
16187
16188 Provide per peer throttling for replication. Add the bandwidth upper limit to ReplicationPeerConfig and a new shell cmd set\_peer\_bandwidth to update the bandwidth in need.
16189
16190
16191 ---
16192
16193 * [HBASE-17277](https://issues.apache.org/jira/browse/HBASE-17277) | *Major* | **Allow alternate BufferedMutator implementation**
16194
16195 Specify the name of an alternate BufferedMutator implementation by either:
16196
16197  \* Setting "hbase.client.bufferedmutator.classname" to the name of the alternate implementation class in Configuration
16198  \* Or, by setting BufferedMutatorParams#implementationClassName and passing the amended BufferedMutatorParams when calling Connection#getBufferedMutator.
16199
16200
16201 ---
16202
16203 * [HBASE-17294](https://issues.apache.org/jira/browse/HBASE-17294) | *Major* | **External Configuration for Memory Compaction**
16204
16205 This patch provides a single external knob to control memstore compaction. It also inmemory compaction with BASIC policy as our default (AFTERWORD: inmemory compaction as default was undone in HBASE-17333 because of test failures; will be reenabled in later, dedicated issue)
16206
16207 Possible memstore compaction policies are:
16208 (1) None - no memory compaction, when size threshold is exceeded data is flushed to disk
16209 (2) Basic policy applies optimizations which modify the index to a more compacted representation. This is beneficial in all access patterns. The smaller the cells are the greater the benefit of this policy. This is the default policy.
16210 (3) Eager - in addition to compacting the index representation as the basic policy, eager policy eliminates duplication while the data is still in memory (much like the on-disk compaction does after the data is flushed to disk). This policy is most useful for applications with high data churn or small working sets.
16211
16212 Memory compaction policeman be set at the column family level at table creation time:
16213 {code}
16214 create ‘\<tablename\>’,
16215    {NAME =\> ‘\<cfname\>’,
16216     IN\_MEMORY\_COMPACTION =\> ‘\<NONE\|BASIC\|EAGER\>’}
16217 {code}
16218 or as a property at the global configuration level by setting the property in hbase-site.xml, with BASIC being the default value:
16219 {code}
16220 \<property\>
16221         \<name\>hbase.hregion.compacting.memstore.type\</name\>
16222         \<value\>\<NONE\|BASIC\|EAGER\>\</value\>
16223 \</property\>
16224 {code}
16225 The values used in this property can change as memstore compaction policies evolve over time.
16226
16227
16228 ---
16229
16230 * [HBASE-16336](https://issues.apache.org/jira/browse/HBASE-16336) | *Major* | **Removing peers seems to be leaving spare queues**
16231
16232 Add a ReplicationZKNodeCleaner periodically check and delete the useless replication queue zk node belong to the peer which is not exist.
16233
16234
16235 ---
16236
16237 * [HBASE-17272](https://issues.apache.org/jira/browse/HBASE-17272) | *Major* | **Doc how to run Standalone HBase over an HDFS instance; all daemons in one JVM but persisting to an HDFS instance**
16238
16239 Adds section at http://hbase.apache.org/book.html#standalone.over.hdfs on how to make standalone persist to an hdfs instance (where standalone is all daemons in the one jvm).
16240
16241
16242 ---
16243
16244 * [HBASE-16700](https://issues.apache.org/jira/browse/HBASE-16700) | *Minor* | **Allow for coprocessor whitelisting**
16245
16246 Provides ability to restrict table coprocessors based on HDFS path whitelist. (Particularly useful for allowing Phoenix coprocessors but not arbitrary user created coprocessors.)
16247
16248
16249 ---
16250
16251 * [HBASE-17221](https://issues.apache.org/jira/browse/HBASE-17221) | *Major* | **Abstract out an interface for RpcServer.Call**
16252
16253 Provide an interface RpcCall on the server side.
16254 RpcServer.Call now is marked as @InterfaceAudience.Private, and implements the interface RpcCall,
16255
16256
16257 ---
16258
16259 * [HBASE-16119](https://issues.apache.org/jira/browse/HBASE-16119) | *Major* | **Procedure v2 - Reimplement merge**
16260
16261 The merge region logic is controlled by master in 2.0.0 (in 1.x, the core merge region logic is in the region server side).  The coprocessors related to merge region in RS-side would be no-op in 2.0.0 and later release.  Therefore, this is an incompatible change.  Users needs to move the CP logic to new master CP and registers them.
16262
16263 A new mergeRegionsAsync() API is added in client.  The existing mergeRegions() API will call the new API so client does not have to change its code.
16264
16265
16266 ---
16267
16268 * [HBASE-17112](https://issues.apache.org/jira/browse/HBASE-17112) | *Major* | **Prevent setting timestamp of delta operations the same as previous value's**
16269
16270 Before this issue, two concurrent Increments/Appends done in same millisecond or RS's clock going back will result in two results have same TS, which is not friendly to versioning and will get wrong result in slave cluster if the replication is disordered.
16271 After this issue, the result of Increment/Append will always have an incremental TS. There is no any inconsistent in replication for these operations. But there is a rare case that if there is a Delete in same millisecond, the later result can not be masked by this Delete. This can be fixed after we have new semantics that previous Delete will never mask later Put even its timestamp is higher.
16272
16273
16274 ---
16275
16276 * [HBASE-17181](https://issues.apache.org/jira/browse/HBASE-17181) | *Minor* | **Let HBase thrift2 support TThreadedSelectorServer**
16277
16278 Add TThreadedSelectorServer support for HBase Thrift2
16279
16280
16281 ---
16282
16283 * [HBASE-17178](https://issues.apache.org/jira/browse/HBASE-17178) | *Major* | **Add region balance throttling**
16284
16285 Add region balance throttling. Master execute every region balance plan per balance interval, which is equals to divide max balancing time by the size of region balance plan. And Introduce a new config hbase.master.balancer.maxRitPercent to protect availability. If config this to 0.01, then the max percent of regions in transition is 1% when balancing. Then the cluster's availability is at least 99% when balancing.
16286
16287
16288 ---
16289
16290 * [HBASE-15786](https://issues.apache.org/jira/browse/HBASE-15786) | *Major* | **Create DBB backed MSLAB pool**
16291
16292 Added a new config hbase.regionserver.offheap.global.memstore.size using which one can specify the global off heap limit that all memstores can use.  When this config is in MSLAB should be turned ON and we will use the entire size for the MSLAB pool. It will make off heap chunks and pool then. It will behave as if we are working with off heap memstores.  When this config is having a valid value and MSLAB is turned OFF, the system will just ignore the offheap size and continue to use global max heap space % for memstores and work with on heap memstores.
16293
16294
16295 ---
16296
16297 * [HBASE-17132](https://issues.apache.org/jira/browse/HBASE-17132) | *Major* | **Cleanup deprecated code for WAL**
16298
16299 Remove HLogKey and related classes and methods. Remove SequenceFile based log reader and writer. WALObserver and RegionObserver are changed so this is an incompatible change.
16300
16301
16302 ---
16303
16304 * [HBASE-16169](https://issues.apache.org/jira/browse/HBASE-16169) | *Major* | **Make RegionSizeCalculator scalable**
16305
16306 Added couple of API's to Admin.java:
16307
16308 Returns region load map of all regions hosted on a region server
16309 Map\<byte[], RegionLoad\> getRegionLoad(ServerName sn) throws IOException;
16310
16311 Returns region load map of all regions of a table hosted on a region server
16312 Map\<byte[], RegionLoad\> getRegionLoad(ServerName sn, TableName tableName) throws IOException
16313
16314 Added an API to region server:
16315
16316 public GetRegionLoadResponse getRegionLoad(RpcController controller,
16317     GetRegionLoadRequest request) throws ServiceException;
16318
16319 Primary intention is to use this API for RegionSizeCalculator and not rely on Master for ClusterStatus. On large clusters, ClusterStatus() can take a long time. IfMaster is down/busy, then some of the jobs timeout/fail. Other possible uses:
16320 1. If there is a lighter version of GetClusterStatus API (i.e without the ServerLoad for each RS), then custom maintenance tools can be better. In current world ClusterStatus is heavy. With the new APIs, each API's payload is smaller and distributed. So custom tools can call getRegionLoad() when needed, it will be more accurate. This helps with large clusters. For tools that don't need RegionLoad, the lighter version of API is fine enough.
16321 2. Another use case is a tool like RSTop - since we can see selective metrics at RegionLevel (possibly even deltas between each RPC to the server).
16322
16323
16324 ---
16325
16326 * [HBASE-15788](https://issues.apache.org/jira/browse/HBASE-15788) | *Major* | **Use Offheap ByteBuffers from BufferPool to read RPC requests.**
16327
16328 Using the ByteBuffers from ByteBufferPool to read the request bytes at server.  When the size of the request is smaller than 1/6th size of a BB in the pool, we will not use that but read into an on demand created, proper sized on heap ByteBuffer.
16329
16330
16331 ---
16332
16333 * [HBASE-17046](https://issues.apache.org/jira/browse/HBASE-17046) | *Major* | **Add 1.1 doc to hbase.apache.org**
16334
16335 Adds a 1.1. item to our 'Documentation and API' tab. Gives access to 1.1 APIs, XRef, etc.
16336
16337
16338 ---
16339
16340 * [HBASE-16962](https://issues.apache.org/jira/browse/HBASE-16962) | *Major* | **Add readPoint to preCompactScannerOpen() and preFlushScannerOpen() API**
16341
16342 The following RegionObserver methods are deprecated
16343
16344 InternalScanner preFlushScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16345     final Store store, final KeyValueScanner memstoreScanner, final InternalScanner s)
16346     throws IOException;
16347
16348 InternalScanner preCompactScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16349     final Store store, List\<? extends KeyValueScanner\> scanners, final ScanType scanType,
16350     final long earliestPutTs, final InternalScanner s, CompactionRequest request)
16351
16352 Instead, use the following methods:
16353
16354 InternalScanner preFlushScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16355     final Store store, final KeyValueScanner memstoreScanner, final InternalScanner s,
16356     final long readPoint) throws IOException;
16357
16358 InternalScanner preCompactScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
16359     final Store store, List\<? extends KeyValueScanner\> scanners, final ScanType scanType,
16360     final long earliestPutTs, final InternalScanner s, final CompactionRequest request,
16361     final long readPoint) throws IOException
16362
16363
16364 ---
16365
16366 * [HBASE-17017](https://issues.apache.org/jira/browse/HBASE-17017) | *Major* | **Remove the current per-region latency histogram metrics**
16367
16368 Removes per-region level (get size, get time, scan size and scan time histogram) metrics that was exposed before. Per-region histogram metrics with 1000+ regions causes millions of objects to be allocated on heap. The patch introduces getCount and scanCount as counters rather than histograms. Other per-region level metrics are kept as they are.
16369
16370
16371 ---
16372
16373 * [HBASE-16955](https://issues.apache.org/jira/browse/HBASE-16955) | *Major* | **Fixup precommit protoc check to do new distributed protos and pb 3.1.0 build**
16374
16375 Test that environment no longer has to have protoc (2.5 and 3.1) available. Needed small adjustment in yetus protoc build but otherwise all works.
16376
16377
16378 ---
16379
16380 * [HBASE-17050](https://issues.apache.org/jira/browse/HBASE-17050) | *Minor* | **Upgrade Apache CLI version from 1.2 to 1.3.1**
16381
16382 Upgrade Apache CLI version from 1.2 to 1.3.1.
16383
16384 These are few good/important changes included in this update:
16385 - HelpFormatter now prints command-line options in the same order as they
16386   have been added. Fixes CLI-212.
16387 - Standard help text now shows mandatory arguments also for the first
16388   option. Fixes CLI-186.
16389 - A new parser is available: DefaultParser. It combines the features of the
16390   GnuParser and the PosixParser. It also provides additional features like
16391   partial matching for the long options, and long options without separator
16392   (i.e like the JVM memory settings: -Xmx512m). This new parser deprecates
16393   the previous ones. Fixes CLI-161,CLI-167,CLI-181.
16394
16395 For full list of changes:
16396   https://commons.apache.org/proper/commons-cli/changes-report.html#a1.3
16397
16398
16399 ---
16400
16401 * [HBASE-15513](https://issues.apache.org/jira/browse/HBASE-15513) | *Major* | **hbase.hregion.memstore.chunkpool.maxsize is 0.0 by default**
16402
16403 MSLAB chunk pool is on by default in hbase-2.0.0.
16404
16405
16406 ---
16407
16408 * [HBASE-16972](https://issues.apache.org/jira/browse/HBASE-16972) | *Major* | **Log more details for Scan#next request when responseTooSlow**
16409
16410 **WARNING: No release note provided for this change.**
16411
16412
16413 ---
16414
16415 * [HBASE-17014](https://issues.apache.org/jira/browse/HBASE-17014) | *Minor* | **Add clearly marked starting and shutdown log messages for all services.**
16416
16417 Delimit START, STOP, and ABORT messages with '\*\*\*\*\*' so denote.
16418
16419
16420 ---
16421
16422 * [HBASE-16765](https://issues.apache.org/jira/browse/HBASE-16765) | *Critical* | **New SteppingRegionSplitPolicy, avoid too aggressive spread of regions for small tables.**
16423
16424 Introduces a new split policy: SteppingSplitPolicy
16425 This will use a simple step function to split a region at (by default) 2  xflushSize when no other region of the same table is seen on the region server, or max-file-size when one or more other regions of the same table is seen.
16426
16427 In HBase 2.0 this is going to be the default. In previous versions it can be configured.
16428
16429
16430 ---
16431
16432 * [HBASE-16608](https://issues.apache.org/jira/browse/HBASE-16608) | *Major* | **Introducing the ability to merge ImmutableSegments without copy-compaction or SQM usage**
16433
16434 The index-compation and data-compaction variants of CompactingMemStore are introduced. In both types the active (mutable) segment is periodically flushed-in-memory and is added as immutable segment in the compaction pipeline. The CompactingMemStore of index-compaction type is merging all immutable segments of the compacting pipeline into one. The merging of N segments is explained below. The CompactingMemStore of data-compaction type is compacting all immutable segments of the compacting pipeline into one. After the merge/compaction the old segments in the compacting pipeline are replaced with one new.
16435
16436 Before explaining the process of merging N old segments into new one, note that segment structure includes ordered index that allows traversing the cells data efficiently. The merge is copying the ordered indexes of the old segments into one ordered index of new segment. No data is copied, no cells are filtered. Alternatively, in the process of compacting N old segments into new one, both data and index are copied. The old cells are filtered, meaning upon compaction unused versions of the cells are not copied so the new segment has less data then all old ones.
16437
16438 This issue introduces only the merging ability and simplifies the user intervention for switching between types. The previous CompactingMemStore structure was added by HBASE-16420 and HBASE-16421. The future refinements of the policy or merging/compacting will come in HBASE-16417.
16439
16440 In order to create a table with CompactingMemStore as a MemStore one should use:
16441 create ‘\<tablename\>’, {NAME =\> ‘\<cfname\>’, IN\_MEMORY\_COMPACTION =\> true}
16442 IN\_MEMORY\_COMPACTION default is false, so table created as following will have the known DefaultMemStore as a MemStore.
16443 create ‘\<tablename\>’, {NAME =\> ‘\<cfname\>’}
16444
16445 The default type of CompactingMemStore is index-compaction. In order to change it to data-compaction one should add to the hbase-site.xml
16446 \<property\>
16447     \<name\>hbase.hregion.compacting.memstore.type\</name\>
16448     \<value\>data-compaction\</value\>
16449   \</property\>
16450
16451 in addition to creating the table as following
16452 create ‘\<tablename\>’, {NAME =\> ‘\<cfname\>’, IN\_MEMORY\_COMPACTION =\> true}
16453
16454
16455 ---
16456
16457 * [HBASE-16747](https://issues.apache.org/jira/browse/HBASE-16747) | *Major* | **Track memstore data size and heap overhead separately**
16458
16459 Marking it as incompatible change as there is a change in behavior for region flush decision. The default flush size of 128 MB per region was tracked against both actual data bytes size + overhead of these cells in memstore memory (Overhead because of Cell java objects and CSLM entry).  As part of this jira we will keep track of cell data size only in region level.  So 128 MB flush size means, 128 MB of cell data bytes (key+ value+..)
16460
16461 Globally we will track cell data size and heap overhead separately and will consider both for forced flushes. We will not allow over consume of heap memory by all memstore. This is as old case. Only tracking way is changed.
16462
16463
16464 ---
16465
16466 * [HBASE-16974](https://issues.apache.org/jira/browse/HBASE-16974) | *Minor* | **Update os-maven-plugin to 1.4.1.final+ for building shade file on RHEL/CentOS**
16467
16468 Upgrade os-maven-plugin mvn extension which figures the os we are running on from 1.4 to 1.5.
16469
16470
16471 ---
16472
16473 * [HBASE-16952](https://issues.apache.org/jira/browse/HBASE-16952) | *Major* | **Replace hadoop-maven-plugins with protobuf-maven-plugin for building protos**
16474
16475 Simplifies .proto manipulations. One step only now -- no need to keep pom.xml listing up to date with the protobuf protos directory content -- and no need to preinstall protoc; mvn does it all for you now.
16476
16477
16478 ---
16479
16480 * [HBASE-14551](https://issues.apache.org/jira/browse/HBASE-14551) | *Minor* | **Procedure v2 - Reimplement split**
16481
16482 Moved the Split Region logic to Master and most of split region coprocessor is in master now.  Need to change dependency such as Phoenix.
16483
16484
16485 ---
16486
16487 * [HBASE-15789](https://issues.apache.org/jira/browse/HBASE-15789) | *Major* | **PB related changes to work with offheap**
16488
16489 This issue adds a patch to our checked in internal, shaded protobuf, but it also adds a general means of apply patches to our version of protobuf. Patches found in the new src/main/patches directory are all applied as the last task when you run a build with the -Pcompile-protobuf profile under the hbase-protocol-shaded module. This commit also includes our first patch to protobuf; it adds ByteInput to mimic pb3.1's ByteOutput (src/main/patches/HBASE-15789\_V2.patch attached here).
16490
16491
16492 ---
16493
16494 * [HBASE-16930](https://issues.apache.org/jira/browse/HBASE-16930) | *Major* | **AssignmentManager#checkWals() function can recur infinitely**
16495
16496 Fixed potential infinite recursion in AssignmentManager.checkWals().
16497
16498
16499 ---
16500
16501 * [HBASE-16463](https://issues.apache.org/jira/browse/HBASE-16463) | *Major* | **Improve transparent table/CF encryption with Commons Crypto**
16502
16503 Improve transparent table/CF encryption with Commons Crypto. The change introduces a new optional CryptoCipherProvider (CommonsCryptoAES) for transparent table/CF encryption. And the encryption performance would be accelerated by hardware in modern CPU (AES-NI). This feature could be enabled by updating the configuration "hbase.crypto.cipherprovider" to "org.apache.hadoop.hbase.io.crypto.CryptoCipherProvider" in hbase-site.xml. For detailed information about transparent table/CF encryption including configuration examples see the Security section of the HBase manual.
16504
16505
16506 ---
16507
16508 * [HBASE-16414](https://issues.apache.org/jira/browse/HBASE-16414) | *Major* | **Improve performance for RPC encryption with Apache Common Crypto**
16509
16510 With the security RPC and encryption enabled, introduce Apache Commons Crypto to do the encryption/decryption which supports both supports both JCE Cipher and OpenSSL Cipher. Adds new configs "hbase.rpc.crypto.encryption.aes.enabled" which defaults to false, and "hbase.rpc.crypto.encryption.aes.cipher.class" which defaults to "org.apache.commons.crypto.cipher.JceCipher" to support JCE Cipher, it also can be set as "org.apache.hadoop.crypto.OpensslCipher" to support Openssl Cipher.
16511
16512
16513 ---
16514
16515 * [HBASE-16721](https://issues.apache.org/jira/browse/HBASE-16721) | *Critical* | **Concurrency issue in WAL unflushed seqId tracking**
16516
16517 Fixed a bug in sequenceId tracking for the WALs that caused WAL files to accumulate without being deleted due to a rare race condition.
16518
16519
16520 ---
16521
16522 * [HBASE-16834](https://issues.apache.org/jira/browse/HBASE-16834) | *Major* | **Add AsyncConnection support for ConnectionFactory**
16523
16524 Add createAsyncConnection method to ConnectionFactory for creating AsyncConnection. The default implementation is org.apache.hadoop.hbase.client.AsyncConnectionImpl. You can use 'hbase.client.async.connection.impl' to plug in your own AsyncConnection implementation.
16525
16526
16527 ---
16528
16529 * [HBASE-16729](https://issues.apache.org/jira/browse/HBASE-16729) | *Trivial* | **Define the behavior of (default) empty FilterList**
16530
16531 Empty filter list will behave as when there is no filter added. This change is a behavioral change for those who rely on Empty filter list.
16532
16533
16534 ---
16535
16536 * [HBASE-16799](https://issues.apache.org/jira/browse/HBASE-16799) | *Major* | **CP exposed Store should not expose unwanted APIs**
16537
16538 Below APIs from CP exposed Store interface are removed
16539 upsert(Iterable\<Cell\> cells, long readpoint)
16540 add(Cell cell)
16541 add(Iterable\<Cell\> cells)
16542 replayCompactionMarker(CompactionDescriptor compaction, boolean pickCompactionFiles,  boolean removeFiles)
16543 assertBulkLoadHFileOk(Path srcPath)
16544 bulkLoadHFile(String srcPathStr, long sequenceId)
16545 bulkLoadHFile(StoreFileInfo fileInfo)
16546
16547
16548 ---
16549
16550 * [HBASE-15921](https://issues.apache.org/jira/browse/HBASE-15921) | *Major* | **Add first AsyncTable impl and create TableImpl based on it**
16551
16552 Add AsyncConnection, AsyncTable and AsyncTableRegionLocator. Now the AsyncTable only support get, put and delete. And the implementation of AsyncTableRegionLocator is synchronous actually.
16553
16554
16555 ---
16556
16557 * [HBASE-16664](https://issues.apache.org/jira/browse/HBASE-16664) | *Major* | **Timeout logic in AsyncProcess is broken**
16558
16559 This issue fix three bugs:
16560 1.  rpcTimeout configuration not work for one rpc call in AP
16561 2.  operationTimeout configuration not work for multi-request (batch, put) in AP
16562 3.  setRpcTimeout and setOperationTimeout in HTable is not worked for AP and BufferedMutator.
16563
16564
16565 ---
16566
16567 * [HBASE-16661](https://issues.apache.org/jira/browse/HBASE-16661) | *Minor* | **Add last major compaction age to per-region metrics**
16568
16569 This adds a new per-region metric named "lastMajorCompactionAge" for tracking time since the last major compaction ran on a given region.  If a major compaction has never run, the age will be equal to the current timestamp.
16570
16571
16572 ---
16573
16574 * [HBASE-16117](https://issues.apache.org/jira/browse/HBASE-16117) | *Major* | **Fix Connection leak in mapred.TableOutputFormat**
16575
16576 (This change will be irrelevant after HBASE-16774 lands).
16577 There is a subtle change with error handling when a connection is not able to connect to ZK.  Attempts to create a connection when ZK is not up will now fail immediately instead of silently creating and then failing on a subsequent HBaseAdmin call.
16578
16579
16580 ---
16581
16582 * [HBASE-15984](https://issues.apache.org/jira/browse/HBASE-15984) | *Critical* | **Given failure to parse a given WAL that was closed cleanly, replay the WAL.**
16583
16584 In some particular deployments, the Replication code believes it has
16585 reached EOF for a WAL prior to successfully parsing all bytes known to
16586 exist in a cleanly closed file.
16587
16588 If an EOF is detected due to parsing or other errors while there are still unparsed bytes before the end-of-file trailer, we now reset the WAL to the very beginning and attempt a clean read-through. Because we will retry these failures indefinitely, two additional changes are made to help with diagnostics:
16589
16590 \* On each retry attempt, a log message like the below will be emitted at the WARN level:
16591
16592       Processing end of WAL file '{}'. At position {}, which is too far away
16593       from reported file length {}. Restarting WAL reading (see HBASE-15983
16594       for details).
16595
16596 \*  additional metrics measure the use of this recovery mechanism. they are described in the reference guide.
16597
16598
16599 ---
16600
16601 * [HBASE-16753](https://issues.apache.org/jira/browse/HBASE-16753) | *Minor* | **There is a mismatch between suggested Java version in hbase-env.sh**
16602
16603 Updates the comments and default values in a few scripts and docs to reflect our Java 1.8+ requirement.
16604
16605
16606 ---
16607
16608 * [HBASE-16567](https://issues.apache.org/jira/browse/HBASE-16567) | *Critical* | **Upgrade to protobuf-3.1.x**
16609
16610 Core is now up on protobuf 3.1.0 (Coprocessor Endpoints and REST are still on protobuf 2.5.0).
16611
16612
16613 ---
16614
16615 * [HBASE-15638](https://issues.apache.org/jira/browse/HBASE-15638) | *Critical* | **Shade protobuf**
16616
16617 Shade/relocate and include the protobuf we use internally. See protobuf chapter in the refguide for more on how we protobuf in hbase-.2.0.0 and going forward.
16618
16619 See https://docs.google.com/document/d/1H4NgLXQ9Y9KejwobddCqaVMEDCGbyDcXtdF5iAfDIEk/edit# for how we arrived at this approach.
16620
16621 See http://mail-archives.apache.org/mod\_mbox/hbase-dev/201610.mbox/%3C07850EDD-7230-431B-9AB0-C5C91B105EEC%40gmail.com%3E for discussion around merging this change and of how we might revert if an alternative to this awkward patch presents itself; e.g. an hadoop with CLASSPATH isolation (and means of dealing with Sparks use of protobuf 2.5.0, etc.)
16622
16623
16624 ---
16625
16626 * [HBASE-16264](https://issues.apache.org/jira/browse/HBASE-16264) | *Critical* | **Figure how to deal with endpoints and shaded pb**
16627
16628 Shade/relocate the protobuf hbase uses internally. All core now refers to new module added in this patch, hbase-protocol-shaded. Coprocessor Endpoints carry-on with references to the original hbase-protocol module. See new chapter in book on protobufs on how-to going forward.
16629
16630
16631 ---
16632
16633 * [HBASE-16672](https://issues.apache.org/jira/browse/HBASE-16672) | *Major* | **Add option for bulk load to always copy hfile(s) instead of renaming**
16634
16635 This issue adds a config, always.copy.files, to LoadIncrementalHFiles.
16636 When set to true, source hfiles would be copied. Meaning source hfiles would be kept after bulk load is done.
16637 Default value is false.
16638
16639
16640 ---
16641
16642 * [HBASE-16660](https://issues.apache.org/jira/browse/HBASE-16660) | *Critical* | **ArrayIndexOutOfBounds during the majorCompactionCheck in DateTieredCompaction**
16643
16644 "Please do not use DateTieredCompaction with Major Compaction unless you have a version with this. Otherwise your cluster will not compact any store files and you can end up running out of file descriptors." @churro morales
16645
16646
16647 ---
16648
16649 * [HBASE-16257](https://issues.apache.org/jira/browse/HBASE-16257) | *Blocker* | **Move staging dir to be under hbase root dir**
16650
16651 The HBase property 'hbase.bulkload.staging.dir' is deprecated and is ignored from HBase 2.0.  It will defaults to hbase.rootdir/staging automatically with the correct permissions.
16652
16653
16654 ---
16655
16656 * [HBASE-16650](https://issues.apache.org/jira/browse/HBASE-16650) | *Major* | **Wrong usage of BlockCache eviction stat for heap memory tuning**
16657
16658 Changed tracking of evictedBlocks count NOT to include evictions of blocks for a removed HFile. HFiles gets removed after compaction
16659
16660
16661 ---
16662
16663 * [HBASE-16294](https://issues.apache.org/jira/browse/HBASE-16294) | *Minor* | **hbck reporting "No HDFS region dir found" for replicas**
16664
16665 Fixed warning error message displayed for region directory not found for non-default/ non-primary replicas in hbck
16666
16667
16668 ---
16669
16670 * [HBASE-16540](https://issues.apache.org/jira/browse/HBASE-16540) | *Major* | **Scan should do additional validation on start and stop row**
16671
16672 Scan#setStartRow() and Scan#setStopRow() now validate the argument passed for each row key.  If the length of the byte[] passed exceeds Short.MAX\_VALUE, an IllegalArgumentException will be thrown.
16673
16674
16675 ---
16676
16677 * [HBASE-7612](https://issues.apache.org/jira/browse/HBASE-7612) | *Trivial* | **[JDK8] Replace use of high-scale-lib counters with intrinsic facilities**
16678
16679 org.apache.hadoop.hbase.util.Counter is deprecated now and will be removed in 3.0. Use LongAdder instead.
16680
16681
16682 ---
16683
16684 * [HBASE-16447](https://issues.apache.org/jira/browse/HBASE-16447) | *Critical* | **Replication by namespaces config in peer**
16685
16686 Support replication by namespaces config in peer.
16687 1. Set a namespace in peer config means that all tables in this namespace will be replicated.
16688 2. If the namespaces config is null, then the table-cfs config decide which table's edit can be replicated. If the table-cfs config is null, then the namespaces config decide which table's edit can be replicated.
16689 3. If you already have set a namespace in the peer config, then you can't set any table of this namespace to the peer config. If you already have set a table in the peer config, then you can't set this table's namespace to the peer config.
16690
16691
16692 ---
16693
16694 * [HBASE-16598](https://issues.apache.org/jira/browse/HBASE-16598) | *Major* | **Enable zookeeper useMulti always and clean up in HBase code**
16695
16696 Deprecate the configuration property 'hbase.zookeeper.useMulti'.
16697 useMulti will always be enabled. ZooKeeper 3.4.x and newer is required.
16698
16699 Internal:
16700
16701 The ZKUtil#multiOrSequential(ZooKeeperWatcher zkw, List\<ZKUtilOp\> ops, boolean runSequentialOnMultiFailure) will not check 'hbase.zookeeper.useMulti' anymore, and will always use multi.
16702 It can still fall back to sequential operations if:
16703
16704 RunSequentialOnMultiFailure is true
16705 On calling multi, we get a ZooKeeper exception that can be handled by a sequential call.
16706
16707
16708 ---
16709
16710 * [HBASE-16388](https://issues.apache.org/jira/browse/HBASE-16388) | *Major* | **Prevent client threads being blocked by only one slow region server**
16711
16712 Add a new configuration, hbase.client.perserver.requests.threshold, to limit the max number of concurrent request to one region server. If the user still create new request after reaching the limit, client will throw ServerTooBusyException and do not send the request to the server. This is a client side feature and can prevent client's threads being blocked by one slow region server resulting in the availability of client is much lower than the availability of region servers.
16713
16714 For completeness, here extract on new config from hbase-default.xml:
16715
16716 Property: hbase.client.perserver.requests.threshold
16717 Default: 2147483647
16718 Description: The max number of concurrent pending requests for one server in all client threads (process level). Exceeding requests will be thrown ServerTooBusyException immediately to prevent user's threads being occupied and blocked by only one slow region server. If you use a fix number of threads to access HBase in a synchronous way, set this to a suitable value which is  related to the number of threads will help you. See https://issues.apache.org/jira/browse/HBASE-16388 for details.
16719
16720
16721 ---
16722
16723 * [HBASE-15297](https://issues.apache.org/jira/browse/HBASE-15297) | *Minor* | **error message is wrong when a wrong namspace is specified in grant in hbase shell**
16724
16725 The security admin instance available within the HBase shell now returns "false" from the namespace\_exists? method for non-existent namespaces rather than raising a wrapped NamespaceNotFoundException.
16726
16727 As a side effect, when the "grant" and "revoke" commands in the HBase shell are invoked with a non-existent namespace the resulting error message now properly refers to said namespace rather than to the user.
16728
16729
16730 ---
16731
16732 * [HBASE-16086](https://issues.apache.org/jira/browse/HBASE-16086) | *Major* | **TableCfWALEntryFilter and ScopeWALEntryFilter should not redundantly iterate over cells.**
16733
16734 push to branch-1.3+
16735
16736
16737 ---
16738
16739 * [HBASE-16340](https://issues.apache.org/jira/browse/HBASE-16340) | *Critical* | **ensure no Xerces jars included**
16740
16741 HBase no longer includes Xerces implementation jars that were previously included via transitive dependencies. Downstream users relying on HBase for these artifacts will need to update their dependencies.
16742
16743
16744 ---
16745
16746 * [HBASE-16213](https://issues.apache.org/jira/browse/HBASE-16213) | *Major* | **A new HFileBlock structure for fast random get**
16747
16748 HBASE-16213 introduced a new DataBlockEncoding in name of ROW\_INDEX\_V1, which could improve random read (get) performance especially when the average record size (key-value size per row) is small. To use this feature, please set DATA\_BLOCK\_ENCODING to ROW\_INDEX\_V1 for CF of newly created table, or change existing CF with below command:
16749 alter 'table\_name',{NAME =\> 'cf', DATA\_BLOCK\_ENCODING =\> 'ROW\_INDEX\_V1'}.
16750
16751 Please note that if we turn this DBE on, HFile block will be bigger than NONE encoding because it adds some meta infos for binary search:
16752 /\*\*
16753  \* Store cells following every row's start offset, so we can binary search to a row's cells.
16754  \*
16755  \* Format:
16756  \* flat cells
16757  \* integer: number of rows
16758  \* integer: row0's offset
16759  \* integer: row1's offset
16760  \* ....
16761  \* integer: dataSize
16762  \*
16763 \*/
16764
16765 Seek in row when random reading is one of the main consumers of CPU. This helps. See slide #7 here https://www.slideshare.net/HBaseCon/lift-the-ceiling-of-hbase-throughputs?qid=597ee2fa-8125-4faa-bb3b-2bf1ba9ccafb&v=&b=&from\_search=6
16766
16767
16768 ---
16769
16770 * [HBASE-16409](https://issues.apache.org/jira/browse/HBASE-16409) | *Minor* | **Row key for bad row should be properly delimited in VerifyReplication**
16771
16772 --delimiter= option is added to verifyrep.
16773 The delimiter would wrap bad rows in log output.
16774
16775
16776 ---
16777
16778 * [HBASE-14921](https://issues.apache.org/jira/browse/HBASE-14921) | *Major* | **Inmemory Compaction Optimizations; Segment Structure**
16779
16780 A long, working issue that discussed Segment formats introducing CellArrayMap (delivered as the patch attached to this issue) and CellChunkMap (to be delivered later in HBASE-16421 but see patch v02 for an embryonic form named CellBlockSerialized); when to copy Segment data (and when not too); and then what to include at flush time (the suffix Segment or all Segments). Designs that evolved as discussion went on are attached. Outstanding issues turned up here, not including a CellChunkMap implementation, are listed below but are to be addressed in follow-ons (See HBASE-16417):
16781
16782 1. The flattening without compaction is causing many small segments in pipeline, and they are not flushed all together.
16783 2. The issue of compaction prediction cost.
16784
16785
16786 ---
16787
16788 * [HBASE-16450](https://issues.apache.org/jira/browse/HBASE-16450) | *Major* | **Shell tool to dump replication queues**
16789
16790 New tool to dump existing replication peers, configurations and queues when using HBase Replication. The tool provides two flags:
16791
16792  --distributed  This flag will poll each RS for information about the replication queues being processed on this RS.
16793 By default this is not enabled and the information about the replication queues and configuration will be obtained from ZooKeeper.
16794  --hdfs   When --distributed is used, this flag will attempt to calculate the total size of the WAL files used by the replication queues. Since its possible that multiple peers can be configured this value can be overestimated.
16795
16796
16797 ---
16798
16799 * [HBASE-16422](https://issues.apache.org/jira/browse/HBASE-16422) | *Major* | **Tighten our guarantees on compatibility across patch versions**
16800
16801 Adds below change to our compat guarantees:
16802
16803 {code}
16804 -\* Example: A user using a newly deprecated api does not need to modify application code with hbase api calls until the next major version.
16805  10 +\* New APIs introduced in a patch version will only be added in a source compatible way footnote:[See 'Source Compatibility' https://blogs.oracle.com/darcy/entry/kinds\_of\_compatibility]: i.e.     code that implements public APIs will continue to compile.
16806 {code}
16807
16808
16809 ---
16810
16811 * [HBASE-7621](https://issues.apache.org/jira/browse/HBASE-7621) | *Major* | **REST client (RemoteHTable) doesn't support binary row keys**
16812
16813 RemoteHTable now supports binary row keys with any character or byte by properly encoding request URLs. This is a both a behavioral change from earlier versions and an important fix for protocol correctness.
16814
16815
16816 ---
16817
16818 * [HBASE-12721](https://issues.apache.org/jira/browse/HBASE-12721) | *Major* | **Create Docker container cluster infrastructure to enable better testing**
16819
16820 Downstream users wishing to test HBase in a "distributed" fashion (multiple "nodes" running as separate containers on the same host) can now do so in an automated fashion while leveraging Docker for process isolation via the clusterdock project.
16821
16822 For details see the README.md in the dev-support/apache\_hbase\_topology folder.
16823
16824
16825 ---
16826
16827 * [HBASE-16267](https://issues.apache.org/jira/browse/HBASE-16267) | *Critical* | **Remove commons-httpclient dependency from hbase-rest module**
16828
16829 This issue upgrades httpclient to 4.5.2 and httpcore to 4.4.4 which are the versions used by hadoop-2.
16830 This is to handle the following CVE's.
16831
16832 https://web.nvd.nist.gov/view/vuln/detail?vulnId=CVE-2015-5262 : http/conn/ssl/SSLConnectionSocketFactory.java in Apache HttpComponents HttpClient before 4.3.6 ignores the http.socket.timeout configuration setting during an SSL handshake, which allows remote attackers to cause a denial of service (HTTPS call hang) via unspecified vectors.
16833
16834 https://web.nvd.nist.gov/view/vuln/detail?vulnId=CVE-2012-6153
16835 https://web.nvd.nist.gov/view/vuln/detail?vulnId=CVE-2012-5783
16836 Apache Commons HttpClient 3.x, as used in Amazon Flexible Payments Service (FPS) merchant Java SDK and other products, does not verify that the server hostname matches a domain name in the subject's Common Name (CN) or subjectAltName field of the X.509 certificate, which allows man-in-the-middle attackers to spoof SSL servers via an arbitrary valid certificate.
16837
16838 Downstream users who are exposed to commons-httpclient via the HBase classpath will have to similarly update their dependency.
16839
16840
16841 ---
16842
16843 * [HBASE-16308](https://issues.apache.org/jira/browse/HBASE-16308) | *Major* | **Contain protobuf references**
16844
16845 Undo protobuf references through the codebase so protobuf references are contained rather than spread about the codebase. For example, moved protobuff-ing up into the various Callables rather than repeat on each method invocation cleaning up boilerplate around rpc calls. Having a few protobuf reference locations only simplifies the parent issue shading project.
16846
16847
16848 ---
16849
16850 * [HBASE-16321](https://issues.apache.org/jira/browse/HBASE-16321) | *Blocker* | **Ensure findbugs jsr305 jar isn't present**
16851
16852 HBase now ensures the jsr305 implementation from the findbugs project is not included in its binary artifacts or the compile / runtime dependencies of its user facing modules. Downstream users that rely on this jar will need to update their dependencies.
16853
16854
16855 ---
16856
16857 * [HBASE-8386](https://issues.apache.org/jira/browse/HBASE-8386) | *Major* | **deprecate TableMapReduce.addDependencyJars(Configuration, class\<?\> ...)**
16858
16859 The MapReduce helper function \`TableMapReduce.addDependencyJars(Configuration, class\<?\> ...)\` has been deprecated since it is easy to use incorrectly. Most users should rely on addDependencyJars(Job) instead.
16860
16861
16862 ---
16863
16864 * [HBASE-16287](https://issues.apache.org/jira/browse/HBASE-16287) | *Major* | **LruBlockCache size should not exceed acceptableSize too many**
16865
16866 In order to avoid blockcache size exceed acceptable size too much, we add one configuration "hbase.lru.blockcache.hard.capacity.limit.factor" to decide whether the block could be put into LruBlockCache or not.  This factor defaults to 1.2
16867 If blockcache size \>= factor\*acceptableSize, we will reject the block into cache.
16868
16869
16870 ---
16871
16872 * [HBASE-16355](https://issues.apache.org/jira/browse/HBASE-16355) | *Major* | **hbase-client dependency on hbase-common test-jar should be test scope**
16873
16874 The HBase client artifact previously incorrectly included the hbase-common test jar as a runtime dependency. With this change, that dependency has been moved to test scope. Downstream users are not expected to be impacted, unless they relied on the transitive dependency for these HBase internal test classes.
16875
16876
16877 ---
16878
16879 * [HBASE-16317](https://issues.apache.org/jira/browse/HBASE-16317) | *Blocker* | **revert all ESAPI changes**
16880
16881 This issue reverts fixes designed to prevent malicious content from rendering in HBase's UIs. Specifically, these changes shipped in 1.1.4+ and 1.2.0+. They were removed due to licensing issues discovered in the dependencies they introduced. Their implementation and those dependencies have been removed from HBase! Removal of these dependencies is against the strict definition of our version compatibility guidelines. However, inclusion of non-Apache approved licenses cannot be tolerated. Implementation of these fixes using an Apache-appropriate means is tracked in HBASE-16328.
16882
16883
16884 ---
16885
16886 * [HBASE-16288](https://issues.apache.org/jira/browse/HBASE-16288) | *Critical* | **HFile intermediate block level indexes might recurse forever creating multi TB files**
16887
16888 A new hfile configuration "hfile.index.block.min.entries" which defaults to 16 determines how many entries the hfile index block can have at least. The configuration which determines how large the index block can be at max (hfile.index.block.max.size) is ignored as long as we have fewer than hfile.index.block.min.entries entries. This ensures that multi-level index does not build up with too many levels.
16889
16890
16891 ---
16892
16893 * [HBASE-16186](https://issues.apache.org/jira/browse/HBASE-16186) | *Major* | **Fix AssignmentManager MBean name**
16894
16895 The AssignmentManager MBean was named AssignmentManger (note misspelling). This patch fixed the misspelling.
16896
16897
16898 ---
16899
16900 * [HBASE-16289](https://issues.apache.org/jira/browse/HBASE-16289) | *Critical* | **AsyncProcess stuck messages need to print region/server**
16901
16902 Adds logging of region and server. Helpful debugging. Logging now looks like this:
16903 {code}
16904 2016-06-23 17:07:18,759 INFO  [Thread-1] client.AsyncProcess$AsyncRequestFutureImpl(1601): #1, waiting for 1  actions to finish on table: DUMMY\_TABLE
16905 2016-06-23 17:07:18,759 INFO  [Thread-1] client.AsyncProcess(1720): Left over 1 task(s) are processed on server(s): [s1:1,1,1]
16906 2016-06-23 17:07:18,759 INFO  [Thread-1] client.AsyncProcess(1728): Regions against which left over task(s) are processed: [DUMMY\_TABLE,DUMMY\_BYTES\_1,1.3fd12ea80b4df621fb15497ba75f7368.,DUMMY\_TABLE,DUMMY\_BYTES\_2,2.924207e242e313d2e5491c625e0a296e.]
16907 {code}
16908
16909
16910 ---
16911
16912 * [HBASE-14743](https://issues.apache.org/jira/browse/HBASE-14743) | *Minor* | **Add metrics around HeapMemoryManager**
16913
16914 A memory metrics reveals situations happened in both MemStores and BlockCache in RegionServer. Through this metrics, users/operators can know
16915 1). Current size of MemStores and BlockCache in bytes.
16916 2). Occurrence for Memstore minor and major flush. (named unblocked flush and blocked flush respectively, shown in histogram)
16917 3). Dynamic changes in size between MemStores and BlockCache. (with Increase/Decrease as prefix, shown in histogram). And a counter for no changes, named DoNothingCounter.
16918 4). Occurrence for memory usage alarm (used more than 95% by default) in RegionServer. (named AboveHeapOccupancyLowWatermarkCounter)
16919
16920
16921 ---
16922
16923 * [HBASE-13701](https://issues.apache.org/jira/browse/HBASE-13701) | *Major* | **Consolidate SecureBulkLoadEndpoint into HBase core as default for bulk load**
16924
16925 SecureBulkLoadEndpoint  has been integrated into HBase core as default bulk load mechanism. It is no longer needed to install it as a coprocessor endpoint.
16926 The new server is backward compatible, accommodating non-secure old client and secure old client requesting SecureBulkLoadEndpoint service.
16927 SecureBulkLoadEndpoint is deprecated. The backward compatibility support may be removed in future releases.
16928
16929
16930 ---
16931
16932 * [HBASE-16244](https://issues.apache.org/jira/browse/HBASE-16244) | *Major* | **LocalHBaseCluster start timeout should be configurable**
16933
16934 When LocalHBaseCluster is started from the command line the Master would give up after 30 seconds due to a hardcoded timeout meant for unit tests. This change allows the timeout to be configured via hbase-site as well as sets it to 5 minutes when LocalHBaseCluster is started from the command line.
16935
16936
16937 ---
16938
16939 * [HBASE-16052](https://issues.apache.org/jira/browse/HBASE-16052) | *Major* | **Improve HBaseFsck Scalability**
16940
16941 HBASE-16052 improves the performance and scalability of HBaseFsck, especially for large clusters with a small number of large tables.
16942
16943 Searching for lingering reference files is now a multi-threaded operation.  Loading HDFS region directory information is now multi-threaded at the region-level instead of the table-level to maximize concurrency.  A performance bug in HBaseFsck that resulted in redundant I/O and RPCs was fixed by introducing a FileStatusFilter that filters FileStatus objects directly.
16944
16945
16946 ---
16947
16948 * [HBASE-16144](https://issues.apache.org/jira/browse/HBASE-16144) | *Major* | **Replication queue's lock will live forever if RS acquiring the lock has died prematurely**
16949
16950 If zk based replication queue is used and useMulti is false, we will schedule a chore to clean up the orphan replication queue lock on zk.
16951
16952
16953 ---
16954
16955 * [HBASE-3727](https://issues.apache.org/jira/browse/HBASE-3727) | *Minor* | **MultiHFileOutputFormat**
16956
16957 MultiHFileOutputFormat support output of HFiles from multiple tables. It will output directories and hfiles as follow,
16958      --table1
16959        --family1
16960        --family2
16961          --Hfiles
16962      --table2
16963        --family3
16964          --hfiles
16965        --family4
16966
16967 family directory and its hfiles match the output of HFileOutputFormat2
16968
16969
16970 ---
16971
16972 * [HBASE-16231](https://issues.apache.org/jira/browse/HBASE-16231) | *Major* | **Integration tests should support client keytab login for secure clusters**
16973
16974 Prior to this change, the integration test clients (IntegrationTest\*) relied on the Kerberos credential cache for authentication against secured clusters.  This could lead to the tests failing due to authentication failures when the tickets in the credential cache expired.  With this change, the integration test clients will make use of the configuration properties for "hbase.client.keytab.file" and "hbase.client.kerberos.principal", when available.  This will perform a login from the configured keytab file and automatically refresh the credentials in the background for the process lifetime.
16975
16976
16977 ---
16978
16979 * [HBASE-13823](https://issues.apache.org/jira/browse/HBASE-13823) | *Major* | **Procedure V2: unnecessaery operations on AssignmentManager#recoverTableInDisablingState() and recoverTableInEnablingState()**
16980
16981 For cluster upgraded from 1.0.x or older releases, master startup would not continue the in-progress enable/disable table process.  If orphaned znode with ENABLING/DISABLING state exists in the cluster, run hbck or manually fix the issue.
16982
16983 For new cluster or cluster upgraded from 1.1.x and newer release, there is no issue to worry about.
16984
16985
16986 ---
16987
16988 * [HBASE-16095](https://issues.apache.org/jira/browse/HBASE-16095) | *Major* | **Add priority to TableDescriptor and priority region open thread pool**
16989
16990 Adds a PRIORITY property to the HTableDescriptor. PRIORITY should be in the same range as the RpcScheduler defines it (HConstants.XXX\_QOS).
16991
16992 Table priorities are only used for region opening for now. There can be other uses later (like RpcScheduling).
16993
16994 Regions of high priority tables (priority \>= than HIGH\_QOS) are opened from a different thread pool than the regular region open thread pool. However, table priorities are not used as a global order for region assigning or opening.
16995
16996
16997 ---
16998
16999 * [HBASE-16081](https://issues.apache.org/jira/browse/HBASE-16081) | *Blocker* | **Replication remove\_peer gets stuck and blocks WAL rolling**
17000
17001 When a replication endpoint is sent a shutdown request by the replication source in situations like removing a peer, we now try to gracefully shut it down by draining the items already sent for replication to the peer cluster. If the drain does not complete in the specified time (hbase.rpc.timeout \* replication.source.maxterminationmultiplier), the regionserver is aborted to avoid blocking the WAL roll.
17002
17003
17004 ---
17005
17006 * [HBASE-16087](https://issues.apache.org/jira/browse/HBASE-16087) | *Major* | **Replication shouldn't start on a master if if only hosts system tables**
17007
17008 Masters will no longer start any replication threads if they are hosting only system tables.
17009
17010 In order to change this add something to the config for tables on master that doesn't start with "hbase:" ( Replicating system tables is something that's currently unsupported and can open up security holes, so do this at your own peril)
17011
17012
17013 ---
17014
17015 * [HBASE-14548](https://issues.apache.org/jira/browse/HBASE-14548) | *Major* | **Expand how table coprocessor jar and dependency path can be specified**
17016
17017 Allow a directory containing the jars or some wildcards to be specified, such as: hdfs://namenode:port/user/hadoop-user/
17018 or
17019 hdfs://namenode:port/user/hadoop-user/\*.jar
17020
17021 Please note that if a directory is specified, all jar files(.jar) directly in the directory are added, but it does not search files in the subtree rooted in the directory.
17022 Do not contain any wildcard if you would like to specify a directory.
17023
17024
17025 ---
17026
17027 * [HBASE-15925](https://issues.apache.org/jira/browse/HBASE-15925) | *Blocker* | **compat-module maven variable not evaluated**
17028
17029 Downstream users of HBase dependencies that do not properly activate Maven profiles should now see a correct transitive dependency on the default hadoop-compatibility-module.
17030
17031
17032 ---
17033
17034 * [HBASE-16140](https://issues.apache.org/jira/browse/HBASE-16140) | *Major* | **bump owasp.esapi from 2.1.0 to 2.1.0.1**
17035
17036 The dependency owasp.esapi had a compatible change from 2.1.0 to 2.1.0.1. As a result, the transitive dependency commons-fileupload had a change from 1.2 to 1.3.1, which has some minor class changes that impact binary compatibility. Interested users should check the release notes of commons-fileupload to see if any of the incompatible changes impact them.
17037
17038 http://commons.apache.org/proper/commons-fileupload/changes-report.html
17039
17040
17041 ---
17042
17043 * [HBASE-16147](https://issues.apache.org/jira/browse/HBASE-16147) | *Major* | **Shell command for getting compaction state**
17044
17045 compaction\_state shell command would return compaction state in String form:
17046 NONE, MINOR, MAJOR, MAJOR\_AND\_MINOR
17047
17048
17049 ---
17050
17051 * [HBASE-14878](https://issues.apache.org/jira/browse/HBASE-14878) | *Major* | **maven archetype: client application with shaded jars**
17052
17053 Adds new hbase-shaded-client archetype; also corrects an omission found in hbase-archetypes/README.md in the section headed "How to add a new archetype".
17054
17055
17056 ---
17057
17058 * [HBASE-14877](https://issues.apache.org/jira/browse/HBASE-14877) | *Major* | **maven archetype: client application**
17059
17060 This patch introduces a new infrastructure for creation and maintenance of Maven archetypes in the context of the hbase project, and it also introduces the first archetype, which end-users may utilize to generate a simple hbase-client dependent project.
17061
17062 NOTE that this patch should introduce two new WARNINGs ("Using platform encoding ... to copy filtered resources") into the hbase install process. These warnings are hard-wired into the maven-archetype-plugin:create-from-project goal. See hbase/hbase-archetypes/README.md, footnote [6] for details.
17063
17064 After applying the patch, see hbase/hbase-archetypes/README.md for details regarding the new archetype infrastructure introduced by this patch. (The README text is also conveniently positioned at the top of the patch itself.)
17065
17066 Here is the opening paragraph of the README.md file:
17067 =================
17068 The hbase-archetypes subproject of hbase provides an infrastructure for creation and maintenance of Maven archetypes pertinent to HBase. Upon deployment to the archetype catalog of the central Maven repository, these archetypes may be used by end-user developers to autogenerate completely configured Maven projects (including fully-functioning sample code) through invocation of the archetype:generate goal of the maven-archetype-plugin.
17069 ========
17070 The README.md file also contains several paragraphs under the heading, "Notes for contributors and committers to the HBase project", which explains the layout of 'hbase-archetypes', and how archetypes are created and installed into the local Maven repository, ready for deployment to the central Maven repository. It also outlines how new archetypes may be developed and added to the collection in the future.
17071
17072
17073 ---
17074
17075 * [HBASE-15977](https://issues.apache.org/jira/browse/HBASE-15977) | *Major* | **Failed variable substitution on home page**
17076
17077 Done. Thanks, Dima, Andrew!
17078
17079
17080 ---
17081
17082 * [HBASE-5291](https://issues.apache.org/jira/browse/HBASE-5291) | *Major* | **Add Kerberos HTTP SPNEGO authentication support to HBase web consoles**
17083
17084 HBase Web UIs can be secured from general public access using SPNEGO to require a valid Kerberos ticket.
17085
17086 Setting 'hbase.security.authentication.ui' to 'kerberos' in hbase-site.xml is a global switch to have all Web UIs allow only authenticated clients via Kerberos. 'hbase.security.authentication.spnego.kerberos.principal' and 'hbase.security.authentication.spnego.kerberos.keytab' are two other required properties in hbase-site.xml, the Kerberos principal and keytab to use for the server to use to log in. The primary in the Kerberos principal must be 'HTTP' as required by the SPNEGO mechanism, e.g. 'HTTP/host.domain.com@DOMAIN.COM'.
17087
17088
17089 ---
17090
17091 * [HBASE-15950](https://issues.apache.org/jira/browse/HBASE-15950) | *Major* | **Fix memstore size estimates to be more tighter**
17092
17093 The estimates of heap usage by the memstore objects (KeyValue, object and array header sizes, etc) have been made more accurate for heap sizes up to 32G (using CompressedOops), resulting in them dropping by 10-50% in practice. This also results in less number of flushes and compactions due to "fatter" flushes. YMMV. As a result, the actual heap usage of the memstore before being flushed may increase by up to 100%. If configured memory limits for the region server had been tuned based on observed usage, this change could result in worse GC behavior or even OutOfMemory errors. Set the environment property (not hbase-site.xml) "hbase.memorylayout.use.unsafe" to false to disable.
17094
17095
17096 ---
17097
17098 * [HBASE-16023](https://issues.apache.org/jira/browse/HBASE-16023) | *Major* | **Fastpath for the FIFO rpcscheduler**
17099
17100 Adds a 'fastpath' when using the default FIFO rpc scheduler ('fifo'). Does direct handoff from Reader thread to Handler if there is one ready and willing. Will shine best when high random read workload (YCSB workloadc for instance)
17101
17102
17103 ---
17104
17105 * [HBASE-15971](https://issues.apache.org/jira/browse/HBASE-15971) | *Critical* | **Regression: Random Read/WorkloadC slower in 1.x than 0.98**
17106
17107 Change the default rpc scheduler from 'deadline' to 'fifo' instead so it is the same as in branch 0.98. 'deadline' was of questionable benefit but with a high cost scheduling. To re-enable 'deadline', set hbase.ipc.server.callqueue.type to 'deadline' in your hbase-site.xml.
17108
17109
17110 ---
17111
17112 * [HBASE-15525](https://issues.apache.org/jira/browse/HBASE-15525) | *Critical* | **OutOfMemory could occur when using BoundedByteBufferPool during RPC bursts**
17113
17114 Added a new ByteBufferPool which pools N ByteBuffers. By default it makes off heap ByteBuffers when getBuffer() is called. The size of each buffer defaults to 64KB. This can be configured using 'hbase.ipc.server.reservoir.initial.buffer.size'.   The max number of buffers which can be pooled defaults to twice the number of handler threads in RS. This can be configured with key 'hbase.ipc.server.reservoir.initial.max'.  While responding to read requests and client support Codec, we will create CellBlocks and directly return it as PB payload. For making this block, we will use N ByteBuffers from pool as per the total size of the response cells. The default size of 64 KB for the buffer is inline with the number of bytes written to RPC layer in one short.(That is also 64KB).  When at point of time, the calle not able to get a free buffer from the pool (it returns null then), it will make on heap Buffer of same size (as that of Buffers in pool) and use that to create cell block.
17115
17116
17117 ---
17118
17119 * [HBASE-15994](https://issues.apache.org/jira/browse/HBASE-15994) | *Major* | **Allow selection of RpcSchedulers**
17120
17121 Adds a FifoRpcSchedulerFactory so you can try the FifoRpcScheduler by setting  "hbase.region.server.rpc.scheduler.factory.class"
17122
17123
17124 ---
17125
17126 * [HBASE-15989](https://issues.apache.org/jira/browse/HBASE-15989) | *Major* | **Remove hbase.online.schema.update.enable**
17127
17128 Removes the "hbase.online.schema.update.enable" property.
17129 from now, every operation that alter the schema (e.g. modifyTable, addFamily, removeFamily, ...) will use the online schema update. there is no need to disable/enable the table.
17130
17131
17132 ---
17133
17134 * [HBASE-15981](https://issues.apache.org/jira/browse/HBASE-15981) | *Minor* | **Stripe and Date-tiered compactions inaccurately suggest disabling table in docs**
17135
17136 Removes reference to disabling table in docs for stripe and date-tiered compactions
17137
17138
17139 ---
17140
17141 * [HBASE-15931](https://issues.apache.org/jira/browse/HBASE-15931) | *Critical* | **Add log for long-running tasks in AsyncProcess**
17142
17143 After HBASE-15931, we will log more details for long-running tasks in AsyncProcess#waitForMaximumCurrentTasks every 10 seconds, including:
17144 1. Table name will be included in the tasks status log
17145 2. On which regionserver(s) the tasks are runnning will be logged when less than hbase.client.threshold.log.details tasks left, by default 10.
17146 3. Against which regions the tasks are running will be logged when less than 2 tasks left.
17147
17148
17149 ---
17150
17151 * [HBASE-15907](https://issues.apache.org/jira/browse/HBASE-15907) | *Major* | **Missing documentation of create table split options**
17152
17153 documentation changes only - added section to Shell tricks and cross reference from region splitting section
17154
17155
17156 ---
17157
17158 * [HBASE-15915](https://issues.apache.org/jira/browse/HBASE-15915) | *Major* | **Set timeouts on hanging tests**
17159
17160 Use @ClassRule to set timeout on test case level (instead of @Rule which sets timeout for the test methods). CategoryBasedTimeout.forClass(..) determines the timeout value based on category annotation (small/medium/large) on the test case.
17161
17162
17163 ---
17164
17165 * [HBASE-15875](https://issues.apache.org/jira/browse/HBASE-15875) | *Major* | **Remove HTable references and HTableInterface**
17166
17167 **WARNING: No release note provided for this change.**
17168
17169
17170 ---
17171
17172 * [HBASE-15610](https://issues.apache.org/jira/browse/HBASE-15610) | *Blocker* | **Remove deprecated HConnection for 2.0 thus removing all PB references for 2.0**
17173
17174 **WARNING: No release note provided for this change.**
17175
17176
17177 ---
17178
17179 * [HBASE-15890](https://issues.apache.org/jira/browse/HBASE-15890) | *Major* | **Allow thrift to set/unset "cacheBlocks" for Scans**
17180
17181 Adds cacheBlocks to Scan
17182
17183
17184 ---
17185
17186 * [HBASE-15876](https://issues.apache.org/jira/browse/HBASE-15876) | *Blocker* | **Remove doBulkLoad(Path hfofDir, final HTable table) though it has not been through a full deprecation cycle**
17187
17188 Removes a doBulkLoad method though it has not been through a full deprecation cycle (but it is 'damaged' because it has a parameter that has been properly deprecated). Use the alternative {code}public void doBulkLoad(Path hfofDir, final Admin admin, Table table, RegionLocator regionLocator){code}
17189
17190 See http://mail-archives.apache.org/mod\_mbox/hbase-dev/201605.mbox/%3CCAMUu0w-ZiLoLBLO3D76=n3AjUr=VMtTUeYA28weLHYeq8+e3bQ@mail.gmail.com%3E for NOTICE on this 'premature' removal.
17191
17192
17193 ---
17194
17195 * [HBASE-15228](https://issues.apache.org/jira/browse/HBASE-15228) | *Major* | **Add the methods to RegionObserver to trigger start/complete restoring WALs**
17196
17197 Added two hooks around WAL restore.
17198 preReplayWALs(final ObserverContext\<? extends RegionCoprocessorEnvironment\> ctx,  HRegionInfo info, Path edits)
17199 and
17200 postReplayWALs(final ObserverContext\<? extends RegionCoprocessorEnvironment\> ctx,  HRegionInfo info, Path edits)
17201
17202 Will be called at start and end of restore of a WAL file.
17203 The other hook around WAL restore (preWALRestore ) will be called before restore of every entry within the WAL file.
17204
17205
17206 ---
17207
17208 * [HBASE-15856](https://issues.apache.org/jira/browse/HBASE-15856) | *Critical* | **Cached Connection instances can wind up with addresses never resolved**
17209
17210 During periods where DNS resolution was not available or not working correctly, we could previously cache unresolved hostnames forever, in some cases preventing further connections to these hosts even when DNS service was restored.  With this change, unresolved hostnames will no longer be cached, and will instead throw an UnknownHostException during connection setup.
17211
17212
17213 ---
17214
17215 * [HBASE-15593](https://issues.apache.org/jira/browse/HBASE-15593) | *Major* | **Time limit of scanning should be offered by client**
17216
17217 Add a new configuration: hbase.ipc.min.client.request.timeout
17218 Minimum allowable timeout (in milliseconds) in rpc request's header. This configuration exists to prevent the rpc service regarding this request as timeout immediately.
17219
17220
17221 ---
17222
17223 * [HBASE-15784](https://issues.apache.org/jira/browse/HBASE-15784) | *Major* | **Misuse core/maxPoolSize of LinkedBlockingQueue in ThreadPoolExecutor**
17224
17225 The core pool size and max pool size of ThreadPoolExecutor should be the same when LinkedBlockingQueue is used. Thus the configurations hbase.hconnection.threads.max, hbase.hconnection.meta.lookup.threads.max, hbase.region.replica.replication.threads.max and hbase.multihconnection.threads.max are used as the number of the core threads, and the related configurations \*.thread.core are not used any more.
17226
17227
17228 ---
17229
17230 * [HBASE-15651](https://issues.apache.org/jira/browse/HBASE-15651) | *Major* | **Add report-flakies.py to use jenkins api to get failing tests**
17231
17232 To find recent set of flakies, run the script added by this patch. Run it to get usage information passing -h:
17233
17234 {code}
17235 $ ./dev-support/report-flakies.py -h
17236 {code}
17237
17238 If you get the below:
17239
17240 {code}
17241 $ python ./dev-support/report-flakies.py
17242 Traceback (most recent call last):
17243   File "./dev-support/report-flakies.py", line 25, in \<module\>
17244     import requests
17245 ImportError: No module named requests
17246 {code}
17247
17248 ... install the requests module:
17249
17250 {code}
17251 $ sudo pip install requests
17252 {code}
17253
17254
17255 ---
17256
17257 * [HBASE-15780](https://issues.apache.org/jira/browse/HBASE-15780) | *Critical* | **Expose AuthUtil as IA.Public**
17258
17259 Downstream users with long lived applications that need to communicate with secure HBase instances can now rely on the AuthUtil class to handle authenticating via keytab.
17260
17261 For more information, see the javadoc for the org.apache.hadoop.hbase.AuthUtil class.
17262
17263
17264 ---
17265
17266 * [HBASE-15811](https://issues.apache.org/jira/browse/HBASE-15811) | *Blocker* | **Batch Get after batch Put does not fetch all Cells**
17267
17268 We were not waiting on all executors in a batch to complete which meant a read-your-own-writes could sometimes fail -- especially if client is loaded; i.e. putting to multiple machines in a cluster. The test for no-more-executors was damaged by the 0.99/0.98.4 fix "HBASE-11403 Fix race conditions around Object#notify"
17269
17270
17271 ---
17272
17273 * [HBASE-15801](https://issues.apache.org/jira/browse/HBASE-15801) | *Major* | **Upgrade checkstyle for all branches**
17274
17275 All active branches now use maven-checkstyle-plugin 2.17 and checkstyle 6.18.
17276
17277
17278 ---
17279
17280 * [HBASE-15236](https://issues.apache.org/jira/browse/HBASE-15236) | *Major* | **Inconsistent cell reads over multiple bulk-loaded HFiles**
17281
17282 This jira fixes that following bug:
17283 During bulkloading, if there are multiple hfiles corresponding to same region, and if they have same timestamps (which may have been set using importtsv.timestamp) and duplicate keys across them, then get and scan may return values coming from different hfiles.
17284
17285
17286 ---
17287
17288 * [HBASE-15740](https://issues.apache.org/jira/browse/HBASE-15740) | *Major* | **Replication source.shippedKBs metric is undercounting because it is in KB**
17289
17290 Removed Replication source.shippedKBs metric in favor of source.shippedBytes
17291
17292
17293 ---
17294
17295 * [HBASE-15773](https://issues.apache.org/jira/browse/HBASE-15773) | *Major* | **CellCounter improvements**
17296
17297 The CellCounter map reduce job now supports additional configuration options on the Scan instance it creates, using the org.apache.hadoop.hbase.mapreduce.TableInputFormat defined property names.  For a full list of the options, run ./hbase org.apache.hadoop.hbase.mapreduce.CellCounter with no arguments.
17298
17299 CellCounter also no longer creates job counters for per-rowkey and per-rowkey/qualifier cell counts.  For most tables, these counters would cause the job to fail due to mapreduce job counter limits.
17300
17301
17302 ---
17303
17304 * [HBASE-15759](https://issues.apache.org/jira/browse/HBASE-15759) | *Minor* | **RegionObserver.preStoreScannerOpen() doesn't have acces to current readpoint**
17305
17306 The following RegionObserver method is deprecated and would no longer be called in hbase 2.0:
17307
17308   public KeyValueScanner preStoreScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
17309       final Store store, final Scan scan, final NavigableSet\<byte[]\> targetCols,
17310       final KeyValueScanner s) throws IOException {
17311
17312 Instead, override this method:
17313
17314   public KeyValueScanner preStoreScannerOpen(final ObserverContext\<RegionCoprocessorEnvironment\> c,
17315       final Store store, final Scan scan, final NavigableSet\<byte[]\> targetCols,
17316       final KeyValueScanner s, final long readPt) throws IOException {
17317
17318
17319 ---
17320
17321 * [HBASE-15743](https://issues.apache.org/jira/browse/HBASE-15743) | *Major* | **Add Transparent Data Encryption support for FanOutOneBlockAsyncDFSOutput**
17322
17323 Now the AsyncFSWAL can write data to a encryption zone on HDFS.
17324
17325
17326 ---
17327
17328 * [HBASE-15767](https://issues.apache.org/jira/browse/HBASE-15767) | *Major* | **Upgrade httpclient dependency**
17329
17330 HBase now relies on version 4.3.6 of the Apache Commons HTTPClient library. Downstream users who are exposed to it via the HBase classpath will have to similarly update their dependency.
17331
17332
17333 ---
17334
17335 * [HBASE-15575](https://issues.apache.org/jira/browse/HBASE-15575) | *Minor* | **Rename table DDL \*Handler methods in MasterObserver to more meaningful names**
17336
17337 **WARNING: No release note provided for this change.**
17338
17339
17340 ---
17341
17342 * [HBASE-15720](https://issues.apache.org/jira/browse/HBASE-15720) | *Major* | **Print row locks at the debug dump page**
17343
17344 Adds a section to the debug dump page listing current row locks held.
17345
17346
17347 ---
17348
17349 * [HBASE-15703](https://issues.apache.org/jira/browse/HBASE-15703) | *Critical* | **Deadline scheduler needs to return to the client info about skipped calls, not just drop them**
17350
17351 With previous deadline mode of RPC scheduling (the implementation in SimpleRpcScheduler, which is basically a FIFO except that long-running scans are de-prioritized) and FIFO-based RPC scheduler clients are getting CallQueueTooBigException when RPC call queue is full.
17352
17353 With this patch and when hbase.ipc.server.callqueue.type property is set to "codel" mode, clients will also be getting CallDroppedException, which means that the request was discarded by the server as it considers itself to be overloaded and starts to drop requests to avoid going down under the load. The clients will retry upon receiving this exception. It doesn't clear MetaCache with region locations.
17354
17355
17356 ---
17357
17358 * [HBASE-15281](https://issues.apache.org/jira/browse/HBASE-15281) | *Major* | **Allow the FileSystem inside HFileSystem to be wrapped**
17359
17360 This patch adds new configuration property - hbase.fs.wrapper. If provided, it should be fully qualified class name of the class used as a pluggable wrapper for HFileSystem. This may be useful for specific debugging/tracing needs.
17361
17362
17363 ---
17364
17365 * [HBASE-15551](https://issues.apache.org/jira/browse/HBASE-15551) | *Minor* | **Make call queue too big exception use servername**
17366
17367 Fixes issue when CallQueueTooBig exception returned to the client could print useless address info (like 0.0.0.0) if RPC server is listening on something other than the host name, making troubleshooting inconvenient.
17368
17369
17370 ---
17371
17372 * [HBASE-15711](https://issues.apache.org/jira/browse/HBASE-15711) | *Major* | **Add client side property to allow logging details for batch errors**
17373
17374 In HBASE-15711 a new client side property hbase.client.log.batcherrors.details is introduced to allow logging full stacktrace of exceptions for batch error. It's disabled by default and set the property to true will enable it.
17375
17376
17377 ---
17378
17379 * [HBASE-15686](https://issues.apache.org/jira/browse/HBASE-15686) | *Major* | **Add override mechanism for the exempt classes when dynamically loading table coprocessor**
17380
17381 New coprocessor table descriptor attribute, hbase.coprocessor.classloader.included.classes, is added.
17382 User can specify class name prefixes (semicolon separated) which should be loaded by CoprocessorClassLoader through this attribute using the following syntax:
17383 {code}
17384   hbase\> alter 't1',    'coprocessor'=\>'hdfs:///foo.jar\|com.foo.FooRegionObserver\|1001\|arg1=1,arg2=2'
17385 {code}
17386
17387
17388 ---
17389
17390 * [HBASE-15645](https://issues.apache.org/jira/browse/HBASE-15645) | *Critical* | **hbase.rpc.timeout is not used in operations of HTable**
17391
17392 Fixes regression where hbase.rpc.timeout configuration was ignored in branch-1.0+
17393
17394 Adds new methods setOperationTimeout, getOperationTimeout, setRpcTimeout, and getRpcTimeout to Table. In branch-1.3+ they are public interfaces and in 1.0-1.2 they are labeled as @InterfaceAudience.Private.
17395
17396 Adds hbase.client.operation.timeout to hbase-default.xml with default of 1200000
17397
17398
17399 ---
17400
17401 * [HBASE-15477](https://issues.apache.org/jira/browse/HBASE-15477) | *Major* | **Do not save 'next block header' when we cache hfileblocks**
17402
17403 Fix over-persisting in blockcache; no longer save the block PLUS the header of the next block (33 bytes) when writing the cache.
17404
17405 Also removes support for hfileblock v1; hfile block v1 was used writing hfile v1. hfile v1 was the default in hbase before hbase-0.92. hbase.96 would not start unless all v1 hfiles had been compacted out of the cluster.
17406
17407
17408 ---
17409
17410 * [HBASE-15628](https://issues.apache.org/jira/browse/HBASE-15628) | *Major* | **Implement an AsyncOutputStream which can work with any FileSystem implementation**
17411
17412 Introduce an AsyncFSOutput interface which is an abstraction of the original FanOutOneBlockAsyncDFSOutput. Now you can create AsyncFSOutput on any FileSystem using the method AsyncFSOutputHelper.createOutput. The returned AsyncFSOutput will be FanOutOneBlockAsyncDFSOutput if the given FileSystem is a DistributedFileSystem.
17413
17414
17415 ---
17416
17417 * [HBASE-15392](https://issues.apache.org/jira/browse/HBASE-15392) | *Major* | **Single Cell Get reads two HFileBlocks**
17418
17419 When an explicit Get with a one or more columns specified, we at a minimum, were overseeking, reading until we tripped over the next row, regardless, and only then returning. If the next row was in-block, we'd just do too much seeking but if the next row was in the next (or in the next block beyond that), we would keep seeking and loading blocks until we found the next row before we'd return.
17420
17421 There remains one case where we will still 'overread'. It is when the row end aligns with the end of the block. In this case we will load the next block just to find that there are no more cells in the current row. See HBASE-15457.
17422
17423
17424 ---
17425
17426 * [HBASE-15671](https://issues.apache.org/jira/browse/HBASE-15671) | *Major* | **Add per-table metrics on memstore, storefile and regionsize**
17427
17428 Adds storeFileSize, memstoreSize and tableSize to the per-table metrics.
17429
17430
17431 ---
17432
17433 * [HBASE-15366](https://issues.apache.org/jira/browse/HBASE-15366) | *Major* | **Add doc, trace-level logging, and test around hfileblock**
17434
17435 No functional change. Added javadoc, comments, and extra trace-level logging to make clear what is happening around the reading and caching of hfile blocks.
17436
17437
17438 ---
17439
17440 * [HBASE-15368](https://issues.apache.org/jira/browse/HBASE-15368) | *Major* | **Add pluggable window support**
17441
17442 Use 'hbase.hstore.compaction.date.tiered.window.factory.class' to specify the window implementation you like for date tiered compaction. Now the only and default implementation is org.apache.hadoop.hbase.regionserver.compactions.ExponentialCompactionWindowFactory.
17443
17444 {code}
17445 \<property\>
17446 \<name\>hbase.hstore.compaction.date.tiered.window.factory.class\</name\>
17447 \<value\>org.apache.hadoop.hbase.regionserver.compactions.ExponentialCompactionWindowFactory\</value\>
17448 \</property\>
17449 \<property\>
17450 {code}
17451
17452
17453 ---
17454
17455 * [HBASE-15518](https://issues.apache.org/jira/browse/HBASE-15518) | *Major* | **Add Per-Table metrics back**
17456
17457 Adds per-table metrics aggregated from per-region metrics in region server metrics. New metrics are available under JMX section "Hadoop:service=HBase,name=RegionServer,sub=Tables" and they are available via hadoop metrics2 collectors.
17458
17459
17460 ---
17461
17462 * [HBASE-15640](https://issues.apache.org/jira/browse/HBASE-15640) | *Major* | **L1 cache doesn't give fair warning that it is showing partial stats only when it hits limit**
17463
17464 The blockcache UI tab would stop refreshing at 100k blocks (configurable, see "hbase.ui.blockcache.by.file.max"), which isn't very many blocks when doing a big cache, giving a misleading picture of the content of L1 and/or L2 cache. Up the default limit to 1M blocks (UI takes a while but just a few seconds counting over 1M blocks).
17465
17466 Also, when beyond the limit give the user a noticeable WARNING in the UI.
17467
17468
17469 ---
17470
17471 * [HBASE-15386](https://issues.apache.org/jira/browse/HBASE-15386) | *Major* | **PREFETCH\_BLOCKS\_ON\_OPEN in HColumnDescriptor is ignored**
17472
17473 This was a non-issue. The PREFETCH\_... flag actually works. While here though made the following additions.
17474
17475 Changes the prefetch TRACE-level loggings to include the word 'Prefetch' in them so you know what they are about.
17476
17477 Changes the cryptic logging of the CacheConfig#toString to have some preamble saying why and what column family is responsible (helps figure what is going on)
17478
17479 Add test that verifies setting flag on HColumnDescriptor actually works.
17480
17481
17482 ---
17483
17484 * [HBASE-13372](https://issues.apache.org/jira/browse/HBASE-13372) | *Major* | **Unit tests for SplitTransaction and RegionMergeTransaction listeners**
17485
17486 HBASE-13372 Add unit tests for SplitTransaction and RegionMergeTransaction listeners
17487
17488
17489 ---
17490
17491 * [HBASE-15187](https://issues.apache.org/jira/browse/HBASE-15187) | *Major* | **Integrate CSRF prevention filter to REST gateway**
17492
17493 Protection against CSRF attack can be turned on with config parameter, hbase.rest.csrf.enabled - default value is false.
17494
17495 The custom header to be sent can be changed via config parameter, hbase.rest.csrf.custom.header whose default value is "X-XSRF-HEADER".
17496
17497 Config parameter, hbase.rest.csrf.methods.to.ignore , controls which HTTP methods are not associated with customer header check.
17498
17499 Config parameter, hbase.rest-csrf.browser-useragents-regex , is a comma-separated list of regular expressions used to match against an HTTP request's User-Agent header when protection against cross-site request forgery (CSRF) is enabled for REST server by setting hbase.rest.csrf.enabled to true.
17500
17501 The implementation came from hadoop/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/security/http/RestCsrfPreventionFilter.java
17502
17503 We should periodically update the RestCsrfPreventionFilter.java in hbase codebase to include fixes to the hadoop implementation.
17504
17505
17506 ---
17507
17508 * [HBASE-15481](https://issues.apache.org/jira/browse/HBASE-15481) | *Trivial* | **Add pre/post roll to WALObserver**
17509
17510 <!-- markdown -->
17511
17512
17513 WALObserver coprocessors now can receive notifications of WAL rolling via the new methods `preWALRoll` and `postWALRoll`.
17514
17515 This change is incompatible due to the addition of these methods to the `WALObserver` interface. Downstream users are encouraged to instead extend the `BaseWALObserver` class, which remains compatible through this change.
17516
17517
17518 ---
17519
17520 * [HBASE-15507](https://issues.apache.org/jira/browse/HBASE-15507) | *Major* | **Online modification of enabled ReplicationPeerConfig**
17521
17522 Added update\_peer\_config to the HBase shell and ReplicationAdmin, and provided a callback for custom replication endpoints to be notified of changes to their configuration and peer data
17523
17524
17525 ---
17526
17527 * [HBASE-15537](https://issues.apache.org/jira/browse/HBASE-15537) | *Major* | **Make multi WAL work with WALs other than FSHLog**
17528
17529 Add the delegate config for multiwal back. Now you can use 'hbase.wal.regiongrouping.delegate.provider' to specify the wal provider you want to use for multiwal. For example:
17530 {code}
17531 \<property\>
17532 \<name\>hbase.wal.regiongrouping.delegate.provider\</name\>
17533 \<value\>asyncfs\</value\>
17534 \</property\>
17535 {code}
17536 And the default value is filesystem which is the alias of DefaultWALProvider, i.e., the FSHLog.
17537
17538
17539 ---
17540
17541 * [HBASE-15400](https://issues.apache.org/jira/browse/HBASE-15400) | *Major* | **Use DateTieredCompactor for Date Tiered Compaction**
17542
17543 With this patch combined with HBASE-15389, when we compact, we can output multiple files along the current window boundaries. There are two use cases:
17544 1. Major compaction: We want to output date tiered store files with data older than max age archived in trunks of the window size on the higher tier. Once a window is old enough, we don't combine the windows to promote to the next tier any further. So files in these windows retain the same timespan as they were minor-compacted last time, which is the window size of the highest tier. Major compaction will touch these files and we want to maintain the same layout. This way, TTL and archiving will be simpler and more efficient.
17545 2. Bulk load files and the old file generated by major compaction before upgrading to DTCP.
17546
17547 This will change the way to enable date tiered compaction.
17548 To turn it on:
17549 hbase.hstore.engine.class: org.apache.hadoop.hbase.regionserver.DateTieredStoreEngine
17550
17551 With tiered compaction all servers in the cluster will promote windows to higher tier at the same time, so using a compaction throttle is recommended:
17552 hbase.regionserver.throughput.controller:org.apache.hadoop.hbase.regionserver.compactions.PressureAwareCompactionThroughputController
17553 hbase.hstore.compaction.throughput.higher.bound and hbase.hstore.compaction.throughput.lower.bound need to be set for desired throughput range as uncompressed rates.
17554
17555 Because there will most likely be more store files around, we need to adjust the configuration so that flush won't be blocked and compaction will be properly throttled:
17556 hbase.hstore.blockingStoreFiles: change to 50 if using all default parameters when turning on date tiered compaction. Use 1.5~2 x projected file count if changing the parameters, Projected file count = windows per tier x tier count + incoming window min + files older than max age
17557
17558 Because major compaction is turned on now, we also need to adjust the configuration for max file to compact according to the larger file count:
17559 hbase.hstore.compaction.max: set to the same number as hbase.hstore.blockingStoreFiles.
17560
17561 For more details, please refer to the design spec at https://docs.google.com/document/d/1\_AmlNb2N8Us1xICsTeGDLKIqL6T-oHoRLZ323MG\_uy8/edit#
17562
17563
17564 ---
17565
17566 * [HBASE-15592](https://issues.apache.org/jira/browse/HBASE-15592) | *Major* | **Print Procedure WAL content**
17567
17568 Use hbase org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALPrettyPrinter
17569 to print the content of a Procedure WAL.
17570 e.g.
17571 hbase org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALPrettyPrinter -f /hbase/MasterProcWALs/state-00000000000000002571.log
17572
17573
17574 ---
17575
17576 * [HBASE-15396](https://issues.apache.org/jira/browse/HBASE-15396) | *Minor* | **Enhance mapreduce.TableSplit to add encoded region name**
17577
17578 To aid troubleshooting of MapReduce job that rely on the HBase provided input format, splits now include the encoded region name they cover.
17579
17580
17581 ---
17582
17583 * [HBASE-15568](https://issues.apache.org/jira/browse/HBASE-15568) | *Major* | **Procedure V2 - Remove CreateTableHandler in HBase Apache 2.0 release**
17584
17585 **WARNING: No release note provided for this change.**
17586
17587
17588 ---
17589
17590 * [HBASE-15521](https://issues.apache.org/jira/browse/HBASE-15521) | *Major* | **Procedure V2 - RestoreSnapshot and CloneSnapshot**
17591
17592 **WARNING: No release note provided for this change.**
17593
17594
17595 ---
17596
17597 * [HBASE-15538](https://issues.apache.org/jira/browse/HBASE-15538) | *Major* | **Implement secure async protobuf wal writer**
17598
17599 Add the following config in hbase-site.xml if you want to use secure protobuf wal writer together with AsyncFSWAL
17600 {code}
17601 \<property\>
17602 \<name\>hbase.regionserver.hlog.async.writer.impl\</name\>
17603 \<value\>org.apache.hadoop.hbase.regionserver.wal.SecureAsyncProtobufLogWriter\</value\>
17604 \</property\>
17605 \<property\>
17606 {code}
17607
17608
17609 ---
17610
17611 * [HBASE-11393](https://issues.apache.org/jira/browse/HBASE-11393) | *Major* | **Replication TableCfs should be a PB object rather than a string**
17612
17613 **WARNING: No release note provided for this change.**
17614
17615
17616 ---
17617
17618 * [HBASE-15265](https://issues.apache.org/jira/browse/HBASE-15265) | *Major* | **Implement an asynchronous FSHLog**
17619
17620 To enable, set the WALProvider as follows:
17621
17622 {code}
17623 \<property\>
17624 \<name\>hbase.wal.provider\</name\>
17625 \<value\>asyncfs\</value\>
17626 \</property\>
17627 \<property\>
17628 {code}
17629
17630 To check which provider is active, look for the log line:
17631
17632 LOG.info("Instantiating WALProvider of type " + clazz);
17633
17634
17635 ---
17636
17637 * [HBASE-14256](https://issues.apache.org/jira/browse/HBASE-14256) | *Major* | **Flush task message may be confusing when region is recovered**
17638
17639 HBASE-14256 Correct confusing flush task message
17640
17641
17642 ---
17643
17644 * [HBASE-15212](https://issues.apache.org/jira/browse/HBASE-15212) | *Major* | **RPCServer should enforce max request size**
17645
17646 Adds a configuration parameter "hbase.ipc.max.request.size" which defaults to 256MB to protect the server against very large incoming RPC requests. All requests larger than this size will be immediately rejected before allocating any resources (memory allocation, etc).
17647
17648
17649 ---
17650
17651 * [HBASE-15412](https://issues.apache.org/jira/browse/HBASE-15412) | *Major* | **Add average region size metric**
17652
17653 Adds a new metric for called "averageRegionSize" that is emitted as a regionserver metric. Metric description:
17654 Average region size over the region server including memstore and storefile sizes
17655
17656
17657 ---
17658
17659 * [HBASE-15479](https://issues.apache.org/jira/browse/HBASE-15479) | *Major* | **No more garbage or beware of autoboxing**
17660
17661 This fix decreases client's memory allocation during writes by more than 50%.
17662
17663
17664 ---
17665
17666 * [HBASE-15322](https://issues.apache.org/jira/browse/HBASE-15322) | *Critical* | **Operations using Unsafe path broken for platforms not having sun.misc.Unsafe**
17667
17668 **WARNING: No release note provided for this change.**
17669
17670
17671 ---
17672
17673 * [HBASE-12940](https://issues.apache.org/jira/browse/HBASE-12940) | *Major* | **Expose listPeerConfigs and getPeerConfig to the HBase shell**
17674
17675 Adds get\_peer\_config and list\_peer\_configs to the hbase shell.
17676
17677
17678 ---
17679
17680 * [HBASE-15430](https://issues.apache.org/jira/browse/HBASE-15430) | *Critical* | **Failed taking snapshot - Manifest proto-message too large**
17681
17682 Failed taking snapshot - Manifest proto-message too large. add property ("snapshot.manifest.size.limit")  to change max size of proto-message
17683
17684
17685 ---
17686
17687 * [HBASE-15323](https://issues.apache.org/jira/browse/HBASE-15323) | *Major* | **Hbase Rest CheckAndDeleteAPi should be able to delete more cells**
17688
17689 Fixed an issue in REST server checkAndDelete operation where the remaining cells other than the to-be-checked column are also applied in the Delete operation. Also fixed an issue in RemoteHTable where the Delete object was not passed correctly to the REST server side.
17690
17691
17692 ---
17693
17694 * [HBASE-15377](https://issues.apache.org/jira/browse/HBASE-15377) | *Major* | **Per-RS Get metric is time based, per-region metric is size-based**
17695
17696 Per-region metrics related to Get histograms are changed from being response size based into being latency based similar to the per-regionserver metrics of the same name.
17697
17698 Added GetSize histogram metrics at the per-regionserver and per-region level for the response sizes.
17699
17700
17701 ---
17702
17703 * [HBASE-6721](https://issues.apache.org/jira/browse/HBASE-6721) | *Major* | **RegionServer Group based Assignment**
17704
17705 [ADVANCED USERS ONLY] This patch adds a new experimental module hbase-rsgroup. It is an advanced feature for partitioning regionservers into distinctive groups for strict isolation, and should only be used by users who are sophisticated enough to understand the full implications and have a sufficient background in managing HBase clusters.
17706
17707 RSGroups can be defined and managed with shell commands or corresponding Java APIs. A server can be added to a group with hostname and port pair, and tables can be moved to this group so that only regionservers in the same rsgroup can host the regions of the table. RegionServers and tables can only belong to 1 group at a time. By default, all tables and regionservers belong to the "default" group. System tables can also be put into a group using the regular APIs. A custom balancer implementation tracks assignments per rsgroup and makes sure to move regions to the relevant regionservers in that group. The group information is stored in a regular HBase table, and a zookeeper-based read-only cache is used at the cluster bootstrap time.
17708
17709 To enable, add the following to your hbase-site.xml and restart your Master:
17710
17711
17712  \<property\>
17713    \<name\>hbase.coprocessor.master.classes\</name\>
17714    \<value\>org.apache.hadoop.hbase.rsgroup.RSGroupAdminEndpoint\</value\>
17715  \</property\>
17716  \<property\>
17717    \<name\>hbase.master.loadbalancer.class\</name\>
17718    \<value\>org.apache.hadoop.hbase.rsgroup.RSGroupBasedLoadBalancer\</value\>
17719  \</property\>
17720
17721
17722 Then use the shell 'rsgroup' commands to create and manipulate regionserver groups: e.g. to add a group and then add a server to it, do as follows:
17723
17724  hbase(main):008:0\> add\_rsgroup 'my\_group'
17725  Took 0.5610 seconds
17726
17727 This adds a group to the 'hbase:rsgroup' system table. Add a server (hostname + port) to the group using the 'move\_rsgroup\_servers' command as follows:
17728
17729  hbase(main):010:0\> move\_rsgroup\_servers 'my\_group',['k.att.net:51129']
17730
17731
17732 ---
17733
17734 * [HBASE-15435](https://issues.apache.org/jira/browse/HBASE-15435) | *Major* | **Add WAL (in bytes) written metric**
17735
17736 Adds a new metric named "writtenBytes" as a per-regionserver metric. Metric Description:
17737 Size (in bytes) of the data written to the WAL.
17738
17739
17740 ---
17741
17742 * [HBASE-13963](https://issues.apache.org/jira/browse/HBASE-13963) | *Critical* | **avoid leaking jdk.tools**
17743
17744 HBase now ensures that the JDK tools jar used during the build process is not exposed to downstream clients as a transitive dependency of hbase-annotations.
17745
17746 If you need to have the JDK tools jar in your classpath, you should add a system dependency on it. See the hbase-annotations pom for an example of the necessary pom additions.
17747
17748
17749 ---
17750
17751 * [HBASE-15271](https://issues.apache.org/jira/browse/HBASE-15271) | *Major* | **Spark Bulk Load: Need to write HFiles to tmp location then rename to protect from Spark Executor Failures**
17752
17753 When using the bulk load helper provided by the hbase-spark module, output files will now be written into temporary files and only made available when the executor has successfully completed.
17754
17755 Previously, failed executors would leave their files in place in a way that would be picked up by a bulk load command. This caused retried failures to include spurious copies of some cells.
17756
17757
17758 ---
17759
17760 * [HBASE-15364](https://issues.apache.org/jira/browse/HBASE-15364) | *Major* | **Fix unescaped \< characters in Javadoc**
17761
17762 HBASE-15364 Fix unescaped \< and \> characters in Javadoc
17763
17764
17765 ---
17766
17767 * [HBASE-15243](https://issues.apache.org/jira/browse/HBASE-15243) | *Major* | **Utilize the lowest seek value when all Filters in MUST\_PASS\_ONE FilterList return SEEK\_NEXT\_USING\_HINT**
17768
17769 When all filters in a MUST\_PASS\_ONE FilterList return a SEEK\_USING\_NEXT\_HINT code, we return SEEK\_NEXT\_USING\_HINT from the FilterList#filterKeyValue() to utilize the lowest seek value.
17770
17771
17772 ---
17773
17774 * [HBASE-15354](https://issues.apache.org/jira/browse/HBASE-15354) | *Major* | **Use same criteria for clearing meta cache for all operations**
17775
17776 This patch fixes some issues when MetaCache (region location cache) gets unnecessarily dropped on the client.
17777
17778 On master branch we now in RegionServerCallable and RegionServerAdminCallable pass the actual exception down to Connection#updateCachedLocation, so we could check there if the exception is "meta-clearing" or not.
17779
17780 on branch-1, branch-1.2 and branch 1.3 we now check if the exception is meta-clearing or not in AsyncProcess (this check was there on master, but not on earlier branches)
17781
17782
17783 ---
17784
17785 * [HBASE-15376](https://issues.apache.org/jira/browse/HBASE-15376) | *Major* | **ScanNext metric is size-based while every other per-operation metric is time based**
17786
17787 Removed ScanNext histogram metrics as regionserver level and per-region level metrics since the semantics is not compatible with other similar metrics (size histogram vs latency histogram).
17788
17789 Instead, this patch adds ScanTime and ScanSize histogram metrics at the regionserver and per-region level.
17790
17791
17792 ---
17793
17794 * [HBASE-15338](https://issues.apache.org/jira/browse/HBASE-15338) | *Minor* | **Add a option to disable the data block cache for testing the performance of underlying file system**
17795
17796 Add a new config: hbase.block.data.cacheonread, which is a global switch for caching data blocks on read. The default value of this switch is true, and data blocks will be cached on read if the block cache is enabled for the family and cacheBlocks flag is set to be true for get and scan operations. If this global switch is set to false, data blocks won't be cached even if the block cache is enabled for the family and the cacheBlocks flag of Gets or Scans are sets as true. Bloom blocks and index blocks are always be cached if the block cache of the regionserver is enabled. One usage of this switch is for the performance tests for the extreme case that  the cache for data blocks all missed and all data blocks are read from underlying file system.
17797
17798
17799 ---
17800
17801 * [HBASE-15136](https://issues.apache.org/jira/browse/HBASE-15136) | *Critical* | **Explore different queuing behaviors while busy**
17802
17803 Previously RPC request scheduler in HBase had 2 modes in could operate in:
17804
17805  - simple FIFO
17806  - "partial" deadline, where deadline constraints are only imposed on long-running scan requests.
17807
17808 This patch adds new type of scheduler to HBase, based on the research around controlled delay (CoDel) algorithm [1], used in networking to combat bufferbloat, as well as some analysis on generalizing it to generic request queues [2]. The purpose of that work is to prevent long standing call queues caused by discrepancy between request rate and available throughput, caused by kernel/disk IO/networking stalls.
17809
17810 New RPC scheduler could be enabled by setting hbase.ipc.server.callqueue.type=codel in configuration. Several additional params allow to configure algorithm behavior -
17811
17812 hbase.ipc.server.callqueue.codel.target.delay
17813 hbase.ipc.server.callqueue.codel.interval
17814 hbase.ipc.server.callqueue.codel.lifo.threshold
17815
17816 [1] Controlling Queue Delay / A modern AQM is just one piece of the solution to bufferbloat. http://queue.acm.org/detail.cfm?id=2209336
17817 [2] Fail at Scale / Reliability in the face of rapid change. http://queue.acm.org/detail.cfm?id=2839461
17818
17819
17820 ---
17821
17822 * [HBASE-15181](https://issues.apache.org/jira/browse/HBASE-15181) | *Major* | **A simple implementation of date based tiered compaction**
17823
17824 Date tiered compaction policy is a date-aware store file layout that is beneficial for time-range scans for time-series data.
17825
17826 When it performs well:
17827
17828     reads for limited time ranges, especially scans of recent data
17829
17830 When it doesn't perform as well:
17831
17832     random gets without a time range
17833     frequent deletes and updates
17834     out of order data writes, especially writes with timestamps in the future
17835     bulk loads of historical data
17836
17837 Recommended configuration:
17838 To turn on Date Tiered Compaction (It is not recommended to turn on for the whole cluster because that will put meta table on it too and random get on meta table will be impacted):
17839 hbase.hstore.compaction.compaction.policy: org.apache.hadoop.hbase.regionserver.compactions.DateTieredCompactionPolicy
17840
17841 Parameters for Date Tiered Compaction:
17842 hbase.hstore.compaction.date.tiered.max.storefile.age.millis: Files with max-timestamp smaller than this will no longer be compacted.Default at Long.MAX\_VALUE.
17843 hbase.hstore.compaction.date.tiered.base.window.millis: base window size in milliseconds. Default at 6 hours.
17844 hbase.hstore.compaction.date.tiered.windows.per.tier: number of windows per tier. Default at 4.
17845 hbase.hstore.compaction.date.tiered.incoming.window.min: minimal number of files to compact in the incoming window. Set it to expected number of files in the window to avoid wasteful compaction. Default at 6.
17846 hbase.hstore.compaction.date.tiered.window.policy.class: the policy to select store files within the same time window. It doesn’t apply to the incoming window. Default at exploring compaction. This is to avoid wasteful compaction.
17847
17848 With tiered compaction all servers in the cluster will promote windows to higher tier at the same time, so using a compaction throttle is recommended:
17849 hbase.regionserver.throughput.controller:org.apache.hadoop.hbase.regionserver.compactions.PressureAwareCompactionThroughputController
17850
17851 Because there will most likely be more store files around, we need to adjust the configuration so that flush won't be blocked and compaction will be properly throttled:
17852 hbase.hstore.blockingStoreFiles: change to 50 if using all default parameters when turning on date tiered compaction. Use 1.5~2 x projected file count if changing the parameters, Projected file count = windows per tier x tier count + incoming window min + files older than max age
17853
17854 For more details, please refer to the design spec at https://docs.google.com/document/d/1\_AmlNb2N8Us1xICsTeGDLKIqL6T-oHoRLZ323MG\_uy8/edit#
17855
17856
17857 ---
17858
17859 * [HBASE-15290](https://issues.apache.org/jira/browse/HBASE-15290) | *Major* | **Hbase Rest CheckAndAPI should save other cells along with compared cell**
17860
17861 Fixed an issue in REST server checkAndPut operation where the remaining cells other than the to-be-checked column are also applied in the put operation .
17862
17863
17864 ---
17865
17866 * [HBASE-15264](https://issues.apache.org/jira/browse/HBASE-15264) | *Major* | **Implement a fan out HDFS OutputStream**
17867
17868 Implement a fan-out asynchronous DFSOutputStream for implementing new WAL writer.
17869
17870
17871 ---
17872
17873 * [HBASE-13259](https://issues.apache.org/jira/browse/HBASE-13259) | *Critical* | **mmap() based BucketCache IOEngine**
17874
17875 mmap() based bucket cache can be configured by specifying the property
17876 {code}
17877 \<property\>
17878   \<name\>hbase.bucketcache.ioengine\</name\>
17879   \<value\> mmap://filepath \</value\>
17880 \</property\>
17881 {code}
17882 This mode of bucket cache is ideal when your file based bucket cache size is lesser than then available RAM. When the cache is bigger than the available RAM then the kernel page faults will make this cache perform lesser particularly in case of scans.
17883
17884
17885 ---
17886
17887 * [HBASE-11927](https://issues.apache.org/jira/browse/HBASE-11927) | *Major* | **Use Native Hadoop Library for HFile checksum (And flip default from CRC32 to CRC32C)**
17888
17889 Checksumming is cpu intensive. HBase computes additional checksums for HFiles (hdfs does checksums too) and stores them inline with file data. During reading, these checksums are verified to ensure data is not corrupted. This patch tries to use Hadoop Native Library for checksum computation, if it’s available, otherwise falls back to standard Java libraries. Instructions to load NHL in HBase can be found here (http://hbase.apache.org/book.html#hadoop.native.lib).
17890
17891 Default checksum algorithm has been changed from CRC32 to CRC32C primarily because of two reasons: 1) CRC32C has better error detection properties, and 2) New Intel processors have a dedicated instruction for crc32c computation (SSE4.2 instruction set)\*. This change is fully backward compatible. Also, users should not see any differences except decrease in cpu usage. To keep old settings, set configuration ‘hbase.hstore.checksum.algorithm’ to ‘CRC32’.
17892
17893 \* On linux, run 'cat /proc/cpuinfo’ and look for sse4\_2 in list of flags to see if your processor supports SSE4.2.
17894
17895
17896 ---
17897
17898 * [HBASE-15219](https://issues.apache.org/jira/browse/HBASE-15219) | *Critical* | **Canary tool does not return non-zero exit code when one of regions is in stuck state**
17899
17900 A new flag is added for Canary tool: -treatFailureAsError
17901 When this flag is specified, read / write failure would result in Canary tool exit code of 5.
17902
17903
17904 ---
17905
17906 * [HBASE-14949](https://issues.apache.org/jira/browse/HBASE-14949) | *Major* | **Resolve name conflict when splitting if there are duplicated WAL entries**
17907
17908 Now we can write duplicated WAL entries into different WAL files. This feature is required by the replication consistency fix and new implementation of WAL writer.
17909
17910
17911 ---
17912
17913 * [HBASE-15100](https://issues.apache.org/jira/browse/HBASE-15100) | *Blocker* | **Master WALProcs still never clean up**
17914
17915 The constructor for o.a.h.hbase.ProcedureInfo was mistakenly labeled IA.Public in previous releases and has now changed to IA.Private. Downstream users are safe to consume ProcedureInfo objects returned from HBase public interfaces, but should not expect to be able to reliably create new instances themselves.
17916
17917 The method ProcedureInfo.setNonceKey has been removed, because it should not have been exposed to clients.
17918
17919
17920 ---
17921
17922 * [HBASE-14355](https://issues.apache.org/jira/browse/HBASE-14355) | *Major* | **Scan different TimeRange for each column family**
17923
17924 Adds being able to Scan each column family with a different time range. Adds new methods setColumnFamilyTimeRange and getColumnFamilyTimeRange to Scan.
17925
17926
17927 ---
17928
17929 * [HBASE-14460](https://issues.apache.org/jira/browse/HBASE-14460) | *Critical* | **[Perf Regression] Merge of MVCC and SequenceId (HBASE-8763) slowed Increments, CheckAndPuts, batch operations**
17930
17931 This release note tries to tell the general story. Dive into sub-tasks for more specific release noting.
17932
17933 Increments, appends, checkAnd\* have been slow since hbase-.1.0.0. The unification of mvcc and sequence id done by HBASE-8763 was responsible.
17934
17935 A ‘fast-path’ workaround was added by HBASE-15031 “Fix merge of MVCC and SequenceID performance regression in branch-1.0 for Increments”. It became available in 1.0.3 and 1.1.3. To enable the fast path, set "hbase.increment.fast.but.narrow.consistency" and then rolling restart. The workaround was for increments only (appends, checkAndPut, etc., were not addressed. See HBASE-15031 release note for more detail).
17936
17937 Subsequently, the regression was properly identified and fixed in HBASE-15213 and the fix applied to branch-1.0 and branch-1.1. As it happens, hbase-1.2.0 does not suffer from the performance regression (though the thought was that it did -- and so it got the fast-path patch too via HBASE-15092) nor does the master branch. HBASE-15213 identified that HBASE-12751 (as a side effect) had cured the regression.
17938
17939 hbase-1.0.4 (if it is ever released -- 1.0 has been end-of-lifed) and hbase-1.1.4 will have the HBASE-15213 fix.  If you are suffering from the increment regression and you are on 1.0.3 or 1.1.3, you can enable the work around to get back your increment performance but you should upgrade.
17940
17941
17942 ---
17943
17944 * [HBASE-15046](https://issues.apache.org/jira/browse/HBASE-15046) | *Major* | **Perf test doing all mutation steps under row lock**
17945
17946 In here we perf tested a realignment of the write pipeline and mvcc handling.  Thought was that this work was a predicate for a general fix of HBASE-14460 (turns out, realignment of write path was not needed to fix the increment perf regression). The perf testing here made it so we were able to simplify writing. HBASE-15158 was just committed. This work is done.
17947
17948
17949 ---
17950
17951 * [HBASE-15158](https://issues.apache.org/jira/browse/HBASE-15158) | *Major* | **Change order in which we do write pipeline operations; do all under row locks!**
17952
17953 Changed the write pipeline order; made it more rational, easier-to-reason-about doing all updates to WA, MemStore, and mvcc while read/write rowlock is held where before we'd release after WAL append and then do sync and mvcc.
17954
17955
17956 ---
17957
17958 * [HBASE-15157](https://issues.apache.org/jira/browse/HBASE-15157) | *Major* | **Add \*PerformanceTest for Append, CheckAnd\***
17959
17960 Add append, increment, checkAndMutate, checkAndPut, and checkAndDelete tests to PerformanceEvaluation tool. Below are excerpts from new usage from PE:
17961
17962 ....
17963 Command:
17964  append          Append on each row; clients overlap on keyspace so some concurrent operations
17965  checkAndDelete  CheckAndDelete on each row; clients overlap on keyspace so some concurrent operations
17966  checkAndMutate  CheckAndMutate on each row; clients overlap on keyspace so some concurrent operations
17967  checkAndPut     CheckAndPut on each row; clients overlap on keyspace so some concurrent operations
17968  filterScan      Run scan test using a filter to find a specific row based on it's value (make sure to use --rows=20)
17969  increment       Increment on each row; clients overlap on keyspace so some concurrent operations
17970  randomRead      Run random read test
17971 ....
17972 Examples:
17973 ...
17974  To run 10 clients doing increments over ten rows:
17975  $ bin/hbase org.apache.hadoop.hbase.PerformanceEvaluation --rows=10 --nomapred increment 10
17976
17977 Removed IncrementPerformanceTest. It is not as configurable as the additions made here.
17978
17979
17980 ---
17981
17982 * [HBASE-15218](https://issues.apache.org/jira/browse/HBASE-15218) | *Blocker* | **On RS crash and replay of WAL, loosing all Tags in Cells**
17983
17984 This issue fixes
17985 - In case of normal WAL (Not encrypted) we were loosing all cell tags on WAL replay after an RS crash
17986 - In case of encrypted WAL we were not even persisting Cell tags in WAL.  Tags from all unflushed (to HFile) Cells will get lost even after WAL replay recovery is done.
17987
17988 As we use tags for Cell level security, this fixes 2 security issues
17989  - Cell level visibility labels security breach . Making a visibility restricted cell global readable
17990  - Cell level ACL availability issue.  A user who is cell level authorized to read this cell can not read it. It is a data loss for him.
17991
17992
17993 ---
17994
17995 * [HBASE-15129](https://issues.apache.org/jira/browse/HBASE-15129) | *Major* | **Set default value for hbase.fs.tmp.dir rather than fully depend on hbase-default.xml**
17996
17997 Before HBASE-15129, if somehow hbase-default.xml is not on classpath, default values for hbase.fs.tmp.dir and hbase.bulkload.staging.dir are left empty. After HBASE-15129,  default values of both properties are set to "/user/\<user.name\>/hbase-staging".
17998
17999
18000 ---
18001
18002 * [HBASE-14969](https://issues.apache.org/jira/browse/HBASE-14969) | *Major* | **Add throughput controller for flush**
18003
18004 Adds means of throttling flush throughput. By default there is no limit; we use NoLimitThroughputController. An alternative controller, PressureAwareFlushThroughputController, allows specifying throughput bounds. A new simple factor, flush pressure, influences throughput. See PressureAwareFlushThroughputController.java class for detail.
18005
18006
18007 ---
18008
18009 * [HBASE-11425](https://issues.apache.org/jira/browse/HBASE-11425) | *Major* | **Cell/DBB end-to-end on the read-path**
18010
18011 For E2E off heaped read path, first of all there should be an off heap backed BucketCache(BC). Configure 'hbase.bucketcache.ioengine' to offheap in hbase-site.xml. Also specify the total capacity of the BC using hbase.bucketcache.size config.  Please remember to adjust value of 'HBASE\_OFFHEAPSIZE' in hbase-env.sh as per this capacity. Here-by we specify the max possible off-heap memory allocation for the RS java process. So this should be bigger than the off-heap BC size. Please keep in mind that there is no default for hbase.bucketcache.ioengine which means the BC is turned OFF by default.
18012
18013 Next thing to tune is the ByteBuffer pool in the RPC server side. The buffers from this pool will be used to accumulate the cell bytes and create a result cell block to send back to the client side. 'hbase.ipc.server.reservoir.enabled' can be used to turn this pool ON or OFF. By default this pool is ON and available. HBase will create off heap ByteBuffers and pool them. Please make sure not to turn this OFF if you want E2E off heaping in read path. If this pool is turned off, the server will create temp buffers on heap to accumulate the cell bytes and make a result cell block. This can impact the GC on a highly read loaded server.  The user can tune this pool with respect to how many buffers are in the pool and what should be the size of each ByteBuffer.
18014 Use the config 'hbase.ipc.server.reservoir.initial.buffer.size' to tune each of the buffer sizes. Defaults is 64 KB.
18015
18016 When the read pattern is a random row read and each of the rows are smaller in size compared to this 64 KB, try reducing this. When the result size is larger than one ByteBuffer size, the server will try to grab more than one buffer and make a result cell block out of these.  When the pool is running out of buffers, the server will end up creating temporary on-heap buffers.
18017
18018 The maximum number of ByteBuffers in the pool can be tuned using the config 'hbase.ipc.server.reservoir.initial.max'. Its value defaults to 64 \* region server handlers configured (See the config 'hbase.regionserver.handler.count'). The math is such that by default we consider 2 MB as the result cell block size per read result and each handler will be handling a read. For 2 MB size, we need 32 buffers each of size 64 KB (See default buffer size in pool).  So per handler 32 ByteBuffers(BB). We allocate twice this size as the max BBs count such that one handler can be creating the response and handing it to the RPC Responder thread and then handling a new request creating a new response cell block (using pooled buffers). Even if the responder could not send back the first TCP reply immediately, our count should allow that we should still have enough buffers in our pool without having to make temporary buffers on the heap.  Again for smaller sized random row reads, tune this max count. There are lazily created buffers and the count is the max count to be pooled.
18019
18020 The setting for HBASE\_OFFHEAPSIZE in hbase-env.sh should consider this off heap buffer pool at the RPC side also.  We need to config this max off heap size for RS as a bit higher than the sum of this max pool size and the off heap cache size. The TCP layer will also need to create direct bytebuffers for TCP communication. Also the DFS client will need some off-heap to do its workings especially if short-circuit reads are configured. Allocating an extra of 1 - 2 GB for the max direct memory size has worked in tests.
18021
18022 If you still see GC issues even after making E2E read path off heap, look for issues in the appropriate buffer pool. Check the below RS log with INFO level:
18023
18024   "Pool already reached its max capacity : XXX and no free buffers now. Consider increasing the value for 'hbase.ipc.server.reservoir.initial.max' ?"
18025
18026 If you are using co processors and refer the Cells in the read results, DO NOT store reference to these Cells out of the scope of the CP hook methods. Some times the CPs need store info about the cell (Like its row key) for considering in the next CP hook call etc. For such cases, pls clone the required fields of the entire Cell as per the use cases.  [ See CellUtil#cloneXXX(Cell) APIs ]
18027
18028
18029 ---
18030
18031 * [HBASE-15145](https://issues.apache.org/jira/browse/HBASE-15145) | *Major* | **HBCK and Replication should authenticate to zookepeer using server principal**
18032
18033 Added a new command line argument: --auth-as-server to enable authenticating to ZooKeeper as the HBase Server principal. This is required for secure clusters for doing replication operations like add\_peer, list\_peers, etc until HBASE-11392 is fixed. This advanced option can also be used for manually fixing secure znodes.
18034
18035 Commands can now be invoked like:
18036 hbase --auth-as-server shell
18037 hbase --auth-as-server zkcli
18038
18039 HBCK in secure setup also needs to authenticate to ZK using servers principals.This is turned on by default (no need to pass additional argument).
18040
18041 When authenticating as server, HBASE\_SERVER\_JAAS\_OPTS is concatenated to HBASE\_OPTS if defined in hbase-env.sh. Otherwise, HBASE\_REGIONSERVER\_OPTS is concatenated.
18042
18043
18044 ---
18045
18046 * [HBASE-15125](https://issues.apache.org/jira/browse/HBASE-15125) | *Major* | **HBaseFsck's adoptHdfsOrphan function creates region with wrong end key boundary**
18047
18048 **WARNING: No release note provided for this change.**
18049
18050
18051 ---
18052
18053 * [HBASE-13082](https://issues.apache.org/jira/browse/HBASE-13082) | *Major* | **Coarsen StoreScanner locks to RegionScanner**
18054
18055 After this JIRA we will not be doing any scanner reset after compaction during a course of a scan. The files that were compacted will still be continued to be used in the scan process. The compacted files will be archived by a background thread that runs every 2 mins by default only when there are no active scanners on those comapcted files. The above duration can be controlled using the knob 'hbase.hfile.compactions.cleaner.interval'.
18056
18057
18058 ---
18059
18060 * [HBASE-14865](https://issues.apache.org/jira/browse/HBASE-14865) | *Major* | **Support passing multiple QOPs to SaslClient/Server via hbase.rpc.protection**
18061
18062 With this patch, hbase.rpc.protection can now take multiple comma-separate QOP values. Accepted QOP values remain unchanged and are 'authentication', 'integrity', and 'privacy'. Server or client can use this configuration to specify their preference (in decreasing order) while negotiating QOP.
18063 This feature can be used to upgrade or downgrade QOP in an online cluster without compromising availability (i.e. taking cluster offline). For e.g. to change qop from A to B, typical steps would be:
18064 "A" --\> "B,A" --\> rolling restart --\> "B" --\> rolling restart
18065
18066 Sidenote: Based on experimentation, server's choice is given higher preference than client's choice. i.e. if server's choices are "A,B,C" and client's choices are "B,C,A", both A and B are acceptable, but A is chosen.
18067
18068
18069 ---
18070
18071 * [HBASE-15098](https://issues.apache.org/jira/browse/HBASE-15098) | *Blocker* | **Normalizer switch in configuration is not used**
18072
18073 The config parameter, hbase.normalizer.enabled, has been dropped since it is not used in the code base.
18074
18075
18076 ---
18077
18078 * [HBASE-15111](https://issues.apache.org/jira/browse/HBASE-15111) | *Trivial* | **"hbase version" should write to stdout**
18079
18080 The \`hbase version\` command now outputs directly to stdout rather than to a logger. This change allows the version information to be output consistently regardless of logger configuration. Naturally, this also means the command output ignores all logger configuration. Furthermore, the move from loggers to direct output changes the output of the command to omit metadata commonly included in logger ouput such as a timestamp, log level, and logger name.
18081
18082
18083 ---
18084
18085 * [HBASE-15027](https://issues.apache.org/jira/browse/HBASE-15027) | *Major* | **Refactor the way the CompactedHFileDischarger threads are created**
18086
18087 The property 'hbase.hfile.compactions.discharger.interval' has been renamed to 'hbase.hfile.compaction.discharger.interval' that describes the interval after which the compaction discharger chore service should run.
18088 The property 'hbase.hfile.compaction.discharger.thread.count' describes the thread count that does the compaction discharge work.
18089 The CompactedHFilesDischarger is a chore service now started as part of the RegionServer and this chore service iterates over all the onlineRegions in that RS and uses the RegionServer's executor service to launch a set of threads that does this job of compaction files clean up.
18090
18091
18092 ---
18093
18094 * [HBASE-14468](https://issues.apache.org/jira/browse/HBASE-14468) | *Major* | **Compaction improvements: FIFO compaction policy**
18095
18096 FIFO compaction policy selects only files which have all cells expired. The column family MUST have non-default TTL.
18097 Essentially, FIFO compactor does only one job: collects expired store files.
18098
18099 Because we do not do any real compaction, we do not use CPU and IO (disk and network), we do not evict hot data from a block cache. The result: improved throughput and latency both write and read.
18100 See: https://github.com/facebook/rocksdb/wiki/FIFO-compaction-style
18101
18102
18103 ---
18104
18105 * [HBASE-14888](https://issues.apache.org/jira/browse/HBASE-14888) | *Major* | **ClusterSchema: Add Namespace Operations**
18106
18107 This patch changes the semantic around namespace create/delete/modify when coprocessor asks that the invocation be by-passed. Previous the by-pass was done silently -- the method would just return with no indication as to whether by-pass route had been taken or not.  This patch adds throwing of a BypassCoprocessorException which is thrown if we have been asked to bypass a call.
18108
18109 The bypass facility has been in place since hbase 1.0.0 when namespace creation/deletion, etc.., was originally added in HBASE-8408 (HBASE-15071 is about addressing bypass handling in a general way)
18110
18111
18112 ---
18113
18114 * [HBASE-15018](https://issues.apache.org/jira/browse/HBASE-15018) | *Major* | **Inconsistent way of handling TimeoutException in the rpc client implementations**
18115
18116 When using the new AsyncRpcClient introduced in HBase 1.1.0 (HBASE-12684), time outs now result in an IOException wrapped around a CallTimeoutException instead of a bare CallTimeoutException. This change makes the AsyncRpcClient behave the same as the default HBase 1.y RPC client implementation.
18117
18118
18119 ---
18120
18121 * [HBASE-14796](https://issues.apache.org/jira/browse/HBASE-14796) | *Minor* | **Enhance the Gets in the connector**
18122
18123 spark.hbase.bulkGetSize  in HBaseSparkConf is for grouping bulkGet, and default value is 1000.
18124
18125
18126 ---
18127
18128 * [HBASE-14976](https://issues.apache.org/jira/browse/HBASE-14976) | *Minor* | **Add RPC call queues to the web ui**
18129
18130 Adds column displaying current aggregated call queues size in region server queues tab UI.
18131
18132
18133 ---
18134
18135 * [HBASE-14822](https://issues.apache.org/jira/browse/HBASE-14822) | *Major* | **Renewing leases of scanners doesn't work**
18136
18137 And 1.1, 1.0, and 0.98.
18138
18139
18140 ---
18141
18142 * [HBASE-14205](https://issues.apache.org/jira/browse/HBASE-14205) | *Critical* | **RegionCoprocessorHost System.nanoTime() performance bottleneck**
18143
18144 **WARNING: No release note provided for this change.**
18145
18146
18147 ---
18148
18149 * [HBASE-14978](https://issues.apache.org/jira/browse/HBASE-14978) | *Blocker* | **Don't allow Multi to retain too many blocks**
18150
18151 Limiting the amount of memory resident for any one request allows the server to handle concurrent requests smoothly. To this end we added the ability to limit the size of responses to a multi request. That worked well however it correctly represent the amount of memory resident. So this issue adds on a an approximation of the number of blocks held for a request.
18152
18153 All clients before 1.2.0 will not get this multi request chunking based upon blocks kept. All clients 1.2.0 and after will.
18154
18155
18156 ---
18157
18158 * [HBASE-14951](https://issues.apache.org/jira/browse/HBASE-14951) | *Minor* | **Make hbase.regionserver.maxlogs obsolete**
18159
18160 Rolling WAL events across a cluster can be highly correlated, hence flushing memstores, hence triggering minor compactions, that can be promoted to major ones. These events are highly correlated in time if there is a balanced write-load on the regions in a table. Default value for maximum WAL files (\* hbase.regionserver.maxlogs\*), which controls WAL rolling events - 32 is too small for many modern deployments.
18161 Now we calculate this value dynamically (if not defined by user), using the following formula:
18162
18163 maxLogs = Math.max( 32, HBASE\_HEAP\_SIZE \* memstoreRatio \* 2/ LogRollSize), where
18164
18165 memstoreRatio is \*hbase.regionserver.global.memstore.size\*
18166 LogRollSize is maximum WAL file size (default 0.95 \* HDFS block size)
18167
18168 We need to make sure that we avoid fully or minimize events when RS has to flush memstores prematurely only because it reached artificial limit of hbase.regionserver.maxlogs, this is why we put this 2 x multiplier in equation, this gives us maximum WAL capacity of 2 x RS memstore-size.
18169
18170 Runaway WAL files.
18171
18172 The default log rolling period (1h) allows to accumulate up to 2 X Memstore Size data in a WAL. For heap size - 32G and all other default setting, this gives ~ 26GB of data. Under heavy write load, the number of WAL files can increase dramatically. RegionServer LogRoller will be archiving old WALs periodically. User has three options, either override default hbase.regionserver.maxlogs or override default hbase.regionserver.logroll.period (decrease), or both to control runaway WALs.
18173
18174 For system with bursty write load,  the hbase.regionserver.logroll.period can be decreased to lower value. In this case the maximum number of wal files will be defined by the total size of memstore (unflushed data), not by the hbase.regionserver.maxlogs. But for majority of applications there will be no issues with defaults. Data will be flushed periodically from memstore, the LogRoller will archive old wal files and the system will never reach the new defaults for hbase.regionserver.maxlogs, unless the system is under extreme load for prolonged period of time, but in this case, decreasing hbase.regionserver.logroll.period allows us to control runaway wal files.
18175
18176 The following table gives the new default maximum log files values for several different Region Server heap sizes:
18177
18178 heap    memstore perc   maxLogs
18179 1G              40%                             32
18180 2G              40%                             32
18181 10G             40%                             80
18182 20G             40%                             160
18183 32G             40%                             256
18184
18185
18186 ---
18187
18188 * [HBASE-14984](https://issues.apache.org/jira/browse/HBASE-14984) | *Major* | **Allow memcached block cache to set optimze to false**
18189
18190 Setting hbase.cache.memcached.spy.optimze to true will allow the spy memcached client to try and optimize for the number of requests outstanding. This can increase throughput but can also increase variance for request times.
18191
18192 Setting it to true will help when round trip times are longer.
18193 Setting it to false ( the default ) will help ensure a more even distribution of response times.
18194
18195
18196 ---
18197
18198 * [HBASE-14534](https://issues.apache.org/jira/browse/HBASE-14534) | *Minor* | **Bump yammer/coda/dropwizard metrics dependency version**
18199
18200 Updated yammer metrics to version 3.1.2 (now it's been renamed to dropwizard). API has changed quite a bit, consult https://dropwizard.github.io/metrics/3.1.0/manual/core/ for additional information.
18201
18202 Note that among other things, in yammer 2.2.0 histograms were by default created in non-biased mode (uniform sampling), while in 3.1.0 histograms created via MetricsRegistry.histogram(...) are by default exponentially decayed. This shouldn't affect end users, though.
18203
18204
18205 ---
18206
18207 * [HBASE-14960](https://issues.apache.org/jira/browse/HBASE-14960) | *Major* | **Fallback to using default RPCControllerFactory if class cannot be loaded**
18208
18209 If the configured RPC controller factory (via hbase.rpc.controllerfactory.class) cannot be found in the classpath or loaded, we fall back to using the default RPC controller factory in HBase.
18210
18211
18212 ---
18213
18214 * [HBASE-14946](https://issues.apache.org/jira/browse/HBASE-14946) | *Critical* | **Don't allow multi's to over run the max result size.**
18215
18216 The HBase region server will now send a chunk of get responses to a client if the total response size is too large. This will only be done for clients 1.2.0 and beyond. Older clients by default will have the old behavior.
18217
18218 This patch is for the case where the basic flow is like this:
18219
18220 I want to get a single column from lots of rows. So I create a list of gets. Then I send them to table.get(List\<Get\>). If the regions for that table are spread out then those requests get chunked out to all the region servers. No one regionserver gets too many. However if one region server contains lots of regions for that table then a multi action can contain lots of gets. No single get is too onerous. However the regionserver won't return until every get is complete. So if there are thousands of gets that are sent in one multi then the regionserver can retain lots of data in one thread.
18221
18222
18223 ---
18224
18225 * [HBASE-14906](https://issues.apache.org/jira/browse/HBASE-14906) | *Major* | **Improvements on FlushLargeStoresPolicy**
18226
18227 In HBASE-14906 we use "hbase.hregion.memstore.flush.size/column\_family\_number" as the default threshold for memstore flush instead of the fixed value through "hbase.hregion.percolumnfamilyflush.size.lower.bound" property, which makes  the default threshold more flexible to various use case. We also introduce a new property in name of "hbase.hregion.percolumnfamilyflush.size.lower.bound.min" with 16M as the default value to avoid small flush in cases like hundreds of column families.
18228
18229 After this change setting "hbase.hregion.percolumnfamilyflush.size.lower.bound" in hbase-site.xml won't take effect anymore, but expert users could still set this property in table descriptor to override the default value just as before
18230
18231
18232 ---
18233
18234 * [HBASE-14769](https://issues.apache.org/jira/browse/HBASE-14769) | *Major* | **Remove unused functions and duplicate javadocs from HBaseAdmin**
18235
18236 - Removes functions from HBaseAdmin which require table name parameter as either byte[] or String. Use their counterparts which take TableName instead.
18237 - Removes redundant javadocs from HBaseAdmin as they will be automatically inherited from Admin interface.
18238 - HBaseAdmin is marked Audience.private so it should have been straight forward okay to remove the functions. But HBaseTestingUtility, which is marked Audience.public had a public function returning its instance, which moved this decision into gray area. Discussing in the community, it was decided that it would be okay to do so in this particular case.
18239
18240
18241 ---
18242
18243 * [HBASE-13153](https://issues.apache.org/jira/browse/HBASE-13153) | *Major* | **Bulk Loaded HFile Replication**
18244
18245 This enhances the HBase replication to support replication of bulk loaded data. This is configurable, by default it is set to false which means it will not replicate the bulk loaded data to its peer(s). To enable it set "hbase.replication.bulkload.enabled" to true.
18246
18247 Following are the additional configurations added for this enhancement,
18248  a. hbase.replication.cluster.id - This is manadatory to configure in cluster where replication for bulk loaded data is enabled. A source cluster is uniquely identified by sink cluster using this id. This should be configured in the source cluster configuration file for all the RS.
18249  b. hbase.replication.conf.dir - This represents the directory where all the active cluster's file system client configurations are defined in subfolders corresponding to their respective replication cluster id in peer cluster. This should be configured in the peer cluster configuration file for all the RS. Default is HBASE\_CONF\_DIR.
18250  c. hbase.replication.source.fs.conf.provider - This represents the class which provides the source cluster file system client configuration to peer cluster. This should be configured in the peer cluster configuration file for all the RS. Default is org.apache.hadoop.hbase.replication.regionserver.DefaultSourceFSConfigurationProvider
18251
18252  For example: If source cluster FS client configurations are copied in peer cluster under directory /home/user/dc1/ then  hbase.replication.cluster.id should be configured as dc1 and hbase.replication.conf.dir as /home/user
18253
18254 Note:
18255  a. Any modification to source cluster FS client configuration files in peer cluster side replication configuration directory then it needs to restart all its peer(s) cluster RS with default hbase.replication.source.fs.conf.provider.
18256  b. Only 'xml' type files will be loaded by the default hbase.replication.source.fs.conf.provider.
18257
18258 As part of this we have made following changes to LoadIncrementalHFiles class which is marked as Public and Stable class,
18259  a. Raised the visibility scope of LoadQueueItem class from package private to public.
18260  b. Added a new method loadHFileQueue, which loads the queue of LoadQueueItem into the table as per the region keys provided.
18261
18262
18263 ---
18264
18265 * [HBASE-7171](https://issues.apache.org/jira/browse/HBASE-7171) | *Major* | **Initial web UI for region/memstore/storefiles details**
18266
18267 HBASE-7171 adds 2 new pages to the region server Web UI to ease debugging and provide greater insight into the physical data layout.
18268
18269 Region names in UI table listing all regions (on the RS status page) are now hyperlinks leading to region detail page which shows some aggregate memstore information (currently just memory used) along with the list of all Store Files (HFiles) in the region. Names of Store Files are also hyperlinks leading to Store File detail page, which currently runs 'hbase hfile' command behind the scene and displays statistics about store file.
18270
18271
18272 ---
18273
18274 * [HBASE-14655](https://issues.apache.org/jira/browse/HBASE-14655) | *Blocker* | **Narrow the scope of doAs() calls to region observer notifications for compaction**
18275
18276 Region observer notifications w.r.t. compaction request are now audited with request user through proper scope of doAs() calls.
18277
18278
18279 ---
18280
18281 * [HBASE-14631](https://issues.apache.org/jira/browse/HBASE-14631) | *Blocker* | **Region merge request should be audited with request user through proper scope of doAs() calls to region observer notifications**
18282
18283 Region observer notifications w.r.t. merge request are now audited with request user through proper scope of doAs() calls.
18284
18285
18286 ---
18287
18288 * [HBASE-14605](https://issues.apache.org/jira/browse/HBASE-14605) | *Blocker* | **Split fails due to 'No valid credentials' error when SecureBulkLoadEndpoint#start tries to access hdfs**
18289
18290 When split is requested by non-super user, split related notifications for Coprocessor are executed using the login of the request user.
18291 Previously the notifications were carried out as super user.
18292
18293
18294 ---
18295
18296 * [HBASE-14926](https://issues.apache.org/jira/browse/HBASE-14926) | *Major* | **Hung ThriftServer; no timeout on read from client; if client crashes, worker thread gets stuck reading**
18297
18298 Adds a timeout to server read from clients. Adds new configs hbase.thrift.server.socket.read.timeout for setting read timeout on server socket in milliseconds. Default is 60000;
18299
18300
18301 ---
18302
18303 * [HBASE-14825](https://issues.apache.org/jira/browse/HBASE-14825) | *Minor* | **HBase Ref Guide corrections of typos/misspellings**
18304
18305 Corrections to content of "book.html", which is pulled from various \*.adoc files and \*.xml files.
18306 -- corrects typos/misspellings
18307 -- corrects incorrectly formatted links
18308
18309
18310 ---
18311
18312 * [HBASE-14821](https://issues.apache.org/jira/browse/HBASE-14821) | *Major* | **CopyTable should allow overriding more config properties for peer cluster**
18313
18314 Configuration properties for org.apache.hadoop.hbase.mapreduce.TableOutputFormat can now be overridden by prefixing the property keys with "hbase.mapred.output.".  When the configuration is applied to TableOutputFormat, these entries will be rewritten with the prefix removed -- ie. "hbase.mapred.output.hbase.security.authentication" becomes "hbase.security.authentication".  This can be useful when directing output to a peer cluster with different security configuration, for example.
18315
18316
18317 ---
18318
18319 * [HBASE-14799](https://issues.apache.org/jira/browse/HBASE-14799) | *Critical* | **Commons-collections object deserialization remote command execution vulnerability**
18320
18321 This issue resolves a potential security vulnerability. For all versions we update our commons-collections dependency to the release that fixes the reported vulnerability in that library. In 0.98 we additionally disable by default a feature of code carried from 0.94 for backwards compatibility that is not needed.
18322
18323
18324 ---
18325
18326 * [HBASE-12751](https://issues.apache.org/jira/browse/HBASE-12751) | *Major* | **Allow RowLock to be reader writer**
18327
18328 Locks on row are now reader/writer rather than exclusive.
18329
18330 Moves sequenceid out of HRegion and into MVCC class; MVCC is now in charge. A WAL append is still stamped in same way (we pass MVCC context in a few places where we previously we did not).
18331
18332 MVCC methods cleaned up. Make a bit more sense now. Less of them.
18333
18334 Simplifies our update of MemStore/WAL. Now we update memstore AFTER we add to WAL (but before we sync). This fixes possible dataloss when two edits came in with same coordinates; we could order the edits in memstore differently to how they arrived in the WAL.
18335
18336 Marked as an incompatible change because it breaks Distributed Log Replay, a feature we'd determined already was unreliable and to be removed.
18337
18338
18339 ---
18340
18341 * [HBASE-14793](https://issues.apache.org/jira/browse/HBASE-14793) | *Major* | **Allow limiting size of block into L1 block cache.**
18342
18343 Very large blocks can fragment the heap and cause bad issues for the garbage collector, especially the G1GC. Now there is a maximum size that a block can be and still stick in the LruBlockCache. That size defaults to 16mb but can be controlled by changing "hbase.lru.max.block.size"
18344
18345
18346 ---
18347
18348 * [HBASE-14387](https://issues.apache.org/jira/browse/HBASE-14387) | *Major* | **Compaction improvements: Maximum off-peak compaction size**
18349
18350 New configuration option: hbase.hstore.compaction.max.size.offpeak - maximum selection size eligible for minor compaction during off peak hours.
18351 hbase.hstore.compaction.max.size - this is default maximum if no off-peak hours are defined or if no maximum off-peak maximum size is defined.
18352
18353
18354 ---
18355
18356 * [HBASE-12822](https://issues.apache.org/jira/browse/HBASE-12822) | *Minor* | **Option for Unloading regions through region\_mover.rb without Acknowledging**
18357
18358 Incorporated in HBASE-13014.
18359
18360
18361 ---
18362
18363 * [HBASE-14700](https://issues.apache.org/jira/browse/HBASE-14700) | *Major* | **Support a "permissive" mode for secure clusters to allow "simple" auth clients**
18364
18365 Secure HBase now supports a permissive mode to allow mixed secure and insecure clients.  This allows clients to be incrementally migrated over to a secure configuration.  To enable clients to continue to connect using SIMPLE authentication when the cluster is configured for security, set "hbase.ipc.server.fallback-to-simple-auth-allowed" equal to "true" in hbase-site.xml.  NOTE: This setting should ONLY be used as a temporary measure while converting clients over to secure authentication.  It MUST BE DISABLED for secure operation.
18366
18367
18368 ---
18369
18370 * [HBASE-14257](https://issues.apache.org/jira/browse/HBASE-14257) | *Major* | **Periodic flusher only handles hbase:meta, not other system tables**
18371
18372 Memstore periodic flusher used to flush META table every 5 minutes but not any other system tables. This jira extends it to flush all system tables within this time period.
18373
18374
18375 ---
18376
18377 * [HBASE-14658](https://issues.apache.org/jira/browse/HBASE-14658) | *Major* | **Allow loading a MonkeyFactory by class name**
18378
18379 You can specify one of the predefined set of Monkeys when you run Integration Tests by passing the -m\|--monkey arguments on the command line; e.g -m CALM or -m SLOW\_DETERMINISTIC
18380
18381 This patch  makes it so you can pass the name of a class as the monkey to run: e.g. -m org.example.KingKong
18382
18383
18384 ---
18385
18386 * [HBASE-14521](https://issues.apache.org/jira/browse/HBASE-14521) | *Major* | **Unify the semantic of hbase.client.retries.number**
18387
18388 After this change, hbase.client.reties.number universally means the number of retry which is one less than total tries number,  for both non-batch operations like get/scan/increment etc. which uses RpcRetryingCallerImpl#callWithRetries to submit the call or batch operations like put through AsyncProcess#submit.
18389
18390 Note that previously this property means total tries number for puts, so please adjust the setting of its value if necessary. Please also be cautious when setting it to zero since retry is necessary for client cache update when region move happens.
18391
18392
18393 ---
18394
18395 * [HBASE-13819](https://issues.apache.org/jira/browse/HBASE-13819) | *Major* | **Make RPC layer CellBlock buffer a DirectByteBuffer**
18396
18397 For master branch(2.0 version), the BoundedByteBufferPool always create Direct (off heap) ByteBuffers and return that.
18398 For branch-1(1.3 version), byte default the buffers returned will be off heap. This can be changed to return on heap ByteBuffers by configuring 'hbase.ipc.server.reservoir.direct.buffer' to false.
18399
18400
18401 ---
18402
18403 * [HBASE-14517](https://issues.apache.org/jira/browse/HBASE-14517) | *Minor* | **Show regionserver's version in master status page**
18404
18405 Adds server version to the listing of regionservers on the master home page.
18406
18407 if a cluster where the versions deviate, at the bottom of the 'Version' column on the master home page listing of 'Region Servers', you will see a note in red that says something like: 'Total:10              9 nodes with inconsistent version'
18408
18409
18410 ---
18411
18412 * [HBASE-12911](https://issues.apache.org/jira/browse/HBASE-12911) | *Major* | **Client-side metrics**
18413
18414 Introduces collection and reporting of various client-perceived metrics. Metrics are exposed via JMX under "org.apache.hadoop.hbase.client.MetricsConnection". Metrics are scoped according to connection instance, so multiple connection objects (ie, to different clusters) will report their metrics separately. Metrics are disabled by default, must be enabled by configuring "hbase.client.metrics.enable=true".
18415
18416
18417 ---
18418
18419 * [HBASE-14529](https://issues.apache.org/jira/browse/HBASE-14529) | *Major* | **Respond to SIGHUP to reload config**
18420
18421 HBase daemons can now be signaled to reload their config by sending SIGHUP to the java process. Not all config parameters can be reloaded.
18422
18423 In order for this new feature to work the hbase-daemon.sh script was changed to use disown rather than nohup. Functionally this shouldn't change anything but the processes will have a different parent when being run from a connected login shell.
18424
18425
18426 ---
18427
18428 * [HBASE-14502](https://issues.apache.org/jira/browse/HBASE-14502) | *Major* | **Purge use of jmock and remove as dependency**
18429
18430 HBASE-14502 Purge use of jmock and remove as dependency
18431
18432
18433 ---
18434
18435 * [HBASE-14544](https://issues.apache.org/jira/browse/HBASE-14544) | *Major* | **Allow HConnectionImpl to not refresh the dns on errors**
18436
18437 By setting hbase.resolve.hostnames.on.failure to false you can reduce the number of dns name resolutions that a client will do. However if machines leave and come back with different ip's the changes will not be noticed by the clients. So only set hbase.resolve.hostnames.on.failure to false if your cluster dns is not changing while clients are connected.
18438
18439
18440 ---
18441
18442 * [HBASE-14367](https://issues.apache.org/jira/browse/HBASE-14367) | *Major* | **Add normalization support to shell**
18443
18444 This patch adds shell support for region normalizer (see HBASE-13103).
18445
18446 3 commands have been added to hbase shell 'tools' command group (modeled on how the balancer works):
18447
18448  - 'normalizer\_enabled' checks whether region normalizer is turned on
18449  - 'normalizer\_switch' allows user to turn normalizer on and off
18450  - 'normalize' runs region normalizer if it's turned on.
18451
18452 Also 'alter' command has been extended to allow user to enable/disable region normalization per table (disabled by default). Use it as
18453
18454 alter 'testtable', {NORMALIZATION\_MODE =\> 'true'}
18455
18456 Here is the help for the normalize command:
18457
18458 {code}
18459 hbase(main):008:0\> help 'normalize'
18460 Trigger region normalizer for all tables which have NORMALIZATION\_MODE flag set. Returns true
18461  if normalizer ran successfully, false otherwise. Note that this command has no effect
18462  if region normalizer is disabled (make sure it's turned on using 'normalizer\_switch' command).
18463
18464  Examples:
18465
18466    hbase\> normalize
18467 {code}
18468
18469
18470 ---
18471
18472 * [HBASE-14475](https://issues.apache.org/jira/browse/HBASE-14475) | *Major* | **Region split requests are always audited with "hbase" user rather than request user**
18473
18474 Region observer notifications w.r.t. split request are now audited with request user through proper scope of doAs() calls.
18475
18476
18477 ---
18478
18479 * [HBASE-14230](https://issues.apache.org/jira/browse/HBASE-14230) | *Minor* | **replace reflection in FSHlog with HdfsDataOutputStream#getCurrentBlockReplication()**
18480
18481 Remove calling getNumCurrentReplicas on HdfsDataOutputStream via reflection. getNumCurrentReplicas showed up in hadoop 1+ and hadoop 0.2x. In hadoop-2 it was deprecated.
18482
18483
18484 ---
18485
18486 * [HBASE-14495](https://issues.apache.org/jira/browse/HBASE-14495) | *Major* | **TestHRegion#testFlushCacheWhileScanning goes zombie**
18487
18488 The WAL append was changed by HBASE-12751. Every append now sets a latch on an edit. The latch needs to be cleared or else the WAL will hang. The original failures in TestHRegion turned up 'holes' where we were failing to throw the latch if we skipped out early because we were interrupted. Other 'holes' were found where we had mocked up a WAL so the latch would just stay in place.  Futher holes were found appending WAL markers... here we were skipping the mvcc completely for a few edits.  A clean up of WALUtils made all markers take the same code paths.
18489
18490
18491 ---
18492
18493 * [HBASE-14280](https://issues.apache.org/jira/browse/HBASE-14280) | *Minor* | **Bulk Upload from HA cluster to remote HA hbase cluster fails**
18494
18495 Patch will effectively work with Hadoop version 2.6 or greater with a launch of "internal.nameservices".
18496 There will be no change in versions older than 2.6.
18497
18498
18499 ---
18500
18501 * [HBASE-14334](https://issues.apache.org/jira/browse/HBASE-14334) | *Major* | **Move Memcached block cache in to it's own optional module.**
18502
18503 Move external block cache to it's own module. This  will reduce dependencies for people who use hbase-server.
18504 Currently Memcached is the reference implementation for external block cache. External block caches allow HBase to take advantage of other more complex caches that can live longer than the HBase regionserver process and are not necessarily tied to a single computer
18505     life time. However external block caches add in extra operational overhead.
18506
18507
18508 ---
18509
18510 * [HBASE-14433](https://issues.apache.org/jira/browse/HBASE-14433) | *Major* | **Set down the client executor core thread count from 256 in tests**
18511
18512 Tests run with client executors that have core thread count of 4 and a keepalive of 3 seconds. They used to default to 256 core threads and 60 seconds  for keepalive.
18513
18514
18515 ---
18516
18517 * [HBASE-14400](https://issues.apache.org/jira/browse/HBASE-14400) | *Critical* | **Fix HBase RPC protection documentation**
18518
18519 To use rpc protection in HBase, set the value of 'hbase.rpc.protection' to:
18520 'authentication' : simple authentication using kerberos
18521 'integrity' : authentication and integrity
18522 'privacy' : authentication and confidentiality
18523
18524 Earlier, HBase reference guide erroneously mentioned in some places to set the value to 'auth-conf'. This patch fixes the guide and adds temporary support for erroneously recommended values.
18525
18526
18527 ---
18528
18529 * [HBASE-14306](https://issues.apache.org/jira/browse/HBASE-14306) | *Major* | **Refine RegionGroupingProvider: fix issues and make it more scalable**
18530
18531 In HBASE-14306 we've changed default strategy of RegionGroupingProvider from "identify" to "bounded", so it's required to explicitly set "hbase.wal.regiongrouping.strategy" to "identify" if user still wants to use one WAL per region
18532
18533 Please also notice that in the new framework there will be one WAL per group, and the region-group mapping is decided by RegionGroupingStrategy. Accordingly, we've removed BoundedRegionGroupingProvider and added BoundedRegionGroupingStrategy as a replacement. If you already have a customized class for hbase.wal.regiongrouping.strategy, please check the new logic and make updates if necessary.
18534
18535
18536 ---
18537
18538 * [HBASE-6617](https://issues.apache.org/jira/browse/HBASE-6617) | *Major* | **ReplicationSourceManager should be able to track multiple WAL paths**
18539
18540 ReplicationSourceManager now could track multiple wal paths. Notice that although most changes are internal and all metrics names remain the same, signature of below methods in MetricsSource are changed:
18541
18542 1. refreshAgeOfLastShippedOp now requires a String parameter which indicates the wal group id of the reporter
18543 2. setAgeOfLastShippedOp also adds a String parameter for wal group id
18544
18545
18546 ---
18547
18548 * [HBASE-14314](https://issues.apache.org/jira/browse/HBASE-14314) | *Major* | **Metrics for block cache should take region replicas into account**
18549
18550 The following metrics for primary region replica are added:
18551
18552 blockCacheHitCountPrimary
18553 blockCacheMissCountPrimary
18554 blockCacheEvictionCountPrimary
18555
18556
18557 ---
18558
18559 * [HBASE-14317](https://issues.apache.org/jira/browse/HBASE-14317) | *Blocker* | **Stuck FSHLog: bad disk (HDFS-8960) and can't roll WAL**
18560
18561 Tighten up WAL-use semantic.
18562
18563 1. If an append or a sync throws an exception, all subsequent attempts at using the log will also throw this same exception. The WAL is now a lame-duck until you roll it.
18564 2. If a successful append, and then we fail to sync the append, this is a fatal exception. The container must abort to replay the WAL logs even though we have told the client that the appends failed.
18565
18566 The above rules have been applied laxly up to this; it used to be possible to get a good sync to go in over the top of a failed append. This has been fixed in this patch.
18567
18568 Also fixed a hang in the WAL subsystem if a request to pause the write pipeline took on a failed sync. before the roll requests sync got scheduled.
18569
18570
18571 TODO: Revisit our WAL system. HBASE-12751 helps rationalize our write pipeline. In particular, it manages sequenceid inside mvcc which should make it so we can purge mechanism that writes empty, unflushed appends just to get the next sequenceid... problematic when WAL goes lame-duck. Lets get it in.
18572 TODO: A successful append followed by a failed sync probably only needs us replace the WAL (if we have signalled the client that the appends failed). Bummer is that replicating, these last appends might make it to the sink cluster or get replayed during recovery. HBase should keep its own WAL length? Or sequenceid of last successful sync should be passed when doing recovery and replication?
18573
18574
18575 ---
18576
18577 * [HBASE-14261](https://issues.apache.org/jira/browse/HBASE-14261) | *Major* | **Enhance Chaos Monkey framework by adding zookeeper and datanode fault injections.**
18578
18579 This change augments existing chaos monkey framework with actions for restarting underlying zookeeper quorum and hdfs nodes of distributed hbase cluster. One assumption made while creating zk actions are that zookeper ensemble is an independent external service and won't be managed by hbase cluster.  For these actions to work as expected, the following parameters need to be configured appropriately.
18580
18581 {code}
18582 \<property\>
18583   \<name\>hbase.it.clustermanager.hadoop.home\</name\>
18584   \<value\>$HADOOP\_HOME\</value\>
18585 \</property\>
18586 \<property\>
18587   \<name\>hbase.it.clustermanager.zookeeper.home\</name\>
18588   \<value\>$ZOOKEEPER\_HOME\</value\>
18589 \</property\>
18590 \<property\>
18591   \<name\>hbase.it.clustermanager.hbase.user\</name\>
18592   \<value\>hbase\</value\>
18593 \</property\>
18594 \<property\>
18595   \<name\>hbase.it.clustermanager.hadoop.hdfs.user\</name\>
18596   \<value\>hdfs\</value\>
18597 \</property\>
18598 \<property\>
18599   \<name\>hbase.it.clustermanager.zookeeper.user\</name\>
18600   \<value\>zookeeper\</value\>
18601 \</property\>
18602 {code}
18603
18604 The service user related configurations are newly introduced since in prod/test environments each service is managed by different user. Once the above parameters are configured properly, you can start using them as needed. An example usage for invoking these new actions is:
18605
18606 {{./hbase org.apache.hadoop.hbase.IntegrationTestAcidGuarantees -m serverAndDependenciesKilling}}
18607
18608
18609 ---
18610
18611 * [HBASE-14309](https://issues.apache.org/jira/browse/HBASE-14309) | *Major* | **Allow load balancer to operate when there is region in transition by adding force flag**
18612
18613 This issue adds boolean parameter, force, to 'balancer' command so that admin can force region balancing even when there is region (other than hbase:meta) in transition - assuming RIT being transient.
18614 If hbase:meta is in transition, balancer command returns false.
18615
18616 WARNING: For experts only. Forcing a balance may do more damage than repair when assignment is confused
18617 Note: enclose the force parameter in double quotes
18618
18619
18620 ---
18621
18622 * [HBASE-14313](https://issues.apache.org/jira/browse/HBASE-14313) | *Critical* | **After a Connection sees ConnectionClosingException it never recovers**
18623
18624 HConnection could get stuck when talking to a host that went down and then returned. This has been fixed by closing the connection in all paths.
18625
18626
18627 ---
18628
18629 * [HBASE-13339](https://issues.apache.org/jira/browse/HBASE-13339) | *Blocker* | **Update default Hadoop version to latest for master**
18630
18631 Master/2.0.0 now builds on the latest stable hadoop by default.
18632
18633
18634 ---
18635
18636 * [HBASE-14224](https://issues.apache.org/jira/browse/HBASE-14224) | *Critical* | **Fix coprocessor handling of duplicate classes**
18637
18638 Prevent Coprocessors being doubly-loaded; a particular coprocessor can only be loaded once.
18639
18640
18641 ---
18642
18643 * [HBASE-13127](https://issues.apache.org/jira/browse/HBASE-13127) | *Major* | **Add timeouts on all tests so less zombie sightings**
18644
18645 Use junit facility to impose timeout on test. Use test category to chose which timeout to apply: small tests timeout after 30 seconds, medium tests after 180 seconds, and large tests after ten minutes.
18646
18647 Updated junit version from 4.11 to 4.12. 4.12 has support for feature used here.
18648
18649 Add this at the head of your junit4 class to add a category-based timeout:
18650
18651 {code}
18652 @Rule public final TestRule timeout =   CategoryBasedTimeout.builder().withTimeout(this.getClass()).
18653       withLookingForStuckThread(true).build();
18654 {code}
18655
18656 For example:
18657
18658
18659 ---
18660
18661 * [HBASE-14148](https://issues.apache.org/jira/browse/HBASE-14148) | *Major* | **Web UI Framable Page**
18662
18663 Security fix: Adds protection from clickjacking using X-Frame-Options header.
18664 This will prevent use of HBase UI in frames. To disable this feature, set the configuration 'hbase.http.filter.xframeoptions.mode' to 'ALLOW' (default is 'DENY').
18665
18666
18667 ---
18668
18669 * [HBASE-10844](https://issues.apache.org/jira/browse/HBASE-10844) | *Major* | **Coprocessor failure during batchmutation leaves the memstore datastructs in an inconsistent state**
18670
18671 Promotes an -ea assert to logged FATAL and RS abort when memstore is found to be in an inconsistent state.
18672
18673
18674 ---
18675
18676 * [HBASE-13966](https://issues.apache.org/jira/browse/HBASE-13966) | *Minor* | **Limit column width in table.jsp**
18677
18678 Wraps region, start key, end key columns if too long.
18679
18680
18681 ---
18682
18683 * [HBASE-13706](https://issues.apache.org/jira/browse/HBASE-13706) | *Minor* | **CoprocessorClassLoader should not exempt Hive classes**
18684
18685 Starting from HBase 2.0, CoprocessorClassLoader will not exempt hadoop classes or zookeeper classes.  This means that if the custom coprocessor jar contains hadoop or zookeeper packages and classes, they will be loaded by the CoprocessorClassLoader.  Only hbase packages and classes  are exempted from the CoprocessorClassLoader. They (and their dependencies) are loaded by the parent server class loader.
18686
18687
18688 ---
18689
18690 * [HBASE-14054](https://issues.apache.org/jira/browse/HBASE-14054) | *Major* | **Acknowledged writes may get lost if regionserver clock is set backwards**
18691
18692 In {{checkAndPut}} write path use max(max timestamp for the row, System.currentTimeMillis()) in the, instead of blindly taking System.currentTimeMillis() to ensure that checkAndPut() cannot do writes which is already eclipsed. This is similar to what has been done in HBASE-12449 for increment and append.
18693
18694
18695 ---
18696
18697 * [HBASE-13985](https://issues.apache.org/jira/browse/HBASE-13985) | *Minor* | **Add configuration to skip validating HFile format when bulk loading**
18698
18699 A new config, hbase.loadincremental.validate.hfile , is introduced - default to true
18700 When set to false, checking hfile format is skipped during bulkloading.
18701
18702
18703 ---
18704
18705 * [HBASE-14201](https://issues.apache.org/jira/browse/HBASE-14201) | *Major* | **hbck should not take a lock unless fixing errors**
18706
18707 HBCK no longer takes a lock until there are changes to the cluster being made.
18708
18709 The old behavior can be achieved by passing the -exclusive flag.
18710
18711
18712 ---
18713
18714 * [HBASE-14081](https://issues.apache.org/jira/browse/HBASE-14081) | *Minor* | **(outdated) references to SVN/trunk in documentation**
18715
18716 HBASE-14081 Remove (outdated) references to SVN/trunk from documentation
18717
18718
18719 ---
18720
18721 * [HBASE-13865](https://issues.apache.org/jira/browse/HBASE-13865) | *Trivial* | **Increase the default value for hbase.hregion.memstore.block.multipler from 2 to 4 (part 2)**
18722
18723 Increase default hbase.hregion.memstore.block.multiplier from 2 to 4 in the code to match the default value in the config files.
18724
18725
18726 ---
18727
18728 * [HBASE-12295](https://issues.apache.org/jira/browse/HBASE-12295) | *Major* | **Prevent block eviction under us if reads are in progress from the BBs**
18729
18730 We try to delay the eviction of the block till the cellblocks are formed at the Rpc layer. A simple reference counting mechanism is introduced when ever a block is accessed from the Bucket cache.  Once a scanner completes using a block the reference count is decremented.  The eviction of the block happens only when the reference count of that block is 0.
18731 We also introduce a concept of ShareableMemory based on the type of blocks we create from the Block cache. The blocks from the ByteBufferIOEngine directly refer to the buckets in offheap and such blocks are marked SHARED memory type. The blocks from LRU, HDFS and file mode of Bucket cache are all marked EXCLUSIVE because these blocks have their own exclusive memory.
18732 For the CP case, any cell coming out of SHARED memory block is copied before returning the results, because CPs can use the results as its state so that eviction cannot corrupt the results.
18733
18734
18735 ---
18736
18737 * [HBASE-11339](https://issues.apache.org/jira/browse/HBASE-11339) | *Major* | **HBase MOB**
18738
18739 The Moderate Object Storage (MOB) feature (HBASE-11339[1]) is modified I/O and compaction path that allows individual moderately sized values (100KB-10MB) to be stored in a way that write amplification is reduced when compared to the normal I/O path. MOB is defined in the column family and it is almost isolated with other components, the features and performance cannot be effected in normal columns.
18740
18741 For more details on how to use the feature please consult the HBase Reference Guide
18742
18743
18744 ---
18745
18746 * [HBASE-13954](https://issues.apache.org/jira/browse/HBASE-13954) | *Major* | **Remove HTableInterface#getRowOrBefore related server side code**
18747
18748 Removed Table#getRowOrBefore, Region#getClosestRowBefore, Store#getRowKeyAtOrBefore, RemoteHTable#getRowOrBefore apis and Thrift support for getRowOrBefore.
18749 Also removed two coprocessor hooks preGetClosestRowBefore and postGetClosestRowBefore.
18750 User using this api can instead use reverse scan something like below,
18751 {code}
18752  Scan scan = new Scan(row);
18753   scan.setSmall(true);
18754   scan.setCaching(1);
18755   scan.setReversed(true);
18756   scan.addFamily(family);
18757 {code}
18758 pass this scan object to the scanner and retrieve the first Result from scanner output.
18759
18760
18761 ---
18762
18763 * [HBASE-12296](https://issues.apache.org/jira/browse/HBASE-12296) | *Major* | **Filters should work with ByteBufferedCell**
18764
18765 Change to support offheaping.
18766
18767 Incompatible change for filters ColumnPrefixFilter and MultipleColumnPrefixFilter
18768
18769 Changes parameters to filterColumn so takes a Cell rather than a byte [].
18770
18771 hbase-client-1.2.7-SNAPSHOT.jar, ColumnPrefixFilter.class
18772 package org.apache.hadoop.hbase.filter
18773 ColumnPrefixFilter.filterColumn ( byte[ ] buffer, int qualifierOffset, int qualifierLength )  :  Filter.ReturnCode
18774 org/apache/hadoop/hbase/filter/ColumnPrefixFilter.filterColumn:([BII)Lorg/apache/hadoop/hbase/filter/Filter$ReturnCode;
18775
18776 Ditto for filterColumnValue in SingleColumnValueFilter. Takes a Cell instead of byte array.
18777
18778
18779 ---
18780
18781 * [HBASE-14045](https://issues.apache.org/jira/browse/HBASE-14045) | *Major* | **Bumping thrift version to 0.9.2.**
18782
18783 This changes upgrades thrift dependency of HBase to 0.9.2. Though this doesn't break any HBase compatibility promises, it might impact any downstream projects that share thrift dependency with HBase.
18784
18785
18786 ---
18787
18788 * [HBASE-14027](https://issues.apache.org/jira/browse/HBASE-14027) | *Major* | **Clean up netty dependencies**
18789
18790 HBase's convenience binary artifact no longer contains the netty 3.2.4 jar . This jar was not directly used by HBase, but may have been relied on by downstream applications.
18791
18792
18793 ---
18794
18795 * [HBASE-7782](https://issues.apache.org/jira/browse/HBASE-7782) | *Minor* | **HBaseTestingUtility.truncateTable() not acting like CLI**
18796
18797 HBaseTestingUtility now uses the truncate API added in HBASE-8332 so that calls to HBTU.truncateTable will behave like the shell command: effectively dropping the table and recreating a new one with the same split points.
18798
18799 Previously, HBTU.truncateTable instead issued deletes for all the data already in the table. If you wish to maintain the same behavior, you should use the newly added HBTU.deleteTableData method.
18800
18801
18802 ---
18803
18804 * [HBASE-14047](https://issues.apache.org/jira/browse/HBASE-14047) | *Major* | **Cleanup deprecated APIs from Cell class**
18805
18806 The following API from Cell (which were deprecated since past few major versions) are removed now.
18807 getRow
18808 getFamily
18809 getQualifier
18810 getValue
18811 getMvccVersion
18812 The above apis can be replaced with their respective CellUtil#cloneXXX (allocates a copy) or Cell#getXXXArray (essentially just returns a pointer) based on the use case.
18813
18814
18815 ---
18816
18817 * [HBASE-14029](https://issues.apache.org/jira/browse/HBASE-14029) | *Major* | **getting started for standalone still references hadoop-version-specific binary artifacts**
18818
18819 HBASE-14029 Correct documentation for Hadoop version specific artifacts
18820
18821
18822 ---
18823
18824 * [HBASE-13849](https://issues.apache.org/jira/browse/HBASE-13849) | *Major* | **Remove restore and clone snapshot from the WebUI**
18825
18826 The HBase master status web page no longer allows operators to clone snapshots nor restore snapshots.
18827
18828
18829 ---
18830
18831 * [HBASE-13646](https://issues.apache.org/jira/browse/HBASE-13646) | *Major* | **HRegion#execService should not try to build incomplete messages**
18832
18833 When RegionServerCoprocessors throw an exception we will no longer attempt to build an incomplete RPC response message. Instead, the response message will be null.
18834
18835
18836 ---
18837
18838 * [HBASE-13639](https://issues.apache.org/jira/browse/HBASE-13639) | *Major* | **SyncTable - rsync for HBase tables**
18839
18840 Tool to sync two tables that tries to send the differences only like rsync.
18841
18842 Adds two new MapReduce jobs, SyncTable and HashTable. See usage for these jobs on how to use. See design doc for generally overview: https://docs.google.com/document/d/1-2c9kJEWNrXf5V4q\_wBcoIXfdchN7Pxvxv1IO6PW0-U/edit
18843
18844 From comments below, "It can be challenging to run against a table getting live writes, if those writes are updates/overwrites. In general, you can run it against a time range to ignore new writes, but if those writes update existing cells, then the time range scan may or may not see older versions of those cells depending on whether major compaction has happened, which may be different in remote clusters."
18845
18846
18847 ---
18848
18849 * [HBASE-13895](https://issues.apache.org/jira/browse/HBASE-13895) | *Critical* | **DATALOSS: Region assigned before WAL replay when abort**
18850
18851 If the master went to assign a region concurrent with a RegionServer abort, the returned RegionServerAbortedException was being handled as though the region had been cleanly offlined so assign was allowed proceed. If the region was opened in its new location before WAL replay completion, the replayed edits were ignored, worst case, or were later played over the top of edits that had come in since open and so susceptible to overwrite. In either case, DATALOSS.
18852
18853
18854 ---
18855
18856 * [HBASE-13983](https://issues.apache.org/jira/browse/HBASE-13983) | *Minor* | **Doc how the oddball HTable methods getStartKey, getEndKey, etc. will be removed in 2.0.0**
18857
18858 Adds extra doc on getStartKeys, getEndKeys, and getStartEndKeys in HTable explaining that they will be removed in 2.0.0 (these methods did not get the proper full major version deprecation cycle).
18859
18860 In this issue, we actually also remove these methods in master/2.0.0 branch.
18861
18862
18863 ---
18864
18865 * [HBASE-13747](https://issues.apache.org/jira/browse/HBASE-13747) | *Critical* | **Promote Java 8 to "yes" in support matrix**
18866
18867 Java 8 is considered supported and tested as of HBase 1.2+
18868
18869
18870 ---
18871
18872 * [HBASE-13959](https://issues.apache.org/jira/browse/HBASE-13959) | *Critical* | **Region splitting uses a single thread in most common cases**
18873
18874 The performance of region splitting has been improved by using a thread pool to split the store files concurrently. Prior to this change, the store files were always split sequentially in a single thread, so a region with multiple store files ended up taking several seconds. The thread pool is sized dynamically with the aim of getting maximum concurrency, without exceeding the number of cores available for HBase Java process. A lower limit for the thread pool can be explicitly set using the property hbase.regionserver.region.split.threads.max.
18875
18876
18877 ---
18878
18879 * [HBASE-13930](https://issues.apache.org/jira/browse/HBASE-13930) | *Major* | **Exclude Findbugs packages from shaded jars**
18880
18881 Exclude Findbugs packages from shaded jars
18882
18883
18884 ---
18885
18886 * [HBASE-13214](https://issues.apache.org/jira/browse/HBASE-13214) | *Major* | **Remove deprecated and unused methods from HTable class**
18887
18888 **WARNING: No release note provided for this change.**
18889
18890
18891 ---
18892
18893 * [HBASE-13869](https://issues.apache.org/jira/browse/HBASE-13869) | *Trivial* | **Fix typo in HBase book**
18894
18895 Fix typo in HBase book
18896
18897
18898 ---
18899
18900 * [HBASE-13938](https://issues.apache.org/jira/browse/HBASE-13938) | *Major* | **Deletes done during the region merge transaction may get eclipsed**
18901
18902 Use the master's timestamp when sending hbase:meta edits on region merge to ensure proper ordering of new region addition and old region deletes.
18903
18904
18905 ---
18906
18907 * [HBASE-13898](https://issues.apache.org/jira/browse/HBASE-13898) | *Minor* | **correct additional javadoc failures under java 8**
18908
18909 Correct Javadoc generation errors
18910
18911
18912 ---
18913
18914 * [HBASE-13103](https://issues.apache.org/jira/browse/HBASE-13103) | *Major* | **[ergonomics] add region size balancing as a feature of master**
18915
18916 This patch adds optional ability for HMaster to normalize regions in size (disabled by default, change hbase.normalizer.enabled property to true to turn it on). If enabled, HMaster periodically (every 30 minutes by default) monitors tables for which normalization is enabled in table configuration and performs splits/merges as seems appropriate. Users may implement their own normalization strategies by implementing RegionNormalizer interface and configuring it in hbase-site.xml.
18917
18918
18919 ---
18920
18921 * [HBASE-13900](https://issues.apache.org/jira/browse/HBASE-13900) | *Minor* | **duplicate methods between ProtobufMagic and ProtobufUtil**
18922
18923 Use ProtobufMagic methods in ProtobufUtil
18924
18925
18926 ---
18927
18928 * [HBASE-13843](https://issues.apache.org/jira/browse/HBASE-13843) | *Trivial* | **Fix internal constant text in ReplicationManager.java**
18929
18930 In previous versions of HBase, the ReplicationAdmin utility erroneously used the string key "columnFamlyName" when listing replicated column families. It now uses the corrected spelling of "columnFamilyName" (note the added "i").
18931
18932 Downstream code that parsed the replication entries returned from listReplicated will need to be updated to use the new key. Previously compiled code that relied on the static CFNAME member of ReplicationAdmin will need to be recompiled in order to see the updated value.
18933
18934
18935 ---
18936
18937 * [HBASE-13886](https://issues.apache.org/jira/browse/HBASE-13886) | *Major* | **Return empty value when the mob file is corrupt instead of throwing exceptions**
18938
18939 By default the Get/Scan will throw Exception when it is not able to find a mob cell because the mob file is missing/corrupted. This jira adds a facility to continue scan/get and get other cells with mob cell value as empty. Set an attribute MobConstants.EMPTY\_VALUE\_ON\_MOBCELL\_MISS = true in Scan/Get for getting this behaviour
18940
18941
18942 ---
18943
18944 * [HBASE-13686](https://issues.apache.org/jira/browse/HBASE-13686) | *Major* | **Fail to limit rate in RateLimiter**
18945
18946 As per this jira contribution. We now support two kinds of RateLimiter.
18947 1) org.apache.hadoop.hbase.quotas.AverageIntervalRateLimiter : This limiter will refill resources at every TimeUnit/resources interval.
18948 Example: For a limiter configured with 10resources/second, then 1resource will be refilled after every 100ms.
18949
18950 2) org.apache.hadoop.hbase.quotas.FixedIntervalRateLimiter: This limiter will refill resources only after a given fixed interval of time.
18951
18952 Client can configure anyone of this rate limiter for the cluster by setting the value for the property "hbase.quota.rate.limiter" in the hbase-site.xml. org.apache.hadoop.hbase.quotas.AverageIntervalRateLimiter is the default value.
18953 Note: Client needs to restart the cluster for the configuration to take into effect.
18954
18955
18956 ---
18957
18958 * [HBASE-13816](https://issues.apache.org/jira/browse/HBASE-13816) | *Major* | **Build shaded modules only in release profile**
18959
18960 hbase-shaded-client and hbase-shaded-server modules will not build the actual jars unless -Prelease is supplied in mvn.
18961
18962
18963 ---
18964
18965 * [HBASE-13754](https://issues.apache.org/jira/browse/HBASE-13754) | *Major* | **Allow non KeyValue Cell types also to oswrite**
18966
18967 This jira has removed the already deprecated method
18968 KeyValue#oswrite(final KeyValue kv, final OutputStream out)
18969
18970
18971 ---
18972
18973 * [HBASE-13375](https://issues.apache.org/jira/browse/HBASE-13375) | *Major* | **Provide HBase superuser higher priority over other users in the RPC handling**
18974
18975 This JIRA modifies the signature of PriorityFunction#getPriority() method to also take request user as a parameter; all RPC requests sent by super users (as determined by cluster configuration) are executed with Admin QoS.
18976
18977
18978 ---
18979
18980 * [HBASE-5980](https://issues.apache.org/jira/browse/HBASE-5980) | *Minor* | **Scanner responses from RS should include metrics on rows/KVs filtered**
18981
18982 Adds scan metrics to the result. In the shell, set the ALL\_METRICS attribute to true on your scan to see dump of metrics after results (see the scan help for examples).
18983
18984 If you would prefer to see only a subset of the metrics, the METRICS array can be defined to include the names of only the metrics you care about.
18985
18986
18987 ---
18988
18989 * [HBASE-13698](https://issues.apache.org/jira/browse/HBASE-13698) | *Major* | **Add RegionLocator methods to Thrift2 proxy.**
18990
18991 Added getRegionLocation and getAllRegionLocations to the thrift2 interface.
18992
18993
18994 ---
18995
18996 * [HBASE-13636](https://issues.apache.org/jira/browse/HBASE-13636) | *Major* | **Remove deprecation for HBASE-4072 (Reading of zoo.cfg)**
18997
18998 Purge support for parsing zookeepers zoo.cfg deprecated since hbase-0.96.0
18999
19000
19001 ---
19002
19003 * [HBASE-13071](https://issues.apache.org/jira/browse/HBASE-13071) | *Major* | **Hbase Streaming Scan Feature**
19004
19005 MOTIVATION
19006
19007 A pipelined scan API is introduced for speeding up applications that combine massive data traversal with compute-intensive processing. Traditional HBase scans save network trips through prefetching the data to the client side cache. However, they prefetch synchronously: the fetch request to regionserver is invoked only when the entire cache is consumed. This leads to a stop-and-wait access pattern, in which the client stalls until the next chunk of data is fetched. Applications that do significant processing can benefit from background data prefetching, which eliminates this bottleneck. The pipelined scan implementation overlaps the cache population at the client side with application processing. Namely, it issues a new scan RPC when the iteration retrieves 50% of the cache. If the application processing (that is, the time between invocations of next()) is substantial, the new chunk of data will be available before the previous one is exhausted, and the client will not experience any delay. Ideally, the prefetch and the processing times should be balanced.
19008
19009 API AND CONFIGURATION
19010
19011 Asynchronous scanning can be configured either globally for all tables and scans, or on per-scan basis via a new Scan class API.
19012
19013 Configuration in hbase-site.xml: hbase.client.scanner.async.prefetch, default false:
19014
19015  \<property\>
19016    \<name\>hbase.client.scanner.async.prefetch\</name\>
19017    \<value\>true\</value\>
19018  \</property\>
19019
19020 API - Scan#setAsyncPrefetch(boolean)
19021
19022       Scan scan = new Scan();
19023       scan.setCaching(1000);
19024       scan.setMaxResultSize(BIG\_SIZE);
19025       scan.setAsyncPrefetch(true);
19026         ...
19027       ResultScanner scanner = table.getScanner(scan);
19028
19029 IMPLEMENTATION NOTES
19030
19031 Pipelined scan is implemented by a new ClientAsyncPrefetchScanner class, which is fully API-compatible with the synchronous ClientSimpleScanner. ClientAsyncPrefetchScanner is not instantiated in case of small (Scan#setSmall) and reversed (Scan#setReversed) scanners. The application is responsible for setting the prefetch size in a way that the prefetch time and the processing times are balanced. Note that due to double buffering, the client side cache can use twice as much memory as the synchronous scanner.
19032
19033 Generally, this feature will put more load on the server (higher fetch rate -- which is the whole point).  Also, YMMV.
19034
19035
19036 ---
19037
19038 * [HBASE-13533](https://issues.apache.org/jira/browse/HBASE-13533) | *Trivial* | **section on configuring ~/.m2/settings.xml has no anchor**
19039
19040 Correct setting.xml anchor in book
19041
19042
19043 ---
19044
19045 * [HBASE-13625](https://issues.apache.org/jira/browse/HBASE-13625) | *Major* | **Use HDFS for HFileOutputFormat2 partitioner's path**
19046
19047 Introduces a new config hbase.fs.tmp.dir which is a directory in HDFS (or default file system) to use as a staging directory for HFileOutputFormat2. This is also used as the default for hbase.bulkload.staging.dir
19048
19049
19050 ---
19051
19052 * [HBASE-10800](https://issues.apache.org/jira/browse/HBASE-10800) | *Major* | **Use CellComparator instead of KVComparator**
19053
19054 From 2.0 branch onwards KVComparator and its subclasses MetaComparator, RawBytesComparator are all deprecated.
19055 All the comparators are moved to CellComparator.  MetaCellComparator, a subclass of CellComparator, will be used to compare hbase:meta cells.
19056 Previously exposed static instances KeyValue.COMPARATOR, KeyValue.META\_COMPARATOR and KeyValue.RAW\_COMPARATOR are deprecated instead use CellComparator.COMPARATOR and CellComparator.META\_COMPARATOR.
19057 Also note that there will be no RawBytesComparator.  Where ever we need to compare raw bytes use Bytes.BYTES\_RAWCOMPARATOR.
19058 CellComparator will always operate on cells and its components, abstracting the fact that a cell can be backed by a single byte[] as opposed to how KVComparators were working.
19059
19060
19061 ---
19062
19063 * [HBASE-13333](https://issues.apache.org/jira/browse/HBASE-13333) | *Major* | **Renew Scanner Lease without advancing the RegionScanner**
19064
19065 Adds a renewLease call to ClientScanner
19066
19067
19068 ---
19069
19070 * [HBASE-13564](https://issues.apache.org/jira/browse/HBASE-13564) | *Major* | **Master MBeans are not published**
19071
19072 To use the coprocessor-based JMX implementation provided by HBase for Master.
19073 Add below property in hbase-site.xml file:
19074
19075 \<property\>
19076   \<name\>hbase.coprocessor.master.classes\</name\>
19077   \<value\>org.apache.hadoop.hbase.JMXListener\</value\>
19078 \</property\>
19079
19080 NOTE: DO NOT set \`com.sun.management.jmxremote.port\` for Java VM at the same time.
19081
19082 By default, the JMX listens on TCP port 10101 for Master, we can further configure the port using below properties:
19083
19084 \<property\>
19085   \<name\>master.rmi.registry.port\</name\>
19086   \<value\>61110\</value\>
19087 \</property\>
19088 \<property\>
19089   \<name\>master.rmi.connector.port\</name\>
19090   \<value\>61120\</value\>
19091 \</property\>
19092 ----
19093
19094 The registry port can be shared with connector port in most cases, so you only need to configure master.rmi.registry.port.
19095 However if you want to use SSL communication, the 2 ports must be configured to different values.
19096
19097
19098 ---
19099
19100 * [HBASE-13537](https://issues.apache.org/jira/browse/HBASE-13537) | *Major* | **Procedure V2 - Change the admin interface for async operations to return Future (incompatible with branch-1.x)**
19101
19102 As we made changes to return types in asynchronous methods of Admin API, this change is going to break binary compatibility. The source compatibility is kept intact though. The applications running against this change needs to be recompiled to keep things working.
19103
19104
19105 ---
19106
19107 * [HBASE-13517](https://issues.apache.org/jira/browse/HBASE-13517) | *Major* | **Publish a client artifact with shaded dependencies**
19108
19109 HBase now provides added convenience artifacts that shade most dependencies. These jars hbase-shaded-client and hbase-shaded-server are meant to be used when dependency conflicts can not be solved any other way. The normal jars hbase-client and hbase-server should still be preferred when possible.
19110
19111 Do not use hbase-shaded-server or hbase-shaded-client inside of a co-processor as bad things will happen.
19112
19113
19114 ---
19115
19116 * [HBASE-13149](https://issues.apache.org/jira/browse/HBASE-13149) | *Blocker* | **HBase MR is broken on Hadoop 2.5+ Yarn**
19117
19118 In HBase 1.1.0 and above we have upgraded the version of Jackson dependencies (jackson-core-asl, jackson-mapper-asl, jackson-jaxrs and jackson-xc) from 1.8.8 to 1.9.13. This is to follow the upgrade to Jackson 1.9.13 in Hadoop 2.5 and above which causes Jackson class incompatibility for HBase as reported in HBASE-13149.  Refer to HADOOP-10104 and YARN-2092 for additional information. Jackson1.9.13 is not completely backward compatible with the prior version 1.8.8 used in HBase. See the Compatibility reports attached in HBASE-13149 and http://svn.codehaus.org/jackson/trunk/release-notes/VERSION for more information.
19119
19120 This upgrade does not have direct impact on HBase users and HBase applications in most cases. In the rare case where your HBase application uses Jackson directly AND your application has compatibility issue with Jackson 1.9.13, you can do the following to mitigate the problem.
19121
19122 1. If you are on Hadoop 2.5 or above, and your HBase application involves running Yarn jobs, we recommend you update your application to use Jackson 1.9.13. You may be able to explore classpath isolation options (e.g. HADOOP-10893) or have your own classpath isolation strategy that works for you, but the general recommendation is that you upgrade to Jackson 1.9.13.
19123 2. You may choose to continue using Jackson 1.8.8 and not to use Jackson 1.9.13 in your classpath.  You can also choose to replace the Jackson 1.9.13 jars in $HBASE\_HOME/lib with 1.8.8 jars.  It can work for you in the following cases:
19124 a) You are on a Hadoop version earlier than Hadoop 2.5,  or
19125 b) You are on Hadoop 2.5 or above, but your HBase application does not involve running Yarn jobs.
19126 3. You may experiment with further isolation using the shaded jars introduced with 1.1.0 via HBASE-13517.
19127
19128 Note that it may not be tested or guaranteed that using Jackson 1.8.8 in $HBASE\_HOME/lib will work in future HBase releases.
19129 It is recommended that your HBase application matches the Jackson version provided in HBase.
19130
19131 In HBase 0.98.x and HBase 1.0.x, we have NOT upgraded the version of Jackson dependencies. If you are on Hadoop 2.5 or above, and your HBase application involves running Yarn jobs, you may encounter Jackson class incomparability issue, as reported in HBASE-13149.
19132
19133 You can do the following to mitigate the problem:
19134 1. Use 'hadoop jar' command to run your HBase jobs.
19135 2. Explore classpath isolation options (e.g. HADOOP-10893) or have your own classpath isolation strategy that works for you.
19136 3. You can also choose to replace the Jackson 1.8.8 jars in $HBASE\_HOME/lib with 1.9.13 jars from your Hadoop lib directory. We have tested HBase 0.98 with Jackson 1.9.13.
19137
19138
19139 ---
19140
19141 * [HBASE-13481](https://issues.apache.org/jira/browse/HBASE-13481) | *Major* | **Master should respect master (old) DNS/bind related configurations**
19142
19143 Master now honors configuration options as was before 1.0.0 releases:
19144 hbase.master.ipc.address
19145 hbase.master.dns.interface
19146 hbase.master.dns.nameserver
19147 hbase.master.info.bindAddress
19148 This jira also adds hbase.master.hostname parameter as an extension to HBASE-12954.
19149
19150
19151 ---
19152
19153 * [HBASE-13090](https://issues.apache.org/jira/browse/HBASE-13090) | *Major* | **Progress heartbeats for long running scanners**
19154
19155 Previously, there was no way to enforce a time limit on scan RPC requests. The server would receive a scan RPC request and take as much time as it needed to accumulate enough results to reach a limit or exhaust the region. The problem with this approach was that, in the case of a very selective scan, the processing of the scan could take too long and cause timeouts client side.
19156
19157 With this fix, the server will now enforce a time limit on the execution of scan RPC requests. When a scan RPC request arrives to the server, a time limit is calculated to be half of whichever timeout value is more restictive between the configurations ("hbase.client.scanner.timeout.period" and "hbase.rpc.timeout"). When the time limit is reached, the server will return whatever results it has accumulated up to that point. The results may be empty.
19158
19159 To ensure that timeout checks do not occur too often (which would hurt the performance of scans), the configuration "hbase.cells.scanned.per.heartbeat.check" has been introduced. This configuration controls how often System.currentTimeMillis() is called to update the progress towards the time limit. Currently, the default value of this configuration value is 10000. Specifying a smaller value will provide a tighter bound on the time limit, but may hurt scan performance due to the higher frequency of calls to System.currentTimeMillis().
19160
19161 Protobuf models for ScanRequest and ScanResponse have been updated so that heartbeat support can be communicated. Support for heartbeat messages is specified in the request sent to the server via ScanRequest.Builder#setClientHandlesHeartbeats. Only when the server sees that ScanRequest#getClientHandlesHeartbeats() is true will it send heartbeat messages back to the client. A response is marked as a heartbeat message via the boolean flag ScanResponse#getHeartbeatMessage
19162
19163
19164 ---
19165
19166 * [HBASE-13307](https://issues.apache.org/jira/browse/HBASE-13307) | *Major* | **Making methods under ScannerV2#next inlineable, faster**
19167
19168 Made methods smaller under Scanner#next so inlinable and compilable (was getting 'too big to compile' from hotspot). Use of unsafe to parse shorts rather than use BB#getShort... faster, etc.
19169
19170
19171 ---
19172
19173 * [HBASE-13453](https://issues.apache.org/jira/browse/HBASE-13453) | *Critical* | **Master should not bind to region server ports**
19174
19175 In 1.0.x, master by default binds to the region server ports (both rpc and info). This change brings back the usage of old master rpc and info ports in 1.1+ and master (2.0) branches. The motivation for this change is to ease the life of the user so that he does not need to do anything to bring up a RS on the same host and also to make the migration from 0.98 to 1.1  hassle free.  However, the users going from 1.0 to 1.1 would see the change in the master ports.
19176
19177
19178 ---
19179
19180 * [HBASE-13419](https://issues.apache.org/jira/browse/HBASE-13419) | *Major* | **Thrift gateway should propagate text from exception causes.**
19181
19182 Compose thrift exception text from the text of the entire cause chain of the underlying exception.
19183
19184
19185 ---
19186
19187 * [HBASE-13275](https://issues.apache.org/jira/browse/HBASE-13275) | *Major* | **Setting hbase.security.authorization to false does not disable authorization**
19188
19189 Prior to this change the configuration setting 'hbase.security.authorization' had no effect if security coprocessor were installed. The act of installing the security coprocessors was assumed to indicate active authorizaton was desired and required. Now it is possible to install the security coprocessors yet have them operate in a passive state with active authorization disabled by setting 'hbase.security.authorization' to false. This can be useful but is probably not what you want. For more information, consult the Security section of the HBase online manual.
19190
19191 'hbase.security.authorization' defaults to true for backwards comptatible behavior.
19192
19193
19194 ---
19195
19196 * [HBASE-13118](https://issues.apache.org/jira/browse/HBASE-13118) | *Major* | **[PE] Add being able to write many columns**
19197
19198 Adds a --columns option to PE so you can write more than one column (changes default qualifier from 'data' to '0').
19199
19200
19201 ---
19202
19203 * [HBASE-13270](https://issues.apache.org/jira/browse/HBASE-13270) | *Major* | **Setter for Result#getStats is #addResults; confusing!**
19204
19205 Deprecates Result#addResults in favor of Result#setStatistics
19206
19207
19208 ---
19209
19210 * [HBASE-13362](https://issues.apache.org/jira/browse/HBASE-13362) | *Major* | **Set max result size from client only (like scanner caching).**
19211
19212 This introduces a new config option: hbase.server.scanner.max.result.size
19213 This setting enforces a maximum result size (in bytes), when reached the server will return the results is has so far.
19214 This is a safety setting and should be kept large. The default is inifinite in 0.98 and 1.0.x and 100mb in 1.1 and later.
19215
19216 Use hbase.client.scanner.max.result.size instead to enforce practical chunk sizes of a few mb (defaults to 2mb)
19217
19218
19219 ---
19220
19221 * [HBASE-11544](https://issues.apache.org/jira/browse/HBASE-11544) | *Critical* | **[Ergonomics] hbase.client.scanner.caching is dogged and will try to return batch even if it means OOME**
19222
19223 Results returned from RPC calls may now be returned as partials
19224
19225 When is a Result marked as a partial?
19226 When the server must stop the scan because the max size limit has been reached. Means that the LAST Result returned within the ScanResult's Result array may be marked as a partial if the scan's max size limit caused it to stop in the middle of a row.
19227
19228 Incompatible Change: The return type of InternalScanners#next and RegionScanners#nextRaw has been changed to NextState from boolean
19229 The previous boolean return value can be accessed via NextState#hasMoreValues()
19230 Provides more context as to what happened inside the scanner
19231
19232 Scan caching default has been changed to Integer.Max\_Value
19233 This value works together with the new maxResultSize value from HBASE-12976 (defaults to 2MB)
19234 Results returned from server on basis of size rather than number of rows
19235 Provides better use of network since row size varies amongst tables
19236
19237 Protobuf models have changed for Result, ScanRequest, and ScanResponse to support new partial Results
19238
19239 Partial Results should be invisible to application layer unless Scan#setAllowPartials is set
19240
19241 Scan#setAllowPartials has been added to allow the application to request to see the partial Results returned by the server rather than have the ClientScanner form the complete Result prior to returning it to the application
19242
19243 To disable the use of partial Results on the server, set ScanRequest.Builder#setClientHandlesPartials() to be false in the ScanRequest issued to server
19244
19245 Partial Results should allow the server to return large rows in parts rather than accumulate all the cells for that particular row and run out of memory
19246
19247
19248 ---
19249
19250 * [HBASE-11864](https://issues.apache.org/jira/browse/HBASE-11864) | *Minor* | **Enhance HLogPrettyPrinter to print information from WAL Header**
19251
19252 Enhance WALPrettyPrinter to print information (writer classnames and cell codec classname) from WAL Header
19253
19254
19255 ---
19256
19257 * [HBASE-13289](https://issues.apache.org/jira/browse/HBASE-13289) | *Major* | **typo in splitSuccessCount  metric**
19258
19259 In hbase 1.0.0, 0.98.10, 0.98.10.1, 0.98.11, and 0.98.12 'splitSuccessCount' was misspelled as 'splitSuccessCounnt'
19260
19261
19262 ---
19263
19264 * [HBASE-12990](https://issues.apache.org/jira/browse/HBASE-12990) | *Major* | **MetaScanner should be replaced by MetaTableAccessor**
19265
19266 Removes MetaScanner. Use MetaTableAccessor instead.
19267
19268
19269 ---
19270
19271 * [HBASE-13373](https://issues.apache.org/jira/browse/HBASE-13373) | *Major* | **Squash HFileReaderV3 together with HFileReaderV2 and AbstractHFileReader; ditto for Scanners and BlockReader, etc.**
19272
19273 Marking as incompatible change. Requires hfiles be major version \>= 2 and \>= minor version 3.  Version 3 files are enabled by default in 1.0.  0.98 writes version 2 minor version 3.  You cannot go to 1.0 from anything before 0.98.
19274
19275
19276 ---
19277
19278 * [HBASE-13252](https://issues.apache.org/jira/browse/HBASE-13252) | *Major* | **Get rid of managed connections and connection caching**
19279
19280 For a long time, HBase supported 2 types of connections - managed, which were cached and closed automatically when not needed, and unmanaged, where user is responsible for closing the connections by calling #close() on them.
19281
19282 The concept of managed connections in HBase (deprecated before) has now been extinguished completely, and now all callers are responsible for managing the lifecycle of connections they acquire.
19283
19284
19285 ---
19286
19287 * [HBASE-12954](https://issues.apache.org/jira/browse/HBASE-12954) | *Minor* | **Ability impaired using HBase on multihomed hosts**
19288
19289 The following config is added by this JIRA:
19290
19291 hbase.regionserver.hostname
19292
19293 This config is for experts: don't set its value unless you really know what you are doing.
19294 When set to a non-empty value, this represents the (external facing) hostname for the underlying server.
19295 See https://issues.apache.org/jira/browse/HBASE-12954 for details.
19296
19297 Caution: please make sure rolling upgrade succeeds before turning on this feature.
19298
19299
19300 ---
19301
19302 * [HBASE-13187](https://issues.apache.org/jira/browse/HBASE-13187) | *Critical* | **Add ITBLL that exercises per CF flush**
19303
19304 Pass the -D flag generator.multiple.columnfamilies on the command-line if you want the generator to write three column families rather than the default one. When set, we will write the usual 'meta' column family and use it checking linked-list is wholesome but we will also write a 'tiny' column family and a 'big' column family to provoke uneven flushing; good for testing the flush-by-columnfamily feature.
19305
19306
19307 ---
19308
19309 * [HBASE-13361](https://issues.apache.org/jira/browse/HBASE-13361) | *Minor* | **Remove or undeprecate {get\|set}ScannerCaching in HTable**
19310
19311 Removed getScannerCaching and setScannerCaching from Table
19312
19313
19314 ---
19315
19316 * [HBASE-10728](https://issues.apache.org/jira/browse/HBASE-10728) | *Major* | **get\_counter value is never used.**
19317
19318 for 0.98 and 1.0 changes are compatible (due to mitigation by HBASE-13433):
19319
19320 \* The "get\_counter" command no longer requires a dummy 4th argument. Downstream users are encouraged to migrate code to not pass this argument because it will result in an error for HBase 1.1+.
19321 \* The "incr" command now outputs the current value of the counter to stdout.
19322 ex:
19323 {code}
19324 jruby-1.6.8 :005 \> incr 'counter\_example', 'r1', 'cf1:foo', 10
19325 COUNTER VALUE = 1772
19326 0 row(s) in 0.1180 seconds
19327 {code}
19328
19329 for 1.1+ changes are incompatible:
19330
19331 \* The "get\_counter" command no longer accepts a dummy 4th argument. Downstream users will need to update their code to not pass this argument.
19332 ex:
19333 {code}
19334 jruby-1.6.8 :006 \> get\_counter 'counter\_example', 'r1', 'cf1:foo'
19335 COUNTER VALUE = 1772
19336
19337 {code}
19338 \* The "incr" command now outputs the current value of the counter to stdout.
19339 ex:
19340 {code}
19341 jruby-1.6.8 :005 \> incr 'counter\_example', 'r1', 'cf1:foo', 10
19342 COUNTER VALUE = 1772
19343 0 row(s) in 0.1180 seconds
19344 {code}
19345
19346
19347 ---
19348
19349 * [HBASE-13170](https://issues.apache.org/jira/browse/HBASE-13170) | *Major* | **Allow block cache to be external**
19350
19351 HBase can use memcached as an external block cache. To use this change your config to set hbase.blockcache.use.external to true and hbase.cache.memcached.servers to contain the list of memcached servers to use.
19352
19353
19354 ---
19355
19356 * [HBASE-13316](https://issues.apache.org/jira/browse/HBASE-13316) | *Minor* | **Reduce the downtime on planned moves of regions**
19357
19358 When issuing an Admin.move command the RegionServer that receive the region will try and open the StoreFiles of that region to prime the block cache with index blocks.
19359
19360
19361 ---
19362
19363 * [HBASE-13298](https://issues.apache.org/jira/browse/HBASE-13298) | *Critical* | **Clarify if Table.{set\|get}WriteBufferSize() is deprecated or not**
19364
19365 Deprecate said methods. They were mistakenly included in Table Interface.
19366
19367
19368 ---
19369
19370 * [HBASE-13248](https://issues.apache.org/jira/browse/HBASE-13248) | *Major* | **Make HConnectionImplementation top-level class.**
19371
19372 **WARNING: No release note provided for this change.**
19373
19374
19375 ---
19376
19377 * [HBASE-13331](https://issues.apache.org/jira/browse/HBASE-13331) | *Blocker* | **Exceptions from DFS client can cause CatalogJanitor to delete referenced files**
19378
19379 Fixes an issue where files from a split region that were still referenced were erroneously deleted leading to data loss.
19380
19381
19382 ---
19383
19384 * [HBASE-13273](https://issues.apache.org/jira/browse/HBASE-13273) | *Major* | **Make Result.EMPTY\_RESULT read-only; currently it can be modified**
19385
19386 The Result.EMPTY\_RESULT object is now immutable. In previous releases, the object could be modified by a caller to no longer be empty. Code that relies on this behavior will now receive an UnsupportedOperationException.
19387
19388
19389 ---
19390
19391 * [HBASE-12867](https://issues.apache.org/jira/browse/HBASE-12867) | *Major* | **Shell does not support custom replication endpoint specification**
19392
19393 Adds support to add\_peer in hbase shell to add a custom replication endpoint from HBASE-12254.
19394
19395
19396 ---
19397
19398 * [HBASE-13198](https://issues.apache.org/jira/browse/HBASE-13198) | *Major* | **Remove HConnectionManager**
19399
19400 **WARNING: No release note provided for this change.**
19401
19402
19403 ---
19404
19405 * [HBASE-12586](https://issues.apache.org/jira/browse/HBASE-12586) | *Major* | **Task 6 & 7 from HBASE-9117,  delete all public HTable constructors and delete ConnectionManager#{delete,get}Connection**
19406
19407 HTable class has been marked as private API before, and now it's no longer directly instantiable from client code (all public constructors have been removed). All clients should use Connection#getTable() and Connection#getRegionLocator() when appropriate to obtain Table and RegionLocator implementations to work with.
19408
19409
19410 ---
19411
19412 * [HBASE-13171](https://issues.apache.org/jira/browse/HBASE-13171) | *Minor* | **Change AccessControlClient methods to accept connection object to reduce setup time.**
19413
19414 **WARNING: No release note provided for this change.**
19415
19416
19417 ---
19418
19419 * [HBASE-12706](https://issues.apache.org/jira/browse/HBASE-12706) | *Critical* | **Support multiple port numbers in ZK quorum string**
19420
19421 hbase.zookeeper.quorum configuration now allows servers together with client ports consistent with the way Zookeeper java client accepts the quorum string. In this case, using hbase.zookeeper.clientPort is not needed. eg.  hbase.zookeeper.quorum=myserver1:2181,myserver2:20000,myserver3:31111
19422
19423
19424 ---
19425
19426 * [HBASE-13142](https://issues.apache.org/jira/browse/HBASE-13142) | *Major* | **[PERF] Reuse the IPCUtil#buildCellBlock buffer**
19427
19428 Adds buffer reuse sending Cell results. It is on by default and should not need configuration. Improves GC profile and ups throughput. The benefit gets better the larger the row size returned.
19429
19430 The buffer reservoir is bounded at a maximum count after which we will start logging at WARN level that the reservoir is running at capacity (returned buffers will be discarded and not added back to the reservoir pool). Default maximum is twice the handler count: i.e. 2 \* hbase.regionserver.handler.count. This should be more than enough. Set the maximum with the new configuration: hbase.ipc.server.reservoir.max
19431
19432 The reservoir will not cache buffers in excess of hbase.ipc.server.reservoir.max.buffer.size  The default is 10MB. This means that if a row is very large, then we will allocate a buffer of the average size that is currently in the pool and we will then resize it till we can accommodate the return. These resizes are expensive. The resultant buffer will be used and then discarded.
19433
19434 To check how the reservoir is doing, enable trace level logging for a few seconds on a regionserver. You can do this from the regionserver UI. See 'Log Level'. Set org.apache.hadoop.hbase.io.BoundedByteBufferPool to TRACE. The BoundedByteBufferPool will spew report to the log. Disable the TRACE level and then check the log. You'll see allocation rate, size of pool, size of buffers in pool, etc.
19435
19436
19437 ---
19438
19439 * [HBASE-13012](https://issues.apache.org/jira/browse/HBASE-13012) | *Major* | **Add shell commands to trigger the mob file compactor**
19440
19441 This adds two new shell commands -- compact\_mob and major\_compact\_mob to the hbase shell.
19442
19443 Run compaction on a mob enabled column family or all mob enabled column families within a table
19444           Examples:
19445           Compact a column family within a table:
19446           hbase\> compact\_mob 't1', 'c1'
19447           Compact all mob enabled column families
19448           hbase\> compact\_mob 't1'
19449
19450 Run major compaction on a mob enabled column family or all mob enabled column families within a table
19451           Examples:
19452           Compact a column family within a table:
19453           hbase\> major\_compact\_mob 't1', 'c1'
19454           Compact all mob enabled column families within a table
19455           hbase\> major\_compact\_mob 't1'
19456
19457
19458 ---
19459
19460 * [HBASE-12869](https://issues.apache.org/jira/browse/HBASE-12869) | *Major* | **Add a REST API implementation of the ClusterManager interface**
19461
19462 Adds an implementation of ClusterManager to control REST API-managed HBase clusters.
19463
19464
19465 ---
19466
19467 * [HBASE-13047](https://issues.apache.org/jira/browse/HBASE-13047) | *Trivial* | **Add "HBase Configuration" link missing on the table details pages**
19468
19469 Add a '/conf' link to UI
19470
19471
19472 ---
19473
19474 * [HBASE-13044](https://issues.apache.org/jira/browse/HBASE-13044) | *Minor* | **Configuration option for disabling coprocessor loading**
19475
19476 This change adds two new configuration options:
19477 - "hbase.coprocessor.enabled" controls globally if any coprocessors will be loaded. Set to "false" to disable. Defaults to "true" for compatibility with previous releases.
19478 - "hbase.coprocessor.user.enabled" controls if any user (aka table) coprocessors will be loaded. Set to "false" to disable. Defaults to "true" for compatibility with previous releases.
19479
19480
19481 ---
19482
19483 * [HBASE-12961](https://issues.apache.org/jira/browse/HBASE-12961) | *Minor* | **Negative values in read and write region server metrics**
19484
19485 Change read and write request count in ServerLoad from int to long
19486
19487
19488 ---
19489
19490 * [HBASE-7332](https://issues.apache.org/jira/browse/HBASE-7332) | *Minor* | **[webui] HMaster webui should display the number of regions a table has.**
19491
19492 Adds counts for various regions states to the table listing on main page. See attached screenshot.
19493
19494
19495 ---
19496
19497 * [HBASE-8329](https://issues.apache.org/jira/browse/HBASE-8329) | *Major* | **Limit compaction speed**
19498
19499 Adds compaction throughput limit mechanism(the word "throttle" is already used when choosing compaction thread pool, so use a different word here to avoid ambiguity). Default is org.apache.hadoop.hbase.regionserver.compactions.PressureAwareCompactionThroughputController, will limit throughput as follow:
19500 1. In off peak hours, use a fixed limitation "hbase.hstore.compaction.throughput.offpeak" (default is Long.MAX\_VALUE which means no limitation).
19501 2. In normal hours, the limitation is tuned between "hbase.hstore.compaction.throughput.lower.bound"(default 10MB/sec) and "hbase.hstore.compaction.throughput.higher.bound"(default 20MB/sec), using the formula "lower + (higer - lower) \* param" where param is in range [0.0, 1.0] and calculate based on store files count on this regionserver.
19502 3. If some stores have too many store files(storefilesCount \> blockingFileCount), then there is no limitation no matter peak or off peak.
19503 You can set "hbase.regionserver.throughput.controller" to org.apache.hadoop.hbase.regionserver.throttle.NoLimitThroughputController to disable throughput controlling.
19504 And we have implemented ConfigurationObserver which means you can change all configurations above and do not need to restart cluster.
19505
19506 The throttle is on by default in hbase-2.0.0. There is no limit in hbase-1.x.
19507
19508
19509 ---
19510
19511 * [HBASE-6778](https://issues.apache.org/jira/browse/HBASE-6778) | *Major* | **Deprecate Chore; its a thread per task when we should have one thread to do all tasks**
19512
19513 Corresponding usages for new ScheduledChore vs. Deprecated Chore:
19514 Chore.interrupt() -\> ScheduledChore.cancel(mayInterruptWhileRunning = true)
19515 Threads.setDaemonThreadRunning(Chore) -\> ChoreService.scheduleChore(ScheduledChore)
19516 Chore.isAlive -\> ScheduledChore.isScheduled()
19517 Chore.getSleeper().skipSleepCycle() -\> ScheduledChore.triggerNow()
19518
19519
19520 ---
19521
19522 * [HBASE-11574](https://issues.apache.org/jira/browse/HBASE-11574) | *Major* | **hbase:meta's regions can be replicated**
19523
19524 On the server side, set hbase.meta.replica.count to the number of replicas of meta that you want to have in the cluster (defaults to 1). hbase.regionserver. meta.storefile.refresh.period should be set to a non-zero number in milliseconds - something like 30000 (defaults to 0).
19525 On the client/user side, set hbase.meta.replicas.use to true.
19526
19527
19528 ---
19529
19530 * [HBASE-12808](https://issues.apache.org/jira/browse/HBASE-12808) | *Major* | **Use Java API Compliance Checker for binary/source compatibility**
19531
19532 Adds a dev-support/check\_compatibility.sh script for comparing versions. Run the script to see usage.
19533
19534
19535 ---
19536
19537 * [HBASE-12684](https://issues.apache.org/jira/browse/HBASE-12684) | *Major* | **Add new AsyncRpcClient**
19538
19539 Retrofit a new, netty-based rpc transport on the client. This client is slightly slower if little contention given the extra tier or so that netty adds and that we block on a Future waiting on the call to finish.  This client opens the way for HBase having a native Async API.
19540
19541 This client is on by default in master branch (2.0 hbase). It is off in branch-1.0 (hbase-1.1.x).  To enable it, set "hbase.rpc.client.impl" to "org.apache.hadoop.hbase.ipc.AsyncRpcClient"
19542
19543
19544 ---
19545
19546 * [HBASE-8410](https://issues.apache.org/jira/browse/HBASE-8410) | *Major* | **Basic quota support for namespaces**
19547
19548 Namespace auditor provides basic quota support for namespaces in terms of number of tables and number of regions. In order to use namespace quotas, quota support must be enabled by setting
19549 "hbase.quota.enabled" property to true in hbase-site.xml file.
19550
19551 The users can add quota information to namespace, while creating new namespaces or by altering existing ones.
19552
19553 Examples:
19554 1. create\_namespace 'ns1', {'hbase.namespace.quota.maxregions'=\>'10'}
19555 2. create\_namespace 'ns2', {'hbase.namespace.quota.maxtables'=\>'2','hbase.namespace.quota.maxregions'=\>'5'}
19556 3. alter\_namespace 'ns3', {METHOD =\> 'set', 'hbase.namespace.quota.maxtables'=\>'5','hbase.namespace.quota.maxregions'=\>'25'}
19557
19558 The quotas can be modified/added to namespace at any point of time. To remove quotas, the following command can be used:
19559
19560 alter\_namespace 'ns3', {METHOD =\> 'unset', NAME =\> 'hbase.namespace.quota.maxtables'}
19561 alter\_namespace 'ns3', {METHOD =\> 'unset', NAME =\> 'hbase.namespace.quota.maxregions'}
19562
19563
19564 ---
19565
19566 * [HBASE-12902](https://issues.apache.org/jira/browse/HBASE-12902) | *Major* | **Post-asciidoc conversion fix-ups**
19567
19568 Pushed to master. Shout if there are any issues.
19569
19570
19571 ---
19572
19573 * [HBASE-12848](https://issues.apache.org/jira/browse/HBASE-12848) | *Major* | **Utilize Flash storage for WAL**
19574
19575 For users on a version of Hadoop that supports tiered storage policies (i.e. Apache Hadoop 2.6.0+), HBase now allows users to opt-in to having the write ahead log placed on the SSD tier. Users on earlier versions of Hadoop will be unable to take advantage of this feature.
19576
19577 Use of tiered storage is controlled by a new RegionServer config, hbase.wal.storage.policy. It defaults to the value 'NONE', which will rely on HDFS defaults for a policy decision.
19578
19579 User can specify ONE\_SSD or ALL\_SSD as the value:
19580 ONE\_SSD: place only one replica of WAL files in SSD and the remaining in default storage
19581 ALL\_SSD: all replica for WAL files are placed on SSD
19582
19583 See [the HDFS docs on storage policy\|http://hadoop.apache.org/docs/r2.6.0/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html]
19584
19585
19586 ---
19587
19588 * [HBASE-11144](https://issues.apache.org/jira/browse/HBASE-11144) | *Major* | **Filter to support scanning multiple row key ranges**
19589
19590 MultiRowRangeFilter is a filter to support scanning multiple row key ranges. If the number of the ranges is small, using multiple scans can also do the same thing and can work well. But when the number of ranges are quite big (e.g. millions), use the MultiRowRangeFilter will be nice. In this filter, the ranges will be sorted and merged, so users do not have to take care of ranges are not continuous. And if users are using something like rest, thrift or pig to access the data the filter might be the practical solution.
19591
19592
19593 ---
19594
19595 * [HBASE-12268](https://issues.apache.org/jira/browse/HBASE-12268) | *Major* | **Add support for Scan.setRowPrefixFilter to shell**
19596
19597 Added new option, ROWPREFIXFILTER, to the scan command in the HBase shell to easily scan for a specific row prefix.
19598
19599
19600 ---
19601
19602 * [HBASE-12775](https://issues.apache.org/jira/browse/HBASE-12775) | *Major* | **CompressionTest ate my HFile (sigh!)**
19603
19604 CompressionTest will now abort when the target path exists.
19605
19606
19607 ---
19608
19609 * [HBASE-12695](https://issues.apache.org/jira/browse/HBASE-12695) | *Critical* | **JDK 1.8 compilation broken**
19610
19611 Use the -Pjavac maven profile in order to compile HBase using the compiler provided by the JDK instead of the default error-prone compiler plugin. This is useful for now if you are building HBase with JDK 1.8 or a JDK that doesn't support error-prone.
19612
19613
19614 ---
19615
19616 * [HBASE-10201](https://issues.apache.org/jira/browse/HBASE-10201) | *Major* | **Port 'Make flush decisions per column family' to trunk**
19617
19618 Adds new flushing policy mechanism. Default, org.apache.hadoop.hbase.regionserver.FlushLargeStoresPolicy, will try to avoid flushing out the small column families in a region, those whose memstores are \< hbase.hregion.percolumnfamilyflush.size.lower.bound. To restore the old behavior of flushes writing out all column families, set hbase.regionserver.flush.policy to org.apache.hadoop.hbase.regionserver.FlushAllStoresPolicy either in hbase-default.xml or on a per-table basis by setting the policy to use with HTableDescriptor.getFlushPolicyClassName().
19619
19620
19621 ---
19622
19623 * [HBASE-12559](https://issues.apache.org/jira/browse/HBASE-12559) | *Major* | **Provide LoadBalancer with online configuration capability**
19624
19625 updateConfiguration(ServerName server) method of Admin now updates config for HMaster as well.
19626 Specifically, config update would be taken by load balancer.
19627
19628
19629 ---
19630
19631 * [HBASE-10378](https://issues.apache.org/jira/browse/HBASE-10378) | *Major* | **Divide HLog interface into User and Implementor specific interfaces**
19632
19633 HBase internals for the write ahead log have been refactored. Advanced users of HBase should be aware of the following changes.
19634
19635 Public Audience
19636   - The Admin API for asking a region server to roll WAL files has changed from a synchronous command that returns a set of regions the WAL implementation would like flushed into an asynchronous command that returns nothing. Older clients relying on the former behavior will still be able to interact with newer servers, but the response body will always contain an empty list of regions to flush.
19637   - The shell command "hlog\_roll" has been deprecated. Operators should use the "wal\_roll" command instead. This command is subject to the changes described above for the Admin API to roll WAL files.
19638   - The command for analyzing write ahead logs has been renamed from 'hlog' to 'wal'. The old usage is deprecated and will be removed in a future version.
19639   - Some utility methods in the HBaseTesetingUtility related to testing write-ahead-logs were changed in incompatible ways. No functionality has been removed, but method names and arguments have changed. See the HBaseTestingUtility javadoc for details.
19640   - The WALPlayer utility has deprecated the configuration keys used for advanced customization. Users should switch to the updated configuration keys. See the usage information on the WALPlayer tool for details.
19641   - The HLogInputFormat utility class for processing logs with MapReduce has been deprecated and will be removed in a future version. Users should switch to the WALInputFormat.
19642   - The labeling of server metrics on the region server status pages changed. Previously, the number of backing files for the write ahead log was labeled 'Num. HLog Files'. If you wish to see this statistic now, please look for the label 'Num. WAL Files.'  If you rely on JMX for these metrics, their location has not changed.
19643
19644 LimitedPrivate(COPROC) Audience, LimitedPrivate(PHOENIX)
19645   - The RegionObserver API has been updated. The changes are both binary and source backwards compatible for coprocessors that use the BaseRegionObserver class. For those that implement RegionObserver directly the changes are binary backwards compatible. Depending on the internals of future HBase versions, coprocessors using the deprecated API may not see all WAL related events. Users are strongly encouraged to update their use of the API; see the RegionObserver javadoc for details.
19646   - Classes related to reading WAL entries (ReaderBase, ProtobufLogReader, SequenceFileLogReader) have changed in a backwards incompatible way. Users who referenced HLog.Reader directly or HLog.Entry will have to update. These changes do not impact compatibility with extant wal files.
19647   - The WALObserver API has been updated. The changes are both binary and source backwards compatible for coprocessors that use the BaseWALObserver class. For those that implement WALObserver directly the changes are binary backwards compatible. Depending on the internals of future HBase versions, coprocessors using the deprecated API may not see all WAL related events. Users are strongly encouraged to update their use of the API; see the WALObserver javadoc for details.
19648  - The WALCoprocessorEnvironment  has changed in a backwards incompatible way. WALObserver coprocessors that relied on retrieving an object representing the write ahead log instance will have to be updated.
19649
19650 LimitedPrivate(REPLICATION) Audience
19651  - The WALEntryFilter API has changed in a backwards incompatible way. Implementers will have to be updated.
19652  - The ReplicationEndpoint.ReplicateContext API has changed in a backwards incompatible way. Implementers who use this interface will have to be updated. These changes do not impact wire compatibility for replicating between clusters.
19653  - The HLogKey API is deprecated in favor of the WALKey API. Additionally, the HLogKey API has changed in a backwards incompatible way by changing from implementing WriteableComparable\<HLogKey\> to implementing Writeable and Comparable\<WALKey\>.
19654
19655
19656 ---
19657
19658 * [HBASE-11683](https://issues.apache.org/jira/browse/HBASE-11683) | *Major* | **Metrics for MOB**
19659
19660 Adds new mob related metrics:
19661
19662 mobCompactedIntoMobCellsCount
19663 mobCompactedIntoMobCellsSize
19664 mobCompactedFromMobCellsCount
19665 mobCompactedFromMobCellsSize
19666 mobFlushCount
19667 mobFlushedCellsCount
19668 mobFlushedCellsSize
19669 mobScanCellsCount
19670 mobScanCellsSize
19671 mobFileCacheAccessCount
19672 mobFileCacheMissCount
19673 mobFileCacheHitPercent
19674 mobFileCacheEvictedCount
19675 mobFileCacheCount
19676
19677
19678 ---
19679
19680 * [HBASE-11912](https://issues.apache.org/jira/browse/HBASE-11912) | *Major* | **Catch some bad practices at compile time with error-prone**
19681
19682 Errors from error-prone will fail the build in the compile phase. Warnings look like Javac warnings and are counted as such by test-patch etc
19683
19684
19685 ---
19686
19687 * [HBASE-12220](https://issues.apache.org/jira/browse/HBASE-12220) | *Major* | **Add hedgedReads and hedgedReadWins metrics**
19688
19689 Adds metrics hedgedReads and hedgedReadWins counts.
19690
19691
19692 ---
19693
19694 * [HBASE-6290](https://issues.apache.org/jira/browse/HBASE-6290) | *Minor* | **Add a function a mark a server as dead and start the recovery the process**
19695
19696 Adds a script to mark a server as dead.
19697
19698 Usage: considerAsDead.sh --hostname serverName
19699
19700
19701 ---
19702
19703 * [HBASE-12111](https://issues.apache.org/jira/browse/HBASE-12111) | *Major* | **Remove deprecated APIs from Mutation(s)**
19704
19705 Removed the below from hbase-2 (were deprecated on release of hbase-1.0.0)
19706
19707 Mutation setWriteToWAL(boolean)
19708 boolean getWriteToWAL()
19709 Mutation setFamilyMap(NavigableMap\<byte [], List\<KeyValue\>\>)
19710 NavigableMap\<byte [], List\<KeyValue\>\> getFamilyMap()
19711
19712
19713 ---
19714
19715 * [HBASE-12084](https://issues.apache.org/jira/browse/HBASE-12084) | *Major* | **Remove deprecated APIs from Result**
19716
19717 The below KeyValue based APIs are removed from Result
19718 KeyValue[] raw()
19719 List\<KeyValue\> list()
19720 List\<KeyValue\> getColumn(byte [] family, byte [] qualifier)
19721 KeyValue getColumnLatest(byte [] family, byte [] qualifier)
19722 KeyValue getColumnLatest(byte [] family, int foffset, int flength, byte [] qualifier, int qoffset, int qlength)
19723
19724 They are replaced with
19725 Cell[] rawCells()
19726 List\<Cell\> listCells()
19727 List\<Cell\> getColumnCells(byte [] family, byte [] qualifier)
19728 Cell getColumnLatestCell(byte [] family, byte [] qualifier)
19729 Cell getColumnLatestCell(byte [] family, int foffset, int flength, byte [] qualifier, int qoffset, int qlength)
19730 respectively
19731
19732 Also the constructors which were taking KeyValues also removed
19733 Result(KeyValue [] cells)
19734 Result(List\<KeyValue\> kvs)
19735
19736
19737 ---
19738
19739 * [HBASE-12048](https://issues.apache.org/jira/browse/HBASE-12048) | *Major* | **Remove deprecated APIs from Filter**
19740
19741 The following APIs are removed from Filter
19742 KeyValue transform(KeyValue)
19743 KeyValue getNextKeyHint(KeyValue)
19744 and replaced with
19745 Cell transformCell(Cell)
19746 Cell getNextCellHint(Cell)
19747 respectively.
19748 If a custom Filter implementation have overridden any of these methods, we will no longer call them. User has to change the custom Filter to override cell based methods as shown above
19749
19750
19751 ---
19752
19753 * [HBASE-7767](https://issues.apache.org/jira/browse/HBASE-7767) | *Major* | **Get rid of ZKTable, and table enable/disable state in ZK**
19754
19755 Keeps table enabled/disabled state in HDFS rather than up in ZooKeeper.  Auto-migrates any existing zk state.
19756
19757
19758 ---
19759
19760 * [HBASE-11911](https://issues.apache.org/jira/browse/HBASE-11911) | *Major* | **Break up tests into more fine grained categories**
19761
19762 Adds new test categories besides the class smalltests, mediumtests, and largetests.  Adds:
19763
19764 ClientTests
19765 CoprocessorTests
19766 FilterTests
19767 FlakeyTests
19768 IOTests
19769 MapReduceTests
19770 MasterTests
19771 MiscTests
19772 RegionServerTests
19773 ReplicationTests
19774 RestTests
19775 SecurityTests
19776 VerySlowMapReduceTests
19777 VerySlowRegionServerTests
19778
19779 See description for examples on how to use them.
19780
19781
19782 ---
19783
19784 * [HBASE-11658](https://issues.apache.org/jira/browse/HBASE-11658) | *Major* | **Piped commands to hbase shell should return non-zero if shell command failed.**
19785
19786 Adds a noninteractive mode (-n or --noninteractive) to the hbase shell that exits with a non-zero error code on failed or invalid shell command executions, and exits with a zero error code upon successful execution.
19787
19788
19789 ---
19790
19791 * [HBASE-11640](https://issues.apache.org/jira/browse/HBASE-11640) | *Major* | **Add syntax highlighting support to HBase Ref Guide programlistings**
19792
19793 This got committed, so I guess it is safe to resolve it?
19794
19795
19796 ---
19797
19798 * [HBASE-11606](https://issues.apache.org/jira/browse/HBASE-11606) | *Minor* | **Enable ZK-less region assignment by default**
19799
19800 By default, we don't use ZK for region assignment now. To fall back to the old way, you can set hbase.assignment.usezk to true.
19801
19802
19803 ---
19804
19805 * [HBASE-3135](https://issues.apache.org/jira/browse/HBASE-3135) | *Major* | **Make our MR jobs implement Tool and use ToolRunner so can do -D trickery, etc.**
19806
19807 All MR jobs implement Tool Interface, http://hadoop.apache.org/docs/current/api/org/apache/hadoop/util/Tool.html, so now you can pass properties on command line with the -D flag, etc.
19808
19809
19810 ---
19811
19812 * [HBASE-11556](https://issues.apache.org/jira/browse/HBASE-11556) | *Major* | **Move HTablePool to hbase-thrift module.**
19813
19814 HTablePool was deprecated in 0.98.1 but was still present and usable by apps built against versions before HBase 2.0.  It has been moved and is not intended to be used by user applications, and is now an internal part of the thrift2 proxy server only.
19815
19816
19817 ---
19818
19819 * [HBASE-11548](https://issues.apache.org/jira/browse/HBASE-11548) | *Trivial* | **[PE] Add 'cycling' test N times and unit tests for size/zipf/valueSize calculations**
19820
19821 Adds --cycles=N argument.
19822
19823
19824 ---
19825
19826 * [HBASE-11344](https://issues.apache.org/jira/browse/HBASE-11344) | *Major* | **Hide row keys and such from the web UIs**
19827
19828 Configure "hbase.display.keys" to false (default: true) in the master/regionservers if the row-keys should be hidden in the webUIs (like in the webUI for table details).
19829
19830
19831 ---
19832
19833 * [HBASE-6580](https://issues.apache.org/jira/browse/HBASE-6580) | *Major* | **Deprecate HTablePool in favor of HConnection.getTable(...)**
19834
19835 This issue introduces a few new APIs:
19836 \* HConnectionManager:
19837 {code}
19838     public static HConnection createConnection(Configuration conf)
19839     public static HConnection createConnection(Configuration conf, ExecutorService pool)
19840 {code}
19841 \* HConnection:
19842 {code}
19843     public HTableInterface getTable(String tableName) throws IOException
19844     public HTableInterface getTable(byte[] tableName) throws IOException
19845     public HTableInterface getTable(String tableName, ExecutorService pool) throws IOException
19846     public HTableInterface getTable(byte[] tableName, ExecutorService pool) throws IOException
19847 {code}
19848
19849 By default HConnectionImplementation will create an ExecutorService when needed. The ExecutorService can optionally passed be passed in.
19850 HTableInterfaces are retrieved from the HConnection. By default the HConnection's ExecutorService is used, but optionally that can be overridden for each HTable.
19851
19852
19853 ---
19854
19855 * [HBASE-8450](https://issues.apache.org/jira/browse/HBASE-8450) | *Critical* | **Update hbase-default.xml and general recommendations to better suit current hw, h2, experience, etc.**
19856
19857 Changed defaults:
19858
19859 + max versions now 1 instead of 3
19860 + row blooms on by default (except on .META. table)
19861 + handlers 30 instead of 10
19862 + upped memstore lower limit from .35 to .38
19863 + zookeeper timeout default is 90seconds instead of 180
19864 + client pause is 100ms instead of 1000ms
19865 + retries are now 20 instead of 10 (so overall we still wait same amount of time)
19866 + bulkload retries is 10 instead of infinite
19867 + major compactions are now once a week instead of once every 24 hours; they are staggered so all regionservers do not start compacting at the same time
19868 + blockingstorefiles is 10 instead of 7
19869 + block cache is 0.4 instead of 0.25
19870 + Previous, default for hbase.rootdir was /tmp/hbase-${user.name}.  Now it is ${java.io.tmpdir}/hbase-${user.name} which is usually the same location but may not be (on macos, it points to /var/tmp....).
19871
19872
19873 ---
19874
19875 * [HBASE-4072](https://issues.apache.org/jira/browse/HBASE-4072) | *Major* | **Deprecate/disable and remove support for reading ZooKeeper zoo.cfg files from the classpath**
19876
19877 The Apache ZooKeeper config file zoo.cfg will no longer be read when instantiating a HBaseConfiguration object, as it causes various inconsistency issues. Instead, users have to specify all HBase-relevant ZooKeeper properties in the hbase-site.xml using the various "hbase.zookeeper" prefixed properties. For example, specify "hbase.zookeeper.quorum" to provide a ZK quorum server list.
19878
19879 To enable zoo.cfg reading, for which support may be removed in a future release, set the property "hbase.config.read.zookeeper.config" to true in the hbase-site.xml at the client and servers like so:
19880
19881 \<property\>
19882   \<name\>hbase.config.read.zookeeper.config\</name\>
19883   \<value\>true\</value\>
19884   \<description\>
19885         Set to true to allow HBaseConfiguration to read the
19886         zoo.cfg file for ZooKeeper properties. Switching this to true
19887         is not recommended, since the functionality of reading ZK
19888         properties from a zoo.cfg file has been deprecated.
19889   \</description\>
19890 \</property\>
19891
19892
19893