当前位置:网站首页>图书 > 正文 >>

HBase Essentials

图书信息

作者Nishant Garg
出版社Packt Publishing
ISBN9781783987252
出版时间2014-11-14
字数93.0万
分类进口书,外文原版书,电脑,网络

读书简介

This book is intended for developers and Big Data engineers who want to know all about HBase at a hands-on level. For in-depth understanding, it would be helpful to have a bit of familiarity with HDFS and MapReduce programming concepts with no prior experience with HBase or similar technologies. This book is also for Big Data enthusiasts and database developers who have worked with other NoSQL databases and now want to explore HBase as another futuristic, scalable database solution in the Big Data space.

目录

HBase Essentials

Table of Contents

HBase Essentials

Credits

About the Author

About the Reviewers

www.PacktPub.com

Support files, eBooks, discount offers, and more

Why subscribe?

Free access for Packt account holders

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Errata

Piracy

Questions

1. Introducing HBase

The world of Big Data

The origin of HBase

Use cases of HBase

Installing HBase

Installing Java 1.7

The local mode

The pseudo-distributed mode

The fully distributed mode

Understanding HBase cluster components

Start playing

Summary

2. Defining the Schema

Data modeling in HBase

Designing tables

Accessing HBase

Establishing a connection

CRUD operations

Writing data

Reading data

Updating data

Deleting data

Summary

3. Advanced Data Modeling

Understanding keys

HBase table scans

Implementing filters

Utility filters

Comparison filters

Custom filters

Summary

4. The HBase Architecture

Data storage

HLog (the write-ahead log – WAL)

HFile (the real data storage file)

Data replication

Securing HBase

Enabling authentication

Enabling authorization

Configuring REST clients

HBase and MapReduce

Hadoop MapReduce

Running MapReduce over HBase

HBase as a data source

HBase as a data sink

HBase as a data source and sink

Summary

5. The HBase Advanced API

Counters

Single counters

Multiple counters

Coprocessors

The observer coprocessor

The endpoint coprocessor

The administrative API

The data definition API

Table name methods

Column family methods

Other methods

The HBaseAdmin API

Summary

6. HBase Clients

The HBase shell

Data definition commands

Data manipulation commands

Data-handling tools

Kundera – object mapper

CRUD using Kundera

Query HBase using Kundera

Using filters within query

REST clients

Getting started

The plain format

The XML format

The JSON format (defined as a key-value pair)

The REST Java client

The Thrift client

Getting started

The Hadoop ecosystem client

Hive

Summary

7. HBase Administration

Cluster management

The Start/stop HBase cluster

Adding nodes

Decommissioning a node

Upgrading a cluster

HBase cluster consistency

HBase data import/export tools

Copy table

Cluster monitoring

The HBase metrics framework

Master server metrics

Region server metrics

JVM metrics

Info metrics

Ganglia

Nagios

JMX

File-based monitoring

Performance tuning

Compression

Available codecs

Load balancing

Splitting regions

Merging regions

MemStore-local allocation buffers

JVM tuning

Other recommendations

Troubleshooting

Summary

Index