HBase high performance cookbook : exciting projects that will teach you how complex data can be exploited to gain maximum insights / Ruchir Choudhry.

Exciting projects that will teach you how complex data can be exploited to gain maximum insights About This Book Architect a good HBase cluster for a very large distributed system Get to grips with the concepts of performance tuning with HBase A practical guide full of engaging recipes and attractiv...

Full description

Saved in:
Bibliographic Details
Online Access: Full Text (via O'Reilly/Safari)
Main Author: Choudhry, Ruchir (Author)
Format: eBook
Language:English
Published: Birmingham, UK : Packt Publishing, 2017.
Subjects:
Table of Contents:
  • Cover
  • Copyright
  • Credits
  • About the Author
  • About the Reviewer
  • www.PacktPub.com
  • Customer Feedback
  • Table of Contents
  • Preface
  • Chapter 1: Configuring HBase
  • Introduction
  • Configuring and deploying HBase
  • Using the filesystem
  • Administering clusters
  • Managing clusters
  • Chapter 2: Loading Data from Various DBs
  • Introduction
  • Extracting data from Oracle
  • Loading data using Oracle Big data connector
  • Bulk utilities
  • Using Hive with Apache HBase
  • Using Sqoop
  • Chapter 3: Working with Large Distributed Systems Part I
  • Introduction
  • Scaling elastically or Auto Scaling with built-in fault tolerance
  • Auto Scaling HBase using AWS
  • Works on different VM/physical, cloud hardware
  • Chapter 4: Working with Large Distributed Systems Part II
  • Introduction
  • Read path
  • Write Path
  • Snappy
  • LZO compression
  • LZ4 compressor
  • Replication
  • Chapter 5: Working with Scalable Structure of tables
  • Introduction
  • HBase data model part 1
  • HBase data model part 2
  • How HBase truly scales on key and schema design
  • Chapter 6: HBase Clients
  • Introduction
  • HBase REST and Java Client
  • Working with Apache Thrift
  • Working with Apache Avro
  • Working with Protocol buffer
  • Working with Pig and using Shell
  • Chapter 7: Large-Scale MapReduce
  • Introduction
  • Chapter 8: HBase Performance Tuning
  • Introduction
  • Working with infrastructure/operating systems
  • Working with Java virtual machines
  • Changing the configuration of components
  • Working with HDFS
  • Chapter 9: Performing Advanced Tasks on Hbase
  • Machine learning using Hbase
  • Real-time data analysis using Hbase and Mahout
  • Full text indexing using Hbase
  • Chapter 10: Optimizing Hbase for Cloud
  • Introduction
  • Configuring Hbase for the Cloud
  • Connecting to an Hbase cluster using the command line.
  • Backing up and restoring Hbase
  • Terminating an HBase cluster
  • Accessing HBase data with hive
  • Viewing the Hbase user interface
  • Monitoring HBase with CloudWatch
  • Monitoring Hbase with Ganglia
  • Chapter 11: Case Study
  • Introduction
  • Configuring Lily Platform
  • Integrating elastic search with Hbase
  • Configuring
  • Index.