Instant Apache Hive Essentials How-topdf下载

Instant Apache Hive Essentials How-to百度网盘pdf下载

作者:
简介:Instant Apache Hive Essentials How-to
出版社:
出版时间:2013-06
pdf下载价格:0.00¥

免费下载


书籍下载


内容介绍

内容简介
Filled with practical, step-by-step instructions and clear explanations for the most important and useful tasks.This book provides quick recipes for using Hive to read data in various formats, efficiently querying this data, and extending Hive with any custom functions you may need to insert your own logic into the data pipeline.This book is written for data analysts and developers who want to use their current knowledge of SQL to be more productive with Hadoop. It assumes that readers are comfortable writing SQL queries and are familiar with Hadoop at the level of the classic WordCount example.
作者简介
Darren Lee studied mathematics at Stanford and New York University. He is currently the lead engineer on the data team at Bizo in San Francisco, CA. He has been using Hive since 2009 and occasionally blogs about it at //dev.bizo.com.
目录
Instant Apache Hive Essentials How-to
Credits
About the Author
About the Reviewer
www.PacktPub.com
Support files, eBooks, discount offers and more
Why Subscribe?
Free Access for Packt account holders
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example code
Errata
Piracy
Questions
1. Instant Apache Hive Essentials How-to
Tables and queries (Simple)
How to do it...
How it works...
There's more...
Understanding complex data types (Simple)
How to do it...
How it works...
There's more...
Using Hive non-interactively (Simple)
Getting ready
How to do it...
How it works...
Join optimizations (Medium)
How to do it...
How it works...
Map join
Bucketed tables
There's more...
Multiple joins
Skew joins
Multiple selects
List bucketing and skew joins
Setting the file format (Simple)
How to do it...
How it works...
Writing a custom SerDe (Intermediate)
How to do it...
How it works...
Object inspectors
Initialization
Serialization
Deserialization
Using static partitions (Intermediate)
Getting ready
How to do it...
How it works...
There's more...
Loading data into partitioned internal tables
Writing data into specific partitions from queries
Using dynamic partitions (Intermediate)
How to do it...
How it works...
Using functions (Simple)
How to do it...
How it works...
Adding custom logic with streaming (Intermediate)
Getting ready
How to do it...
How it works...
Simple user-defined functions (Intermediate)
Getting ready
How to do it...
How it works...
Advanced user-defined functions (Advanced)
How to do it...
How it works...
Initialization
Evaluation
The display string
User-defined table-generating functions (Advanced)
How to do it...
How it works...
Initialization
Processing inputs
Final output
User-defined aggregation functions (Advanced)
Getting ready
How to do it...
How it works...
The resolver
The modes of evaluation
Initializing the evaluator
Aggregation buffers
The logic of aggregation
There's more...