书籍详情
《Instant Apache Hive Essentials How-to》[28M]百度网盘|亲测有效|pdf下载
  • Instant Apache Hive Essentials How-to

  • 出版时间:2013-06
  • 热度:8466
  • 上架时间:2024-06-30 08:52:20
  • 价格:0.0
书籍下载
书籍预览
免责声明

本站支持尊重有效期内的版权/著作权,所有的资源均来自于互联网网友分享或网盘资源,一旦发现资源涉及侵权,将立即删除。希望所有用户一同监督并反馈问题,如有侵权请联系站长或发送邮件到ebook666@outlook.com,本站将立马改正

内容介绍

内容简介
Filled with practical, step-by-step instructions and clear explanations for the most important and useful tasks.This book provides quick recipes for using Hive to read data in various formats, efficiently querying this data, and extending Hive with any custom functions you may need to insert your own logic into the data pipeline.This book is written for data analysts and developers who want to use their current knowledge of SQL to be more productive with Hadoop. It assumes that readers are comfortable writing SQL queries and are familiar with Hadoop at the level of the classic WordCount example.
作者简介
Darren Lee studied mathematics at Stanford and New York University. He is currently the lead engineer on the data team at Bizo in San Francisco, CA. He has been using Hive since 2009 and occasionally blogs about it at //dev.bizo.com.
目录
Instant Apache Hive Essentials How-to
Credits
About the Author
About the Reviewer
www.PacktPub.com
Support files, eBooks, discount offers and more
Why Subscribe?
Free Access for Packt account holders
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example code
Errata
Piracy
Questions
1. Instant Apache Hive Essentials How-to
Tables and queries (Simple)
How to do it...
How it works...
There's more...
Understanding complex data types (Simple)
How to do it...
How it works...
There's more...
Using Hive non-interactively (Simple)
Getting ready
How to do it...
How it works...
Join optimizations (Medium)
How to do it...
How it works...
Map join
Bucketed tables
There's more...
Multiple joins
Skew joins
Multiple selects
List bucketing and skew joins
Setting the file format (Simple)
How to do it...
How it works...
Writing a custom SerDe (Intermediate)
How to do it...
How it works...
Object inspectors
Initialization
Serialization
Deserialization
Using static partitions (Intermediate)
Getting ready
How to do it...
How it works...
There's more...
Loading data into partitioned internal tables
Writing data into specific partitions from queries
Using dynamic partitions (Intermediate)
How to do it...
How it works...
Using functions (Simple)
How to do it...
How it works...
Adding custom logic with streaming (Intermediate)
Getting ready
How to do it...
How it works...
Simple user-defined functions (Intermediate)
Getting ready
How to do it...
How it works...
Advanced user-defined functions (Advanced)
How to do it...
How it works...
Initialization
Evaluation
The display string
User-defined table-generating functions (Advanced)
How to do it...
How it works...
Initialization
Processing inputs
Final output
User-defined aggregation functions (Advanced)
Getting ready
How to do it...
How it works...
The resolver
The modes of evaluation
Initializing the evaluator
Aggregation buffers
The logic of aggregation
There's more...