MapReduce Design Patterns by Donald Miner & Adam Shook

MapReduce Design Patterns by Donald Miner & Adam Shook

Author:Donald Miner & Adam Shook [Donald Miner and Adam Shook]
Language: eng
Format: epub, pdf
Tags: COMPUTERS / Data Modeling & Design
ISBN: 9781449341978
Publisher: O'Reilly Media
Published: 2012-11-25T16:00:00+00:00


Caution

Be considerate of follow on data parsing to ensure proper field delimiters. Outputting an empty text object is actually unwise. A record that contains the proper structure but with null fields should be generated instead of outputting an empty object. This will ensure proper parsing for follow-on analytics.

Combiner optimization

Because the join logic is performed on the reduce side, a combiner will not provide much optimization in this example.

Reduce Side Join with Bloom Filter



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.