Notice: This website is an unofficial Microsoft Knowledge Base (hereinafter KB) archive and is intended to provide a reliable access to deleted content from Microsoft KB. All KB articles are owned by Microsoft Corporation. Read full disclaimer for more details.

Hadoop Sort / Merge / By-Group Processing


View products that this article applies to.

Hadoop Sort / Merge / By-Group Processing

Workarounds

Even if you pre-sort in Hadoop and then import to RRE there is no guarantee that the splits will contain whole by-groups or be processed in the correct order. Hence the options narrow to CSV input and:

1)            Hive or Pig for Sort, merge, and by-group processing. 

2)            rmr2 or plyrmr for by-group processing in R.   

↑ Back to the top


Keywords: kb

↑ Back to the top

Article Info
Article ID : 3104162
Revision : 1
Created on : 1/7/2017
Published on : 11/1/2015
Exists online : False
Views : 76