Transitive data skew

In distributed computing problems, transitive data skew is an issue of data synchronization.

It arises with the uneven distribution of otherwise evenly distributed data across a number of devices while the data is in transition. If sorted data is being distributed across multiple devices and the column on which that data is sorted is the "key" used to identify the target device, the resulting transitive data skew may be self-correcting.


This article is issued from Wikipedia - version of the 1/7/2015. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.