Dirty data

Dirty data is inaccurate, incomplete or erroneous data, especially in a computer system or database.[1]

In reference to databases, this is data that contain errors. Unclean data can contain such mistakes as spelling or punctuation errors, incorrect data associated with a field, incomplete or outdated data, or even data that has been duplicated in the database. It can be cleaned through a process known as data cleansing.[2]

See also

References

  1. Margaret Chu (2004), "What Are Dirty Data?", Blissful Data, p. 71 et seq., ISBN 9780814407806
  2. Wu, S. (2013), "A review on coarse warranty data and analysis", Reliability Engineering and System, 114: 1–11, doi:10.1016/j.ress.2012.12.021
This article is issued from Wikipedia - version of the 10/29/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.