Statistics for Data quality in a big data context