Data Quality

David Hearle
David Hearle
7.9 هزار بار بازدید - 4 سال پیش - Data quality refers to the
Data quality refers to the ability to use a dataset for its intended purpose, requiring
four criteria

availability,
relevance,
clean,
and usability.

Data availability refers to data that is ready for use and up-to-date

Relevance implies, that data should be clear, not confusing and answers the research questions being asked. Irrelevant data is of no use to a data analyst.

Clean, complete datasets are free of errors and have minimal missing entries

Usability is the ease to conduct an analysis to uncover useful findings from the dataset.

Proper formatting and a well-organized codebook will ensure data quality of the highest standards.

For a dataset displaying poor quality a data improvement plan will be needed to make the required adjustments.
4 سال پیش در تاریخ 1399/06/03 منتشر شده است.
7,956 بـار بازدید شده
... بیشتر