Glossary
- analytics-ready data
-
data that has been prepared to meet the requirements of analysis. This means the data is clean, consistent, complete and formatted in a way that allows for easy interpretation. To achieve this state, the data undergoes pre-processing steps, such as cleaning, transformation and validation, to align it with the specific goals of the analysis.
- data
-
(singular: datum) refers to a collection of raw, unprocessed facts, measurements or observations. Data can take many forms, including numbers, text, symbols, images, audio, video, and more.
- data accessibility
-
the ease with which authorised users can locate, retrieve and use data when needed for analysis or decision-making
- data cleaning
-
process of identifying and correcting errors, inconsistencies and missing pieces of information in a dataset to improve its quality and reliability
- data completeness
-
extent to which all required data fields, including the metadata that describes the characteristics and context of the data, are available and properly filled
- data consistency
-
data values remain uniform (identical and accurate) across datasets and systems
- data consolidation
-
in a business setting, data consolidation is the process of combining information collected from different systems or tools into one central place
- data content accuracy
-
data values are correct and reflect the true state of the object or event they represent
- data contextuality
-
refers to whether the data captures the context surrounding an event or transaction, enhancing its interpretability
- data diversity
-
the variety of data types and sources, ensuring a multi-dimensional view
- data granularity
-
refers to the level of detail in the data. High granularity means the data is very detailed, while low granularity provides a summarised view.
- data integrity
-
ensures that relationships between data elements are maintained correctly
- data interoperability
-
ability of data to integrate seamlessly across different platforms, systems and tools
- data interpretability
-
the extent to which the data can be analysed and understood in a meaningful way
- data pre-processing
-
preparing raw data for analysis by applying techniques such as cleaning, transformation, reduction and integration. It ensures that data is in an optimal state for generating reliable and accurate insights.
- data privacy
-
the rights of individuals to control how their personal data is collected, stored and shared in compliance with ethical and legal standards
- data reduction
-
process of making large datasets smaller and simpler without losing important information
- data relevance
-
measures of whether the data is meaningful and applicable to the specific analysis or business question
- data richness
-
refers to the depth, breadth and diversity of data available for analysis
- data scalability
-
ensures that the data infrastructure can handle growth in data volume, variety and complexity without degrading performance
- data security
-
refers to the measures and technologies used to protect data from unauthorised access, breaches, theft or corruption
- data timeliness
-
also referred to as data currency ensures that data is up-to-date and available at the moment when it is needed for decision-making
- data traceability
-
ability to trace data back to its source and track its transformations during processing
- data transformation
-
process of converting raw data into a format suitable for analysis. It involves modifying the structure, format or values of data to meet specific requirements of the analytics task.
- data uniqueness
-
no duplicate records exist within a dataset
- data usability
-
the ease with which data can be accessed, understood and utilised by end-users
- data validity
-
data follows predefined formats, types or business rules
- DIKW hierarchy
-
Data-Information-Knowledge-Wisdom (DIKW) hierarchy illustrates how raw data is transformed into meaningful insights and, ultimately, into wise decision-making.
- information
-
data that has been processed, organised or structured to provide meaning and context, making it more understandable and relevant to specific purposes. Unlike raw data, information serves a particular goal, often answering specific questions or helping with decision-making.
- knowledge
-
created by analysing and synthesising information to identify patterns, relationships or insights that contribute to a deeper understanding. It connects various pieces of information, enabling interpretation based on experience, expertise or reasoning. Knowledge goes beyond basic facts, providing meaningful insights that often lead to actionable recommendations for strategic decisions.
- temporal relevance
-
ensures that data corresponds to the appropriate time frame for the specific analysis or decision-making process.
- wisdom
-
represents the highest level in the progression from data to decision-making. It is the ability to make sound decisions and judgements based on accumulated knowledge and experience. Wisdom involves applying knowledge with insight, ethics and a long-term perspective, carefully weighing the potential impacts of decisions.