L: Label
Make sense of what kind of data you have. Organize and categorize your data to make it understandable and usable for AI applications.
The S.O.R.T. Methodology
A framework for structuring and classifying your data for maximum clarity and utility.
Structure types
Group your data by its structure: structured (e.g., tables), semi-structured (e.g., JSON, XML), and unstructured (e.g., text, images).
Organise formats
Standardize file formats and data structures within each category to ensure consistency and ease of processing.
Rank quality
Prioritize datasets based on their completeness, accuracy, and relevance to your business goals.
Tag for use
Label sensitive data (e.g., PII), assign data ownership, and tag datasets with metadata describing their content and intended use.
Next Step: Evaluate
Now that your data is labeled and organized, it's time to assess its quality. Learn how to identify and address issues with the Evaluate dimension.
Continue to E: Evaluate