--- alias: user-guide-data-set-index tags: - data platform description: "This documentation describes data sets used within a data platform" --- # Data Set :lock: Dataset.**Show** ## Overview A **Data Set** represents a collection of data in tabular format. **Data Sets** can be automatically created upon the creation of a new IoT Event Definition, or can be user-defined with a SELECT SQL statement. They can be used for many purposes, such as training ML models or feeding dashboards. Data can also be accessed by any client supporting OData (example: Excel, PowerBI or Jupyter Notebook). ## Folders If you navigate to the **Data Sets** page in the **Data Platform** menu, you can see that the **Data Sets** are organized into multiple root folders: ![Data Sets](images/data_set_01.png) Each folder contains different types of **Data Sets**: - **Apps** - folder for data sets created and used by MES Apps. - **CDM** - folder for system data sets containing the Canonical Data Model data. - **Datasilos** - folder for Data Silos. - **DWH** - folder for Data Warehouse data sets containing aggregated data. - **IoTEvents** - folder for data sets associated with IoT Events. - **MES** - folder for data sets containing data from the ClickHouse MES database. - **ODS** - folder for data sets containing MES ODS data. - **UserDefined** - folder for data sets manually created by MES users. The **IoTEvents** folder contains data sets that are automatically created when new **IoT Event Definitions** are created, and those data sets contain the flatted events being posted to **Data Platform** using those IoT Events. Data sets that are created manually will go to the **User Defined** root folder, where you can also create sub-folders. All other folders contain system data sets that contain MES data. ## System Data Sets Here is the list of system data sets: !!! warning This list does not include all **Data Sets** that are available and created from each **IoT Event Definition**. {% include-markdown 'includes/docsync/dataset_list.md' %}