Repository structure

Transformation rules and other artefacts for the TED Open Data Services (TED-ODS) are organised in the ted-rdf-mapping repository.

The repository structure and folder descriptions are shown below:

/mappings
    /package_F03
    /package_F06
    /package_F25
    ...
/src
    /mappings
        *.rml.ttl
/test_data
    /sampling_2014_2022
    /sampling_2021
    /sampling_manual
Makefile
requirements.txt

/mappings folder contains mapping suite packages organised based on the standard forms numbers. Their name is formed based on the form number (e.g. F03, F06) prefixed with package_ for readability. When the eForms will be mapped, then the corresponding appropriate organisation will be chosen.

/src/mappings folder represents the "single source of truth" for the mapping rules across various mapping suite packages. This is necessary because of the modularisation and reuse method adopted in this project. The basic idea is that the mapping rules are organised in modules and all are stored in the source folder. Each mapping suite provides in the conceptual mapping workbook the list of modules that be used to compose the complete set of transformation rules of the mapped form number.

/test_data folder contains sample TED notices generated by different selection methods. Some manually selected notices are available in the /sampling_manual subfolder. The automatically generated notice samples that are in the /sampling_2021 subfolder are described here. In the sampling_2014_2022 subfolder there are samples, generated from all available notices in the 2014-2022 period that cover the various changes in the XML Schema over the years. More automatically generated samples will follow.

Upper level folders of the GitHub ted-rdf-mapping repository

This section describes the upper level of the GitHub repository, the next section describes the lower levels of the GitHub repository where the more granular artefacts can be found. The aim of this section is to give the reader an overview of the repository structure.

Folder Subfolders Description

/docs

/antora

Contains the documentation for the ted-rdf mapping

/mappings

NB: the example provided here is for Standard Form F03 but can be applied in the same way for the mapping suites for the other forms covered by the mappings.

Contains mapping suite packages organised based on the standard forms numbers. Their name is formed based on the form number (e.g. F03, F06) prefixed with package_ for readability. When the eForms are mapped, the corresponding appropriate organisation will be chosen.

/mappings continued: subfolders

Contains example notices extracted from the TED website with real data for the forms concerned in this example F03.

The folder ending F03_2021 contain examples from notices published in 2021

The folder ending F03_S01 contains examples from notices published using the xml release R2.0.9.S01.E01

The folder ending F03_S02 contains examples from notices published using the xml release R2.0.9.S02.E01, and so forth whereby the S01 is replaced by either S02, S03, S04 and S05

The latest publication xml can be found at TED schemas - EU Vocabularies - Publications Office of the EU (europa.eu) under Publication Schema the Excel file in the zip file provides the change log between schema.

/src

Contains the "single source of truth" for the mapping rules across various mapping suite packages. This is necessary because of the modularisation and reuse method adopted in this project. The basic idea is that the mapping rules are organised in modules, and all are stored in the source folder.

Each mapping suite provides in the conceptual mapping workbook the list of modules that be used to compose the complete set of transformation rules of the mapped form number.

/test_data

Covers samples of notices across all form types whether they are covered by the actual mappings covered to date.

The samples are broken down by a mapping to the eForms sub-type:

/sampling_2014_2022 and sampling_2021 cover notices over the respective periods and are generated respectively.

/sample_manual are notices that have been produced manually concerning F03

Test data is also provided in the mapping suite packages that are specific to the mapping suites.

The lower level folders of the GitHub Repository

This section provides more detailed information on the content available in the lower levels of the repository:

  • Mapping suite package structure

    • output

    • test-data

    • transformation

    • validation

  • source files

    • mappings

    • validation


Any comments on the documentation?