Writing good data definitions
Having a clear and consistent definition of the data you are exchanging will make it easier for the community to send the right data, first time
These principles were influenced by ISO 11179, part 4. You should have a look if you’re interested writing good definitions
Define in the singular
Example: Article Number
Not so good definition | Reference number identifying articles |
Better | A reference number that describes an article |
Why? | The ‘not so good’ definition uses the plural word “articles”. This makes it ambiguous since it could imply that article numbers can apply to more than one article |
Don’t negate
Example: Freight Cost Amount
Not so good definition | Costs which are not related to packing, documentation, loading, unloading and insurance |
Better | The cost incurred by a shipper moving goods from one place to another |
Why? | The ‘not so good’ definition doesn’t specify what is actually in the definition |
Use phrases
Example: Agent Name
Not so good definition | Representative |
Better | The name of a party authorised to act on behalf of another party |
Why? | ‘Representative’ is a near-synonym of the item’s name. It doesn’t actually explain anything. |
Example: Country Name
Not so good definition | Name of a country |
Better | The commonly known, short name that identifies a country |
Why? | Simply restating the name in sentence form doesn’t describe the concept that is a short name, vs. an expanded or long Name |
Example: Mailing Address Country
Not so good definition | The mailing address country name is the name of the country where the mail piece is delivered. |
Better | The name of the country where a mail piece is delivered |
Why? | Repeating the item’s name in a sentence doesn’t add anything that clarifies the item’s intent. There is also no reason to use more words than needed. |
Only use commonly understood abbreviations
Example: Tide Height
Not so good definition | The vertical distance from MSL to a specific tide level |
Better | The vertical distance from mean sea level (MSL) to a specific tide level |
Why? | MSL may not be a commonly understood abbreviation and some members may need to refer to other sources to determine what it represents |
Don’t embed other definitions
Example: Sample Type Code
Not so good definition | The code identifying the kind of sample collected. A sample is a small specimen taken for testing. It can either be an actual sample for testing, or a quality control surrogate sample. A quality control sample is a surrogate sample taken to verify results of actual samples |
Better | The code identifying the kind of sample |
Why? | “quality control surrogate sample” should be defined as a separate item |
Stick to the essentials
Example: Invoice Amount
Not so good definition | The total sum of all chargeable items mentioned on an invoice, taking into account deductions on one hand, such as allowances and discounts, and additions on the other hand, such as charges for insurance, transport, handling, etc |
Better | The total sum charged on an invoice |
Why? | There is a lot on extra information that isn’t relevant |
Don’t overprescribe
Example: Consignment Loading Sequence Number
Not so good definition | The number indicating the sequence in which consignments are loaded in a truck |
Better | The number indicating the sequence in which consignments are loaded for transport |
Why? | Consignments can be transported by more than trucks. |
Be precise
Example: Country Code
Not so good definition | The code that represents a country |
Better | The alphabetic code assigned by the International Standards Organisation (ISO) 3166-1 to represent a country |
Why? | Country Codes are assigned by various standards and organisations. Without specifying which standard is being used, interoperability issues are likely to occur |
Be concise
Example: Character Set Name
Not so good definition | The name given to the set of phonetic or ideographic symbols in which data is encoded, for the purpose of this metadata registry, or, as used elsewhere, the capability of systems hardware and software to process data encoded in one or more scripts |
Better | The name given to the set of phonetic or ideographic symbols in which data is encoded |
Why? | Don’t include unnecessary information that doesn’t actually describe what the item is |
Stand alone
Example: School Location City Name
Not so good definition | See “school site” |
Better | The name of the city where a school is situated |
Why? | Don’t rely on other definitions to understand the meaning of items |
Don’t be restrictive
Example: Income Tax Gross Amount
Not so good definition | The amount of income tax payable on the company’s annual return before the allowance of any rebates/tax offsets or credits |
Better | The amount of income tax payable before the allowance of any rebates/tax offsets or credits |
Why? | By being overly restrictive the item can only be used in one context, forcing other similar items to be created |
Avoid circular reasoning
Example: Employee and Employee ID Number
Employee definition | The number assigned to an Employee |
Employee ID Number definition | The person corresponding to the an employee ID number |
Why? | Each definition refers to the other for it’s meaning. Neither definition actually provides any meaning. |