Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • To be able to list all projects and datasets in a chronological order, all names MUST start with four digit version of the current year eg. 2023.

  • All names MUST be case insensitive. Do not assume case sensitivity across operating systems eg. Project, PROJECT, project are three different names on Linux, but it is the same name on Windows.

  • All names MUST only contain upper/ lower case letters, digits, underscore and hyphen characters to achieve interoperability across the operating systems.

  • All names MUST have between 8 to 64 characters in total. Minimum amount of characters is derived from the current dataset names and maximum of 64 was agreed during a eRI project review on 2 February 2023.

...

We can derive the following regular expression from the common needs:

Code Block
([0-9]{4})([a-zA-Z0z0-9_-]{4,60})

Here is a sample Python code to illustrate the naming convention regular expression in action for existing datasets with shortest and longest names that currently exist in AgResearch HPC:

...