Recommended file formats
All digital file formats risk becoming obsolete over time. If this happens, future software may no longer be able to read or display the file contents correctly. To reduce the risk of files becoming unreadable, researchers should choose file formats that are likely to remain usable in the long term. Suitable formats for sharing and long-term preservation should be:
- common and widely used
- well-documented (meaning: with an available technical specification explaining how the format stores information)
- open or non-proprietary.
In some cases, data may be stored in a format that is not the most open option but that is a recognized standard within a specific research domain – in other words, a format commonly used and supported by most researchers in the field. In such cases, it may be advisable to publish the data both in the format that is widely used within the discipline and, where possible, in a format more suitable for long-term preservation.
The same considerations apply to the long-term preservation of data as to data publication. To reduce the risk of files becoming unreadable, choose formats that are likely to be supported in the future. It is preferable that the format can be opened with a range of different software applications.
The Swedish National Archives has published a set of criteria for archival formats, which can also be used as guidance for selecting formats suitable for long-term preservation. (The information is currently in Swedish only.)
Recommended file formats in DORIS
SND has evaluated and compiled a list of formats suitable for research data intended for publication in a research data catalogue. The recommended formats are divided into two categories: formats for sharing and formats for long-term archival preservation.
The formats recommended for sharing are widely used and compatible with open-source software. This list also includes some proprietary formats that meet these criteria.
The table also contains suggested suitable formats for long-term preservation in archives or similar environments. Preserving data in these formats allows for future analysis using different software, even if all features of the original format may not be retained.
More information and file format guides on Researchdata.se
For more information, see the file format pages on Researchdata.se. There you will also find links to SND’s dedicated file format guides. Where available, the table below includes links to relevant pages on Researchdata.se.
If you are reviewing data provided in formats not listed in the table, please contact the SND office at snd@snd.se for advice on suitable approaches for sharing and long-term preservation.
Suggested formats |
– for sharing |
– for long-term preservation |
|
|
|
|
|
|
Raster images
Vector images
|
Raster images
Vector images
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Markup language |
|
|
RDF |
|