2.3 How will you handle copyright and Intellectual Property Rights issues?
Content to be filled in by: TTO, Library_SEARCH, AD
IK contacte Gabriel Clerc.
3. Data storage and preservation
3.1 How will your data be stored and backed-up during the research?
Content to be filled in by: NL, AS, ME
- Where and on what media will the data be stored?
- How will the back-up be organized (frequency and responsibilities)?
- What are the risks to data security?
- How will data security be guaranteed (e.g. encryption or recovery after an incident)?
- Is the available storage sufficient or will you need to invest in additional services?
Table with storage solutions proposed at EPFL is stored on Switchdrive/DMP FNS folder. Le tableau d'offres de stockage avec leurs caractéristiques est disponible dans le folder DMP SNF de Switchdrive.
3.2 What is your data preservation plan?
Content to be filled in by: RR, JK, NL, AS, ME
Describe the procedure, notably which criteria, is used to select data to be preserved, for how long and whether they must be made public. Note that preservation does not necessarily mean publication (e.g. personal sensitive data may be preserved but never published).
This section should answer the following questions:
- What data will be preserved in the long term: selection criteria. Most important criteria:
- Reusability of the data (quality of metadata, integrity and accessibility of data, license allowing reuse, readability of data (chosen file formats))
- Value of the data (indispensable data, completeness of the data or data set, uniqueness, possibility to reproduce the data in the same conditions, interest of the data, potential of reuse)
- Ethical considerations
- Stakeholders requirements
- Costs (are there additional costs that come with using the repositoryor data archive of your choice? Did you anticipate these costs for using therepositorrepository or data archive? Howwill you cover such costs?)
- Selection basically has to be done together with or by the data producer or someone else with deep specialist knowledge.
- What data curation process(es) will be applied: anonymization (if necessary), improvement of the metadata, format migration, integrity check, measures to ensure accessibiliy, etc.?
- For how long (0, 5, 10, 20... years or unlimited)?
- Will the data be made public?
- Is the data sensitive (i.e. subject to privacy, ethics, or intelectual property laws)?
- Who will be responding to the process and after the project, who will be the respondent of these data?
4. Data sharing and reuse
4.1 How and where will the data be shared?
Content to be filled in by: LS, JK (+KD), AS, ME
On which repository do you plan to share your data?
It is recommended to publish in well established (or even certified) domain specific repositories if available :
- ETH Zurich researchers are encouraged to publish in ETH's own Research Collection repository to ensure full compliance with ETH regulations.
- re3data is a repository directory allowing to select repositories by subject and level of trust (e.g. certifications)
- recommended data repositories by Nature
In domains, for which no suitable subject repositories are available, generalist repositories are available, particularly:
Note: SNSF does not pay for storage in commercial data repositories (even thought data preparation costs are eligible)
How will potential users find out about your data?
You may consider:
- Citing your dataset from your publications (using persistent identifiers such as Digital Objects Identifiers).
- Write a data / software paper describing your data.
- Advertise via social media.
4.2 Are there any necessary limitations to protect sensitive data?
Content to be filled in by: AD
You may mention specifically the conditions under which the data will be made available:
- there is no sensitive data
- the data is not available at the time of publication
- the data is not available before publication
- the data is available after the embargo of ...
- the data is not available because of the patent of ...
AD s'en occupe et à re-confirmer avecReO
4.3 I will choose digital repositories that are conform to the FAIR data principles
Content to be filled in by: LS, JK, AS, ME
A good data management should in fine facilitate discovery, assess, reuse and analysis of data by humans and machines. Following the
Force11 FAIR Data Principles is a good way to achieve this. FAIR stands for :
- Findable: a persistent and unique identifiers is attributed to each dataset, which is described by searchable metadata, i.e. data describing data
- Accessible: the persistent identifier allows to retrieve the datasets via a an open communication protocol
- Metadata and data are accessible
- Interoperable: data and metadata are both expressed in a formal and broadly applicable language or format
- Reusable: metadata are accurate and relevant, follow community standards, specify a clear data usage license and indicate provenance of the data
4.4 I will choose digital repositories maintained by a non-profit organisation
Content to be filled in by: AS, ME
You have to answer here by choosing yes/no on a radio button. If you choose "no", you will have to explain why you cannot share your data on a non-commercial digital repository.
One possible reason would be to ensure the visibility of your research, if your research community is standardly publishing data on a well-established but commercial digital repository.