[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [AUDITORY] F.O.S.S auditory data server 20TB - feasibility/utility ?



Nathan,

We’re running a 6TB NAS over local LAN for our lab needs and use OSF repositories (https://osf.io/) for publishing open datasets. I think that the issue here is longevity and as a community we should aim for 50+ years of preservation guarantees. OSF has been setup for exactly that, with an institutional fund to support their servers for 50+ years. 

Personally I’ve been in a situation where I'd needed data that someone had openly shared in the early days of open science (think early 2000s). They'd shared it on their institution’s server, which seemed sensible at the time. However, the server has since been updated a million times and the URL is broken. The corresponding author seems to have left academia, the senior author died, other authors have no clue as to what happened with the data… 

So even though it’s quirky, I prefer to have an OSF DOI and a guarantee that it’s going to point to the dataset for the next 50+ years. 

Storing data that’s not anonymised is another issue altogether, but I’d myself prefer to have something local for that.

Best,
Chris

--
Krzysztof Basiński, PhD
PRINCIPAL INVESTIGATOR

ASSISTANT PROFESSOR
Division of Quality of Life Research
Medical University of Gdańsk
krzysztof.basinski@xxxxxxxxxxxx
zbnjz@xxxxxxxxxxxx
+48 58 349 1569
Tuwima 15, 80-210 Gdańsk 
mug.edu.pl 
farU.edu.pl/en 


attachment.png 


This e-mail and attachments to it may include information legally privileged. It is intended solely for the recipient. If you are not the recipient,please inform the sender accordingly and delete the e-mail with all attachments received. Copying and/or disseminating this e-mail and information included in the e-mail by unauthorised individuals is strictly prohibited. Your personal data is administered by Medical University of Gdańsk. Information on personal data processing is available at gumed.edu.pl/55839.html


On 26 Sep 2024, at 09:38, Nathan Barlow <nb.audiology@xxxxxxxxx> wrote:

Dear AUDITORY List members,

Would there be any interest from members in a collective server with a data array of 20TB; the aim being able to secure store Auditory data from human participants in line with Human ethics boards requirements on data longevity. Think large open data sets. Think being able to browse your dataset for new insights in a new way with their specialism.

Myself,I have found that long-term storage required a specialised array to extend past the 10year mark. This is for example the finalised dataset of semi or even fully anonymised data, or equally the large size original recordings (video, EEG, etc) that span several GB a file.

A description of the server can be found at: https://tinyurl.com/auditory-serve

(note the drives in the actual server are six 5Tb HDD's in an array, with data longetivty exceeding 15 years- but the description shows six 1Tb - an older configuration)
It runs Linux but SMB based (allowing Windows compatibility) and exFAT (limiting to 4GB per file) and using SSH currently for secure connection and data transfer.


I look forward with interest to any correspondence with AUDITORY members and their learned opinions on wether such a open source, non-profit server is of use within our auditory neuroscientific community, or indeed even hearing about your existing options in this space.

nga mihi
Nathan



--
Nathan Barlow
BSc, PGDip, MSc(SpchSci)(Hons), CoP, MSc(Clinical Audiology)(Soton)
@eres_ope