Data Archiving
Google Drive
ASU provides unlimited google drive storage to all the students and faculty members.
We can leverage this opportunity to archive our group research data.
Source vs Data
For source files, we use git mainly for version control purposes. Also, the source files are usually small in size, and github is sufficient for storage.
For data files, we use googledrive for archiving purposes. Also, the data files are usually large in size, and googledrive is perfect for this.
General principles
- For each project, create a subfolder in the shared google drive.
- The subfolder should have a concise but indicative name, for example, jtao-rotation1
- Create sub-subfolders as necessary
- Include a README file to briefly introduce the background of the data and the structure of the data; also in the README file, include a link to the git repo of the source files.
- …
For use with the cluster
- Follow the instructions from Research Computing to configure your
rcloneon agave and linkrcloneto the group shared google drive. - Include a line of
rclone copy commandin the batch file (as shown in the template), and follow the instructions to automate the archiving process immediately after your simulation is completed. - If you prefer to use GUI, you can use Globus.
For use outside of the cluster
- Archive early, archive often.
- Follow the general principles.