Globus is a third-party service for transferring large amounts of data between Globus Data Transfer Nodes (DTNs). With Globus, very high data transfer rates are achievable. This service allows data to be accessible to any person who has a Globus account.
To use Globus on NeSI platforms, you need:
- A Globus account (see Initial Globus Sign-Up and Globus Id)
- An active NeSI account (see Creating a NeSI Account)
- Access privileges on the non-NeSI Globus end point you plan on transferring data from or to. This other end point could be a personal end point on your workstation, or it could be managed by your institution or a third party.
Note that a NeSI user account does not create a Globus account, and similarly a Globus account does not create a NeSI user account. Nor can you, as the end user, link the two through any web site.
Both your accounts (NeSI and Globus) must exist before you try to use our DTN.
The NeSI Wellington DTN endpoint is protected by a second factor authentication (2FA).
The NeSI Data Transfer Node
The NeSI Data Transfer Node (DTN) acts as an interface between our HPC facility storage and a worldwide network of Globus endpoints. This is achieved using Globus.org, a web-based service that solves many of the challenges encountered moving large volumes of data between systems. While NeSI supports use of other data transfer tools and protocols such as
scp, Globus provides the most comprehensive, efficient, and easy to use service for NeSI users who need to move large data sets (more than a few gigabytes at a time).
Types of Globus endpoints or Data Transfer Nodes
Globus data transfers take place between endpoints. An endpoint is nothing more than an operating system (Windows, Linux, etc) that has the Globus endpoint software installed on it. Endpoints come in two kinds: personal and server.
The NeSI DTN is an example of a server endpoint. These type of endpoints are usually configured to access large capacity and high-performance parallel filesystems. Endpoints can be unmanaged or managed by a subscription. NeSI DTN is a server type, managed endpoint (by NeSI subscription) which allows authorised users to provide data transfer and data sharing services on behalf of their Globus accounts.
Your institution may have its own managed server endpoint, and if so we encourage you to use that endpoint for your data transfers between your institution and NeSI. You may need to apply to the person or group administering the managed server endpoint, most likely your IT team, to get access to the endpoint. Your institution may even have several endpoints, in which case we recommend that you consider which one would be best suited for your data transfer requirements. If you need any help in regards to this, get in touch with us via firstname.lastname@example.org, or consult your institution's IT team.
If your institution doesn't have a managed server endpoint, you can set up a personal endpoint using software provided by Globus (see below). Please be aware that even if you set up a personal endpoint, you may still need to consult your IT team in order to make it usable, especially if your institution has an aggressive firewall.
Transferring data using a managed endpoint
As an example, to move files between the NeSI HPC Storage (accessible from Māui and Mahuika) and the Otago University high-capacity central file storage (another managed server endpoint):
NeSI endpoints start with "nesi#":
Select the endpoint "Nesi Wellington DTN", and you will be asked to authenticate your access to the endpoint.
Note: Effective from 15 September 2020, the NeSI Wellington DTN endpoint will be protected by a second factor authentication (2FA). This change will require additional information to be entered in the 'Password' field when you log in. In the 'Username' field, enter your Māui/Mahuika username. In the 'Password' field, your
Password will be equal to
First Factor +
Second Factor e.g.
password123456. (Do not use any additional characters or spaces between your password and the token number.)
On Māui and Mahuika, the NeSI DTN can only see (1) your /home directory, (2) project directory (read-only) /nesi/project/<project_code> and (3) project subdirectories of /nesi/nobackup - see Globus Paths, Permissions, Storage Allocation.
Navigate to your selected directory eg the nobackup filesystem /nesi/nobackup/<project_code> and select the two-endpoint panel for transfer.
Select the target endpoint and authenticate.
When you have activated endpoints in both transfer windows, you can start transferring files between them.
Select files you wish to transfer and select the corresponding "Start" button:
- Sign in to the NESI Globus Web App https://transfer.nesi.org.nz/. You will be taken to the File Manager page https://transfer.nesi.org.nz/file-manager
- If this is your first time, you will need to create a Globus account.
- Open the two-endpoint panel located on the top-right of the File Manager page.
- Select the Endpoints you wish to move files between (start typing "nesi#" to see the list of NeSI DTNs to select from). Authenticate at both endpoints.
- At Globus.org the nesi#hpcf-dtn endpoint defaults to your home directory (represented by "/~/") on Mahuika or Māui. We do not recommend uploading data to your home directory, as home directories are very small. Instead, navigate to an appropriate project directory under /nesi/nobackup (see Globus Paths, Permissions, Storage Allocation).
- Transfer the files by clicking the appropriate button depending on the direction of the transfer.
- Check your email for confirmation about the job completion report.
Transferring data using a personal endpoint
To transfer files into/out of your laptop, desktop computer or any other system you control, configure it as a Globus Personal Endpoint (see Personal Globus Endpoint Configuration for transfers between personal endpoints).
To share files with others outside your filesystem, see https://docs.globus.org/how-to/share-files/