Skip to content

Summary of differences between C-SUB and JADE

This is a list of topics. Click on one to read more about it (click again to close it). We have used three colors of blocks to indicate the likelihood of that topic impacting the typical user. Some items are marked red but only require a one word change in your command or changing directory to a different location to retrieve your incoming files.

  • red, danger - you must deal with/know about
  • orange, warning - you probably should know about
  • light blue, info - less important changes
User Communities on JADE

If you had been using the C-SUB, you are a member of the "CMS community" on JADE.

There are three communities (so far) in JADE. Each one has a unique name and a single character equivalent. You will see both appear in cluster configuration, such as the first letter of a username (c-jhedid-dua) and in paths to files (/transfer/in/cms/).

Character       Name
----------      ---------- 
c               cms
d               dbgap
s               sysadmin
Login node

You must change your settings or commands to use the JADE login server.

JADE: jade01.jhsph.edu

C-SUB: jhpcecms01.jhsph.edu

Unchanged: Usernames,verification codes, and passwords (except 10201)

Most C-SUB users will continue using their usernames, verification codes, and passwords with JADE.

C-SUB users with 10201 in their usernames will switch to new 10401 accounts. For each such user, systems administrators will make a copy of the C-SUB 10201 home directory and place it into the JADE 10401 home directory.

10201 users can then look through "csub-homedir", move files out of it they want to keep, and then delete the rest if they want to save disk space. (Some directories like ~/.cache/ can contain large amounts of material.)

Compute node names have changed

If your batch job scripts specify the name of one or more compute nodes to use, you will need to modify them so they work on JADE.

Compute nodes in C-SUB had the prefix "compute-" and a suffix in the range 132 to 139.

Compute nodes in JADE have the prefix "jcompute-" and a suffix in the range 032 and higher.

There is a direct mapping of C-SUB compute-132 to JADE jcompute-032, etc.

Unix group changes

This change should be invisible.

Each C-SUB user has a 'primary' group and one or more 'secondary' groups. When users are migrated to JADE, we will change both primary & secondary group memberships.

SLURM partition name changes

If your batch job scripts specified the name of the partition to use, you will need to modify them so they work on JADE.

We have a document explaining how to replace strings in documents here.

In C-SUB, the default partition was named "cms" and the SAS one "sas".

In JADE, each community now has its own default partition. The CMS default is named "c-shared". Also, "sas" was renamed to be "c-sas".

If you do not specify a partition when submitting an interactive or batch job, then it is assigned to the cluster's default partition. Because the C-SUB contained very few SLURM partitions, we expect that very few batch jobs explicitly specified "cms".

We have added a modification to your home directory to accomodate a change in the name of the CMS community's default partition. The file ~/slurm/defaults contains a line specifying "c-shared" as the default.

Your ~/.bashrc file and ~/proposed directory

Explanation

Incoming/Outgoing files will not be copied from C-SUB to JADE

Files found in /cms01/incoming/ and /cms01/outgoing/ are not going to be copied over to JADE. Please copy any of them that you want to keep into your home directory before your group is migrated from the C-SUB to JADE.

SFTP data transfers

JADE uses the same TCP/IP networking ports as was used on the C-SUB: Data going into the cluster: 22011 Data coming out of the cluster: 22027

JADE uses different paths to store this kind of data. We cannot create symbolic links to hide this change.

C-SUB

Incoming   `/cms01/incoming/<username>`
Outgoing   `/cms01/outgoing/<username>`

JADE

Incoming   `/transfer/in/cms/c<dua>/<username>`
Outgoing   `/transfer/out/cms/c<dua>/<username>`

Example:
Incoming   `/transfer/in/cms/c55548/c-jxu123-55548`
Outgoing   `/transfer/out/cms/c55548/c-jxu123-55548`

There are two ways you work with this new path - when inside an SFTP transaction and when you are accessing files in the operating system.

When you connect with SFTP to JADE, the SFTP server program places you inside the directory appropriate for your community. You can only see files below that point.

So you "start from" either /transfer/in/cms/ or /transfer/out/cms/

To reach your own files, you need to cd into your group's directory, then into your own personal directory.

For example: cd c55548/c-jxu123-55548

If you use the MobaXterm SFTP program, there is a place in the session's settings where you can specify your directory. If you configure that once, you will automatically "land" in your final destination without having to issue any cd commands every time.

After you transfer files into JADE, you need to use a different path to access them.

On the C-SUB, you used /cms01/incoming/&lt;your_username&gt; On JADE, you will use /transfer/in/cms/c&lt;your_dua&gt;/&lt;your_username&gt;

Data moderation process

The data moderation (DM) team needs to use two new paths - where you find proposed files and where you put them after approving them. We cannot create symbolic links to hide this change.

Please take note of these path element symbols:

<dua> = for example, 59614 or 55548
<cdua> = for example, c59614 or c55548
<username> = for example, c-jli89-10201

Data moderators: After you have reviewed the files users have proposed for egress, you copy those files into the user-specific directory where they can access them from outside.

On the C-SUB, you found user files to review in /users/<dua>/<username>/proposed/
On JADE, you will find user files to review in /proposed/cms/<cdua>/<username>/

On the C-SUB, you placed approved files into /cms01/outgoing/<username>/
On JADE, you will use /transfer/out/cms/<cdua>/<username>/

The data moderation process:

  1. Users create their files
  2. Users copy the ones that they want to export out of JADE to a proposed directory in their home directory, as they did before. However, in JADE we have replaced that actual directory with a symbolic link pointing to the actual location: /proposed/cms/<cdua>/<username>/ Users can just work within ~/proposed/ as long as they don’t delete the link. (DM reviewers cannot use the “proposed” symbolic links, e.g. they cannot say “cd ~c-tbrow261-55548/proposed”. They cannot do that because, in JADE, home directories are unreadable by anyone except the user. Which is the way it should be.)
  3. Users contact DM reviewer team asking for review
  4. DM reviewer (who is a member of the j-c10401 group) changes directory to the "/proposed" directory for that user, and does what they do.
  5. DM reviewer copies the approved files to /transfer/out/cms/<cdua>/<username>/
  6. DM reviewer notifies user
  7. User, from outside of JADE, sftp -P 22027 <username>@jade01.jhsph.edu
  8. User issues command “cd /” and “get file1”
  9. User quits sftp session.
  10. Files older than 30 days are automatically deleted from /transfer/out/cms/<cdua>/<username>/
  11. It is up to users to clean up their ~/proposed/ files.
Mailing lists

Explanation

Operating system version

This change should be invisible.

Both clusters use the Rocky 9.x operating system. The C-SUB uses 9.4 while JADE uses 9.6.

libreoffice program names

Explanation

MATERIAL BELOW HAS NOT YET BEEN INCORPORATED OR DELETED YET

COMMUNITIES

Shorthand variables you will see used below:

<dua#> = alphanumeric string, typically containing only digits e.g. 55548 <cdua> = <community-letter><dua#> e.g. c55548 NOTE the "c" in <cdua> does NOT mean "cms", it means "c"ommunity

<username> = c-<jhedid>-<dua#> (for cms users) <username> = d-<jhedid>-<dua#> (for dbgap users) <username> = s-<jhedid>-<dua#> (for sysadmin users)

LOGIN NODE

C-SUB: jhpcecms01.jhsph.edu JADE: jade01.jhsph.edu

MAILING LISTS

mailing list is jade-cms-users rather than c-sub-users (@lists.jh.edu)

SLURM PARTITION

In the C-SUB, everyone shares a default partition named "cms" In JADE, the default partition for the cms community is c-shared.

Brand new accounts created by my scripts will be given a ~/.slurm/defaults file, the last line of which will be a community-specific default partition. We'll need to Do The Right Thing as we copy over C-SUB home directories of testers and then later migrated users. (First checking that no one has such a file, then, if not, copying a default one into place.)

USERNAME

The userid remains the same for C-SUB users.

UNIX USER GROUPS

We need to add users to new groups.

Existing groups used in the C-SUB will still be defined, at least for a time. I do not know whether having them around will introduce misleading testing results or actual problems, during testing or after a DUA group has migrated to JADE. This needs to be studied and made part of the recipe for copying /cms01/data/dua/ over into JADE.

Add users to: j-c-users This is the group which identifies members of the cms community within JADE C-SUB: c-users meant basically "all normal human users in the cluster" JADE: j-c-users means "cms people"

Add users to: j- This is the group which identifies access to a DUA's files. C-SUB: c- e.g. c-55548 JADE: j- e.g. j-c55548

Add data moderators to j-c-sftp-out This group replaces "c-sftp-out" for data moderators Data moderators have account names in the 10201 DUA, i.e. c--10201

PATHS PATHS PATHS PATHS PATHS PATHS PATHS PATHS PATHS PATHS PATHS PATHS PATHS PATHS PATHS


HOME DIRECTORIES

C-SUB: /users// /users/10101/c-jtuniso1-10101

JADE: /users/// /users/cms/c77777/c-test1-77777

I have created symbolic links for each of the DUAs, e.g. /users/10101 -> /users/cms/c10101/

That will accomodate references to home directories or per-dua shared directories.

However, I think that any new CMS DUA's that come along should not have a link created, because the other JADE communities will not have them at all.

If users want to refer to home directories, as a general practice it is best to use $HOME or ~

If users want to refer to per-dua shared/ directories, they should use the new correct path.

INCOMING SFTP DATA

The port for incoming CMS data, TCP/IP port 22011, will remain unchanged.

Data will now land in a new path. C-SUB: /cms01/incoming// JADE: /transfer/in///

OUTGOING SFTP DATA (AFTER BEING APPROVED FOR EGRESS)

The port for outgoing CMS data, TCP/IP port 22027, will remain unchanged.

Data will now be available, after data review, in a new path. C-SUB: /cms01/outgoing// JADE: /transfer/out///

PROPOSED OUTGOING SFTP DATA (BEFORE BEING APPROVED FOR EGRESS)

Data needing data review, will go into a new path. C-SUB: /users///proposed JADE: /proposed///

This is being done so user home directories can be simpler and also be standard across all communities.

20251013: I have NOT finished tweaking everything for this to work.

DUA DATA

DUA data has been relocated to /data We are still working out the details of what part of paths under /data/ represent mount points.

C-SUB: /cms01/data/dua/
JADE: /data/cms//cui/ /data/cms//cui/intermediate/ /data/cms//cui-intermediate/

Why cui/intermediate/ ? We want groups to be able to keep CUI out of /users/ as much as possible. Creating a default location helps keep that in mind. When DUAs expire, the CUI is supposed to be deleted. If people have a place to put their "near-raw" data then it will make it easier for the group later to review files before deletion. Some users will may have left -- if they put material in the right locations, then it makes it easier for the PI or power users appointed by the PI as data stewards.

/data/cms/<cdua>/shared/

Why shared? The c57285 group wanted a group-writable location adjacent to their CUI (the "raw" data from CMS) where they could store large amounts of data

(/cms01/data/dua/57285/ is stored on one of our spinning-hard-drive servers, which is cheaper, slower but more capacious. The /users//shared directories I created come from our SSD-equipped server, which is fast but does not have as much space.)

In JADE, /users///shared is stored on our SSD-equipped server.

"Shared" provides a directory under which groups can choose what they want place there. If subdirectories are well organized, we can create whole new file systems if their current one fills up, or if PIs want to pay extra to create a file system on our SSD-equipped server.

We could to create some standard items under shared/, such as /data/cms//shared/ /data/cms//shared/tools/ /data/cms//shared/refdata/ SUBDIRECTORIES ARE REQUIRED IF THE GROUP wants to have different permissions on different types of material !!!!!!