Access to data

Data processing has been moved to the S3DF this includes access to the FFB and processing from the FFB see: Running at S3DF

The FFB batch system has been shutdown.

The FFB system is designed to provide dedicated analysis capabilities during the experiment.

The FFB currently offers the fastest file system (WekaIO on NVME disks via IB HDR100) of all LCLS storage systems however it size is only about 900 TB.
The raw data will be kept on the FFB a week after an experiment ends however for data intensive experiments files might be purged even before an experiment ends.
Files deleted from the FFB will be available only on one of the offline systems (S3DF or NERSC).
The raw data are copied to the offline storage system and to tape immediately, i.e. in quasi real time during the experiment, not after they have been deleted from FFB.
T~~he users generated data created in the scratch/ folder are moved to the offline storage when the experiment is deleted from the FFB.~~
When running on the FFB the xtc/ and scratch/ folder should be used for reading and writing ( below /cds/data/drpsrcf/...). The Lustre ana-filesystems should (must) not be used (only exception is calib/, see below).
~~The LCLS Jupyterhub allows to start notebooks on the psffb nodes which will have access to the data and the FFB scratch folder of an experiment.~~

You can access the FFB system from pslogin, psdev or psnx with:

% ssh psffb

The experiment data will be available under:

/cds/data/drpsrcf/<instrument>/<experiment>

There are two options for telling psana which directory the data is in. One can add the "dir=" keyword to the psana DataSource, like this:

dsource = DataSource('exp=cxilu9218:run=20:smd:dir=/cds/data/drpsrcf/cxi/cxilu9218/xtc')

or one can set the following environment variable:

export SIT_PSDM_DATA=/cds/data/drpsrcf

Besides the xtc/ folder for the raw data the scratch/ folder allows user to write their processing output. This folder will be moved to the offline filesystem after an experiment is done. The calib/ is a link to the offline calib folder. A ":live" is often added to the DataSource string in order to process xtc files while they are written (currently only works for LCLS1-style DAQ/analysis).

FFB SLURM partitions

The FFB slurm partitions have been removed

Directories and Lifetime of data on the FFB

xtc folder

xtc files are immediately copied to the offline filesystem
the lifetime on the ffb is dictated by how much data is generated
- typically files stay on the ffb a week after an experiment ends
- however if space is need the oldest files will be purged from the FFB even before an experiment has finished
- after an experiment is done the ffb should not be used anymore except if discussed with the POC

scratch folder

From run 21 on no scratch folders are created on the FFB. The documentation below is only valid for old experiments.

Once an experiment has been complete the ffb scratch folder is moved to the experiments scratch folder on the offline filesystems. The following rules are applied:

scratch folder is made non accessible to users.
files and directories below the ffb scratch/ are moved to the scratch/ffb/ on the offline filesystem:
/cds/data/psdm/<instr>/<expt>/scratch/ffb/
except for hdf5 files in the smalldata folder (see next).
Once the data are on the offline scratch the Data Retention Policy applies. The transfer preserves the files mtime which is used by the cleanup.
hdf5 files below scratch/hdf5/smalldata/ are moved the the hdf5/smalldata/ folder on the offline filesystem, e.g.
/cds/data/drpsrcf/mfx/mfx123456/scratch/smalldata/*.h5 -> /cds/data/psdm/psdm/mfx/mfx123456/hdf5/smalldata/
1. Only h5 files (and directories) below scratch/hdf5/smalldata/ are copied to the hdf5/smalldata folder on the offline storage.
2. Files that don't match the rule in a. will be moved to the offline scratch/ffb/hdf5 folder.
3. If a h5 file already exists on the offline storage and is newer than the one on the ffb the ffb file will not be copied but just removed.

FFB File Permissions

The same permission, based on ACLs, as used for the Lustre analysis file-systems are used for the FFB. However, there is an issue with the current version of the file system:

the umask is applied when creating files and directories which violates the ACL specs. As the default umask is 022 the group write permission will be removed. We recommend to set ones umask to:

% umask 0002

FFB setup

The following figure shows the connectivity of the nodes:

Each FFB file server (16 of them) has a 100Gb/s IB connection and 100GB/s ethernet
Each batch node has a 100Gb/s IB connection
Batch nodes have a 100Gb/s Ethernet connection
The figure also shows which network is used for the different file path

Confluence and Jira now require federated login. Read more.

Space shortcuts

Child pages

Access to data

FFB SLURM partitions

Directories and Lifetime of data on the FFB

xtc folder

scratch folder

FFB File Permissions

FFB setup

Confluence and Jira now require federated login. Read more.

Space shortcuts

Child pages

Fast Feedback System

Access to data

FFB SLURM partitions

Directories and Lifetime of data on the FFB

xtc folder

scratch folder

FFB File Permissions

FFB setup