Association Overview

What are Associations?

Associations are basically just lists of things, mostly exposures, that are somehow related. With respect to JWST and the Data Management System (DMS), associations have the following characteristics:

Relationships between multiple exposures are captured in an association.
An association is a means of identifying a set of exposures that belong together and may be dependent upon one another.
The association concept permits exposures to be calibrated, archived, retrieved, and reprocessed as a set rather than as individual objects.
For each association, DMS will generate the most combined and least combined data products.

Associations and JWST

The basic chunk in which science data arrives from the observatory is termed an exposure. An exposure contains the data from a single set of integrations per detector per instrument. In general, it takes many exposures to make up a single observation, and a whole program is made up of a large number of observations.

On first arrival, an exposure is termed to be at Level1b: The only transformation that has occurred is the extraction of the science data from the observatory telemetry into a FITS file. At this point, the science exposures enter the calibration pipeline.

The pipeline consists of three stages: Stage 1, Stage 2, and Stage 3 processing. Stage 2 processing is the calibration necessary to remove instrumental effects from the data. The resulting files contain flux and spatially calibrated data, called Stage 2b data. The information is still in individual exposures.

Note

Older documentation and code may refer to the stages as levels. They are synonymous.

To be truly useful, the exposures need to be combined and, in the case of multi-object spectrometry, separated, into data that is source-oriented. This type of calibration is called Stage 3 processing. Due to the nature of the individual instruments, observing modes, and the interruptibility of the observatory itself, how to group the right exposures together is not straight-forward.

Enter the Association Generator. Given a set of exposures, called the Association Pool, and a set of rules found in an Association Registry, the generator groups the exposures into individual associations. These associations are then used as input to the Stage 3 calibration steps to perform the transformation from exposure-based data to source-based, high(er) signal-to-noise data.

In short, Stage 2 and Stage 3 associations are created running the asn_generate task on an AssociationPool using the default Stage 2 and Stage 3 association rules to produce respective associations. When retrieving the data from the archive, users will find the list of associated data in JSON files that are submitted together with the requested Stage 2 or Stage 3 data.

Association Pools

The information about what data will be associated is constructed with the information derived from the Astronomer Proposal Tool (APT) and the rules on how data should be associated that are defined by the instrument teams. All the information from a single proposal is captured in a single file known as the Association Pool.

Usage

Users should not need to run the generator. Instead, it is expected that one edits an already existing association that accompanies the user’s JWST data.

Care should be taken if editing an association file. Association files must be in valid JSON format, and we encourage use of a JSON linter, for example jsonlint, to validate the file. Keep in mind that generally all input files listed in the association files are assumed to be in the same directory as the association file and no path information is put in expname.

Note

In previous versions of the pipeline, association files with additional trailing commas that rendered them invalid according to the JSON specification were quietly accepted. This behavior has been deprecated and will be removed in a future release; please ensure that trailing commas are removed from hand-edited association files.

If need be, an association can be created based on the existing Stage 2 or Stage 3 examples, or using the command line tool asn_from_list. If, however, the user does need to run the generator, the Association Generator description and asn_generate documentation will be helpful.

Once an association is in-hand, one can pass it as input to a pipeline routine. For example:

% strun calwebb_image3 jw12345-o001_20210311t170002_image3_001_asn.json

Programmatically, to read in an Association, one uses the load_asn() function:

from jwst.associations import load_asn

with open('jw12345-o001_20210311t170002_image3_001_asn.json') as fp:
    asn = load_asn(fp)

What exactly is returned depends on what the association is. However, for all Stage 2 and Stage 3 associations, a Python dict is returned and its structure matches that of the JSON file. Continuing from the above example, the following shows how to access the first exposure file name of a Stage 3 associations:

exposure = asn['products'][0]['members'][0]['expname']

Since most JWST data are some form of a JWST Data Model an association can be opened with open() which returns a ModelContainer. All members of the association that can be represented as a JwstDataModel, will be available in the ModelContainer as their respective DataModels.

from stdatamodels.jwst.datamodels import open as dm_open
container_model = dm_open('jw12345-o001_20210311t170002_image3_001_asn.json')

Utilities

There are a number of utilities to create user-specific associations that are documented under Association Commands.