Association Overview
What are Associations?
Associations are basically just lists of things, mostly exposures, that are somehow related. With respect to JWST and the Data Management System (DMS), associations have the following characteristics:
Relationships between multiple exposures are captured in an association.
An association is a means of identifying a set of exposures that belong together and may be dependent upon one another.
The association concept permits exposures to be calibrated, archived, retrieved, and reprocessed as a set rather than as individual objects.
For each association, DMS will generate the most combined and least combined data products.
Associations and JWST
The basic chunk in which science data arrives from the observatory is termed an exposure. An exposure contains the data from a single set of integrations per detector per instrument. In general, it takes many exposures to make up a single observation, and a whole program is made up of a large number of observations.
On first arrival, an exposure is termed to be at Level1b: The only transformation that has occurred is the extraction of the science data from the observatory telemetry into a FITS file. At this point, the science exposures enter the calibration pipeline.
The pipeline consists of three stages: Stage 1, Stage 2, and Stage 3 processing. Stage 2 processing is the calibration necessary to remove instrumental effects from the data. The resulting files contain flux and spatially calibrated data, called Stage 2b data. The information is still in individual exposures.
Note
Older documentation and code may refer to the stages as levels. They are synonymous.
To be truly useful, the exposures need to be combined and, in the case of multi-object spectrometry, separated, into data that is source-oriented. This type of calibration is called Stage 3 processing. Due to the nature of the individual instruments, observing modes, and the interruptibility of the observatory itself, how to group the right exposures together is not straight-forward.
Enter the Association Generator. Given a set of exposures, called the Association Pool, and a set of rules found in an Association Registry, the generator groups the exposures into individual associations. These associations are then used as input to the Stage 3 calibration steps to perform the transformation from exposure-based data to source-based, high(er) signal-to-noise data.
In short, Stage 2 and Stage 3 associations are created running the
asn_generate task on an AssociationPool
using the default Level2 and Level
3 association rules to produce Stage 2 and Stage 3
associations. When retrieving the data from the archive, users will find the
list of associated data in JSON files that are submitted together with the
requested Stage 2 or Stage 3 data.
Association Pools
The information about what data will be associated is constructed with the information derived from the Astronomer Proposal Tool and the rules on how data should be associated that are defined by the instrument teams. All the information from a single proposal is captured in a single file known as the Association Pool.
Usage
Users should not need to run the generator. Instead, it is expected that one edits an already existing association that accompanies the user’s JWST data.
Care should be taken if editing an association file. Keep in mind that generally
all input files listed in the association files are assumed to be in the same
directory as the association file and no path information is put in expname
.
If need be, an association can be created based on the existing Stage 2 or Stage 3 examples, or using the command line tool asn_from_list. If, however, the user does need to run the generator, the Association Generator description and asn_generate documentation will be helpful.
Once an association is in-hand, one can pass it as input to a pipeline routine. For example:
% strun calwebb_image3 jw12345-o001_20210311t170002_image3_001_asn.json
Programmatically, to read in an Association, one uses the
load_asn()
function:
from jwst.associations import load_asn
with open('jw12345-o001_20210311t170002_image3_001_asn.json') as fp:
asn = load_asn(fp)
What exactly is returned depends on what the association is. However,
for all Stage 2 and Stage 3 associations, a Python dict
is returned,
whose structure matches that of the JSON or YAML file. Continuing
from the above example, the following shows how to access the first
exposure file name of a Stage 3 associations:
exposure = asn['products'][0]['members'][0]['expname']
Since most JWST data are some form of a
JWST Data Model
an association can be opened with
datamodels.open
which returns a
ModelContainer
. All members of the association that can
be represented as a DataModel
, will be available in the ModelContainer
as their respective DataModels.
from stdatamodels.jwst.datamodels import open as dm_open
container_model = dm_open('jw12345-o001_20210311t170002_image3_001_asn.json')
Utilities
There are a number of utilities to create user-specific associations that are documented under Association Commands.