Transitions#

Psifr has a core set of tools for analyzing transitions in free recall data. These tools focus on measuring what transitions actually occurred, and which transitions were possible given the order in which participants recalled items.

Actual and possible transitions#

Calculating a conditional response probability involves two parts: the frequency at which a given event actually occurred in the data and frequency at which a given event could have occurred. The frequency of possible events is calculated conditional on the recalls that have been made leading up to each transition. For example, a transition between item \(i\) and item \(j\) is not considered “possible” in a CRP analysis if item \(i\) was never recalled. The transition is also not considered “possible” if, when item \(i\) is recalled, item \(j\) has already been recalled previously.

Repeated recall events are typically excluded from the counts of both actual and possible transition events. That is, the transition event frequencies are conditional on the transition not being either to or from a repeated item.

Calculating a CRP measure involves tallying how many transitions of a given type were made during a free recall test. For example, one common measure is the serial position lag between items. For a list of length \(N\), possible lags are in the range \([-N+1, N-1]\). Because repeats are excluded, a lag of zero is never possible. The count of actual and possible transitions for each lag is calculated first, and then the CRP for each lag is calculated as the actual count divided by the possible count.

The transitions masker#

The transitions_masker() is a generator that makes it simple to iterate over transitions while “masking” out events such as intrusions of items not on the list and repeats of items that have already been recalled.

On each step of the iterator, the output position and the previous, current, and possible items are yielded. The output position is the position of the transition in the recall sequence, starting from one, including all repeats and intrusions. The previous item is the item being transitioned from. The current item is the item being transitioned to. The possible items includes an array of all items that were valid to be recalled next, given the recall sequence up to that point (not including the current item).

In [1]: from psifr.maskers import transitions_masker

In [2]: pool = [1, 2, 3, 4, 5, 6]

In [3]: recs = [6, 2, 3, 6, 1, 4]

In [4]: masker = transitions_masker(
   ...:     pool_items=pool, recall_items=recs, pool_output=pool, recall_output=recs
   ...: )
   ...: 

In [5]: for op, prev, curr, poss in masker:
   ...:    print(op, prev, curr, poss)
   ...: 
1 6 2 [1 2 3 4 5]
2 2 3 [1 3 4 5]
5 1 4 [4 5]

Only valid transitions are yielded, so the code for a specific analysis only needs to calculate the transition measure of interest and count the number of actual and possible transitions in each bin of interest.

Four inputs are required:

pool_items

List of identifiers for all items available for recall. Identifiers can be anything that is unique to each item in the list (e.g., serial position, a string representation of the item, an index in the stimulus pool).

recall_items

List of identifiers for the sequence of recalls, in order. Valid recalls must match an item in pool_items. Other items are considered intrusions.

pool_output

Output codes for each item in the pool. This should be whatever you need to calculate your transition measure.

recall_output

Output codes for each recall in the sequence of recalls.

By using different values for these four inputs and defining different transition measures, a wide range of analyses can be implemented. All conditional response probability and rank analyses are implemented using the same core of the transitions masker.

The sequences masker#

Some analyses, such as lag_crp_compound() and distance_rank_shifted(), require examining longer sequences rather than individual one-step transitions. These analyses are implemented using the sequences_masker(), which allows analysis code to iterate over sequences of a specified length. Only contiguous sequences are yielded; repeats or intrusions will interrupt the sequence and start it over at the next recall.

Any other conditions, applied using the test input, must also apply to every transition in the sequence for that sequence to be yielded by the masker. This makes it simple to, for example, run an analysis that only examines sequences of within-category transitions. See the user guide for an example of conditionalizing a sequence analysis.

Similarly to the transitions_masker(), the sequences masker will yield the output position, previous item, current item, and possible items for each transition. Here, however, there is a list of values corresponding to positions within the sequence. For example, curr[-1] is the “current” item for the most recent transition, and prev[-2] is the “previous” item for the prior transition.

In [6]: from psifr.maskers import sequences_masker

In [7]: pool = [1, 2, 3, 4, 5, 6]

In [8]: recs = [6, 2, 3, 6, 1, 4, 5]

In [9]: masker = sequences_masker(
   ...:     2, pool_items=pool, recall_items=recs, pool_output=pool, recall_output=recs
   ...: )
   ...: 

In [10]: for output, prev, curr, poss in masker:
   ....:     print(output, prev, curr, poss)
   ....: 
[1, 2] [6, 2] [2, 3] [array([1, 2, 3, 4, 5]), array([1, 3, 4, 5])]
[5, 6] [1, 4] [4, 5] [array([4, 5]), array([5])]

From these outputs, it is then relatively simple to do things like calculate response probabilities conditionalized on prior transitions (like in the compound lag-CRP analysis) or measure distances to recalls before the just-recalled item (like in the shifted distance rank analysis).