Department of Psychology, University of Zurich;
Peter Shepherdson
Faculty of Psychology, University of Akureyri
Acknowledgement: We thank Atalia Adank and Dawid Strzelczyk for helping with data collection. The data and the analysis scripts can be accessed on the Open Science Framework (
Working memory (WM) is a system comprising a set of mechanisms used to hold selected mental representations for further processing, and one of its core characteristics is its limited capacity (
In contrast to WM’s limited capacity, long-term memory (LTM) is a system with potentially unlimited capacity, and is used for permanently storing, managing, and retrieving information (
A close relation between WM and LTM is also well supported empirically. For instance, measures of WM capacity are highly correlated with the ability to remember over the long term (
Within conceptualizations of LTM, episodic LTM refers to remembering the personal experience of past events. This can include, for example, experiences that individuals have been exposed to in a laboratory setting, including memory for lists of words or pictures presented to them. Semantic LTM refers to our knowledge of facts, including the meaning of words, and representations of well-known concepts. In laboratory tasks, the contents of WM frequently share particular characteristics (e.g., all stimuli are coloured isosceles triangles pointing in various directions) which influence their representations in WM, and knowledge from semantic LTM is assumed to contribute to performance in most tests of WM, as shown in effects like the sentence superiority benefit, the chunking benefit, and the lexicality effect in WM (
In addition, research has shown that WM recall is influenced by lexical properties of the memoranda, such as their concreteness (
Researchers have also suggested that episodic LTM contributes to performance on many WM tests (
Earlier research by
Another way of testing the causal role that episodic LTM plays in performance on WM tasks is to familiarize participants with a subset of the memoranda prior to a separate WM test, thereby making some of the information to-be-held in WM available in LTM as well. For instance,
The question we address here concerns how far active maintenance in WM can be replaced by the reliance on information stored in LTM. A recent study investigated how much information is actively maintained in WM when LTM representations for parts of the to-be-remembered items are available (
In the present study, we aimed to investigate whether the presence and use of LTM representations frees capacity for maintaining additional information in WM. In the first phase of each experiment, we presented participants with word pairs for them to encode into LTM (LTM learning phase). Subsequently, they completed trials of a WM task, also involving word pairs. Crucially, the pairs presented in each WM trial consisted of varying numbers of new pairs (LTM unavailable) and the previously learned LTM pairs. Our main goal was to test the hypothesis that performance in the WM task deteriorates less or not at all if an increase in set size is operationalized through the addition of LTM available pairs, relative to the addition of LTM unavailable (new) pairs. In the first experiment we examined the effect that the number of new and LTM pairs in each trial had on participants’ WM test performance. Consistent with the idea that participants can and do outsource workload to LTM to optimize performance, we found that increasing memory load with LTM pairs had a limited effect on memory performance, whereas increasing load with new pairs showed typical set size effects. In the second experiment we replicated the findings of the first experiment and extended the design by orthogonally varying WM and LTM load across the trials of the WM task. This allowed us to identify a boundary condition, whereby the effects of added LTM load can deleteriously affect memory when WM load is sufficiently low, as would be expected if there is a cost to allowing information exchange between WM and LTM. Finally, in our third experiment we investigated how flexibly individuals can draw on previously learned LTM representations, by making information stored in LTM unreliable and thus introducing conditions of proactive interference (PI) to the WM test phase. Our results showed strong PI effects on the LTM pairs only, whereas WM for new pairs was unaffected by manipulations of PI.
In our first experiment, participants viewed pairs of words (e.g., cat–table) and subsequently had to select the correct associate (e.g., table) from three options when given its partner (e.g., cat) as a cue. On some trials, all word pairs were novel, whereas on others some of the pairs had been prelearned at the beginning of the experiment. If people are able to optimize their use of WM resources, the presence of these LTM pairs in a memory set should allow the storage of novel pairs to be prioritized, enhancing performance in these LTM-available trials (e.g., 4 novel pairs + 2 LTM pairs) relative to trials with the same set size composed of only novel pairs (e.g., six novel pairs). By contrast, if participants need to store all pairs in WM, then the addition of extra LTM pairs to a memory set (e.g., 4 novel pairs + 2 LTM pairs) should result in worse performance than in trials with the same number of novel pairs, but without additional LTM pairs (e.g., 4 novel pairs + 0 LTM pairs). In general, immediate relational recognition tasks, as implemented here, require participants to retain bindings between each item (e.g., a word or an object) and another element, such as another item (e.g., pairings of words with other words) or the item’s context (e.g., the serial position in which they were presented). During the test phase, participants are required to distinguish between the original pairings, recombined pairings, and pairs of new items. In past research these binding tasks have been modified to include more types of pairings and stimuli as well as different test formats (
Preregistration documents, incorporating design, data collection, and analysis plans, are available on the Open Science Framework at
Participants
We collected data from 30 participants (Mage = 22.75 years, 20 female). Based on our preregistered criteria, we replaced two participants as their overall response accuracy across all conditions was greater than 2 standard deviations below the overall mean. All participants’ first language was German or Swiss-German, and they reported normal or corrected-to-normal vision. Participants signed an informed consent form prior to the study and were debriefed at the end. The experimental protocol of this and all following experiments was in accordance with the regulations of the Ethics Committee of the Faculty of Arts and Social Sciences at the University of Zurich.
Materials and Procedure
The WM trials of either four or six word-pairs either consisted of new pairs only (LTM unavailable) or comprised two LTM pairs plus two or four new pairs, adding up to the respective Set Sizes 4 and 6. All pairs presented in each WM trial were tested in random order. In trials including LTM pairs, the within-trial lure randomly came from a within-trial LTM pair or a new pair.
Following these first iterations of the LTM learning and the WM task (first megablock), subjects underwent a second LTM learning phase for another 36 word-pairs, followed yet again by a WM task (second megablock), both of which were procedurally identical to the first instances of each. The final part of the study consisted of an LTM test for all the pairs (both new and LTM) used in the experiment. Here, subjects were presented with the same three-alternative forced choice procedure for every pair of the experiment again. The 90-min experiment comprised 72 WM trials in total, with 18 trials per combination of set size and LTM availability, equally distributed across six blocks (i.e., three blocks in each of the two WM phases). Within a block of 12 trials, the three trials per cell of the design occurred in random order.
Data Analysis
WM Task Performance
The data from the WM task were analyzed using Bayesian generalized linear mixed models (BGLMM) implemented in the R package brms (
As a next step of the analyses, we focused specifically on the performance for new pairs (i.e., those not presented in the LTM learning phase), and investigated whether performance in remembering those new pairs varied between trials which included LTM pairs and trials that did not. This analysis was designed to help us distinguish between possible explanations for a main effect of LTM availability in the overall analysis. If a hypothetical performance advantage in the LTM-available condition in that analysis resulted from WM workload being outsourced to LTM, we would expect to find a similar effect here. On the other hand, if an advantage results from generally better memory for LTM pairs (vs. new pairs), no effect should have been present in this analysis. The fixed-effects in this analysis were again set size (4 vs. 6) and LTM availability (yes [two pairs of the set are taken from the LTM pairs] versus no [all items are new pairs]).
Furthermore, we also separately analyzed performance for the LTM pairs, to see whether performance differed when the response options included a lure taken from a new compared to a lure taken from an LTM pair. This analysis helps us to determine whether the presence of LTM lures—which, by definition, can only occur in LTM-available trials—has a noticeable effect on the manipulations of interest assessed in the previously described analysis. The fixed effects were set size (4 vs. 6) and lure type (new [taken from a new pair of that trial] versus LTM [taken from a LTM pair of that trial]).
For regression coefficients, we used moderately informative Cauchy priors with scales between .3 and 2. These scales define a default prior analogous to that proposed by
We used an MCMC algorithm (implemented in Stan;
LTM Task Performance
As with the WM task, we analyzed the data from the LTM task using BGLMM implemented in the R package brms (
WM Task Performance
Second, we sought to distinguish between possible explanations for a main effect of LTM availability. We therefore focused on WM performance for the new pairs only. If the performance advantage in the LTM-available condition in the previous analysis resulted from WM workload being outsourced to LTM, we would expect to find a similar effect here. On the other hand, if the advantage resulted from generally better memory for LTM pairs (vs. new pairs), no effect should be present in this analysis. This analysis, conditionalized on the new pairs only, revealed the same pattern of results as the analysis including both new and LTM pairs: Evidence against an interaction effect (BF01 = 3.65), evidence for a main effect of set size (BF10 = 3019), and inconclusive evidence for a main effect of LTM availability (BF10 = 1.54).
Pairwise comparisons of the Set Size 4 and 4 + 2 condition showed that memory for four new pairs in WM was impervious to the addition of LTM pairs in the 4 + 2 condition, with substantial evidence against a difference (BF01 = 176.26).
Third, we assessed whether the presence of LTM lures - which, by definition, can only occur in LTM-available trials - had a noticeable effect on the manipulations of interest assessed in the previously described analysis. The BGLMM focusing on the LTM pairs showed that the type of lure had no effect on the probability of choosing the correct item (BF01 = 255.10). Furthermore, there was evidence against an effect of set size (BF01 = 71.43), and the interaction (BF01 = 44.07), emphasizing that the probability of remembering LTM items was unaffected by WMload.
LTM Task Performance
Performance in the LTM test for all the pairs presented across the experiment is shown in
In our first experiment we aimed to investigate whether the presence and use of LTM representations frees capacity for maintaining additional information in WM. This was inspired by a recent study (
Furthermore, our results add to the existing literature showing that those LTM representations are easily learned: Previous studies have shown that episodic memories are formed within a few seconds (
In our second experiment we investigated how robust the effect shown in Experiment 1 is, and under what circumstances it holds, by independently varying WM and LTM load. Therefore, we followed the same procedure as in Experiment 1, involving an LTM learning phase followed by WM trials; but this time we varied WM load across two levels (two and four pairs), crossed with three levels of LTM load (zero, two, and four pairs). In doing so, we aimed to assess the extent to which WM performance is impervious to changes in LTM load, and if so whether this is consistent across different levels of WM load.
Participants
We collected data from 27 participants (Mage = 23 years, 19 female). We had to diverge from the preregistered sample size of 30 participants, due to the inability to complete in-person testing resulting from the SARS-CoV-2 outbreak. All participants’ first language was German or Swiss-German, and they reported normal or corrected-to-normal vision. Participants signed an informed consent form prior to the study and were debriefed at the end.
Materials and Procedure
Apart from the changes detailed in the following, Experiment 2 was consistent with the Materials and Procedure used in Experiment 1. Here, the WM trials included all possible factorial combinations of two or four WM pairs, and zero, two, or four LTM pairs, resulting in overall set sizes of two (LTM0+WM2), 4 (LTM0+WM4, LTM2+WM2), 6 (LTM2+WM4, LTM4+WM2), or eight (LTM4+WM4) pairs per trial. The detailed design can be seen in
Data Analysis
WM Task Performance
The data from the WM task were analyzed using BGLMM implemented in the R package brms (
As a next step of the analyses, we again focused specifically on performance for new pairs, and investigated whether memory for those new pairs varied between trials that included LTM pairs and trials that did not. The fixed effects were again WM load (i.e., two or four) and LTM load (zero, two, or four). As described previously, this analysis helps us to distinguish between possible explanations for a main effect of LTM load in the overall analysis (see Experiment 1’s Data Analysis subsection).
Furthermore, to replicate the finding of Experiment 1 that lure type had no effect on WM performance, we focused on memory for the LTM pairs: In doing so, we can investigate whether performance differs when the response option includes a lure taken from a new compared to a lure taken from a LTM pair. The fixed-effects here were WM load (two vs. four), LTM load (two vs. four) and lure type (new [taken from a new pair of that trial] versus LTM [taken from a LTM pair of that trial]). This analysis helps us determine whether the presence of LTM lures—which, by definition, can only occur in LTM-available trials—has a noticeable effect on the manipulations of interest assessed in the previously described analysis.
As for Experiment 1, we calculated BFs to estimate the strength of evidence for the null or the alternative hypothesis, by comparing the evidence for a model including a factor of interest against a model omitting the effect.
LTM Task Performance
Equivalently to the WM task, and to Experiment 1, the data of the LTM task were analyzed using BGLMMs implemented in the R package brms (
WM Task Performance
The analysis further revealed evidence of better performance when the trial included 2 rather than 4 new pairs (main effect of WM load: BF10 = 9.28 × 10
Again, our second analysis was aimed at distinguishing between possible explanations for a main effect of LTM load. We therefore focused specifically on correct recall of new pairs. If the performance advantage in the LTM-available conditions (LTM Load 2 or 4) relative to matched set-size trials without LTM pairs in the previous analysis resulted from WM workload being outsourced to LTM, we would expect to find a similar effect here. On the other hand, if the advantage resulted from generally better memory for LTM pairs (vs. new pairs), no effect should be present in this analysis. Consistent with the former idea we found the same pattern of results as the analysis including both new and LTM pairs: Evidence for an interaction effect (BF01 = 1.52 × 10
Again, pairwise comparisons of the interaction effect revealed that memory performance for new pairs was better in LTM-available conditions relative to matched set-size trials without LTM pairs (WM load 4 vs. WM Load 2 + LTM Load 2; BF10 = 4.63).
The third question was whether the presence of LTM lures—which, by definition, can only occur in LTM-available (LTM load 2, and 4) trials—has a noticeable effect on the manipulations of interest assessed in the previously-described analysis. The BGLMM focusing on the LTM pairs revealed evidence against an effect of lure type (BF10 = 2.83 × 10
LTM Task Performance
Performance in the LTM test for all the pairs presented across the experiment is shown in
The results from Experiment 2 were consistent with the main finding from Experiment 1: Where possible, people will make use of LTM representations to more efficiently allocate WM capacity to novel information. This was evident in the fact that WM cued recall performance was superior for conditions in which some of the to-be-remembered pairs had previously been learned in the initial phase of the experiment (e.g., 2 new pairs + 2 LTM pairs), relative to conditions where the same number of novel pairs had to be remembered (e.g., 4 new pairs). Notably, when LTM pairs were available, recall of the new pairs was better, suggesting that this effect does not simply reflect a performance boost on the prelearned pairs, but instead involves an effective reallocation of WM resources.
One novel finding that our fully crossed design revealed was that the addition of LTM pairs did have a negative effect on performance for trials with only two new pairs. Specifically, WM recall was worse in the condition with two new and two LTM pairs than in the condition with two new pairs only. By contrast, in trials with four new pairs, adding LTM load had no deleterious effects—and in fact, even seems to have improved performance in one instance (i.e., correct recall was more frequent with four new and two LTM pairs than with four new pairs only). What might explain such a pattern of results? We suspect that this might have occurred due to a “gating” of access to WM (e.g.,
Taken together, results of Experiment 2 add to the evidence of the first experiment suggesting that individuals can outsource workload to LTM to optimize performance, but also suggest a WM system with a gate to LTM that negotiates the exchange of information between WM and LTM depending on the current cognitive need. We further investigated the flexibility of this hypothetical gate, and the extent to which it can be operated strategically, in the following experiment.
In our first two experiments, information learned in the initial phase was completely reliable, such that participants could safely neglect to encode it during the WM phase, and rely instead on LTM. However, in real-world contexts information stored in LTM may in fact conflict with the demands of tasks requiring WM. For instance, consider a teacher trying to learn students’ names on the first day of a new school year. The teacher manages to remember who is sitting in which seat, but after an activity the students sit down again in different positions. Even after asking a student for their name again, if the teacher then tries to use their seating position to remember it, relying on LTM will hinder, rather than help, their ability to do so. Essentially, this is a result of PI from LTM (the old student-position pairing) in a WM task (trying to remember the updated student-position pairing). In Experiment 3 we modified our existing method to incorporate this possibility, such that some LTM information was unreliable for the purposes of the WM task. In doing so, we aimed to assess how changing the utility of LTM pairs affected participants’ tendency to outsource WM workload. As in the previous experiments, we first had participants learn arbitrary word pairs, and then had them complete a WM task that also incorporated word pairs. In some WM trials, all the pairs were novel, whereas in others, words from the learning phase were included in the memory sets. However, in this instance not all of these LTM pairs were consistent with those presented in the learning phase: Some were one LTM word paired with a novel associate, others were two words from the learning phase that had previously been paired with different associates, and the remainder retained their original pairings. Further, the original associates of the recombined pairs were always presented as lures during the WM test. Thus, here we could (a) compare memory for intact versus recombined pairs, to assess the influence of PI on performance; and (b) compare memory for trials with versus without LTM pairs, to assess the influence of unreliable LTM information on participants’ tendency to outsource load from WM.
Three patterns of results seemed plausible to us (see
Preregistration documents, incorporating design, data collection, and analysis plans, are available on the Open Science Framework at
Participants
We collected data from 99 participants online via Prolific (Mage = 26.28 years, 40 female). Based on our preregistered exclusion criteria, we excluded 16 participants for spending less than 10 s reading the instructions, and another four participants as their overall response accuracy across all conditions was greater than 2 standard deviations below the overall mean. Therefore, the final sample included 79 participants. As preregistered, we also excluded data from any trials with response times < 500 ms, as participants are unlikely to be able to read the cue word and identify the appropriate response in such a short duration. All participants’ first language was German or Swiss-German. Participants gave informed consent prior to the study and were debriefed at the end.
Materials and Procedure
The study consisted of two phases: an LTM learning phase in which subjects were presented with 24 pairs of concrete words presented sequentially, for 4 s per pair, and a subsequent WM task phase with sequential encoding of either two, four, or six word-pairs. The WM task followed the procedure from Experiments 1 and 2 apart from the following changes: participants were to choose one out of four response options: the target (previously paired with the cue), another item which was paired with a different word in the same trial (within trial intrusion probe), a LTM lure (in case of PI condition: the word, which was originally paired with the cue, in case of no PI: a word from another LTM pair) or a new item. The WM trials either consisted of new pairs only (LTM Load 0, WM Load 2 or 4), or comprised two or four new pairs plus the two LTM pairs (LTM Load 2), adding up to the respective Set Sizes 2, 4, and 6. Further, in 2/3 of the trials with LTM load of two, interference was introduced in two different ways (see
Investigations of PI have traditionally used a paired-associate learning procedure, in which interference is created by holding cues constant, with the responses being changed between two lists (A–B, A–D). Performance in this interference condition is compared with that in a control condition for which both cues and responses are changed between lists (A–B, C–D). We followed this implementation of PI and added the implementation of a new associate to investigate the influence of only the binding and the cue being stored in LTM (compared to when both words and the binding are in LTM). The detailed design can be seen in
Data Analysis
The data from the WM task were analyzed using BGLMM implemented in the R package brms (
As a next step of the analyses, we again focused on performance for new pairs only and investigated whether memory for those new pairs varied between trials which included intact or rearranged/new associated LTM pairs and trials that did not. The fixed effects were again WM load (i.e., zero, two, or four) by condition (LTM Load 0 vs. LTM Load 2/intact vs. LTM Load 2/PI). This analysis helped us to distinguish between possible explanations for a main effect of condition in the overall analysis. If a hypothetical performance advantage in the LTM Load 2/intact trials in that analysis results from WM workload being outsourced to LTM, we would expect to find a similar effect here. On the other hand, if an advantage results from generally better memory for LTM pairs (vs. new pairs), no effect should be present in this analysis.
Furthermore, to investigate whether PI effects differed between the two PI manipulations (rearranged vs. new associated pairs), we computed planned pairwise comparisons between the two conditions for both LTM pairs as well as new pairs only. This analysis helped us determine whether the way we manipulate PI (if only the binding and the cue is stored in LTM compared to when both words and the binding are in LTM) determines the size of the effect.
Results were most consistent with the second of these options. Performance in trials with a greater WM load (i.e., WM Load 4) was subject to effects of PI (WM Load × Condition interaction: BF10 = 16.30). Specifically, adding LTM load had a different effect depending on whether intact or rearranged LTM pairs were used. Performance was unaffected by increasing LTM load at high WM load (four pairs) when intact pairs were used (LTM Load 2/intact vs. LTM Load 0 condition; BF10 = .13). However, with unreliable LTM information (in the LTM Load 2/PI condition), WM was worse than in the intact LTM load and 0 LTM load condition for both WM loads (LTM Load 2/intact vs Load 2./PI: BF10 = 11.23, and BF10 = 2.32 × 10
The analysis further revealed evidence of better performance when the trial included 2 rather than 4 new pairs (main effect of WM load: BF10 = 23.25). There was also evidence for a main effect of condition (BF10 = 8.73 × 10
Next, we focused on new pairs only and investigated whether performance in remembering those new pairs varied between trials which included intact or rearranged/new associated LTM pairs and trials that did not. As can be seen in
Lastly, we were interested in whether PI effects differed between the two PI manipulations (rearranged vs. new associated pairs). Planned pairwise comparisons between the two conditions for both LTM as well as new pairs revealed evidence against a difference (BF10 = .04 [BF01= 28.51]).
In Experiment 3, we evaluated the effect of making LTM information unreliable on the benefit people obtained from it. This allowed us to assess how flexibly participants could use (or not use) LTM to support WM functioning when memory load reaches capacity limits: Whether they (a) outsourced load to LTM when LTM pairs were intact but not when they were rearranged, (b) obligatorily outsourced load to LTM with performance suffering when pairs were rearranged, or (c) did not outsource load to LTM at all when information held there was sometimes unreliable. Results were most consistent with the second of these options, with strong PI effects on the LTM pairs only, as reflected in worse performance for rearranged compared to intact LTM and new pairs, independent of set size. At the same time our results supported the prediction of option 2 for the effects on new pairs, as WM for new pairs was unaffected by manipulations of PI/PF (see
The goal of this study was to investigate to what extent performance in a WM task is assisted by episodic LTM for subsets of the memoranda. From our results we can draw three major conclusions: First, performance in a WM task is enhanced when LTM representations are available and reliable. When they can, people will draw on previously learned LTM representations, and further they are able to benefit from an effective reallocation of WM resources, freeing capacity for new information and resulting in better WM performance overall, and better memory for new pairs, in particular. Second, the enhancement of performance is limited to situations where some negotiation of information between WM and LTM is already necessary. And third, performance enhancements resulting from LTM representations are dependent on the reliability of the information in LTM.
As noted in the introduction, the idea that LTM contributes to WM performance is not novel. Current models of WM such as the three Embedded Components model (
Our findings add to existing demonstrations of the contribution of episodic LTM to WM performance, and reveal a novel aspect of the phenomenon: That the contribution from LTM, and thereby the exchange of information between the two memory systems, is inevitable once WM capacity is exceeded. By providing prelearned episodic LTM traces for parts of the memoranda, we highlighted the conditions under which information already stored in LTM enters and supports WM.
It is important to note at this point that we do not assume that participants in our experiments maintained all items (only) in WM during trials with larger set sizes. In contrast, we explicitly assume a close interaction of WM and LTM with people probably encoding items into episodic LTM in some form and drawing from existing (semantic) LTM representations naturally. This information in LTM can then contribute to performance in a WM task. Our conclusions are founded on the idea that there is some contribution of LTM to task performance even in cases where memory stimuli consist of novel pairs. Essentially, if large numbers of word pairs were capable of being successfully retained in WM, without any contribution from LTM, then we would expect to see limited or no improvement in performance when the same number of items consists of a mixture of prelearned and novel pairs (e.g., 2 LTM + 4 new pairs), relative to novel pairs only (e.g., six new pairs). We assume this improvement originates in the superior LTM representations for prelearned pairs than for novel pairs. Thus, when the LTM pairs become unreliable (in our PI conditions in Experiment 3), the improvement vanishes.
Our results further suggest a second conclusion, that whether or not an LTM representation is used in a WM task is controlled by a gating mechanism: if the representation in WM is sufficient to solve the task, people rely on it exclusively. If it is not sufficient, the LTM representation is retrieved instead. This inference is based on the distinction between the outcome when LTM pairs are added to two WM pairs (i.e., performance gets worse), and when LTM pairs are added to four or more WM pairs. Thus, here we have made an assumption about when the use of LTM is necessary (i.e., when there are more than two pairs), and when it is not (i.e., when there are 2 pairs). This assumption is grounded in the patterns evident in the data—namely, the different effects of adding LTM pairs to two and more than two novel pairs—and so we feel it is justified. We are certainly not wedded to the idea of a hard limit to WM of two pairs (or any other number of pairs, for that matter).
This second conclusion is further consistent with work by
In the following section we elaborate on the idea that, by opening the gate to LTM and starting to interact with its content, the WM system also becomes susceptible to interference caused by LTM information, leading to general performance costs.
The results of Experiment 2 revealed that retrieval from LTM - either at test or during maintenance - is subject to some performance costs. When memory load in a given trial was sufficiently low (WM Load 2, LTM Load 0) participants seem to have been able to rely exclusively on WM, and this yielded the best performance; yet when the trial additionally included two LTM pairs, performance deteriorated. This can be explained in two ways: either (a) subjects were able to encode these novel pairs into LTM, resulting in a dual-code of information (redundant WM and LTM representations to support performance) as these easy trials required less of their cognitive resources (see
We can think of four potential nonexclusive mechanisms underlying this cost of exchanging information between WM and LTM: (a) interference, (b) a time-cost, (c) a limited cognitive resource, or (d) process-switching costs. First, opening the gate to LTM entails a higher probability of interfering information entering the focus of attention—this entails task unrelated thoughts (
Last but not least, another underlying cause of a cost of exchanging information between WM and LTM could be a process-switching cost, analogous to those identified as affecting WM updating (e.g.,
There are at least two possible points at which people could draw on previously learned LTM representations and thereby benefit in our and other WM paradigms—solely at test, or during maintenance as well. Specifically, subjects could either (a) simply retrieve these LTM representations at the WM test, or (b) they could form an additional representation which is maintained in WM. At test, subjects then draw from both the WM and LTM representations, resulting in better WM performance. For instance, separate representations in WM and LTM may provide redundancy in case one of the representations is forgotten or inaccessible (i.e., a redundancy gain; e.g.,
Whereas in Experiment 2 the beneficial effect of LTM on WM performance did not simply reflect a performance boost on the prelearned pairs, but instead involved an effective reallocation of WM resources, in Experiment 3, trials containing LTM pairs only showed a performance boost on the prelearned pairs themselves, and not the novel pairs. This suggests that when a pair was forgotten, LTM increased the probability of a correct response when the pair was intact but decreased it when the pair had been rearranged. This indicates that the flexibility with which individuals can negotiate between WM and LTM may depend on the cognitive cost of identifying when LTM is and is not reliable: When this identification becomes too difficult, people simply prioritise the use of their limited WM resources to maintain information that LTM cannot help with.
Recent research by
A different mechanism was recently proposed by
Nevertheless, in one respect our results seem to conflict with the recent findings investigating the effect of PI on variants of the Hebb paradigm by
In light of these differences, we view our findings as an addition to, rather than a contradiction of, this work. Specifically, our experiments show that the benefit of available LTM representations arises from subjects simply retrieving these prelearned LTM representations at the WM test, which then suffer from typical interference effects.
One striking side-finding of our Experiments 1 and 2 was that, although increases in set size through LTM unavailable new pairs had a detrimental effect on WM performance, no such set size effect was evident in the LTM data. This replicates recent findings from a similar relational recognition task using word-word pairs undertaken by
These two findings support the notion that there is a separation between representations in WM and LTM reflecting their different functions. As we mentioned in our introduction, we understand WM as a system consisting of a set of mechanisms—a system that interacts with and relies on LTM (e.g.,
Our present findings of a set size effect in WM but not LTM, in combination with the effects of proactive facilitation and interference which were specific to the LTM pairs and not the new WM pairs, support this notion.
One explanation of these findings would be that, at encoding, separate WM and LTM traces are formed. The WM trace is then subject to interference between bindings from additional pairs added in the trial, which leads to the demonstrated detrimental effect of increases in set size (
Taken together, although our data do not allow decisive insight into the question of how the mechanisms of WM are implemented, the above results support our initial assumption that there is a separation between representations in WM and LTM reflecting their different functions.
Our study has shown that the presence and use of reliable LTM representations frees capacity for maintaining additional information in WM. Furthermore, a gating mechanism between WM and LTM controls the information passing from one system to the other, with initial costs to opening the gate. The benefit of available LTM representations arises from subjects simply retrieving these LTM representations at the (WM) test, which then suffer from typical interference effects. However, the flexibility with which individuals can negotiate between WM and LTM may depend on the cognitive cost of identifying when LTM is and is not reliable.
Adam, K. C. S., Robison, M. K., & Vogel, E. K. (2018). Contralateral delay activity tracks fluctuations in working memory contralateral delay activity tracks fluctuations in working memory performance. Journal of Cognitive Neuroscience, 30(9), 1229–1240. 10.1162/jocn_a_01233
Atkinson, R. C., & Shiffrin, R. M. (1968). Human memory: A proposed system and its control processes. Psychology of Learning and Motivation, 2, 89–195. 10.1016/S0079-7421(08)60422-3
Baddeley, A. D. (1986). Working memory. Current Biology, 20(4), 136–140. 10.1016/j.cub.2009.12.014
Baddeley, A. D. (2012). Working memory: Theories, models, and controversies. Annual Review of Psychology, 63, 1–29. 10.1146/annurev-psych-120710-100422
Baddeley, A. D., Hitch, G. J., & Allen, R. J. (2009). Working memory and binding in sentence recall. Journal of Memory and Language, 61(3), 438–456. 10.1016/j.jml.2009.05.004
Badre, D. (2012). Opening the gate to working memory. Proceedings of the National Academy of Sciences of the United States of America, 109(49), 19878–19879. 10.1073/pnas.1216902109
Balota, D. A. (1983). Automatic semantic activation and episodic memory encoding. Journal of Verbal Learning and Verbal Behavior, 22(1), 88–104. 10.1016/S0022-5371(83)80008-5
Barr, D. J., Levy, R., Scheepers, C., & Tily, H. J. (2013). Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language, 68(3), 255–278. 10.1016/j.jml.2012.11.001
Barrouillet, P., & Camos, V. (2015). Working memory: Loss and reconstruction. Psychology Press.
Bartsch, L. M., Loaiza, V. M., & Oberauer, K. (2019). Does limited working memory capacity underlie age differences in associative long-term memory?Psychology and Aging, 34(2), 268–281. 10.1037/pag0000317
Bürkner, P.-C. (2017). brms: An R package for Bayesian multilevel models using Stan. Journal of Statistical Software, 80(1), 1–28. 10.18637/jss.v080.i01
Bürkner, P.-C. (2018). Advanced Bayesian multilevel modeling with the R package brms. The R Journal, 10(1), 395–411. 10.32614/rj-2018-017
Carpenter, B., Gelman, A., Hoffman, M. D., Lee, D., Goodrich, B., Betancourt, M., Brubaker, M., Guo, J., Li, P., & Riddell, A. (2017). Stan: A probabilistic programming language. Journal of Statistical Software, 76(1), 1–32. 10.18637/jss.v076.i01
Chen, Z., & Cowan, N. (2005). Chunk limits and length limits in immediate recall: A reconciliation. Journal of Experimental Psychology: Learning, Memory, and Cognition, 31(6), 1235–1249. 10.1037/0278-7393.31.6.1235
Chen, Z., & Cowan, N. (2009). Core verbal working-memory capacity: The limit in words retained without covert articulation. Quarterly Journal of Experimental Psychology: Human Experimental Psychology, 62(7), 1420–1429. 10.1080/17470210802453977
Cowan, N. (1988). Evolving conceptions of memory storage, selective attention, and their mutual constraints within the human information-processing system. Psychological Bulletin, 104(2), 163–191.
Cowan, N. (1995). Attention and memory: An integrated framework. Oxford University Press.
Cowan, N. (2001). The magical number 4 in short-term memory: A reconsideration of mental storage capacity. Behavioral and Brain Sciences, 24(1), 87–114. 10.1017/s0140525x01003922
Cowan, N. (2006). Working memory capacity. Psychology Press. 10.4324/9780203342398
Cowan, N. (2008). What are the differences between long-term, short-term, and working memory?Progress in Brain Research, 169(07), 323–338. 10.1016/S0079-6123(07)00020-9
Craik, F. I. M. (1971). Age differences in recognition memory. The Quarterly Journal of Experimental Psychology, 23(3), 316–323. 10.1080/14640746908401827
Crowder, R. G. (1982). The demise of short-term memory. Acta Psychologica, 50(3), 291–323.
De Schrijver, S., & Barrouillet, P. (2017). Consolidation and restoration of memory traces in working memory. Psychonomic Bulletin & Review, 24(5), 1651–1657. 10.3758/s13423-017-1226-7
De Simoni, C., & von Bastian, C. C. (2018). Working memory updating and binding training: Bayesian evidence supporting the absence of transfer. 10.1037/xge0000453
Engle, R. W., Cantor, J., & Carullo, J. J. (1992). Individual differences in working memory and comprehension: A test of four hypotheses. Journal of Experimental Psychology: Learning, Memory, and Cognition, 18(5), 972–992. 10.1037/0278-7393.18.5.972
Ericsson, K. A., & Kintsch, W. (1995). Long-term working memory. Psychological Review, 102(2), 211–245. 10.1037/0033-295X.102.2.211
Eriksson, J., Vogel, E. K., Lansner, A., Bergström, F., & Nyberg, L. (2015). Neurocognitive architecture of working memory. Neuron, 88(1), 33–46. 10.1016/j.neuron.2015.09.020
Farrell, S. (2012). Temporal clustering and sequencing in short-term memory and episodic memory. Psychological Review, 119(2), 223–271. 10.1037/a0027371
Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., & Rubin, D. B. (2013). Bayesian data analysis (3rd edition). Chapman & Hall/CRC.
Gerver, D. (1969). The effects of source language presentation rate on the performance of simultaneous conference interpreters. Proceedings of the 2nd Louisville Conference on Rate and/or Frequency Controlled Speech (pp. 162–184). University of Louisville.
Gronau, Q. F., Singmann, H., & Wagenmakers, E. J. (2020). Bridgesampling: An R package for estimating normalizing constants. Journal of Statistical Software. Advance online publication. 10.18637/jss.v092.i10
Huebner, G. M., & Gegenfurtner, K. R. (2011). The efficiency of encoding: Limits of information transfer into memory. Attention, Perception & Psychophysics, 73(5), 1503–1521. 10.3758/s13414-011-0120-z
Hulme, C., Maughan, S., & Brown, G. D. A. (1991). Memory for familiar and unfamiliar words: Evidence for a long-term memory contribution to short-term memory span. Journal of Memory and Language, 30(6), 685–701. 10.1016/0749-596X(91)90032-F
Hulme, C., Roodenrys, S., Brown, G., & Mercer, R. (1995). The role of long-term memory mechanisms in memory span. British Journal of Psychology, 86(4), 527–536. 10.1111/j.2044-8295.1995.tb02570.x
Hulme, C., Roodenrys, S., Schweickert, R., Brown, G. D. A., Martin, S., & Stuart, G. (1997). Word-frequency effects on short-term memory tasks: Evidence for a redintegration process in immediate serial recall. Journal of Experimental Psychology: Learning, Memory, and Cognition, 23(5), 1217–1232. 10.1037/0278-7393.23.5.1217
Jackson, M. C., & Raymond, J. E. (2008). Familiarity enhances visual working memory for faces. Journal of Experimental Psychology: Human Perception and Performance, 34(3), 556–568. 10.1037/0096-1523.34.3.556
Jefferies, E., Frankish, C. R., & Lambon Ralph, M. A. (2006). Lexical and semantic binding in verbal short-term memory. Journal of Memory and Language, 54(1), 81–98. 10.1016/j.jml.2005.08.001
Kessler, Y., & Oberauer, K. (2014). Working memory updating latency reflects the cost of switching between maintenance and updating modes of operation. Journal of Experimental Psychology: Learning, Memory, and Cognition, 40(3), 738–754. 10.1037/a0035545
Kowialiewski, B., & Majerus, S. (2018). The non-strategic nature of linguistic long-term memory effects in verbal short-term memory. Journal of Memory and Language, 101, 64–83. 10.1016/j.jml.2018.03.005
Lewis-Peacock, J. A., & Postle, B. R. (2008). Temporary activation of long-term memory supports working memory. Journal of Neuroscience, 28(35), 8765–8771. 10.1523/JNEUROSCI.1953-08.2008
Loaiza, V. M., & Camos, V. (2016). Does controlling for temporal parameters change the levels-of-processing effect in working memory?Advances in Cognitive Psychology, 12(1), 2–9. 10.5709/acp-0182-3
Loaiza, V. M., McCabe, D. P., Youngblood, J. L., Rose, N. S., & Myerson, J. (2011). The influence of levels of processing on recall from working memory and delayed recall tasks. Journal of Experimental Psychology: Learning, Memory, and Cognition, 37(5), 1258–1263. 10.1037/a0023923
Luck, S. J., & Vogel, E. K. (1997). The capacity of visual working memory for features and conjunctions. Nature, 390(6657), 279–281. 10.1038/36846
McCabe, D. P. (2008). The role of covert retrieval in working memory span tasks: Evidence from delayed recall tests. Journal of Memory and Language, 58(2), 480–494. 10.1016/j.jml.2007.04.004
Melton, A. W. (1963). Implications of short-term memory for a general theory of memory. Journal of Verbal Learning and Verbal Behavior, 2(1), 1–21.
Miller, J. (1982). Divided attention: Evidence for coactivation with redundant signals. Cognitive Psychology, 14(2), 247–279. 10.1016/0010-0285(82)90010-X
Miller, E. K., & Cohen, J. D. (2001). An integrative theory of prefrontal cortex function. Annual Review of Neuroscience, 24, 167–202. 10.1146/annurev.neuro.24.1.167
Mızrak, E., & Oberauer, K. (2020). Working memory recruits long-term memory when it is beneficial: Evidence from the Hebb Effect. PsyArXiv, 1, 44. 10.31234/osf.io/ryqj9
Nairne, J. S. (1990). A feature model of immediate memory. Memory & Cognition, 18(3), 251–269.
Nairne, J. S. (2002). Remembering over the short-term: The case against the standard model. Annual Review of Psychology, 53(1), 53–81.
Norris, D., Kalm, K., & Hall, J. (2019). Chunking and redintegration in verbal short-term memory. Journal of Experimental Psychology: Learning, Memory, and Cognition, 46(5), 872–893.
O’Reilly, R. C., & Frank, M. J. (2006). Making working memory work: A computational model of learning in the prefrontal cortex and basal ganglia. Neural Computation, 18(2), 283–328. 10.1162/089976606775093909
Oberauer, K. (2002). Access to information in working memory: Exploring the focus of attention. Journal of Experimental Psychology: Learning, Memory, and Cognition, 28(3), 411–421.
Oberauer, K. (2009). Design for a working memory. Psychology of Learning and Motivation, 51, 45–100.
Oberauer, K., Awh, E., & Sutterer, D. W. (2017). The role of long-term memory in a test of visual working memory: Proactive facilitation but no proactive interference. Journal of Experimental Psychology: Learning, Memory, and Cognition, 43(1), 1–22. 10.1037/xlm0000302
Oberauer, K., Farrell, S., Jarrold, C., & Lewandowsky, S. (2016). What limits working memory capacity?Psychological Bulletin, 142(7), 758–799. 10.1037/bul0000046
Oberauer, K. (2019). Working memory capacity limits memory for bindings. Journal of Cognition, 2(1), 40. 10.5334/joc.79
Perham, N., Marsh, J. E., & Jones, D. M. (2009). Short article: syntax and serial recall: how language supports short-term memory for order. Quarterly Journal of Experimental Psychology: Human Experimental Psychology, 62(7), 1285–1293. 10.1080/17470210802635599
Poirier, M., & Saint-Aubin, J. (1995). Memory for related and unrelated words: Further evidence on the influence of semantic factors in immediate serial recall. The Quarterly Journal of Experimental Psychology Section A, 48(2), 384–404. 10.1080/14640749508401396
Popov, V., & Reder, L. M. (2020). Frequency effects on memory: A resource-limited theory. Psychological Review, 127(1), 1–46. 10.1037/rev0000161
Raaijmakers, J. G. W., & Shiffrin, R. M. (1980). SAM: A theory of probabilistic search of associative memory. Psychology of Learning and Motivation, 14, 207–262.
Raghavachari, S., Kahana, M. J., Rizzuto, D. S., Caplan, J. B., Kirschen, M. P., Bourgeois, B., Madsen, J. R., & Lisman, J. E. (2001). Gating of human theta oscillations by a working memory task. The Journal of Neuroscience: The Official Journal of the Society for Neuroscience, 21(9), 3175–3183. 10.1523/JNEUROSCI.21-09-03175.2001
Ranganath, C. (2006). Working memory for visual objects: Complementary roles of inferior temporal, medial temporal, and prefrontal cortex. Neuroscience, 139(1), 277–289. 10.1016/j.neuroscience.2005.06.092
Ranganath, C., & Blumenfeld, R. S. (2005). Doubts about double dissociations between short- and long-term memory. Trends in Cognitive Sciences, 9(8), 374–380. 10.1016/j.tics.2005.06.009
Ranganath, C., Cohen, M. M. X., & Brozinsky, C. C. J. (2005). Working memory maintenance contributes to long-term memory formation: Neural and behavioral evidence. Journal of Cognitive Neuroscience, 17(7), 994–1010. 10.1162/0898929054475118
Romani, C., Mcalpine, S., & Martin, R. C. (2008). Concreteness effects in different tasks: Implications for models of short-term memory. Quarterly Journal of Experimental Psychology: Human Experimental Psychology, 61(2), 292–323. 10.1080/17470210601147747
Rose, N. S., & Craik, F. I. M. (2012). A processing approach to the working memory/long-term memory distinction: Evidence from the levels-of-processing span task. Journal of Experimental Psychology: Learning, Memory, and Cognition, 38(4), 1019–1029. 10.1037/a0026976
Rose, N. S., Craik, F. I. M., & Buchsbaum, B. R. (2015). Levels of processing in working memory: Differential involvement of fronto-temporal networks. Journal of Cognitive Neuroscience, 27(3), 522–532. 10.1162/jocn_a_00738
Rouder, J. N., Morey, R. D., Speckman, P. L., & Province, J. M. (2012). Default Bayes factors for ANOVA designs. Journal of Mathematical Psychology, 56(5), 356–374. 10.1016/j.jmp.2012.08.001
Schielzeth, H., & Forstmeier, W. (2009). Conclusions beyond support: Overconfident estimates in mixed models. Behavioral Ecology, 20(2), 416–420. 10.1093/beheco/arn145
Schurgin, M. W., Cunningham, C. A., Egeth, H. E., Brady, T. F., & Hall, M. (2018). Visual long-term memory can replace active maintenance in visual working memory. bioRxiv. 10.1101/381848
Shaw, M. L. (1982). Attending to multiple sources of information: I. The integration of information in decision making. Cognitive Psychology, 14(3), 353–409. 10.1016/0010-0285(82)90014-7
Shiffrin, R. M., & Atkinson, R. C. (1969). Storage and retrieval processes in long-term memory. Psychological Review, 76(2), 179–193. 10.1037/h0027277
Smallwood, J., & Schooler, J. W. (2015). The science of mind wandering: Empirically navigating the stream of consciousness. Annual Review of Psychology, 66, 487–518. 10.1146/annurev-psych-010814-015331
Souza, A. S., & Oberauer, K. (2017). Time to process information in working memory improves episodic memory. Journal of Memory and Language, 96, 155–167. 10.1016/j.jml.2017.07.002
Tamez, E., Myerson, J., & Hale, S. (2008). Learning, working memory, and intelligence revisited. Behavioural Processes, 78(2), 240–245. 10.1016/j.beproc.2008.01.008
Tamez, E., Myerson, J., & Hale, S. (2012). Contributions of associative learning to age and individual differences in fluid intelligence. Intelligence, 40(5), 518–529. 10.1016/j.intell.2012.04.004
Thalmann, M., Souza, A. S., & Oberauer, K. (2019). How does chunking help working memory?Journal of Experimental Psychology: Learning, Memory, and Cognition, 45(1), 37–55. 10.1037/xlm0000578
Tulving, E. (1972). Episodic and semantic memory. In E.Tulving & W.Donaldson (Eds.), Organization of memory (pp. 381–403). Academic Press.
Unsworth, N. (2010). Interference control, working memory capacity, and cognitive abilities: A latent variable analysis. Intelligence, 38(2), 255–267. 10.1016/j.intell.2009.12.003
Unsworth, N., & Engle, R. W. (2006a). Simple and complex memory spans and their relation to fluid abilities: Evidence from list-length effects. Journal of Memory and Language, 54(1), 68–80. 10.1016/j.jml.2005.06.003
Unsworth, N., & Engle, R. W. (2006b). A temporal–contextual retrieval account of complex span: An analysis of errors. Journal of Memory and Language, 54(3), 346–362. 10.1016/j.jml.2005.11.004
Unsworth, N., & Engle, R. W. (2007a). On the division of short-term and working memory: An examination of simple and complex span and their relation to higher order abilities. Psychological Bulletin, 133(6), 1038–1066. 10.1037/0033-2909.133.6.1038
Unsworth, N., & Engle, R. W. (2007b). The nature of individual differences in working memory capacity: Active maintenance in primary memory and controlled search from secondary memory. Psychological Review, 114(1), 104–132. 10.1037/0033-295X.114.1.104
Verhaeghen, P., Vandenbroucke, V., & Dierckx, A. (1998). Growing slower and less accurate: Adult age differences in time-accuracy functions for recall and recognition from episodic memory. Experimental Aging Research, 24(1), 3–19. 10.1080/036107398244337
Verschooren, S., Kessler, Y., & Egner, T. (2020). Evidence for a single mechanism gating perceptual and long-term memory information into working memory. PsyArXiv.
Walker, I., & Hulme, C. (1999). Concrete words are easier to recall than abstract words: Evidence for a semantic contribution to short-term serial recall. Journal of Experimental Psychology: Learning, Memory, and Cognition, 25(5), 1256
Watkins, M. J. (1977). The intricacy of memory span. Memory & Cognition, 5(5), 529–534. 10.3758/BF03197396
Wilhelm, O., Hildebrandt, A., & Oberauer, K. (2013). What is working memory capacity, and how can we measure it?Frontiers in Psychology, 4, 433. 10.3389/fpsyg.2013.00433
Xie, W., & Zhang, W. (2017). Familiarity increases the number of remembered Pokémon in visual short-term memory. Memory & Cognition, 45(4), 677–689. 10.3758/s13421-016-0679-7
Zhang, W., & Luck, S. J. (2008). Discrete fixed-resolution representations in visual working memory. Nature, 453(7192), 233–235. 10.1038/nature06860
Submitted: July 14, 2020 Revised: February 4, 2021 Accepted: February 4, 2021