Treffer: Information-Limited Parallel Processing in Difficult Heterogeneous Covert Visual Search
University of Southern California, United States
CC BY 4.0
Sauf mention contraire ci-dessus, le contenu de cette notice bibliographique peut être utilisé dans le cadre d’une licence CC BY 4.0 Inist-CNRS / Unless otherwise stated above, the content of this bibliographic record may be used under a CC BY 4.0 licence by Inist-CNRS / A menos que se haya señalado antes, el contenido de este registro bibliográfico puede ser utilizado al amparo de una licencia CC BY 4.0 Inist-CNRS
FRANCIS
Weitere Informationen
Difficult visual search is often attributed to time-limited serial attention operations, although neural computations in the early visual system are parallel. Using probabilistic search models (Dosher, Han, & Lu, 2004) and a full time-course analysis of the dynamics of covert visual search, we distinguish unlimited capacity parallel versus serial search mechanisms. Performance is measured for difficult and error-prone searches among heterogeneous background elements and for easy and accurate searches among homogeneous background elements. Contrary to the claims of time-limited serial attention, searches in heterogeneous backgrounds instead exhibited nearly identical search dynamics for display sizes up to 12 items. A review and new analyses indicate that most difficult as well as easy visual searches operate as an unlimited-capacity parallel analysis over the visual field within a single eye fixation, which suggests limitations in the availability of information, not temporal bottlenecks in analysis or comparison. Serial properties likely reflect overt attention expressed in eye movements.
Information-Limited Parallel Processing in Difficult Heterogeneous Covert Visual Search
<cn> <bold>By: Barbara Anne Dosher</bold>>
> <bold>Songmei Han</bold>
>
> <bold>Zhong-Lin Lu</bold>
>
<bold>Acknowledgement: </bold>This research was supported by the National Institute of Mental Health and the Air Force Office of Scientific Research, and by the Summer Fellowship of the Institute for Mathematical Behavioral Sciences, by the National Eye Institute.
Visual search for a target among distractor elements—finding a particular object among many others—is of both theoretical and practical significance. Neural computations in early visual cortex represent visual inputs simultaneously at distinct retinal locations. Visual search for a specific target in the visual field requires the further analysis and identification of each display element as a target or distractor, a process reflecting both attention and decision (Treisman, 1982; Treisman & Gelade, 1980; Verghese, 2001). One of the most widely studied paradigms in cognitive science, visual search has also been investigated in neurophysiology (Chelazzi, Miller, Duncan, & Desimone, 2001; Reynolds & Desimone, 2001), in computational neuroscience (Corchs & Deco, 2001), and in cognitive psychology (Neisser, 1967; Sperling, Budiansky, Spivak, & Johnson, 1971; Treisman & Gelade, 1980; Treisman & Gormican, 1988; Wolfe, 2003; Wolfe & Friedman-Hill, 1992) and has applications to practical situations in screening and human operator environments.
One central theoretical issue is whether human observers are characterized by serial or parallel search processing architectures, often associated with the processing of complex and basic visual features, respectively (Dosher, 1998; Treisman & Gelade, 1980). Most behavioral assays of search efficiency evaluate either search time in freely viewed displays (i.e., Treisman & Gelade, 1980) or search accuracy in time-limited displays (i.e., Palmer, 1994). Search times that increase substantially with added display elements are usually associated with a covert serial processing architecture (Sternberg, 1966), yet in fact are ambiguous and may reflect parallel processes (Theois, 1973; Townsend & Ashby, 1983). Mathematical analyses show that increased average response time (or decreased accuracy) for larger displays do not, by themselves, distinguish the serial or parallel architecture of visual analysis (see Palmer, Verghese, & Pavel, 2000, and Sperling & Dosher, 1986, for reviews). On the other hand, some search accuracy experiments express the performance as thresholds (either contrast or feature differences) corresponding to a criterion accuracy (e.g., Allen & Humphreys, 2007), and focus on the relationship between thresholds and display size. Search accuracy and thresholds can be directly related to one another within theoretical contexts of observer models (e.g., Lu & Dosher, 2008), and are considered briefly in the discussion. These studies provide important insights into processing capacity (limited vs. unlimited) but not the temporal architecture (parallel vs. serial) of search. Detection theory–based analyses of decision uncertainty may test for whether ultimately—perhaps at a delayed processing time—accuracy is consistent with unlimited capacity (Shaw, 1982; Shaw & Shaw, 1997). However, such analyses cannot answer questions about the time course of processing.
A true test of the full architecture of visual search requires joint evaluation of both the accuracy and the temporal properties of search. Here, speed–accuracy trade-off methods (e.g., Dosher, 1976; Reed, 1973) were used to trace the full time course of processing and assess the serial or parallel nature of visual search. Two previous studies (Dosher, Han, & Lu, 2004; McElree & Carrasco, 1999) measured the full time course of visual search for different display sizes in demanding searches and made conflicting claims. McElree and Carrasco (1999) argued that search for a conjunction target defined by color and form (i.e., red triangle among red squares and green triangles), often associated with serial search, was consistent with a limited-capacity parallel process. In contrast, the time course of a difficult asymmetry search (Dosher et al., 2004), also often associated with serial search, was consistent with an unlimited capacity parallel process. A recent study of isoluminant color search (Sanhti & Reeves, 2004) used a model and response-time analysis and also argued for parallel processing. Which result is the more typical? Is pure parallel processing only rarely characteristic of difficult search, or do most cases of difficult visual search engage capacity-limited parallel or serial processes?
This article focuses on time-course measurements and concludes that the classification of individual elements as target or distractors is carried out in parallel across the visual field within any single episode of information acquisition (eye fixation), even in many very difficult searches. Here we study difficult heterogeneous distractor search and easy homogeneous distractor search to evaluate the generality of unlimited capacity parallel mechanisms. The time-course accuracy functions, hits, and false alarms, as well as the previous results on asymmetry searches (Dosher et al., 2004) and a reanalysis of conjunction search data (McElree & Carrasco, 1999), all show that within a single eye fixation, unlimited capacity parallel processing characterized visual search for a wide class of tasks. These results are consistent with an analysis of multiple-target search (Thornton & Gilden, 2007) and with recent computational models of eye fixations in search that assume parallel processing across the field within a single glance (Najemnik & Geisler, 2005).
Parallel and Serial Visual Search
>
In serial architectures (Figure 1A), as the number of elements in a visual display increases, more elements must be searched one after the other in sequence to find a target. The visual search terminates as soon as a target is found, or continues through the entire display to decide that a target is not present. Serial searches show reduced accuracy and slowed time course (Figure 1B) for displays with more objects, as well as slower response times and lower accuracies in a standard response-time measurements (x symbols) (Sternberg, 1975). Many models of visual search invoke serial processes associated with the capacity-limited deployment of attention, including feature integration theory (Treisman & Gelade, 1980), selective search models (Dosher, 1998; Egeth, Virzi, & Garbart, 1984), guided search models (Cave & Wolfe, 1990; Wolfe, 1994, 2003), and others. In unlimited capacity parallel architectures (Figure 1C; e.g., Dosher, Han, & Lu, 2004; McElree & Dosher, 1989, 1993; Ratcliff, 1978; Townsend & Ashby, 1983, all objects are processed simultaneously, but with variable completion times. Again, search terminates when a target is found or after all items are evaluated when a target is not found. Parallel searches may or may not show noticeable reductions in ultimate accuracy, but exhibit very similar time courses (Figure 1D) for larger and smaller displays. Even if standard response times and error rates (o symbols) show increases with display size, especially if the target is absent, they may nonetheless be consistent with a parallel time course.
>
><anchor name="fig1"></anchor>
Pairs of response times and accuracies (RT and
The full time course of visual search is measured in a speed–accuracy trade-off paradigm in which processing time of the observer is manipulated and performance accuracy is the dependent measure. In the cued-response speed–accuracy trade-off (SAT) paradigm (e.g., Dosher, 1976), the observer is interrupted by a cue to respond (such as a brief tone) and is required to respond as quickly as possible. Usually 6 to 8 interruption times, or cue lags, are used to measure the full time course of information accumulation. Performance accuracy, usually
Probabilistic Models of Visual Search
>
The time course data are evaluated using a
The limiting accuracy for each condition depends on the probability of correctly classifying the target,
While time–accuracy (or SAT) functions may reflect either a continuous accrual of information over time, or the cumulative distribution of completion times of a discrete process (e.g., Dosher, 1976, 1979, 1981), the probabilistic parallel and serial models of visual search are developed here as completion time models. However, this analysis is also consistent with continuous diffusion information accumulators in which information becomes available only when a decision or classification boundary is reached (i.e., Ratcliff, 1998; Thornton & Gilden, 2007), in which case the distribution
Heterogeneous and Homogeneous Search
>
Duncan and Humphreys (1989) were among the first to systematize the observation that search efficiency depended both upon the similarity of the target and distractors and on the heterogeneity of different distractors. In homogeneous conditions, all distractor elements are identical. In heterogeneous conditions, distractor elements are of at least two types. Homogeneous visual search is usually associated with pre-attentive parallel evaluation. Even when the target is known, as in the current case, heterogeneous visual search is usually claimed to involve attention-demanding serial search processes (Duncan & Humphreys, 1989; Rosenholtz, 2001; Wolfe, Friedman-Hill, Stewart, & O'Connell, 1992). In order to guarantee that only covert information processing is evaluated, the SAT study uses time-limited displays to guard against eye movements during search. All the experiments and analyses testing signal detection accounts of search accuracy (without reaction time) also use brief time-limited displays (e.g., Palmer, 1994; Palmer et al., 2000).
In these experiments, the target was always a line of shallow right tilt (8° clockwise of vertical). In homogeneous search, all distractors had a sharper right tilt (25° clockwise of vertical). In heterogeneous search, distractors were equally often tilted sharp right or somewhat left (25° clockwise or 15° counterclockwise of vertical). So, a single boundary in orientation space separated the target and distractors in the homogeneous condition; the heterogeneous condition required multiple or nonlinear boundaries in orientation space, although some researchers might claim a unique category for the target as a steep angle (Wolfe, Friedman-Hill, et al., 1992). Annular search layouts controlled for eccentricity (Carrasco, Evert, Chang, & Katz, 1995; Carrasco, McLean, Katz, & Frieder, 1998) and density effects (see Dosher, Han, & Lu, 2004). A sample trial and layout are illustrated in Figure 2.
>
><anchor name="fig2"></anchor>
Experiment 1 documented the typical reaction time effects of display size for heterogeneous search in a standard response-time paradigm with display until response (Wolfe, Friedman-Hill, et al., 1992). Experiment 2 evaluated time course of heterogeneous and homogeneous visual search using the cued-response SAT paradigm for brief displays of 100 ms or 50 ms. Experiment 3 measured response times for the practiced observers of Experiment 2 in standard response time and in brief displays.
General Method
> <h31 id="xhp-36-5-1128-d287e429">Observers</h31>
In Experiment 1, 10 observers participated in a 1-hr session for undergraduate course credit. Four new observers participated in Experiments 2–3 and were paid for their service. Experiment 2 required a series of 10–12 sessions, while Experiment 3 required 4 sessions. Observers reported normal or corrected-to-normal vision.
<h31 id="xhp-36-5-1128-d287e433">Stimuli</h31>The target was a shallow line tilted 8° clockwise of vertical. Homogeneous distractors were tilted 25° clockwise of vertical. Heterogeneous distractors were either tilted 25° clockwise of vertical or tilted 15° counterclockwise of vertical, with equal probability. These values were chosen on the basis of piloting to optimize asymptotic accuracies in the SAT data. The dark tilted lines were rendered as gray-scale images with anti-aliasing on a 32 × 32 pixel grid displayed on a Leading Edge Technology 1230V monitor controlled by a Vista image board on a PC computer. A special circuit combined two output channels to produce 4096 grey levels (12 bits), linearized to yield 256 programmable luminance levels. The tilted lines were specified as the minimum luminance (1 cd/m<sups>2</sups>), with the background luminance of 71 cd/m<sups>2</sups> (luminance range 1 cd/m<sups>2</sups> to 144 cd/m<sups>2</sups>). The lines were rendered in regions subtending 0.98 × 0.98 degrees at a viewing distance of approximately 60 cm, and were arranged on a 4.12° radius with 15 possible equally spaced positions on the annulus, randomly rotated on each trial. Elements of displays of size 4, 8, and 12 consisted of one, two, or three sets of four adjacent locations, with a space between sets (Figures 2A–2C), thus equating eccentricity and the density for all displays (one half of elements are adjacent to a space, and one half are internal items). Lastly, the position of each element was randomly “jittered” (a uniform distribution from –4 to +4 pixels) in the horizontal and vertical directions to introduce some irregularity into global contour cues.
<h31 id="xhp-36-5-1128-d287e452">Design</h31>Homogeneous and heterogeneous displays were tested in separate blocks and alternated. For all display sizes, half the trials included a target and half did not. In heterogeneous trials, the two types of distractors appeared in randomized locations in the display; a target replaces one of these items when it is present. Trials with different display size and target presence or absence were presented in a random order within blocks.
Experiment 1 tested display sizes of 4, 8, and 12 for target-present and target-absent conditions for separate blocks of heterogeneous and homogeneous displays. Blocks were 480 trials with 80 trials per condition per subject. The displays remained on until the observer responded.
Experiment 2 tested display sizes of 4 and 12 for target-present and target-absent conditions for separate blocks of heterogeneous and homogeneous displays. Processing time was manipulated with 7 cue delays of 0.0, 0.050, 0.150, 0.300, 0.500, 1.150, and 1.800 s after display offset, for net cue delays from stimulus onset of these values plus the display duration. All trial types within a block were tested in random order. After one or two sessions of SAT training with display duration set at 150 ms, observers participated for six sessions (
Experiment 3 tested display sizes of 4, 8, and 12 for target-present and target-absent conditions for separate blocks of heterogeneous and homogeneous displays. In one condition, the displays remained on until the observer responded. In another, a brief display of 50 ms was used. Observers ran one session each of the homogeneous and heterogeneous search tasks, four sessions total, to yield a sample size per observer per condition of 80 in each condition.
<h31 id="xhp-36-5-1128-d287e464">Procedure</h31>On each trial, a fixation plus appeared for 250 ms, followed by the test display. For Experiment 1 and the free response version of Experiment 3, the display remained available until the observer responded by pressing the
Percentage of
Both the probabilistic serial and parallel search models (see the Appendix) and an exponential model were fit to time–accuracy
where the 
where 
is the observed mean,
with degrees of freedom
where <anchor name="eq5"></anchor>
is the product over
Results
> <h31 id="xhp-36-5-1128-d287e667">Experiment 1 Results</h31>
Response times increased with display size for the heterogeneous condition, whereas display size had minimal effects in the homogeneous condition (Figure 3A). All main effects and interactions of distractor condition, number of display elements, and target presence were significant for response times (all
>
><anchor name="fig3"></anchor>
The slopes of heterogeneous target-present and target-absent displays were 113 ms and 184 ms per item, respectively. The slopes of homogeneous displays were 0 ms and 22 ms, respectively. Homogeneous search was more accurate than was heterogeneous search, with display sizes of 4, 8, and 12 yielding 83.8, 97.3, and 83.8 percentage correct, and 92.8, 89.7, and 89.2 percentage correct, respectively. With the exception of the relatively high error rates for the target-absent heterogeneous distractor conditions, the primary differences were reflected in reaction times.
These results replicate in annular displays the previous reports for distractor heterogeneity conditions (e.g., Wolfe, Friedman-Hill, et al., 1992). The high slopes in the heterogeneous distractor condition would typically be interpreted as the consequence of attention-demanding serial processing. The small slopes of the homogeneous condition would typically be associated with pre-attentive parallel processing. The results in this free-viewing condition may in part reflect overt information acquisition through movement of the eyes.
<h31 id="xhp-36-5-1128-d287e880">Experiment 2 Results</h31><bold>Speed–accuracy trade-off functions</bold>
Experiment 2 tested the full time course of visual search with the cued-response speed–accuracy trade-off. The average time–accuracy functions (
>
><anchor name="fig4"></anchor>
The data for the 100-ms (performed first) and 50-ms (performed second) display conditions are quite similar. These conditions may differ slightly because they reflect different display durations, and hence visual availability, but also different stages of practice. The 50-ms display duration allows earlier cues to respond. We treat the two sets of data as independent replications. Exponential fits of time–accuracy data (
>
><anchor name="tbl1"></anchor>
Importantly, in no case did the quality of fit significantly improve by allowing the larger display size to slow the search speed (exponential rate parameter;
<bold>Probabilistic parallel search models</bold>
In the probabilistic parallel search model, visual search involves identifying each element of the display as a target or as a distractor. The items are processed in parallel, starting simultaneously and with a common time distribution. At any given time, the observer will respond “yes” if at least one item has been identified (correctly or in error) as a target or “no” if all items have been (correctly or in error) identified as distractors, or may guess “yes” in the absence of information. The predicted time course of the target-present decision is controlled by the distribution of times for individual comparisons (drawn from an
The probabilistic parallel search models were fit separately to average
An eight-parameter (plus one preset parameter) model, (4
For the 100-ms data, the estimated parameters were as follows: identification probabilities
>
><anchor name="fig5"></anchor>
The probabilistic parallel search model constrains the relationships of the asymptotic search accuracies of the two display sizes because the asymptotic accuracies are derived from the probabilities of correct identification of the single target and the numbers of distractors for display sizes of 4 and 12. In the descriptive exponential model, the asymptotic accuracies for the display sizes of 4 and 12 were simply estimated independently to maximize fit. The asymptotic accuracy for display sizes 4 and 12 are well fit by the decision model in the probabilistic model, which embodies the classic uncertainty calculations for these display sizes in search accuracy experiments (Eckstein, 1998; Palmer et al., 2000).
Distractor identification in the heterogeneous displays was estimated to be notably poorer than that of the homogeneous displays, although all the heterogeneous distractors were at least as dissimilar from the target (see Hodsoll & Humphreys, 2005, and Rosenholtz, 2001, for related discussions). The probabilistic parallel search model applies directly to homogeneous search, but could be an approximation to heterogeneous search. If the two types of distractors in heterogeneous search were to differ substantially in identification accuracy or temporal parameters, then a more complex model that distinguishes the two would be required. Counting all of the combinatoric instances of such a model would be so complex that it would likely be implemented by simulation rather than derivation. That the probabilistic parallel search model fit the data from the heterogeneous condition quite well suggests that the two types of distractor were similar in identification time and accuracy.
In summary, the probabilistic parallel search model provided a good account of the time course and asymptotic accuracies of visual search, not just for homogeneous search—generally associated with parallel processes—ut also in the case of heterogeneous search—generally cited as a classic example of serial search.
<bold>Probabilistic serial search models</bold>
In this section, the complementary test evaluates the probabilistic serial search model and its ability to fit the time-course data. The probabilistic serial model is an exact analog to the probabilistic parallel model, except that the identification of items occurs in series, one after the other, in random order. All of the model parameters remain the same, except that the temporal properties of the search are determined by the gamma distribution with time parameter τ characterizing the processing of each individual item, and a variable number of stages determined by the (random) order of processing each item, and the probabilities of doing so accurately. See the Appendix for details and equations. The probabilistic serial model, which incorporates the consequence of identification errors into the time-course predictions, moderates the strongly slowed time-course predictions of previous versions of the serial model in visual search (e.g., McElree & Carrasco, 1999).
We gave the probabilistic serial model every possible chance to work by fitting a fully saturated serial model with all different parameters for homogeneous and heterogeneous searches, allowing the greatest possible freedom to this model (four independent probabilities of element identification accuracy, independent estimates of guessing bias, independent time constants, and independent time intercepts for the homogeneous and heterogeneous conditions, 4
>
><anchor name="fig6"></anchor>
Further relaxing the relationship in asymptotic levels, in which the modeled asymptotic performance is constrained by the signal detection uncertainty relationship between the two displays sizes, still failed to achieve the quality of fit of the parallel models. This 8
Even assuming that homogeneous displays reflect parallel search and heterogeneous displays reflect a standard serial search would not solve the poor fit of the serial model because the fit to the heterogeneous data is independent of the fit of the homogeneous data. The sum of squared errors for the mixed homogeneous-parallel, heterogeneous-serial model exceeded that of the parallel model by a factor of more than 2 for both 100-ms data and 50-ms data. The parallel search model provides a good account of both homogeneous and homogeneous search, while serial search gives a poor account of both. These results were independently replicated in the data for the 100-ms displays and the data for the 50-ms displays.
<bold>Compensatory-rate serial model</bold>
Is there any condition under which a serial model could account well for the time-course data? One such case, suggested by a reviewer, might be a paradoxical
It seems obvious, however, that such a paradoxical compensatory processing model should overcome the incompatibility in time courses of display sizes 4 and 12. We implemented the compensatory-rate probabilistic serial model by incorporating not just independent speeds of processing for the two search types (homogeneous and heterogeneous), but also for each display size within search type. All other aspects of the probabilistic serial model are retained. Thus, the model includes 4 independent probabilities of element identification accuracy, independent estimates of guessing bias, independent time constants for the homogeneous and heterogeneous conditions for display sizes 4 and 12, and independent time intercepts for the homogeneous and heterogeneous conditions (4
<bold>Probabilistic parallel model and percent yes data</bold>
The previous sections documented that the probabilistic parallel model of processing provided a good account of the time course of discrimination (
The fact that the 8
The best-fitting probabilistic parallel model for the average hit and false alarm data is shown in Figure 7. The percentage of
>
><anchor name="fig7"></anchor>
The identification probabilities for the homogeneous condition are essentially identical (and very high) for the two display sizes, so the differences focus on the heterogeneous condition. The four independent identification probabilities for the heterogeneous conditions (targets and distractors in display sizes 4 and 12) can be equivalently remapped in terms of a signal detection situation with four free parameters. The mean and standard deviation of the evidence distribution for distractors are set to 0 and 1, respectively, corresponding with standard scaling assumptions of the signal detection theory. The mean of the evidence distribution for targets is
In summary, the probabilistic parallel search model provided an excellent account of the hit and false alarm data over the full time course of visual search as well as an excellent account of the
<bold>Time-limited display RT data</bold>
The response time and errors for the 50-ms time-limited displays under a response time protocol are shown in Figure 3B for the practiced observers who had previously participated in Experiment 2. Response times were relatively fast for these time-limited displays. Reaction time slopes were all –1 ms to 1 ms; however, the response time levels were sensitive to both distractor type and target presence. Error rates in the time-limited displays also depend on distractor type and target presence, and there is a modest increase in errors of about 0.4% per display element in homogeneous conditions and of about 0.8% in heterogeneous conditions. The level of performance is consistent with the performance in the 50-ms time-limited time-course data. Indeed, the relevant response time–accuracy points from these data are very close to points generated in the time-controlled testing protocol. These relatively flat functions of display size seem to differ for similar brief displays of Santhi and Reeves (2004) for isoluminant color stimuli. The reasons for this difference are not clear.
The details of the ANOVA for response times are as follows: distractor condition,
<bold>Free-viewing RT data</bold>
Figure 3C shows the response times and error rates for standard response time conditions, in which the stimulus was available until response, for the practiced observers of Experiment 2. The performance of the practiced observers is faster and more accurate than that of the unpracticed observers in Experiment 1, yet the general pattern of the data was equivalent. With only four observers, all main effects and interactions, including those assessing the effects of display size, were significant in the response time data (
The response time slopes were 2 ms and 1.5 ms for homogeneous conditions, respectively, for target-present and target-absent displays; and were 57 ms and 120 ms for heterogeneous conditions, respectively, for target-present and target-absent conditions. Practice improves, but does not fundamentally alter, the processes for visual search.
General Discussion
> <h31 id="xhp-36-5-1128-d287e1628">Empirical Summary</h31>
Experiment 1 documented the classic heterogeneity effect in a standard, free-viewing reaction time paradigm in annular displays controlling for target and distractor eccentricity. Response times increased substantially with the number of distractors in heterogeneous distractor conditions, but only slightly in homogeneous conditions. Attention-demanding serial search processes is widely associated with the pattern of performance in heterogeneous displays.
In Experiment 2, the time course of visual search was measured using speed–accuracy trade-off methods in time-limited displays (100 ms or 50 ms). A probabilistic parallel model provided an excellent account of both the constrained relation of the asymptotic performance and the common time course of visual search with display sizes of 4 and 12. The probabilistic parallel model also provided an excellent fit to the hit and false alarm pattern underlying the
In contrast, a fully elaborated 10-parameter probabilistic serial search model provided a relatively poor account of the data. The parallel model is consistent with closely equivalent temporal dynamics over the various conditions, with a small slowing regardless of display size due to heterogeneity. The serial model predicts slowing in dynamics for larger displays, a phenomenon that was not observed in the data. Even adding to the freedom of this model to allow unconstrained fits to the asymptotic levels for display sizes 12 and 4, with 14 parameters, failed to substantially improve the quality of the fits and isolated the failures firmly in dynamic aspects of the time-course functions.
Relatively high asymptotic levels in homogeneous search and much lower accuracy in heterogeneous search were related to differences in item identification probabilities. The relative accuracies of the display sizes of 4 and 12, however, were then fully constrained by these estimates in both models.
A compensatory-rate serial model in which each item is processed approximately 3 times more quickly in 12-element displays than in 4-element displays provided nearly but not quite as good a fit as did the parallel model. This paradoxical model violates the assumptions of all common tests of serial models of equal efficiency in all display sizes, or lower efficiency in larger display sizes if capacity limits are evoked. Even if this model achieved an equal fit, the parallel model would be preferred on the grounds of simplicity. The simpler parallel model accounts for the data with 8 free parameters rather than the 12-parameter compensatory serial model. However, certain unusual serial models such as the compensatory rate model cannot be completely ruled out.
Experiment 3 documented that highly practiced SAT observers showed the same pattern of performance as that of unpracticed observers in a standard response time paradigm. There was no evidence that the extended practice altered the form or mechanisms of search, although those mechanisms became somewhat more efficient with faster response times and fewer errors. The data from the time-limited response time paradigm were consistent with the SAT data.
<h31 id="xhp-36-5-1128-d287e1644">Relation to Other Findings</h31>The probabilistic parallel search model provided a good quantitative fit of the time course data, consistent with the suggestive analysis based on descriptive exponential models. The explicit model is important because it assesses the constraints on performance for different display sizes, incorporating asymptotic constraints and the effects of classification errors on the time course of visual search. The model accounts well for the asymptotic accuracies for display size of 4 and 12 using the same parameters
The current asymptotic accuracy effects are consistent with a series of findings focused on search accuracy in time-limited displays (Palmer, 1994, 1995; Palmer, Ames, & Lindsey, 1993; Palmer et al., 2000). In that literature, effects of display size were consistent with statistical uncertainty within the framework of a signal detection framework, and not consistent with models that proposed capacity limitations, for a wide range of searches with targets defined by primary features such as line length, brightness, or orientation. This is also true for the portion of the search accuracy literature (e.g., Palmer, 1994) measuring the effects of display size as feature or contrast thresholds, the difference needed to achieve a given accuracy. These are two different ways of expressing the same accuracy effects, which can be directly related via observer models such as the perceptual template model (see Lu & Dosher, 2008, for a review).
Overall, then, the uncertainty constraints of the search accuracy experiments of Palmer and others (Palmer, 1994, 1995; Palmer, Ames, & Lindsey, 1993; Palmer et al., 2000) are consistent with the asymptotic accuracies of the current and other time-accuracy studies. The current results go beyond the search accuracy literature in providing a consistent account of the temporal properties of visual search, not just the ultimate, asymptotic accuracy levels.
The current results join our earlier results on the time course of visual search asymmetries (O in Cs; Dosher et al., 2004). This difficult search asymmetry also generated large display size effects in standard response time, yet the probabilistic parallel model of visual search gave an excellent account of the time course of search for brief displays. The current results extend the parallel model from a simple case of feature search to the more complex heterogeneous search conditions.
Thus, parallel processing has been documented in two cases that have classically been associated with serial deployment of covert attention. However, the previous study of McElree and Carrasco (1999), the first to apply cued-response speed–accuracy trade-off methods to visual search, argued that a pure unlimited capacity parallel model cannot account for the temporal dynamics of conjunction search, and suggested instead that conjunction search invoked a limited-capacity parallel search.
Combining the results of the current study, the results of Dosher et al. (2004), and the results of McElree and Carrasco (1999) could suggest a qualitative difference between difficult search conditions of heterogeneous search and difficult asymmetric search that exhibit parallel search processes on the one hand, and conjunction searches that exhibit some form of capacity limitations on the other hand. The definition of this boundary between purely parallel processes and other forms as involving conjunction search would be complicated under Wolfe, Friedman-Hill, et al.'s (1992) suggestion that some cases of heterogeneous search are implicitly conjunction searches in which the target is the conjunction of being tilted right and having steep (or shallow) tilt. However, the conclusions of McElree and Carrasco (1999) were based on the presence of small but significant slowing in dynamics of conjunction search as assessed in exponential fits rather than a direct fit of an explicit parallel model. We fit the data of McElree and Carrasco (personal communication, March, 2007) with the probabilistic parallel and serial search models, and found that, counter to the initial interpretation, the probabilistic parallel search model provided quite a good account of these time-course data and asymptotic accuracy data for conjunction search. The fit of the probabilistic parallel search model to the McElree and Carrasco conjunction data is shown in Figure 8.
>
><anchor name="fig8"></anchor>
This new finding, based on direct fitting of explicit models, considerably simplifies the meta-pattern of results. All cases in which the full time-course of visual search has been measured, for brief visual displays that eliminate eye movements, are consistent with parallel processing architectures, although the difficult conditions may yield low accuracies. We are not claiming that all difficult visual searches will exhibit unlimited-capacity parallel processing, although we have yet to find a documented case of clear serial processing using time-course analysis. There may, however, be some cases in future investigations that do show covert serial processes.
These time-course results and the probabilistic parallel and serial models (Dosher et al., 2004) are closely related to the modeling development and observations of Thornton and Gilden (2007) in a multitarget search paradigm. Their analysis used four-location search, varied the number of targets, and examined the response time and error patterns of search within the context of an unlimited-capacity parallel processing model. This model simulated the various outcomes on the basis of independent parallel accumulation of evidence to a criterion for each display element, and is essentially equivalent to the probabilistic parallel model with the distribution of element comparison times (Dosher, Han, & Lu, 2004). The analysis of the multitarget paradigm also led Thornton and Gilden to conclude that a wide range of difficult search tasks were consistent with parallel search. The multitarget data, together with the time-course investigations, provide converging evidence for the widespread explanatory adequacy of parallel processes in visual analysis. This is also generally consistent with recent analyses of increasing
Models that routinely incorporate attention-demanding serial processing architectures face challenges in accounting for the visual search data in time-limited displays. This includes the feature integration model (Treisman, 1993; Treisman & Gelade, 1980; Treisman & Gormican, 1988), selective search models (Dosher, 1998; Egeth et al., 1984), and guided search models (Cave & Wolfe, 1990; Wolfe, 1994, 2003). Each of these models ascribes effects of display size on reaction time to the serial deployment of covert attention over the display in free-viewing conditions. The feature integration model (Treisman & Gelade, 1980) assumes serial search over the display elements, or groups of elements, and is directly tested with the probabilistic serial search model here. Selective search models (Dosher, 1998; Egeth et al., 1984) restrict serial searches to particular subsets of stimuli (e.g., the red items). The guided search models (Wolfe, 1994, 2003) also serially search selected subsets of stimuli, in this case defined by a more complex salience ordering. Finally, recursive rejection models (Humphreys & Muller, 1993) would require elaboration or modification to account for full time course data. In short, a number of models of covert attention, especially those involving serial search operations, appear to be simply inconsistent with the time course results, and others would require elaboration and further evaluation.
Visual search models are often applied to the response times and accuracies of free-viewing search paradigms, yet make claims about covert attention processes. A comparison of the search reaction times in freely viewed displays and those in time-limited displays leads us to believe that eye movements must play a considerable role in the former (e.g., Geisler & Chou, 1995; Motter & Belky, 1998). The current analysis of the temporal properties of search in time-limited displays suggests that processing within a single episode of information acquisition is parallel, at least for the clear (unmasked) displays used in these studies.
This conclusion is consistent with all the data on time course of visual search, with the data in a multiple-target paradigm, and it is also consistent with recent proposed models of eye movements during search for small targets in cluttered fields (Najemnik & Geisler, 2005). These models are based on an assumption of parallel uptake of information across the visual field, modulated by eccentricity, and compute a region that is expected to yield the most new information as the location of the next eye fixation. The eye movement system works together with visual processing of the information available over the visual field.
Conclusions
>
Covert attention is deployed in parallel over the items in the visual field in heterogeneous searches studied here, in asymmetry searches (Dosher et al., 2004), and in conjunction search (McElree & Carrasco, 1999). A probabilistic parallel search model (Dosher et al., 2004) provided an excellent account of the time course and asymptotic accuracy of search in all these cases. The time-course of search for different display sizes is consistent with the combination of classifications, some of them errors, of all the display items embodied in the probabilistic parallel model. Visual search is information limited, not limited with temporally serial processing within an eye movement. These results converge with an analysis of multiple-target searches (Thornton & Gilden, 2007) and with recent analyses of eye movements in visual search (Najemnik & Geisler, 2005). Some extremely difficult versions of search, or of search within masked or noisy displays (e.g., Dosher & Lu, 2000), may require close scrutiny of the targets and hence serial processes, but it is an open question whether even such examples would engender serial processes within a single eye fixation or information acquisition episode.
References
<anchor name="c1"></anchor>Allen, H. A., & Humphreys, G. W. (2007). A psychophysical investigation into the preview benefit in visual search.
Carrasco, M., Evert, D. L., Chang, I., & Katz, S. M. (1995). The eccentricity effect: Target eccentricity affects performance on conjunction searches.
Carrasco, M., McLean, T. L., Katz, S. M., & Frieder, K. S. (1998). Feature asymmetries in visual search: Effects of display duration, target eccentricity, orientation and spatial frequency.
Cave, K. R., & Wolfe, J. M. (1990). Modeling the role of parallel processing in visual search.
Chelazzi, L., Miller, E. K., Duncan, J., & Desimone, R. (2001). Responses of neurons in macaque area V4 during memory-guided visual search.
Corchs, S., & Deco, G. (2001). A neurodynamical model for selective visual attention using oscillators.
Dosher, B. (1976). The retrieval of sentences from memory: A speed–accuracy study.
Dosher, B. (1979). Empirical approaches to information processing: Speed–accuracy tradeoff or reaction time.
Dosher, B. (1981). The effect of delay and interference: A speed–accuracy study.
Dosher, B., Han, S., & Lu, Z.-L. (2004). Parallel processing in visual search asymmmetry.
Dosher, B., & Lu, Z.-L. (2000). Noise exclusion in spatial attention.
Dosher, B. A., & McElree, B. (1992). Memory search: Retrieval processes in short-term and long-term recognition. In L. R.Squire (Ed.),
Duncan, J., & Humphreys, G. W. (1989). Visual search and stimulus similarity.
Eckstein, M. P. (1998). The lower visual search efficiency for conjunctions is due to noise and not serial attentional processing.
Egeth, H., Virzi, R. A., & Garbart, H. (1984). Searching for conjunctively defined targets.
Geisler, W. S., & Chou, K. L. (1995). Separation of low-level and high-level factors in complex tasks: Visual search.
Hodsoll, J. P., & Humphreys, G. W. (2005). The effect of target foreknowledge on visual search for categorically separable orientation targets.
Humphreys, G. W., & Muller, H. J. (1993). Search via recursive rejection (SERR): A connectionist model of visual search.
Lu, Z.-L., & Dosher, B. (2008). Characterizing observers using external noise and observer models: Assessing internal representations with external noise.
Macmillan, N. A., & Creelman, C. D. (2005).
McElree, B., & Carrasco, M. (1999). The temporal dynamics of visual search: Evidence for parallel processing in feature and conjunction search.
McElree, B., & Dosher, B. (1989). Serial position and set size in short-term memory: Time course of recognition.
McElree, B., & Dosher, B. (1993). Serial retrieval processing in the recovery of order information.
Motter, B. C., & Belky, E. J. (1998). The zone of focal attention during active visual search.
Najemnik, J., & Geisler, W. S. (2005). Optimal eye movement strategies in visual search.
Neisser, U. (1967).
Palmer, J. (1994). Set-size effects in visual search: The effect of attention is independent of the stimulus for simple tasks.
Palmer, J. (1995). Attention in visual search: Distinguishing four causes of a set-size effect.
Palmer, J., Ames, C. T., & Lindsey, D. T. (1993). Measuring the effect of attention on simple visual search.
Palmer, J., Verghese, P., & Pavel, M. (2000). The psychophysics of visual search.
Ratcliff, R. (1978). A theory of memory retrieval.
Reed, A. V. (1973). Speed–accuracy trade-off in recognition memory.
Reynolds, J. H., & Desimone, R. (2001). Neural mechanisms of attentional selection. In Braun, J., Koch, C. & Davis, J. L. (Eds.),
Rosenholtz, R. (2001). Visual search for orientation among heterogeneous distractors: Experimental results and implications for signal-detection theory models of search.
Santhi, N., & Reeves, A. (2004). The roles of distractor coherence and target certainty on visual search: A signal detection model.
Shaw, M. L. (1982). Attending to multiple sources of information: I. The integration of information in decision making.
Shaw, M. L., & Shaw, P. (1977). Optimal allocation of cognitive resources to spatial locations.
Sperling, G., Budiansky, J., Spivak, J. G., & Johnson, M. C. (1971). Extremely rapid visual search: The maximum rate of scanning letters for the presence of a numeral.
Sperling, G., & Dosher, B. (1986). Strategy and optimization in human information processing. In K. R.Boff, L.Kaufman, & J. P.Thomas (Eds.),
Sperling, G., & Weichselgartner, E. (1995). Episodic theory of the dynamics of spatial attention.
Sternberg, S. (1966). High speed scanning in human memory.
Sternberg, S. (1975). Memory-scanning: New findings and current controversies.
Sutter, A., & Graham, N. V. (1995). Investigating simple and complex mechanisms in texture segregation using the speed–accuracy tradeoff method.
Sutter, A., & Hwang, D. (1999). A comparison of the dynamics of simple (Fourier) and complex (non-Fourier) mechanisms in texture segregation.
Theois, J. (1973). Reaction time measurement in the study of memory processes: Theory and data. In G. H.Bower (Ed.),
Thornton, T. K., & Gilden, D. L. (2007). Parallel and serial processes in visual search.
Townsend, J. T., & Ashby, F. G. (1983).
Townsend, J. T., & Nozawa, G. (1997). Serial exhaustive models can violate the race model inequality: Implications for architecture and capacity.
Treisman, A. (1982). Perceptual grouping and attention in visual search for features and for objects.
Treisman, A. (1993). Representing visual objects.
Treisman, A., & Gelade, G. (1980). A feature integration theory of attention.
Treisman, A., & Gormican, S. (1988). Feature analysis in early vision: Evidence from search asymmetries.
Verghese, P. (2001). Visual search and attention: A signal detection theory approach.
Wannacott, T. H., & Wannacott, R. J. (1981).
Wolfe, J. M. (1994). Guided search 2.0: A revised model of visual search.
Wolfe, J. M. (2003). Moving towards solutions to some enduring controversies in visual search.
Wolfe, J. M., & Friedman-Hill, S. R. (1992). Visual search for orientation: The role of angular relations between targets and distractors.
Wolfe, J. M., Friedman-Hill, S. R., Stewart, M. I., & O'Connell, K. M. (1992). The role of categorization in visual search for orientation.
This appendix presents the equations for probabilistic serial and parallel search models developed by Dosher, Lu, and Han (2004). These model predictions include a single identification accuracy for non-targets. As such, these models are directly applicable to homogeneous search conditions, and are a first-order approximation for heterogeneous search conditions. See Dosher, Han, and Lu (2004) for a more detailed model development.
Figure 1A illustrates a probabilistic serial search model in which each item in a display is searched successively in a random order. The serial model implements a probabilistic weighting rule that incorporates errors—both misses of the target and false alarms to distractor items—and determines both the completion time and accuracy of the search. Observers begin in a neutral information state. A positive information state is entered when an item, correctly or incorrectly, is identified as a target. The negative information state is entered when all items, correctly or incorrectly, are identified as distractors.
Let PT and PD be the probability of correctly identifying a target and a distractor, respectively, and N be the display size. Finally, G(t|τ,α), the gamma distribution with time constant τ and number of stages α (defined later), characterizes the finishing time distributions as a function of time from the onset of the display, t.
For target-present displays, the probability of entering the positive information state (correctly or in error) by time t following display onset is <anchor name="eq6"></anchor>
Here, m is an index for the order or position in which the location containing the target is searched, and k is an index for calculating the combinatorics of errors at various positions.
For target-present displays, the probability of entering the negative information-state is <anchor name="eq7"></anchor>
For target-absent displays, the probability of entering the positive information state is <anchor name="eq8"></anchor>
where m is the first process in which a distractor is incorrectly identified as a target.
For target-absent displays, the probability of entering the negative information state is <anchor name="eq9"></anchor>
Figure 1C illustrates a probabilistic parallel search model in which each item in a display is searched in parallel, beginning at the same time but with independent finishing times drawn from a distribution G(t|τ,a). PT and PD are the probabilities of correct target and distractor identification. This is a parallel model with unlimited-capacity dynamics, in that the speed of processing individual items does not depend upon the number of elements in the display.
For the target-present displays, the probability of entering a positive information state is <anchor name="eq10"></anchor>
where m is the number of distractors that are misidentified as targets and the weighting factors reflect the combinatorics on the completion order of those processes.
For the target-present displays, the probability of entering a negative information state is <anchor name="eq11"></anchor>
For target-absent displays, the probability of entering a positive information state is <anchor name="eq12"></anchor>
For target-absent displays, the probability of entering a negative information state is <anchor name="eq13"></anchor>
For both parallel and serial models, the probabilities of yes and no responses is used to calculate a composite (overall) d′ performance accuracy. The probability of yes and no responses is calculated by assuming that the observers say “yes” when in the positive information state, say “no” when in the negative information state, and otherwise guess with probability g: <anchor name="eq14"></anchor>
and <anchor name="eq15"></anchor>
A predicted measure of bias-fee accuracy, d′ for the model is derived from the predicted hit and false alarm rates as a function of processing time, d′ = Z(Pyes) – Z(1 – Pno).
The cumulative density function of the gamma distribution, G(t|τ,a), in the time-course equations is <anchor name="eq16"></anchor>
else <anchor name="eq17"></anchor>
(This may be generalized to include a shift by a base time δ.)