jagomart
digital resources
picture1_Chinese Alphabet Pdf 99782 | Wang18 Comparing The Minimum Spatial Frequency Content For Recognizing Chinese And Alphabet Characters


 150x       Filetype PDF       File size 0.84 MB       Source: legge.psych.umn.edu


File: Chinese Alphabet Pdf 99782 | Wang18 Comparing The Minimum Spatial Frequency Content For Recognizing Chinese And Alphabet Characters
journal of vision 2018 18 1 1 1 13 1 comparing the minimum spatial frequency content for recognizing chinese and alphabet characters department of biomedical engineering university of minnesota minneapolis ...

icon picture PDF Filetype PDF | Posted on 21 Sep 2022 | 3 years ago
Partial capture of text on file.
           Journal of Vision (2018) 18(1):1, 1–13                                                                                                                      1
           Comparing the minimum spatial-frequency content for
           recognizing Chinese and alphabet characters
                                                                                               Department of Biomedical Engineering,
                                                                                  University of Minnesota, Minneapolis, MN, USA
                                                                               Present address: Athinoula A. Martinos Center for
                                                                                    Biomedical Imaging, Department of Radiology,
                                                                          Massachusetts General Hospital and Harvard Medical
           Hui Wang                                                                                        School, Charlestown, MA, USA                             $
                                                                             Department of Psychology University of Minnesota,
           Gordon E. Legge                                                                                             Minneapolis, MN, USA                         $
           Visual blur is a common problem that causes difficulty in                           Introduction
           pattern recognition for normally sighted people under
           degraded viewing conditions (e.g., near the acuity limit,
           when defocused, or in fog) and also for people with                                   Character recognition is a prerequisite for reading
           impaired vision. For reliable identification, the spatial                         and is typically a fast and accurate visual process. It
           frequency content of an object needs to extend up to or                           becomes difficult under degraded visual conditions,
           exceed a minimum value in units of cycles per object,                             suchasreadingsmallsymbolsatalongdistanceorwith
           referred to as the critical spatial frequency. In this study,                     optical defocus, and is especially difficult in patients
           we investigated the critical spatial frequency for                                with severe low vision. The spatial-frequency properties
           alphabet and Chinese characters, and examined the                                 of letter recognition have been widely explored.
           effect of pattern complexity. The stimuli were divided                            Previous studies show that the visual system utilizes a
           into seven categories based on their perimetric                                   spatial frequency of 1–3 cycles per letter (CPL) for
           complexity, including the lowercase and uppercase
           alphabet letters, and five groups of Chinese characters.                          reliable identification (Alexander, Xie, & Derlacki,
           Wefound that the critical spatial frequency significantly                         1994; Chung, Legge, & Tjan, 2002; Ginsburg, 1978;
           increased with complexity, from 1.01 cycles per                                   Gold, Bennett, & Sekuler, 1999; Legge, Pelli, Rubin, &
           character for the simplest group to 2.00 cycles per                               Schleske, 1985; Parish & Sperling, 1991; Solomon &
           character for the most complex group of Chinese                                   Pelli, 1994), with the optimal spatial frequency de-
           characters. A second goal of the study was to test a                              pending somewhat on the angular size of letters (Majaj,
           space-bandwidth invariance hypothesis that would                                  Pelli, Kurshan, & Palomares, 2002). Kwon and Legge
           represent a tradeoff between the critical spatial                                 (2011) reported that accurate letter identification is
           frequency and the number of adjacent patterns that can                            possible with letters containing spatial frequencies only
           be recognized at one time. We tested this hypothesis by                           upto0.9CPL.Theseauthorsappliedlowpassfiltersto
           comparing the critical spatial frequencies in cycles per                          images of letters and faces and obtained psychometric
           character from the current study and visual-span sizes in                         functions showing recognition performance (percent
           number of characters (measured by Wang, He, & Legge,                              correct) as a function of the cutoff frequency of the
           2014) for sets of characters with different complexities.                         filters. They referred to the minimal spatial-frequency
           For the character size (1.28) we used in the study, we                            requirement for pattern recognition (with 80% accura-
           found an invariant product of approximately 10 cycles,                            cy) as the critical spatial frequency.
           which may represent a capacity limitation on visual                                   Chinese characters differ from alphabetic characters
           pattern recognition.
                                                                                             in having a wider range of pattern complexities.
                                                                                             Studying Chinese character recognition may elucidate
                                                                                             the connection between pattern recognition and pattern
                                                                                             complexity. The goal of our study was to determine the
                                                                                             critical-frequency requirements for Chinese characters,
                                                                                             and to examine the effect of pattern complexity.
           Citation: Wang, H. & Legge, G. E. (2018). Comparing the minimum spatial-frequency content for recognizing Chinese and
           alphabet characters. Journal of Vision, 18(1):1, 1–13, https://doi.org/10.1167/18.1.1.
           https://doi.org/10.1167/18.1.1                       Received April 11, 2017; published January 2, 2018           ISSN 1534-7362 Copyright 2018 The Authors
                                This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
   Downloaded From: https://jov.arvojournals.org/pdfaccess.ashx?url=/data/journals/jov/936669/ on 01/07/2019
            Journal of Vision (2018) 18(1):1, 1–13                                   Wang & Legge                                                                            2
            Critical cutoff frequencies can be expressed in both                                complexity increases (Wang et al., 2014). If critical
            retinal spatial frequency (cycles per degree) or image-                             frequencies are found to increase with complexity, it is
            based spatial-frequency (cycles per character; CPC). In                             possible that the product of critical frequency and
            this paper, we will usually refer to spatial frequencies                            visual-span size may be constant, representing a form
            (including cutoff frequencies) in cycles per character.                             of capacity limitation on visual pattern recognition. In
            Anexception will be our consideration of the effects of                             the context of this paper, we refer to the bandwidth of
            the contrast sensitivity function (CSF) in the Discus-                              the low-pass filter as the range from zero to the critical
            sion.                                                                               frequency. For simplicity, we used the term bandwidth
               Previous studies have shown that the acuity limit for                            instead of the critical frequency in our hypothesis.
            recognizing Chinese characters with more strokes                                        The study of character recognition has important
            requires larger size (Cai, Chi, & You, 2001; Chi, Cai, &                            practical implications for reading performance. It is
            You, 2003; Huang & Hsu, 2005). Chinese characters                                   known that a critical frequency is required for
            with more strokes also have higher contrast thresholds                              uncompromised reading speed in alphabet reading
            (Yen & Liu, 1972) and longer response times (Yu &                                   (Kwon&Legge,2012).Therefore,studying the spatial-
            Cao, 1992). However, reports on the spatial frequency                               frequency requirements for Chinese characters may be
            properties of Chinese character recognition are scarce.                             relevant to Chinese reading under low-resolution
            Chen, Yeh, and Lin (2001) adopted the critical-band–                                conditions including low vision. It may also have
            maskingparadigmusedbySolomonandPelli(1994)to                                        practical applications in designing reading material for
            investigate the best central frequencies for Chinese                                difficult viewing conditions.
            characters. They tested Chinese characters with 3 to 21
            strokes, and reported an average spatial frequency of
            approximately 8 CPC. The study however, did not take                                  Methods
            the variation of complexities into account, and did not
            investigate the minimal spatial-frequency requirements
            for Chinese character recognition.                                                  Subjects
               In this study, we explored the critical spatial-
            frequency requirements for alphabet and Chinese                                         Six college students (three men, three women) with
            characters, and examined the effect of complexity on                                normal or corrected-to-normal vision participated in
            these requirements. As the more complex characters                                  the experiments. They were all native Chinese speakers,
            have broader spatial-frequency spectra than the simple                              originally educated in the simplified Chinese script
            characters, they may require higher spatial frequency                               system, and all had more than 10 years education in
            for character recognition. We divided alphabet char-                                English. The subjects signed an Internal Review Board
            acters and Chinese characters into categories, based on                             (IRB) approved consent form before the experiments.
            ranges of complexity values, using the perimetric
            complexity metric (Arnoult & Attneave, 1956; Pelli,
            Burns, Farell, & Moore-Page, 2006). The perimetric                                  Stimulus sets
            complexity of a symbol is defined as its perimeter
            squared divided by its ink area. We showed                                          The stimulus characters were lowercase (LL) and
            previously (Wang et al., 2014) that the perimetric                                  uppercase (UL) alphabet letters in the Arial font, and
            complexity metric has high correlation with other                                   simplified Chinese characters in the Heiti font in which
            complexity metrics, such as the number of strokes, the                              all the strokes have the same width.
            stroke frequency (Majaj et al., 2002; Zhang, Zhang,                                     The 700 most frequently used Chinese characters
            Xue, Liu, & Yu, 2007) and the skeleton method                                       (State Language Work Committee, 1992) were divided
            (Bernard & Chung, 2011). For each complexity                                        into five nonoverlapping groups based on their
            category, we measured recognition performance for                                   perimetric complexity values (Pelli et al., 2006).
            sets of 26 characters as a function of the cutoff                                   Twenty-six characters whose complexity values were
            frequency of low-pass filters.                                                       close to the mean of the group were selected to form
               Asecond goal of this study was to test an empirical                              five sets of symbols (C1–C5). Characters with very high
            hypothesis of a tradeoff between the critical frequency                             or low similarity were excluded from the stimulus sets.
            for character recognition and the visual span for                                   Ameasure of similarity for the characters in each set
            character recognition; we term this the space-band-                                 was computed using a normalized Euclidean distance
            width invariance hypothesis. The visual span is the                                 method (Wang et al., 2014).
            number of characters that can be recognized without                                     To determine whether subjects familiarity with the
            moving the eyes. We have examined the size of the                                   characters affected their performance, we included a
            visual span for alphabet letters and Chinese characters,                            groupofChinesecharacterswithlowerusagefrequency
            and discovered that the visual span size decreases as                               in text but comparable in complexity with characters in
    Downloaded From: https://jov.arvojournals.org/pdfaccess.ashx?url=/data/journals/jov/936669/ on 01/07/2019
            Journal of Vision (2018) 18(1):1, 1–13                                   Wang & Legge                                                                            3
                                                                                                       f ¼        1         ð1Þ
                                                                                                                  
                                                                                                            1þ r 2n
                                                                                                                    c
                                                                                                where r is the radius of the components in the
                                                                                                frequency domain, c is the radius of the cutoff
                                                                                                frequency, and n is the order of the filter. Figure 2A
                                                                                                demonstrates the response function of the low-pass
                                                                                                filter in the spatial-frequency domain.
                                                                                                    To test the recognition accuracy as a function of
                                                                                                blurring levels, six cutoff frequencies were selected for
                                                                                                each stimulus set while character size remained
                                                                                                constant. A demonstration of the characters with and
                                                                                                without low-pass filtering is shown in Figure 2. The sets
                                                                                                of filter cutoffs used for the eight complexity groups
                                                                                                were chosen based on recognition performance in pilot
            Figure 1. Representative characters from the eight stimulus sets                    runs. We ensured that the cutoffs were selected so that
            (LL, UL, C1–C5, and C30). The complexity gradually increases in                     recognition accuracy spanned a wide range, and the
            the first seven rows (from LL to C5). The bottom row (C30)                          psychometric function exhibited a clear transition from
            shows a group with comparable complexity to C3, but lower                           low to high performance accuracy. The cutoffs used for
            familiarity.                                                                        each stimulus set are summarized in Table 2.
            the group C3. We did this by identifying the next 700
            most frequent Chinese characters and divided them                                   Image display
            into five complexity groups as well, based on the same
            complexity metric. Twenty-six characters were selected                                  The stimuli were displayed on a 19 in. CRT monitor
            to comprise a comparison group (C30), which had                                     (refresh rate: 75 Hz, resolution: 1280 3 960). The
            comparable complexity with C3 but lower frequency                                   luminance of the blurred images on the screen was
            and presumably lower familiarity. The pattern com-                                  mapped onto 256 gray levels. The background of the
            plexity in the 1,400 most frequently used characters                                image was set to the gray level 127, corresponding to a
                                                                                                                                       2
            covers most of the complexity range across all                                      meanluminance of 40 cd/m . Luminance of the display
            simplified Chinese characters. Remaining characters                                  monitor was made linear using an 8-bit lookup table in
            with even higher complexities are rarely used in                                    conjunction with photometric readings from a Konica
            ordinary reading. Five representative characters from                               Minolta CS-100 Chroma Meter (Konica Minolta
            each stimulus set are shown in Figure 1. Statistics of the                          Sensing Americas, Inc., Ramsey, NJ). The image
            perimetric complexity values for each stimulus set are                              luminance values were mapped onto the values stored
            given in Table 1.                                                                   in the lookup table for the display. The character image
                                                                                                was displayed at the center of the screen. The stimulus
                                                                                                symbol was created and controlled using MATLAB
            Low-pass filtering                                                                  (MathWorks, Natick, MA) and Psychophysics Tool-
                                                                                                box extensions (Brainard, 1997; Pelli, 1997; Kleiner et
                                                                                                al., 2007), running on a Mac Pro computer (Apple,
               Ablack character was generated on a gray back-                                   Cupertino, CA).
            ground and stored as a grayscale image. The size of the
            image was 2503250 pixels, and the size of the
            characters (height of Chinese characters and x-height of                            Procedure
            alphabet letters) subtended 1.28 visual angle at a
            viewing distance of 40 cm. The image was blurred                                        Each subject participated in three test sessions on
            through a third order Butterworth low-pass filter (f)                                three days. One session consisted of eight blocks: seven
            given by the following equation:                                                    blocks with varied complexity levels (LL, UL, C1–C5),
            Group                                LL              UL              C1              C2               C3               C4               C5                C30
            Complexity mean (SD)           48.6 (11.7)      66.5 (17.9)      98.0 (6.3)     136.9 (2.3)      176.6 (4.3)      216.2 (5.0)      280.1 (33.7)      182.0 (5.2)
            Table 1. Perimetric complexity measures for the stimulus sets. Note: LL, lowercase letter; UL, uppercase letter; C1–C5, five sets of
            Chinese characters from the simplest to the most complex; C30, Chinese character group of comparable complexity with C3 but less
            familiarity.
    Downloaded From: https://jov.arvojournals.org/pdfaccess.ashx?url=/data/journals/jov/936669/ on 01/07/2019
            Journal of Vision (2018) 18(1):1, 1–13                                   Wang & Legge                                                                            4
            Figure 2. (A) The response function of the third-order Butterworth filter in the spatial frequency domain.The arrow indicates a cutoff
            frequency of 1.5 cycles per character (CPC) for a 18 letter size. The filter’s cutoff is defined as the frequency at half amplitude. (B)
            Demonstration of low-pass filtered Chinese characters from the five complexity categories. The right column shows the unfiltered
            character.
            and one block with complexity equivalent to C3 but                                  center of the screen. In each trial, a character was
            lower character familiarity (C30). In each block, there                             presented for 200 ms at fixation. After that, the display
            were 25 trials for each of six cutoffs forming a total of                                                                                                      2,
                                                                                                became uniform at the background level of 40 cd/m
            150 trials. The stimulus symbol was randomly selected                               and the subject was asked to report the character. The
            from the 26-character set, and the order of the cutoff                              experimenter recorded the responses, and the subject
            frequencies presented was shuffled. The resulting                                    clicked the mouse to start the next trial. A reference
            psychometric functions for a given complexity category                              page was available, showing the 26 symbols in the
            were therefore based on 450 trials (six cutoff frequen-                             current category, if the subject had trouble recalling the
            cies and 75 trials per cutoff frequency). The orders of                             characters in the set. Subjects rarely responded with
            the blocks were counterbalanced between sessions and                                characters outside of the stimulus category (,1% of
            subjects.                                                                           trials.) The 26 unfiltered characters were tested at the
               The subject was shown the 26 unfiltered symbols on                                end of every block in order to evaluate the baseline
            a hard copy page before the start of a block and urged                              performance for recognition. Performance on the
            to restrict responses to the stimulus set. During test                              unfiltered stimuli was at the ceiling value of 100%.
            trials, the subject was directed to fixate on a cross at the                             Achin rest was used during the test to reduce head
                                                                                                movements and to maintain the viewing distance.
                                                                                                Practice trials, including all the stimulus sets and the
            Group           f1         f2          f3          f4         f5          f6        filter cutoffs, were provided at the beginning of the test.
            LL            0.78        1.02        1.27       1.49        1.80       2.16
            UL            0.78        1.02        1.27       1.49        1.80       2.16        Data analysis
            C1            0.78        1.02        1.27       1.49        1.80       2.16
            C2            0.92        1.18        1.42       1.63        1.94       2.34            The character recognition accuracy was plotted
            C3/C30        1.08        1.32        1.57       1.79        2.1        2.52        against the cutoff frequencies for each stimulus set.
            C4            1.24        1.44        1.73       1.94        2.28       2.66        Cumulative Gaussian functions (Wichmann & Hill,
            C5            1.30        1.54        1.87       2.09        2.46       2.82        2001)wereusedtofittheplotswiththeleast-square
            Table 2. Butterworth filter cutoff frequencies (in cycles per                       criterion. The critical spatial frequency was estimated
            character; CPC) used for recognition tests with the seven                           from the psychometric function, and defined as the
            complexity categories. Note: LL, lowercase letter; UL, uppercase                    cutoff frequency yielding 80% correct responses. It is
            letter; C1–C5, five sets of Chinese characters from the simplest                    noted that the guessing level of the psychometric
            to the most complex; C30, Chinese character group of                                functionsis1/26¼3.85% for all the groups, because
            comparable complexity with C3 but less familiarity.                                 there are 26 stimuli in each complexity set. Figure 3
    Downloaded From: https://jov.arvojournals.org/pdfaccess.ashx?url=/data/journals/jov/936669/ on 01/07/2019
The words contained in this file might help you see if this file matches what you are looking for:

...Journal of vision comparing the minimum spatial frequency content for recognizing chinese and alphabet characters department biomedical engineering university minnesota minneapolis mn usa present address athinoula a martinos center imaging radiology massachusetts general hospital harvard medical hui wang school charlestown ma psychology gordon e legge visual blur is common problem that causes difficulty in introduction pattern recognition normally sighted people under degraded viewing conditions g near acuity limit when defocused or fog also with character prerequisite reading impaired reliable identification typically fast accurate process it an object needs to extend up becomes difcult exceed value units cycles per suchasreadingsmallsymbolsatalongdistanceorwith referred as critical this study optical defocus especially patients we investigated severe low properties examined letter have been widely explored effect complexity stimuli were divided previous studies show system utilizes i...

no reviews yet
Please Login to review.