首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
We present an off-line cursive word recognition system based completely on neural networks: reading models and models of early visual processing. The first stage (normalization) preprocesses the input image in order to reduce letter position uncertainty; the second stage (feature extraction) is based on the feedforward model of orientation selectivity; the third stage (letter pre-recognition) is based on a convolutional neural network, and the last stage (word recognition) is based on the interactive activation model.  相似文献   

2.
When a static textured background is covered and uncovered by a moving bar of the same mean luminance we can clearly see the motion of the bar. Texture-defined motion provides an example of a naturally occurring second-order motion. Second-order motion sequences defeat standard spatio-temporal energy models of motion perception. It has been proposed that second-order stimuli are analysed by separate systems, operating in parallel with luminance-defined motion processing, which incorporate identifiable pre-processing stages that make second-order patterns visible to standard techniques. However, the proposal of multiple paths to motion analysis remains controversial. Here we describe the behaviour of a model that recovers both luminance-defined and an important class of texture-defined motion. The model also accounts for the induced motion that is seen in some texture-defined motion sequences. We measured the perceived direction and speed of both the contrast envelope and induced motion in the case of a contrast modulation of static noise textures. Significantly, the model predicts the perceived speed of the induced motion seen at second-order texture boundaries. The induced motion investigated here appears distinct from classical induced effects resulting from motion contrast or the movement of a reference frame.  相似文献   

3.
Gilet E  Diard J  Bessière P 《PloS one》2011,6(6):e20387
In this paper, we study the collaboration of perception and action representations involved in cursive letter recognition and production. We propose a mathematical formulation for the whole perception-action loop, based on probabilistic modeling and bayesian inference, which we call the Bayesian Action-Perception (BAP) model. Being a model of both perception and action processes, the purpose of this model is to study the interaction of these processes. More precisely, the model includes a feedback loop from motor production, which implements an internal simulation of movement. Motor knowledge can therefore be involved during perception tasks. In this paper, we formally define the BAP model and show how it solves the following six varied cognitive tasks using bayesian inference: i) letter recognition (purely sensory), ii) writer recognition, iii) letter production (with different effectors), iv) copying of trajectories, v) copying of letters, and vi) letter recognition (with internal simulation of movements). We present computer simulations of each of these cognitive tasks, and discuss experimental predictions and theoretical developments.  相似文献   

4.
Band-spectrum noise has been shown to suppress the visual perception of printed letters. The suppression exhibits a specific dependence on the spatial frequency of the noise, and the frequency domain of most effective inhibition has been related to the size of the letters. In this paper, we address two important questions that were left open by previous studies: (1) Is the observed effect specific to text, and which parameters determine the domain of most effective suppression? (2) What is the origin of the effect in terms of underlying neural processes? We conduct a series of psychophysical experiments that demonstrate that the frequency domain of most effective inhibition depends on the stroke width of the letter rather than on the letter size. These experiments also demonstrate that the effect is not specific to the recognition of letters but also applies to other objects and even to single bars. We attribute the observed effect to nonclassical receptive field (non-CRF) inhibition in visual area V1. This mechanism has previously been suggested as the possible origin of various other perceptual effects. We introduce computational models of two types of cell that incorporate non-CRF inhibition, which are based on Gabor energy filters extended by surround suppression of two kinds: isotropic and anisotropic. The computational models confirm previous qualitative explanations of perceptual effects, such as orientation contrast pop-out, reduced saliency of lines embedded in gratings, and reduced saliency of contours surrounded by textures. We apply the computational models to the images used in the psychophysical experiments. The computational results show a dependence of the inhibition effect on the spatial frequency of the noise that is similar to the suppression effect measured in the psychophysical experiments. The experimental results and their explanation give further support to the idea of a possible functional role of non-CRF inhibition in the separation of contour from texture information and the mediation of object contours to higher cortical areas.  相似文献   

5.
Four experiments examined the ability of respondents to identify letters that were displayed on an LED array with flashes lasting little more than a microsecond. The first experiment displayed each letter with a single, simultaneous flash of all the dots forming the letter and established the relation of flash intensity to the probability of letter identification. The second experiment displayed the letters with multiple flashes at different frequencies to determine the probability that the sequence of flashes would be perceived as fused. The third experiment displayed the letters at a frequency that was above the flicker-fusion frequency, varying flash intensity to establish the amount needed to elicit a given probability of letter identification. The fourth experiment displayed each letter twice, once at a frequency where no flicker was perceived and also with steady light emission. The intensity of each flash was fixed and the steady intensity was varied; respondents were asked to judge whether the fused-flicker display and the steady display appeared to be the same brightness. Steady intensity was about double the average flash intensity where the two conditions were perceived as being equal in brightness. This is at odds with Talbot-Plateau law, which predicts that these two values should be equal. The law was formulated relative to a flash lasting half of each period, so it is surprising that it comes this close to being correct where the flash occupies only a millionth of the total period.  相似文献   

6.
Children often make letter reversal errors when first learning to read and write, even for letters whose reversed forms do not appear in normal print. However, the brain basis of such letter reversal in children learning to read is unknown. The present study compared the neuroanatomical correlates (via functional magnetic resonance imaging) and the electrophysiological correlates (via event-related potentials or ERPs) of this phenomenon in children, ages 5–12, relative to young adults. When viewing reversed letters relative to typically oriented letters, adults exhibited widespread occipital, parietal, and temporal lobe activations, including activation in the functionally localized visual word form area (VWFA) in left occipito-temporal cortex. Adults exhibited significantly greater activation than children in all of these regions; children only exhibited such activation in a limited frontal region. Similarly, on the P1 and N170 ERP components, adults exhibited significantly greater differences between typical and reversed letters than children, who failed to exhibit significant differences between typical and reversed letters. These findings indicate that adults distinguish typical and reversed letters in the early stages of specialized brain processing of print, but that children do not recognize this distinction during the early stages of processing. Specialized brain processes responsible for early stages of letter perception that distinguish between typical and reversed letters may develop slowly and remain immature even in older children who no longer produce letter reversals in their writing.  相似文献   

7.
Summary Due to the manner in which the English language is used, words exhibit strong internal constraints on letters, but some additional constraint may be imposed by the context in which words appear. In order to estimate the internal constraints of words and the overall effect of context, an experiment was carried out using 225 human subjects who predicted letters in each of the first four positions within words, both with and without context prior to the words. It was found that as more letters at the beginning of words are given, prediction of the following letters increases monotonically, but the increase is not smooth. Prediction of the third letter of words given the first two letters is only a little better than prediction of the second letter given only the first. This effect may be explained by the probable combinations of vowels and consonants at the beginning of words. Letters in the first two positions show no improvement due to long context but prediction of later letters is increased by such context so that prediction rises smoothly from the initial letter to the fourth letter. Also, the type of word in which the letters are to be predicted affects the prediction, function words showing more constraint on letters than content words. The difference between function and content words does not take effect, however, until the first two letters of the word are given. Using the prediction data from words preceded by long context, extrapolations of constraint out to the tenth letter were obtained. From the values of constraint at the first ten letter positions it was possible to estimate the maximum unilateral sequential constraint in English. A value of about 48% was obtained which compares with previous estimates of 50%. A further evaluation of the overall effect of context indicates that about 81% of the constraint in English is contained within the words themselves, and the other 19% is due to any additional context.This paper is based on a dissertation submitted to the Department of Psychology, The Johns Hopkins University, in partial fulfillment of the requirements for the Ph. D. degree. The research was done under Contract Nonr-248(55) between the Office of Naval Research and The Johns Hopkins University. This is Report No. 13 under that contract. Reproduction in whole or in part is permitted for any purpose of the United States Government.During the period of this investigation the author was a National Institutes of Health Fellow. The author wishes to thank Wendell R. Garner for his encouragement and advice.  相似文献   

8.
A right-handed patient, aged 72, manifested alexia without agraphia, a right homonymous hemianopia and an impaired ability to identify visually presented objects. He was completely unable to read words aloud and severely deficient in naming visually presented letters. He responded to orthographic familiarity in the lexical decision tasks of the Psycholinguistic Assessments of Language Processing in Aphasia (PALPA) rather than to the lexicality of the letter strings. He was impaired at deciding whether two letters of different case (e.g., A, a) are the same, though he could detect real letters from made-up ones or from their mirror image. Consequently, his core deficit in reading was posited at the level of the abstract letter identifiers. When asked to trace a letter with his right index finger, kinesthetic facilitation enabled him to read letters and words aloud. Though he could use intact motor representations of letters in order to facilitate recognition and reading, the slow, sequential and error-prone process of reading letter by letter made him abandon further training.  相似文献   

9.
Adult subjects were asked to recognize a hierarchical visual stimulus (a letter) while their attention was drawn to either the global or local level of the stimulus. Event-related potentials (ERP) and psychophysical indices (reaction time and percentage of correct responses) were measured. An analysis of psychophysical indices showed the global level precedence effect, i.e., the increase in a small letter recognition time when this letter is a part of incongruent stimulus. An analysis of ERP components showed level-related (global vs. local) differences in the timing and topography of the brain organization of perceptual processing and regulatory mechanisms of attention. Visual recognition at the local level was accompanied by (1) stronger activation of the visual associative areas (Pz and T6) at the stage of sensory features analysis (P1 ERP component), (2) involvement mainly of inferior temporal cortices of the right hemisphere (T6) at the stage of sensory categorization (P2 ERP component), and (3) involvement of prefrontal cortex of the right hemisphere at the stage of the selection of the relevant features of the target (N2 ERP component). Visual recognition at the global level was accompanied by (1) pronounced involvement of mechanisms of early sensory selection (N1 ERP component), (2) prevailing activation of parietal cortex of the right hemisphere (P4) at the stage of sensory categorization (P2 ERP component) as well as at the stage of the target stimulus identification (P3 ERP component). It is suggested that perception at the global level of the hierarchical stimulus is related primarily to the analysis of the spatial features of the stimulus in the dorsal visual system whereas the perception at the local level primarily involves an analysis of the object-related features in the ventral visual system.  相似文献   

10.
OBJECTIVE--To study delays between sending referral letters and the outpatient appointment and to assess the content of referral and reply letters, their educational value, and the extent to which questions asked are answered by reply letters. DESIGN--Retrospective review of referrals to 16 consultant orthopaedic surgeons at five hospitals, comprising 288 referral letters with corresponding replies, by scoring contents of letters. SETTING--Orthopaedic teaching hospitals in Nottingham, Derby, and Mansfield. MAIN OUTCOME MEASURES--Weighted scores of contents of referral and reply letters, assessment of their educational value, and responses to questions in referral letters. RESULTS--Median outpatient delay was 23.4 weeks. There was no significant decrease in waiting time if the referral letter was marked "urgent" but a significantly greater delay (p less than 0.01) if referrals were directed to an unnamed consultant. The content score was generally unsatisfactory for both referrals and replies, and there was no correlation for the content scores of the referral letter and its reply (r = 0.13). Items of education were rare in the referral letters (8/288; 3%) and significantly more common in replies (75/288; 26%) (p much less than 0.001). Senior registrars were significantly more likely to attempt education than other writers (p less than 0.02). Education in replies was significantly related to increased length of the letter (p less than 0.05) and was more likely to occur if the referral was addressed to a named consultant (p less than 0.03). 48 (17%) Referral letters asked questions, of which 21 (44%) received a reply. No factor was found to influence the asking of or replying to questions. CONCLUSIONS--The potential for useful communication in the referral letter and in the reply from orthopaedic surgeons is being missed at a number of levels. The content is often poor, the level of mutual education is low, and the use of the referral letter to determine urgency is deficient. Most questions asked by general practitioners are not answered.  相似文献   

11.
Calculations have been made of the change with time in the tritiated thymidine labelling index of erythroid cells in normal and anaemic rats, on the basis of two different models of the erythroid system. In the first model it is assumed that cells pass from one stage of maturation to the next at all phases of the cell cycle, whereas in the second model the cells can only progress to the next stage when they reach a certain point in the cell cycle. the changes in labelling index predicted on the basis of these two models are markedly different, especially in the non-dividing stages of the system, and the change in labelling index as a function of time therefore provides an experimental method for distinguishing between the two models. the experimental data favours the model in which cells cross compartmental boundaries at all stages of maturation. Some important consequences of this model are discussed.  相似文献   

12.
The centerpiece of this document is an unanswered letter of appeal from the author to Professor Roderick MacKinnon of the Rockefeller University dated November 17, 2003. The aim of the appeal is summarized in the title of this communication. In addition to the 2003 letter, there are also two follow-up letters in this communication, each containing a copy of the 2003 letter and each repeating the appeal. The follow-up letters, dated February 22, 2008 and April 2, 2008 respectively, were also unanswered. To make sure that these letters reached their destination, each was certified with delivery time and date affirmed. Thus the February 22 letter was delivered on the February 24 by the US Postal Service. Two copies of the April 2 follow-up letter were sent. The first copy was delivered by Federal Express on April 4. The second copy of the April 2 letter was delivered by the US Postal Service on the same day. Thus all told three additional copies of the 2003 letters were delivered to, and must be in the hand of Professor MacKinnnon. All these efforts were made to make certain that Professor MacKinnon's refusal to answer my registered 2003 letter was not due to his not having received a copy of that letter.  相似文献   

13.
Reading speed is dramatically reduced when readers cannot use their central vision. This is because low visual acuity and crowding negatively impact letter recognition in the periphery. In this study, we designed a new font (referred to as the Eido font) in order to reduce inter-letter similarity and consequently to increase peripheral letter recognition performance. We tested this font by running five experiments that compared the Eido font with the standard Courier font. Letter spacing and x-height were identical for the two monospaced fonts. Six normally-sighted subjects used exclusively their peripheral vision to run two aloud reading tasks (with eye movements), a letter recognition task (without eye movements), a word recognition task (without eye movements) and a lexical decision task. Results show that reading speed was not significantly different between the Eido and the Courier font when subjects had to read single sentences with a round simulated gaze-contingent central scotoma (10° diameter). In contrast, Eido significantly decreased perceptual errors in peripheral crowded letter recognition (-30% errors on average for letters briefly presented at 6° eccentricity) and in peripheral word recognition (-32% errors on average for words briefly presented at 6° eccentricity).  相似文献   

14.
We investigated the relationships between conception rates (CRs) at first service in Japanese Holstein heifers (i.e. animals that had not yet had their first calf) and cows and their test-day (TD) milk yields. Data included records of artificial insemination (AI) for heifers and cows that had calved for the first time between 2000 and 2008 and their TD milk yields at 6 through 305 days in milk (DIM) from first through third lactations. CR was defined as a binary trait for which first AI was a failure or success. A threshold-linear animal model was applied to estimate genetic correlations between CRs of heifers or cows and TD milk yield at various lactation stages. Two-trait genetic analyses were performed for every combination of CR and TD milk yield by using the Bayesian method with Gibbs sampling. The posterior means of the heritabilities of CR were 0.031 for heifers, 0.034 for first-lactation cows and 0.028 for second-lactation cows. Heritabilities for TD milk yield increased from 0.324 to 0.433 with increasing DIM but decreased slightly after 210 DIM during first lactation. These heritabilities from the second and third lactations were higher during late stages of lactation than during early stages. Posterior means of the genetic correlations between heifer CR and all TD yields were positive (range, 0.082 to 0.287), but those between CR of cows and milk yields during first or second lactation were negative (range, −0.121 to −0.250). Therefore, during every stage of lactation, selection in the direction of increasing milk yield may reduce CR in cows. The genetic relationships between CR and lactation curve shape were quite weak, because the genetic correlations between CR and TD milk yield were constant during the lactation period.  相似文献   

15.
We present a protein fold recognition method, MANIFOLD, which uses the similarity between target and template proteins in predicted secondary structure, sequence and enzyme code to predict the fold of the target protein. We developed a non-linear ranking scheme in order to combine the scores of the three different similarity measures used. For a difficult test set of proteins with very little sequence similarity, the program predicts the fold class correctly in 34% of cases. This is an over twofold increase in accuracy compared with sequence-based methods such as PSI-BLAST or GenTHREADER, which score 13-14% correct first hits for the same test set. The functional similarity term increases the prediction accuracy by up to 3% compared with using the combination of secondary structure similarity and PSI-BLAST alone. We argue that using functional and secondary structure information can increase the fold recognition beyond sequence similarity.  相似文献   

16.
Our visual system segments images into objects and background. Figure-ground segregation relies on the detection of feature discontinuities that signal boundaries between the figures and the background and on a complementary region-filling process that groups together image regions with similar features. The neuronal mechanisms for these processes are not well understood and it is unknown how they depend on visual attention. We measured neuronal activity in V1 and V4 in a task where monkeys either made an eye movement to texture-defined figures or ignored them. V1 activity predicted the timing and the direction of the saccade if the figures were task relevant. We found that boundary detection is an early process that depends little on attention, whereas region filling occurs later and is facilitated by visual attention, which acts in an object-based manner. Our findings are explained by a model with local, bottom-up computations for boundary detection and feedback processing for region filling.  相似文献   

17.
Adult subjects were asked to recognize a hierarchical visual stimulus (a letter) while their attention was drawn to either the global or local level of the stimulus. Event-related potentials (ERP) and behavioral indices (reaction time and percentage of correct responses) were measured. An analysis of behavioral indices showed the global level precedence effect, i.e. the increase in a small letter recognition time when this letter is a part of incongruent stimulus. An analysis of ERP components showed level-related (global vs. local) differences in the timing and topography of the brain organization of perceptual processing and regulatory mechanisms of attention. Visual recognition at the local level was accompanied by (1) stronger activation of the visual associative areas (P z and T 6) at the stage of sensory features analysis (P1 ERP component), (2) involvement mainly of inferior temporal cortices of the right hemisphere (T 6) at the stage of sensory categorization (P2 ERP component), and (3) involvement of prefrontal cortex of the right hemisphere at the stage of selection of the relevant features of the target (N2 ERP component). Visual recognition at the global level was accompanied by (1) pronounced involvement of mechanisms of early sensory selection (N1 ERP component), (2) prevailing activation of parietal cortex of the right hemisphere (P 4) at the stage of sensory categorization (P2 ERP component) as well as at the stage of the target stimulus identification (P3 ERP component). We suggested that perception of the hierarchical stimulus at the global level is related primarily to the analysis of its spatial features in the dorsal visual system whereas the perception at the local level primarily involves an analysis of the object-related features in the ventral visual system.  相似文献   

18.

Background

The question of how the brain encodes letter position in written words has attracted increasing attention in recent years. A number of models have recently been proposed to accommodate the fact that transposed-letter stimuli like jugde or caniso are perceptually very close to their base words.

Methodology

Here we examined how letter position coding is attained in the tactile modality via Braille reading. The idea is that Braille word recognition may provide more serial processing than the visual modality, and this may produce differences in the input coding schemes employed to encode letters in written words. To that end, we conducted a lexical decision experiment with adult Braille readers in which the pseudowords were created by transposing/replacing two letters.

Principal Findings

We found a word-frequency effect for words. In addition, unlike parallel experiments in the visual modality, we failed to find any clear signs of transposed-letter confusability effects. This dissociation highlights the differences between modalities.

Conclusions

The present data argue against models of letter position coding that assume that transposed-letter effects (in the visual modality) occur at a relatively late, abstract locus.  相似文献   

19.
An influential theory of mammalian vision, known as the efficient coding hypothesis, holds that early stages in the visual cortex attempts to form an efficient coding of ecologically valid stimuli. Although numerous authors have successfully modelled some aspects of early vision mathematically, closer inspection has found substantial discrepancies between the predictions of some of these models and observations of neurons in the visual cortex. In particular analysis of linear-non-linear models of simple-cells using Independent Component Analysis has found a strong bias towards features on the horoptor. In order to investigate the link between the information content of binocular images, mathematical models of complex cells and physiological recordings, we applied Independent Subspace Analysis to binocular image patches in order to learn a set of complex-cell-like models. We found that these complex-cell-like models exhibited a wide range of binocular disparity-discriminability, although only a minority exhibited high binocular discrimination scores. However, in common with the linear-non-linear model case we found that feature detection was limited to the horoptor suggesting that current mathematical models are limited in their ability to explain the functionality of the visual cortex.  相似文献   

20.
A model of local image encoding is described which explicitly incorporates quantitative data about the number density, bandwidth and receptive field organisation of neurons involved in motion detection. The model solves the problem of extracting local velocity on the basis of inputs tuned to spatiotemporal frequency and sensitive to contrast. The spatiotemporally tuned, opponent motion filters are followed by a compressive non-linearity and comprise a first stage. The inter-stage signals are interpreted as those from single neurons and the second stage is modelled as a neural-network layer. The second stage uses semilinear units and models the effect of lateral, on-centre off-surround, intralayer connections. Characterisation of the first stage leads to a clarification of the concept of the psychophysical channel and its relation to physiological data. The quantitative parametrisation of the model allows the simulation of several psychophysical phenomena which are reported in a companion paper.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号