| Method and system for implementing n-dimensional object recognition using dynamic adaptive recognition layers -> Monitor Keywords |
|
Method and system for implementing n-dimensional object recognition using dynamic adaptive recognition layersRelated Patent Categories: Image Analysis, Pattern RecognitionMethod and system for implementing n-dimensional object recognition using dynamic adaptive recognition layers description/claimsThe Patent Description & Claims data below is from USPTO Patent Application 20060204097, Method and system for implementing n-dimensional object recognition using dynamic adaptive recognition layers. Brief Patent Description - Full Patent Description - Patent Application Claims BACKGROUND OF THE INVENTION [0001] The invention relates to a method and system for implementing successive multi-layered feature recognition in N-dimensional space in which recognition cells are dynamically generated to accommodate the input data and are adapted during the recognition process, wherein the recognition cells are structured into groups which have specific recognition features assigned to them. [0002] Pattern and object recognition by means of successive computation steps generally begins with the loading of an input dataset into a pre-defined set of input variables or cells which constitute the lowest recognition layer. During the recognition process, each cell in higher recognition layers generates a response based on the values or responses of a selected subset of cells in lower layers (receptive field). The number of layers used, the sizes of the receptive fields, and the rule used by each cell to compute its response vary depending on the type of information to be recognized, that is, the complexity and number of the patterns, and the intermediate features that must be recognized to successfully identify the pattern. Sufficiently fine-grained intermediate features, overlapping receptive fields, and strongly converging data paths enable distortion-invariant and position-tolerant recognition. [0003] The structure and dimension of the recognition layers are generally fixed during the recognition process, requiring that each layer contain enough recognition cells to fill the N dimensions of the recognition space in the required resolution. For cases where N>2 the resulting large number of cells makes a computation of the cell responses unfeasible. The process becomes inefficient particularly where the input data is sparsely distributed throughout a large input space. [0004] It is the object of the present invention to provide a method by which the responses of the relevant cells in the higher recognition layers can be efficiently calculated without performing trivial calculations that contribute nothing to the solution. SUMMARY OF THE INVENTION [0005] In a method and a system for the implementation of multi-layered network object recognition in N-dimensional space, the structure of a neural recognition network is dynamically generated and adapted to recognize an object. The layers of the network are capable of recognizing key features of the input data by using evaluation rules to establish a hierarchical structure that can adapt to data position and orientation, varying data densities, geometrical scaling, and faulty or missing data. Based on successive hierarchical feature recognition and synthesis, sufficient relevant recognition cells are generated to enable data processing and information propagation through the recognition network without generating or computing unnecessary irrelevant cells. [0006] Before any data is processed, a network hierarchy is defined and constructed comprising a succession of recognition layers which each contain a collection of nodes or cells. The number of cells in a layer varies during processing, and each layer is initially equipped with zero cells. Each layer recognizes a specific group of key features, and the cells in the layer are used to represent the presence and characteristics of the features. The layers are interlinked by ownership-type relationships in which a cell in a given layer is said to own a group of cells in a subordinate layer, and this subordinate group of cells is said to constitute the receptive field of the superordinate cell. This owner-subordinate relationship is repeated between each pair of successive layers from the lowest input layer to the final uppermost recognition layer. This final layer represents the answer to the overall recognition problem in that its cells recognize the key features that uniquely classify the pattern imposed on the input layer. [0007] All layers have the following properties: [0008] 1. Each layer represents an abstraction level of the object to be recognized, with the complexity of the abstraction being greater in the higher layers. [0009] 2. Each layer is equipped with a set of key features that characterize an aspect or property of the object to be recognized, with said features being computable by examining the properties and responses of a subset of cells in the subordinate layer. [0010] 3. Each layer is equipped with a rule or algorithm for determining whether a subordinate layer cell contributes positively or negatively to the recognition process carried out in said layer, specifically, a means of determining whether the inclusion of a given subordinate cell in the receptive field of a given superordinate cell is advantageous to the recognition function of the superordinate cell. [0011] All cells have the further following properties: [0012] 1. A 1-dimensional response vector (fuzzy polarization vector) which indicates the cell's recognition of the features which are key to the layer containing the cell. [0013] 2. A collection of pointers representing references to a subset of cells in the layer subordinate to the layer containing said cell, collectively called receptive field, and a corresponding collection of weights. [0014] 3. A single pointer representing a reference to a cell in the layer superordinate to the layer containing said cell, called owning cell. [0015] 4. Variables containing computed geometric information such as unit normal vector, centroid, and orientation direction. [0016] During the computation process, cells in the various hierarchical layers may be created and destroyed, linked and unlinked with owners, and assigned to, and removed from, receptive fields in an iterative convergent process that can implement neural network recognition techniques that result in a final collection of cells that adequately represents the object to be recognized in the various hierarchical layers. This final structure is both a hierarchical map of the input data as well as a network capable of recognizing the key features of the input data. Information flows between neighboring layers in both bottom-up and top-down directions during the convergence process: superordinate cells extract features by evaluating the properties of the subordinate cells in their receptive fields, and subordinate cells base their membership decisions, receptive field sizes, and evaluation parameters on information from the superordinate layer cells. Recognition occurs when the top-down signals and bottom-up signals are sufficiently mutually fortifying to establish a persistent stable activation level of all involved cells; at this point the iterative process has converged to a solution. [0017] Overall solution convergence is driven by cells being grouped into receptive fields if they have converging interests, that is, if they represent the same thing, as can be seen from their recognition vectors, which must be converging with the owner. [0018] The computation process is initiated by transferring the input data into the lowest hierarchical layer (the input or zeroth layer). Typical input data may consist of simply a list of coordinates and a physical property that has been measured at that coordinate. In this case, a single input cell may be generated for each input data point, fully representing the input data with zero information loss. [0019] The next superordinate layer is constructed by means of an appropriate rule or algorithm for grouping input layer cells. In principle, the input layer cells are divided into a number of groups, a new layer 1 cell is generated for each group, the receptive field pointers of the layer 1 cell are set to point to the cells in the group, and the cells in the group become owned by the new layer 1 cell. This initial grouping into receptive fields is repeated as each new layer is constructed, and, although the iterative solution process converges despite ill-defined initial groupings, a well-planned initial grouping speeds convergence. [0020] When the first two layers have been constructed, the iterative solution process may begin, and may continue throughout and after the successive construction of the third to last layers. The iterative process may implement whatever neural network recognition techniques are appropriate to the key features of the relevant layer, generally simple analytical formulae in the lower layers, and more complex pattern recognition algorithms in the higher layers, but always driving toward a stable state of mutual inter-layer reinforcement. In addition, during the iterative solution process, receptive field sizes and recognition parameters are adjusted, cells may reselect the receptive field to which they belong based on new information from higher layers, and cells may even find there is no existing receptive field they wish to join. Superordinate cells are created as required to overtake ownership of orphaned cells, and are destroyed when their receptive fields have atrophied. [0021] The flexible dynamic structure has the following advantages: [0022] Cells are generated only where there is data to recognize [0023] Receptive field sizes are dynamic and membership criteria are based on rules, allowing receptive fields to adapt to varying data densities and geometric scaling of key features (the network is scale invariant) [0024] Since, through the process of mutual reinforcement, the response of superordinate cells is activated by the presence of merely sufficient supporting subordinate cells, not all conceivable supporting cells, the recognition process succeeds despite faulty, noisy. or missing data, or even despite errors in the recognition of a minority of cells in lower layers (the network is fault-tolerant) [0025] Since recognition cells are constructed where the data is found, a spatial translation of input data has no effect whatever on the ability of the network to recognize the assigned patterns (the network's recognition is position-invariant) [0026] Since the network is fault-tolerant at each recognition layer, the network is capable of compensating for each way in which the data may differ from the recognition ideal (the network's recognition is distortion-invariant). DESCRIPTION OF A PREFERRED EMBODIMENT [0027] A typical problem well suited for this network algorithm is 3-dimensional object recognition using as input data a set of 3-dimensional coordinate points representing random points on the surface of the object to be recognized. Such a dataset could be generated, for example by a laser scanning device capable of detecting and outputting the 3-dimensional coordinates of its scan points. Scan data generated in an industrial plant setting for the purposes of documentation, computation, and analysis has a usefulness that is directly related to the degree or complexity of recognition, or intelligence. For example, recognizing the mere presence of a surface is accomplished by a laser scan, but is not very useful; to recognize that surface as a part of a cylinder is better, but to recognize it as part of a pipe of a certain size with specific end connections allows the generation of useful CAE models of existing plants. In the current implementation, the input data consists of a 3-dimensional laser scan of a section of an industrial plant. [0028] The current implementation uses 5 layers abstracted as follows (that is, what a cell from each layer represents): [0029] Layer 0: 3-D Point [0030] Layer 1: Nearly flat surface patch [0031] Layer 2: Curved or flat surface fragment [0032] Layer 3: Geometric primitive (cylinder, box, edge, sphere) [0033] Layer 4: Final 3-d object Continue reading about Method and system for implementing n-dimensional object recognition using dynamic adaptive recognition layers... Full patent description for Method and system for implementing n-dimensional object recognition using dynamic adaptive recognition layers Brief Patent Description - Full Patent Description - Patent Application Claims Click on the above for other options relating to this Method and system for implementing n-dimensional object recognition using dynamic adaptive recognition layers patent application. ### 1. Sign up (takes 30 seconds). 2. Fill in the keywords to be monitored. 3. Each week you receive an email with patent applications related to your keywords. Start now! - Receive info on patent apps like Method and system for implementing n-dimensional object recognition using dynamic adaptive recognition layers or other areas of interest. ### Previous Patent Application: Apparatus, method, and computer program for analyzing document layout Next Patent Application: Image processing apparatus, image processing method, and storage medium Industry Class: Image analysis ### FreshPatents.com Support Thank you for viewing the Method and system for implementing n-dimensional object recognition using dynamic adaptive recognition layers patent info. IP-related news and info Results in 0.20629 seconds Other interesting Feshpatents.com categories: Qualcomm , Schering-Plough , Schlumberger , Seagate , Siemens , Texas Instruments , 174 |
* Protect your Inventions * US Patent Office filing
PATENT INFO |
|